JPH0554192A

JPH0554192A - Recognition character correction method

Info

Publication number: JPH0554192A
Application number: JP3213466A
Authority: JP
Inventors: Tamotsu Maeda; 保前田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1991-08-26
Filing date: 1991-08-26
Publication date: 1993-03-05
Anticipated expiration: 2015-11-27
Also published as: JP3111522B2

Abstract

PURPOSE:To provide a recognition character correction method capable of surely applying corrections when an error exists in the character code of the character size of a recognition character in a character recognition device. CONSTITUTION:A character is segmented by a character segment part 2 from a document pattern recognized by a character input part 1. The character is recognized by a character recognition part 3 and each maximum value of the longitudinal and lateral sizes is calculated by a maximum side calculation part 7. This maximum value and the length of the corresponding side of a character which is to be judged as a large character or a small character are compared and a character size decision part 4 decides that the character is a large character or a small character. When the contents of the character is different from the contents recognized in the character recognition part 3, it is corrected to the decided contents by a correction part 5.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字認識装置により文
字パターンを読み取って、その文字パターンから文字の
認識を行う文字認識の際に、認識文字の文字サイズに関
する誤りを修正する認識文字修正方法に関するものであ
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a recognized character correction method for correcting an error relating to the character size of a recognized character at the time of character recognition in which a character pattern is read by a character recognition device and a character is recognized from the character pattern. It is about.

【０００２】[0002]

【従来の技術】近年、文字認識装置は、各種機器の入力
端末装置の一つとして導入が盛んで、実用化が進んでい
る。この文字認識装置における従来の認識文字の大きさ
修正方式では文字パターンの縦の長さ、または横の長さ
を基準とし、これが所定の値より小さい場合には小文
字、大きい場合には大文字に分類していた。2. Description of the Related Art In recent years, a character recognition device has been widely introduced as one of input terminal devices for various devices, and is being put to practical use. In the conventional recognition character size correction method in this character recognition device, the vertical or horizontal length of the character pattern is used as a reference. Was.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら上記の従
来の方法では、対象とする書体あるいは字体を限定する
場合にはある程度有効であるが、対象を広げた場合には
修正が困難であるという問題点を有していた。具体例を
用いてこれを説明する。However, the above-mentioned conventional method is effective to some extent when the target typeface or font is limited, but it is difficult to correct it when the target is widened. Had. This will be described using a specific example.

【０００４】図６に書体や字体が異なるが、ポイント数
はいずれも等しい６種類のかな文字（ャ，ヤ，ュ，ユ，
ョ，ヨ）と４種類の漢字（煙，燕，猿，縁）を示す。列
方向は同一フォントである。同図で、たとえばｃ１は小
文字の「ュ」であり、ｃ２は大文字の「ユ」である。こ
の場合、縦の長さは小文字であるｃ１が大文字であるｃ
２より大きく、従来のサイズだけによる方法ではもし認
識の際に小文字か大文字かの文字コードの誤りがあって
も修正が困難であることがわかる。Although the typeface and the typeface are different from each other in FIG. 6, six kinds of kana characters (a, ya, u, you,
Yo, yo) and four types of kanji (smoke, swallow, monkey, rim). The same font is used in the column direction. In the figure, for example, c1 is a lower case "U" and c2 is an upper case "Y". In this case, the vertical length is lowercase c1 and uppercase c
It is larger than 2 and it can be seen that the conventional size-only method is difficult to correct even if there is an error in the character code, such as lowercase or uppercase, during recognition.

【０００５】本発明は上記課題に留意し、文字認識の際
に文字サイズに関する文字コードの誤りがあっても、確
実にその誤り修正が行われる認識文字修正方法を提供し
ようとするものである。The present invention has been made in view of the above problems, and an object of the present invention is to provide a recognized character correction method which can surely correct an error in a character code relating to the character size at the time of character recognition.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に本発明の認識文字修正方法は、認識すべき文書パター
ンより文字を認識する際に、認識した文字列の文字の中
で縦サイズの最大辺の長さの第１の最大値と、横サイズ
の最大辺の長さの第２の最大値を算出し、認識した文字
の中で大文字と小文字の両方を有するものについては、
その縦または横の長い方の辺の長さと前述の対応する第
１または第２の最大値と比較することにより、文字サイ
ズに関する文字コードである大文字か小文字かを判定す
るものである。In order to achieve the above object, the recognized character correction method according to the present invention, when recognizing a character from a document pattern to be recognized, has a vertical size among characters of a recognized character string. The first maximum value of the maximum side length and the second maximum value of the maximum side length of the horizontal size are calculated, and among the recognized characters that have both uppercase and lowercase letters,
By comparing the length of the longer side in the vertical direction or the horizontal direction with the corresponding first or second maximum value described above, it is determined whether the character code relating to the character size is uppercase or lowercase.

【０００７】[0007]

【作用】上記の手順による本発明の認識文字修正方法
は、文字列の中の文字の縦横サイズのそれぞれの最大値
である第１，第２の最大値を求め、この値と大文字と小
文字の両方を有するかな文字の長い方の辺との比較か
ら、文字の書体または字体の特徴を生かした正確な大文
字，小文字判別が可能なものである。According to the recognized character correction method of the present invention by the above-mentioned procedure, the first and second maximum values, which are the maximum values of the vertical and horizontal sizes of the characters in the character string, are calculated, and the upper and lower case letters By comparing with the longer side of the kana character that has both, it is possible to accurately distinguish uppercase letters and lowercase letters by making the best use of the characteristics of the typeface or font of the character.

【０００８】以下に図面を用いて本発明の原理について
説明する。図６に書体，字体が異なるが、ポイント数は
どれも等しい６種類の大文字と小文字の両方を有するか
な文字（ャ，ヤ，ュ，ユ，ョ，ヨ）と４種類の漢字
（煙，燕，猿，縁）を示す。列方向は同一の書体または
字体、すなわち同一フォントである。これから以下のこ
とがいえる。The principle of the present invention will be described below with reference to the drawings. Figure 6 has different typefaces and fonts, but the number of points is the same. Kana characters (ya, ya, yu, yu, yo, yo) that have both uppercase and lowercase letters and four kinds of kanji (smoke, swallow) , Monkey, rim). The column direction has the same typeface or font, that is, the same font. From this, the following can be said.

【０００９】（１）同一フォントでは、必ずかな大文字
はかな小文字より小さい。（２）同一フォントでは、かな文字の大文字と小文字に
おける縦と横の長さの比（以下、縦横比）がほぼ等し
い。(1) In the same font, kana capital letters are always smaller than kana lower case letters. (2) In the same font, the ratio of the vertical and horizontal lengths of kana characters in uppercase and lowercase (hereinafter, the aspect ratio) is substantially equal.

【００１０】（３）同一フォントでは、かな大文字と漢
字の長辺はほぼ等しい。本発明では、書体や字体が異なった文字に対して、
（１），（２），（３），の性質を用いることにより効
果的な分類を行う。すなわち、入力文字が大文字と小文
字の両方の可能性があるかな文字であることが既知の場
合（たとえば、「っ」と「つ」など）に、このかな文字
と漢字と推定される文字からそれぞれ求めた縦と横の長
さから推定した大文字の面積を基準とした入力文字の面
積によって大文字と小文字を判定する。すなわち、かな
大文字とほぼ縦横比が等しいという事実から漢字を含め
入力文字が属す文字列中に存在する全ての文字パターン
の縦と横の長さのそれぞれの最大値を求める。(3) In the same font, the long sides of Kana and Kanji are almost equal. In the present invention, for characters with different typefaces and fonts,
Effective classification is performed by using the properties of (1), (2), and (3). That is, if it is known that the input character is a kana character that can be in both uppercase and lowercase (for example, "tsu" and "tsu"), the kana character and the kana character estimated to be the kana character respectively Uppercase and lowercase letters are determined by the area of the input characters based on the area of the uppercase letters estimated from the obtained vertical and horizontal lengths. That is, the maximum values of the vertical and horizontal lengths of all the character patterns existing in the character string to which the input character belongs, including the kanji, are obtained from the fact that the aspect ratio is almost the same as that of Kana capital letters.

【００１１】いま、漢字、および入力されたかな文字の
縦，横の長さをそれぞれＭ，Ｎ，ａ，ｂとするとき、漢
字の縦，横の長さがやはり最大値となり、第１の最大値
がＭ、第２の最大値がＮとなる。つぎにかな文字の大文
字，小文字の判定はａ＜ｂの場合、かな大文字の面積は
入力かな文字の面積ａ・ｂに（Ｎ／ｂ）・（Ｎ／ｂ）を
乗じた値と推定される。これは文字フォントが同じにな
るように長い辺同士の比を取って、文字サイズの面積比
に等しい値を求めることができる。よって、入力かな文
字のかな大文字に対する面積の比は（ｂ／Ｎ）・（ｂ／
Ｎ）になる。同様にして、ａ≧ｂの場合は（ａ／Ｍ）・
（ａ／Ｍ）となる。この値から大文字か小文字かを判定
することができる。When the vertical and horizontal lengths of a kanji character and an input kana character are M, N, a, and b, respectively, the vertical and horizontal lengths of the kanji character also have the maximum values. The maximum value is M, and the second maximum value is N. Next, in the case of uppercase / lowercase determination of kana characters, if a <b, the area of the kana capitalization is estimated to be a value obtained by multiplying the area kb of the input kana character by (N / b) * (N / b). .. In this method, the ratio of the long sides is calculated so that the character font is the same, and a value equal to the area ratio of the character size can be obtained. Therefore, the ratio of the area of input kana characters to kana capital letters is (b / N) · (b /
N). Similarly, when a ≧ b, (a / M)
(A / M). From this value, it is possible to determine whether it is uppercase or lowercase.

【００１２】[0012]

【実施例】図１は本発明の認識文字修正方法を用いた一
実施例における文字認識装置の機能ブロック図を示すも
のである。図１に示すようにその構成要素として１は認
識すべき文書パターンの認識データを光電変換して２値
化データとして文書パターンメモリに記憶させる文字入
力部、２は上記２値化データから文字切り出しを行う文
字切り出し部、３は文字切り出し部２からの文字パター
ンに対応する文字コードを出力する文字認識部、４は文
字認識部３からの認識文字が大文字と小文字を共に有す
る文字である場合に、文字切り出し部２から与えられる
同一文字列中の文字パターンの縦と横の長さのそれぞれ
の最大値が求めるられ、この値と入力パターンの縦と横
の長さから文字パターンが大文字，小文字をどちらであ
るかを判定する文字サイズ判定部、５は文字サイズ判定
部４の結果により文字認識部３からの認識文字の文字サ
イズが間違っている場合にこれを修正する修正部、６は
修正部５の結果を出力する表示部、７は前述の文字切り
出し部２から得られた文字列中の全文字の縦と横の長さ
からそれぞれの最大値を求める機能を有するもので、縦
の第１の最大値と、横の第２の最大値を求める最大辺計
算部である。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a functional block diagram of a character recognition apparatus in one embodiment using the recognized character correction method of the present invention. As shown in FIG. 1, 1 is a character input unit for photoelectrically converting recognition data of a document pattern to be recognized and storing it as binary data in a document pattern memory as shown in FIG. The character slicing unit 3 for outputting the character code that outputs the character code corresponding to the character pattern from the character slicing unit 2 is a character recognizing unit 4 for recognizing the character from the character recognizing unit 3 that has both uppercase and lowercase letters. , The maximum values of the vertical and horizontal lengths of the character pattern in the same character string given from the character cutout unit 2 are obtained, and the character pattern is capitalized or lowercased from this value and the vertical and horizontal lengths of the input pattern. The character size determination unit 5 that determines which of the two is used when the character size of the recognized character from the character recognition unit 3 is incorrect according to the result of the character size determination unit 4. Is a display unit for outputting the result of the correction unit 5, and 7 is a maximum value of each of the vertical and horizontal lengths of all the characters in the character string obtained from the character cutting unit 2 described above. It is a maximum side calculating unit that has a function of obtaining a vertical first maximum value and a horizontal second maximum value.

【００１３】図２は本実施例の文字認識装置の構成を示
すブロック図である。ここで２１は認識すべき文書を読
み取るスキャナであり読み取った文書をビットデータに
して出力する。２２は読み出し，書き込み可能なメモリ
（以下ＲＡＭと略す）であり、スキャナ２１からのビッ
トデータを記憶する文書パターン領域２３と、この文書
パターン領域２３内の文書パターンから切り出された文
字列中に含まれる文字パターンのサイズを記憶する文字
サイズ領域２７と、文字サイズ領域２７中の縦と横の長
さからそれぞれの最大値を記憶する最大辺領域２５と、
文字パターンを記憶する文字パターン領域２４と、この
文字パターンを認識して得られる文字コードを記憶する
文字コード領域２６と、処理で使用するレジスタ領域２
８とを有している。２９は読み出し専用メモリ（以下Ｒ
ＯＭと略す）であり大文字と小文字の文字コードと文字
コードに固有なしきい値を記憶した文字サイズ辞書領域
３０と、図３に示すフローチャートに従った制御プログ
ラムを記憶したプログラム記憶領域３１とを有する。３
２はプログラム記憶領域３１に記憶された制御プログラ
ムに従って処理を行う処理回路である。３３はデータを
入力するキーボードであり、３４は文字パターン領域２
４内の文字パターンと文字コード領域２６内の文字コー
ド、またはこれに対応する文字フォントを表示する表示
部である。FIG. 2 is a block diagram showing the configuration of the character recognition device of this embodiment. Reference numeral 21 is a scanner for reading a document to be recognized, and outputs the read document as bit data. Reference numeral 22 denotes a readable / writable memory (hereinafter abbreviated as RAM), which is included in a document pattern area 23 that stores bit data from the scanner 21 and a character string cut out from the document pattern in the document pattern area 23. A character size area 27 for storing the size of a character pattern to be stored, a maximum side area 25 for storing the maximum values of the vertical and horizontal lengths of the character size area 27,
A character pattern area 24 for storing a character pattern, a character code area 26 for storing a character code obtained by recognizing the character pattern, and a register area 2 used for processing.
8 and. 29 is a read-only memory (hereinafter R
OM) and has a character size dictionary area 30 in which upper and lower case character codes and threshold values unique to the character codes are stored, and a program storage area 31 in which a control program according to the flowchart shown in FIG. 3 is stored. .. Three
Reference numeral 2 denotes a processing circuit that performs processing according to the control program stored in the program storage area 31. Reference numeral 33 is a keyboard for inputting data, and 34 is a character pattern area 2
4 is a display unit for displaying the character pattern in 4 and the character code in the character code area 26, or the character font corresponding thereto.

【００１４】以上のように構成された本実施例の文字認
識装置を用いて本発明の認識文字修正方法について図３
のフローチャートに従って説明する。まず、ステップＳ
１で、文字入力部１において認識すべき文書パターンか
ら読み込まれた文書パターンから、文字切り出し部２の
処理により文字を切り出し、同一文字列中に存在するす
べての文字パターンの縦と横の長さを文字サイズ領域２
７に書き込む。次のステップＳ２では最大辺計算部７で
文字サイズ領域２７中の縦および横の長さからそれぞれ
の最大値第１の最大値Ｍと、第２の最大値Ｎを捜し出
し、結果を最大辺領域２５に保存する。次にステップＳ
３で文字認識部３による文字認識処理を行い、認識文
字，文字コードを文字コード領域２６に保存する。ステ
ップＳ４では、文字コード領域２６中の認識文字が文字
サイズ辞書領域３０に登録されているかどうかの判断を
行う。登録されている場合、この文字は小文字を有する
のでステップＳ４で文字サイズ判定処理を行うが、そう
でない場合は処理を終了する。ステップＳ５では、まず
入力文字パターンの縦ａと横ｂの長さの大小関係を調
べ、ａ≧ｂであるときステップＳ６、ａ＜ｂの場合ステ
ップＳ７に進む。ステップＳ６では（ａ／Ｍ）の２乗
を、ステップＳ７では（ｂ／Ｎ）の２乗を計算する。ス
テップＳ８ではステップＳ６またはＳ７の結果が文字サ
イズ辞書領域３０に登録された所定の値よりも小さいと
きは小文字、大きいときは大文字と判定する。最後にス
テップＳ９で修正部６によりこの判定結果が文字認識部
３における認識文字の文字コードと異なり認識文字の文
字サイズが間違っていた場合に文字コード領域２６の内
容を修正する。FIG. 3 shows the recognized character correction method of the present invention using the character recognition apparatus of the present embodiment having the above-described structure.
It will be described according to the flowchart of First, step S
1, the characters are cut out from the document pattern read from the document pattern to be recognized in the character input unit 1 by the processing of the character cutout unit 2, and the vertical and horizontal lengths of all the character patterns existing in the same character string. Character size area 2
Write to 7. In the next step S2, the maximum side calculation unit 7 searches for the maximum value, the first maximum value M, and the second maximum value N, from the vertical and horizontal lengths in the character size area 27, and the result is the maximum side area. Save to 25. Then step S
In step 3, character recognition processing is performed by the character recognition unit 3, and the recognized character and character code are stored in the character code area 26. In step S4, it is determined whether the recognized character in the character code area 26 is registered in the character size dictionary area 30. If registered, this character has a lower case character, so the character size determination process is performed in step S4, but if not, the process ends. In step S5, first, the size relationship between the length a and the length b of the input character pattern is checked. If a ≧ b, the process proceeds to step S6, and if a <b, the process proceeds to step S7. In step S6, the square of (a / M) is calculated, and in step S7, the square of (b / N) is calculated. In step S8, when the result of step S6 or S7 is smaller than the predetermined value registered in the character size dictionary area 30, it is determined to be a small letter, and when it is large, it is determined to be an upper case. Finally, in step S9, the correction unit 6 corrects the content of the character code area 26 when the determination result is different from the character code of the recognized character in the character recognition unit 3 and the character size of the recognized character is incorrect.

【００１５】つぎに具体的に認識対象例題文字列パター
ン「ナショナルエレクトリック」を例に、以下その動作
を説明する。まず、文字入力部１で、認識文字列パター
ンを２値画像として入力して文書パターン領域２３に記
憶する。次に文字切り出し部２で文字列を切り出した
後、文字切り出し部２が１文字ずつに切り離すが、この
とき同一文字列中に存在するすべての文字パターンの縦
と横の長さを文字サイズ領域２７に書き込む。この時点
での文字サイズ領域２７を図４に示す。最大辺計算部７
によると、縦，横の長さは「ナ」，「ル」が最大であ
り、それぞれＭ＝６０、Ｎ＝１２２が得られる。文字認
識部３が認識例題文字列「ナショナルエレクトリック」
を「ナシヨナルェレクトリツク」と認識したとする。文
字サイズ判定部４ではまず認識文字が大文字と小文字の
両方を持つ文字か否かの判断を行う。ここでは「ヨ」，
「ェ」，「ツ」の３つが該当する。「ヨ」は縦が横より
長いので（ａ／Ｍ）の２乗を、「ェ」と「ツ」は横の長
さが長いので（ｂ／Ｎ）の２乗を求めると図４のように
なる。一方、文字サイズ辞書領域３０には図５のように
大文字，小文字、所定の値が記憶されているので、入力
文字の値が該当する文字の値より大きければ大文字、小
さければ小文字に判定する。これより「ヨ」は小文字で
あり、「エ」は大文字、「ツ」は小文字であることが判
定され、図４の最下段に示すような認識文字が得られ
る。ここで「ヨ」「エ」「ツ」ともに修正部６では認識
文字の文字サイズが間違った場合として修正する。Next, the operation of the recognition target example character string pattern "National Electric" will be specifically described below. First, the character input unit 1 inputs the recognized character string pattern as a binary image and stores it in the document pattern area 23. Next, after the character string is cut out by the character cutting unit 2, the character cutting unit 2 separates the characters one by one. At this time, the vertical and horizontal lengths of all the character patterns existing in the same character string are set to the character size area. Write in 27. The character size area 27 at this point is shown in FIG. Maximum side calculator 7
According to the above, the maximum vertical and horizontal lengths are "na" and "ru", and M = 60 and N = 122 are obtained, respectively. Character recognition unit 3 recognizes the example character string "National Electric"
Is recognized as "Nasiyonarelektritsk". The character size determination unit 4 first determines whether or not the recognized character has both uppercase and lowercase letters. Here, "Yo",
Three of "e" and "tsu" correspond. Since "Yo" has a longer vertical length than horizontal, (a / M) squared, and since "E" and "Ts" have a long horizontal length, (b / N) squared is calculated as shown in Fig. 4. become. On the other hand, since the character size dictionary area 30 stores uppercase letters, lowercase letters, and predetermined values as shown in FIG. 5, if the value of the input character is larger than the value of the corresponding character, it is determined to be uppercase, and if smaller, it is determined to be lowercase. From this, it is determined that "yo" is lowercase, "e" is uppercase, and "tsu" is lowercase, and a recognition character as shown in the bottom row of FIG. 4 is obtained. Here, the correction unit 6 corrects “yo”, “e”, and “tsu” assuming that the character size of the recognized character is incorrect.

【００１６】以上のように本実施例によれば、最大辺計
算部と文字サイズ判定部と修正部を設けることにより、
文字認識部から出力された認識文字のうち文字サイズの
誤りを修正する場合に、横倍角文字に対しても精度よく
修正できる。As described above, according to this embodiment, by providing the maximum side calculation unit, the character size determination unit and the correction unit,
When correcting the error in the character size among the recognized characters output from the character recognition unit, it is possible to correct even double-width characters with high accuracy.

【００１７】なお、本実施例では漢字とかな文字につい
て説明したが、アルファベットなどでも書体が確定でき
るものについては、同様な効果を有することは言うまで
もない。Although the kanji and kana characters have been described in the present embodiment, it is needless to say that the same effect can be obtained for alphabets and the like whose typeface can be determined.

【００１８】[0018]

【発明の効果】以上の説明より明らかなように、認識し
た文字の縦，横それぞれの最大辺の長さを基準に文字サ
イズを比較することにより本発明の認識文字修正方法は
横角文字などの特殊な書体，字体に対しても認識文字の
文字サイズに関する誤りを精度良く修正を行うことがで
きるものである。As is apparent from the above description, the recognized character correction method of the present invention is a method of correcting a horizontally-oriented character by comparing the character sizes based on the maximum lengths of the vertical and horizontal sides of the recognized character. With regard to the special typefaces and fonts of, it is possible to accurately correct errors related to the character size of the recognized characters.

[Brief description of drawings]

【図１】本発明の認識文字修正方法の一実施例を機能別
に示した構成図FIG. 1 is a configuration diagram showing an embodiment of a recognized character correction method of the present invention by function.

【図２】同実施例をハード構成として示した文字認識装
置のブロック図FIG. 2 is a block diagram of a character recognition device showing the same embodiment as a hardware configuration.

【図３】同実施例の手順を示すフローチャートFIG. 3 is a flowchart showing the procedure of the same embodiment.

【図４】同実施例の具体的文字列による手順を示す説明
図FIG. 4 is an explanatory view showing a procedure using a specific character string of the same embodiment.

【図５】同実施例の文字サイズ辞書領域に記憶されてい
る記憶内容配置図FIG. 5 is a layout view of stored contents stored in a character size dictionary area of the same embodiment.

【図６】従来の認識文字修正方法を説明するための文字
のパターン図FIG. 6 is a character pattern diagram for explaining a conventional recognized character correction method.

[Explanation of symbols]

１文字入力部２文字切り出し部３文字認識部４文字サイズ判定部５修正部６表示部７最大辺計算部 1 character input part 2 character cutout part 3 character recognition part 4 character size judgment part 5 correction part 6 display part 7 maximum side calculation part

Claims

[Claims]

1. A document pattern to be recognized is read, characters are cut out from the document pattern, and a first maximum value and a horizontal size that are the maximum vertical size of the character pattern in the character string including the cut out characters. The second maximum value that is the maximum of is calculated, the character code is read from each character pattern, the possibility that the character code has both uppercase and lowercase letters is determined, and if it has the possibility, the character pattern is determined. The vertical length and the horizontal length are compared, and if the vertical length is long, the upper and lower case letters are determined by the vertical length and the value calculated from the first maximum value, and the horizontal length is long. In this case, the uppercase and lowercase letters are determined based on the horizontal length and the value calculated from the second maximum value, and if different from the character code, the character code is corrected.

2. The method for correcting a recognized character according to claim 1, wherein when deciding the uppercase letter and the lowercase letter, it is decided whether it is the uppercase letter or the lowercase letter based on the area ratio with the character pattern having the first or second maximum value.