JPH1166234A

JPH1166234A - Image-processing method, record medium recorded with the same and image processor thereof

Info

Publication number: JPH1166234A
Application number: JP9230896A
Authority: JP
Inventors: Nobuo Miyamoto; 信夫宮本; Teruo Akiyama; 照雄秋山; Kenji Ogura; 健司小倉
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1997-08-27
Filing date: 1997-08-27
Publication date: 1999-03-09
Anticipated expiration: 2017-08-27
Also published as: JP3368184B2

Abstract

PROBLEM TO BE SOLVED: To provide a method and a processor for image processing which can accurately put document images one over the other at a high speed. SOLUTION: A character recognition part 3 recognizes characters present at an overlap part of a partial document image, and a character code matching part 4 matches them for extracting the character code string having a maximum number of matching characters. A calculation part 5 for the quantity of displacement between partial document images finds the quantity of displacement between a couple of partial document images from the difference between character circumscribed rectangle coordinates calculated by a character circumscribed rectangle calculation part 2, and a connecting image composition part 6 puts them one over the other according to the displacement quantity. A character pattern which can have its characters recognized is normally of a size tens of pixels by tens of pixels large, so that the superposition position of the images can be found very rapidly, so that speedy superposition can be actualized. Further, the same character string will hardly appears in a document and the effects of periodic patterns can be eliminated, so that the precision of the superposition can be improved.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、重畳部分を持つ複
数の部分文書画像から連結した文書画像を合成する画像
処理方法及び装置に関するものである。[0001] 1. Field of the Invention [0002] The present invention relates to an image processing method and apparatus for synthesizing a combined document image from a plurality of partial document images having a superimposed portion.

【０００２】[0002]

【従来の技術】画像を重ね合わせる方法として従来より
提案されている方法は、画像の重畳部分を少しずつずら
しながら対応する画素間の差の総和（残差）が最小とな
る位置を求める方法が一般的である。すなわち、２つの
部分文書画像をＦ（ｘ，ｙ）、Ｇ（ｘ，ｙ）（ｘ＝１，
…，Ｘ，ｙ＝１，…，Ｙ）とするとき、両文書画像を上
下にｉ画素、左右にｊ画素、相対的にずらしたときの残
差Ｒｉｊは次式で与えられる。2. Description of the Related Art Conventionally, as a method of superimposing images, a method of obtaining a position where the sum of the differences (residuals) between corresponding pixels is minimized while shifting the superimposed portion of the image little by little. General. That is, two partial document images are represented by F (x, y) and G (x, y) (x = 1,
, X, y = 1,..., Y), the residual Rij when both document images are vertically shifted by i pixels and left and right by j pixels is given by the following equation.

【０００３】Ｒｉｊ＝Σ_S│Ｆ（ｘ，ｙ）−Ｇ（ｘ−ｉ，ｙ−ｊ）│ ここで、Ｓは重畳領域を表す。このＲｉｊをいろいろな
ｉ，ｊの組み合わせについて計算し、最小のＲｉｊを与
えるｉ，ｊの位置で文書画像を重ね合わせる。[0003] _{Rij = Σ S │F (x,} y) -G (x-i, y-j) │ where, S is representative of the overlap region. This Rij is calculated for various combinations of i and j, and the document image is superimposed at the position of i and j that gives the minimum Rij.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら従来の方
法では、計算量が膨大となるという問題があった。例え
ば、重畳部分が１００画素×１００画素、探索範囲が５
０画素×５０画素の場合であっても、減算、絶対値演
算、加算をそれぞれ１００００回行う操作を２５００回
繰り返す必要がある。そのため、大きなサイズの文書画
像を扱う場合や、高速性を要する場合には適用が困難で
あった。また、背景がｉ，ｊの探索範囲に比べて小さな
周期の模様の場合には、周期分だけずれた重ね合わせが
起こりやすいという問題もあった。However, the conventional method has a problem that the amount of calculation is enormous. For example, the overlapping portion is 100 pixels × 100 pixels, and the search range is 5
Even in the case of 0 pixels × 50 pixels, it is necessary to repeat the operation of performing subtraction, absolute value calculation and addition 10,000 times each 2500 times. Therefore, it is difficult to apply the method when handling a large-sized document image or when high speed is required. Further, in the case where the background has a pattern with a smaller cycle than the search range of i and j, there is a problem that the superposition shifted by the cycle is likely to occur.

【０００５】本発明は、上記事情に鑑みてなされたもの
で、その課題は、高速かつ正確に文書画像の重ね合わせ
を行える画像処理方法及び装置を提供することにある。SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and has as its object to provide an image processing method and apparatus which can quickly and accurately superimpose document images.

【０００６】[0006]

【課題を解決するための手段】本発明は、上記課題を解
決するため、以下の（１）〜（３）の発明を手段とす
る。Means for Solving the Problems In order to solve the above-mentioned problems, the present invention uses the following inventions (1) to (3).

【０００７】（１）重畳部分を持つ複数の部分文書画像
からこれらを連結した文書画像を合成する画像処理方法
であって、各部分文書画像中に存在する文字パタン毎
に、該文字パタンの外接矩形座標を算出する過程と、部
分文書画像毎に前記外接矩形で囲まれる各文字パタンの
認識を行い、部分文書画像毎の文字コード列を生成する
過程と、全ての部分文書画像間で前記文字コード列の照
合を行い、部分文書画像対毎に一致文字数が最大になる
ときの一致文字コード列を抽出する過程と、全ての部分
文書画像対について、前記一致文字コード列に属する文
字コードの部分文書画像内の外接矩形座標から部分文書
画像間の変位量を算出する過程と、全ての部分文書画像
対についての変位量を用いて連結文書画像を合成する過
程と、を具備することを特徴とする画像処理方法。(1) An image processing method for synthesizing a document image obtained by linking a plurality of partial document images having a superimposed portion, wherein a circumscribing of the character pattern is performed for each character pattern present in each partial document image. Calculating rectangular coordinates; recognizing each character pattern surrounded by the circumscribed rectangle for each partial document image to generate a character code string for each partial document image; A process of collating the code strings and extracting a matching character code string when the number of matching characters is maximized for each partial document image pair, and for all partial document image pairs, a portion of the character code belonging to the matching character code string A step of calculating a displacement amount between partial document images from circumscribed rectangular coordinates in the document image; and a step of combining linked document images using displacement amounts of all the partial document image pairs. Image processing method according to claim.

【０００８】（２）重畳部分を持つ複数の部分文書画像
からこれらを連結した文書画像を合成する画像処理方法
における、各部分文書画像中に存在する文字パタン毎
に、該文字パタンの外接矩形座標を算出する手順と、部
分文書画像毎に前記外接矩形で囲まれる各文字パタンの
認識を行い、部分文書画像毎の文字コード列を生成する
手順と、全ての部分文書画像間で前記文字コード列の照
合を行い、部分文書画像対毎に一致文字数が最大になる
ときの一致文字コード列を抽出する手順と、全ての部分
文書画像対について、前記一致文字コード列に属する文
字コードの部分文書画像内の外接矩形座標から部分文書
画像間の変位量を算出する手順と、全ての部分文書画像
対についての変位量を用いて連結文書画像を合成する手
順と、をコンピュータに実行させるプログラムを、該コ
ンピュータが読み取り可能な媒体に記録したことを特徴
とする画像処理方法を記録した記録媒体。(2) For each character pattern present in each partial document image in an image processing method for combining a plurality of partial document images having a superimposed portion and a document image obtained by concatenating them, the circumscribed rectangular coordinates of the character pattern And a procedure for recognizing each character pattern surrounded by the circumscribed rectangle for each partial document image and generating a character code string for each partial document image. And extracting a matching character code string when the number of matching characters is maximized for each partial document image pair; and for all partial document image pairs, a partial document image of a character code belonging to the matching character code string Calculating the amount of displacement between partial document images from the circumscribed rectangular coordinates inside the document and combining the connected document images using the amounts of displacement of all the partial document image pairs. Recording medium the program was recorded an image processing method characterized by the computer is recorded on a medium readable to execute.

【０００９】（３）重畳部分を持つ複数の部分文書画像
からこれらを連結した文書画像を合成する画像処理装置
であって、各部分文書画像中に存在する文字パタン毎
に、該文字パタンの外接矩形座標を算出する手段と、部
分文書画像毎に前記外接矩形で囲まれる各文字パタンの
認識を行い、部分文書画像毎の文字コード列を生成する
手段と、全ての部分文書画像間で前記文字コード列の照
合を行い、部分文書画像対毎に一致文字数が最大になる
ときの一致文字コード列を抽出する手段と、全ての部分
文書画像対について、前記一致文字コード列に属する文
字コードの部分文書画像内の外接矩形座標から部分文書
画像間の変位量を算出する手段と、全ての部分文書画像
対についての変位量を用いて連結文書画像を合成する手
段と、を具備することを特徴とする画像処理装置。(3) An image processing apparatus for combining a plurality of partial document images having a superimposed portion into a combined document image, and for each character pattern present in each partial document image, circumscribing the character pattern Means for calculating rectangular coordinates; means for recognizing each character pattern surrounded by the circumscribed rectangle for each partial document image to generate a character code string for each partial document image; Means for collating a code string and extracting a matching character code string when the number of matching characters is maximized for each partial document image pair, and a part of a character code belonging to the matching character code string for all partial document image pairs Means for calculating the amount of displacement between partial document images from the circumscribed rectangular coordinates in the document image, and means for synthesizing a connected document image using the amounts of displacement for all pairs of partial document images. The image processing apparatus according to claim.

【００１０】本発明では、文書画像中の文字情報を利用
して、部分文書画像の重畳部分に存在する文字を認識
し、文字コードレベルで位置の照合をとることにより、
高速かつ正確な重ね合わせを実現する。文字認識処理の
可能な文字パタンは通常、数十画素×数十画素程度の大
きさがある。そのため、例えば、１００画素×１００画
素の重畳部分の場合には、一般に数十文字しか含まれな
い。重畳部分に含まれる文字数を１０個と仮定すると、
文字コード照合回数は高々１０×１０回に過ぎず、極め
て高速に画像の重畳位置を見い出すことができ、高速に
重ね合わせを行うことができる。また、特別な文書を除
いて、文書中に同じ文字列が周期的に出現することは少
なく、また周期的な模様等の影響を排除できることか
ら、重ね合わせの精度を向上させることができる。According to the present invention, by utilizing character information in a document image, a character present in a superimposed portion of a partial document image is recognized, and the position is collated at a character code level.
Achieve high-speed and accurate overlay. A character pattern that can be subjected to character recognition processing usually has a size of about several tens of pixels × several tens of pixels. Therefore, for example, in the case of a superimposed portion of 100 pixels × 100 pixels, generally, only several tens of characters are included. Assuming that the number of characters included in the superimposed part is 10,
The number of times of character code collation is only 10 × 10 at most, and the superimposition position of the image can be found very quickly, and the superposition can be performed at high speed. Also, except for a special document, the same character string rarely appears periodically in the document, and the influence of a periodic pattern or the like can be eliminated, so that the overlay accuracy can be improved.

【００１１】[0011]

【発明の実施の形態】以下、本発明の実施形態例を図面
を参照して詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００１２】図１は、本発明の一実施形態例の画像処理
装置のブロック図である。FIG. 1 is a block diagram of an image processing apparatus according to an embodiment of the present invention.

【００１３】本実施形態例の画像処理装置は、部分文書
画像格納部１と、文字外接矩形算出部２と、文字認識部
３と、文字コード列照合部４と、部分文書画像間変位量
算出部５と、連結文書画像合成部６と、連結文書画像格
納部７とで構成されている。図１において、破線で囲ん
だ部分、すなわち文字外接矩形算出部２と、文字認識部
３と、文字コード列照合部４と、部分文書画像間変位量
算出部５が、本発明で追加した部分である。The image processing apparatus according to the present embodiment includes a partial document image storage section 1, a character circumscribed rectangle calculation section 2, a character recognition section 3, a character code string collation section 4, and a displacement calculation between partial document images. It is composed of a unit 5, a connected document image synthesizing unit 6, and a connected document image storage unit 7. In FIG. 1, a portion enclosed by a broken line, that is, a portion added by the present invention to a character circumscribed rectangle calculation unit 2, a character recognition unit 3, a character code string collation unit 4, and a partial document image displacement amount calculation unit 5 It is.

【００１４】部分文書画像格納部１には、重畳部分を持
つ部分文書画像の集合が格納されている。The partial document image storage unit 1 stores a set of partial document images having a superimposed portion.

【００１５】文字外接矩形算出部２は、部分文書画像格
納部１に格納されている部分文書画像を１枚ずつ読み出
し、画像中に存在する文字パタンの各々について外接矩
形座標を算出する。The character circumscribed rectangle calculation unit 2 reads out the partial document images stored in the partial document image storage unit 1 one by one, and calculates the circumscribed rectangle coordinates for each of the character patterns existing in the image.

【００１６】文字認識部３は、文字外接矩形算出部２で
算出された外接矩形で囲まれる各文字パタンの認識を行
い、部分文書画像毎に文字コード列を生成する。The character recognition unit 3 recognizes each character pattern surrounded by the circumscribed rectangle calculated by the character circumscribed rectangle calculation unit 2 and generates a character code string for each partial document image.

【００１７】文字コード列照合部４は、全ての部分文書
画像の対について、文字認識部３で得られた文字コード
列の照合を行い、一致文字数が最大になるときの一致文
字コード列を抽出する。The character code string collating section 4 collates the character code strings obtained by the character recognizing section 3 for all the pairs of partial document images, and extracts a matching character code string when the number of matching characters is maximized. I do.

【００１８】部分文書画像間変位量算出部５は、部分文
書画像対毎に、一致文字コード列に属する文字コードの
両部分文書画像内における外接矩形座標の差分の平均値
から両部分文書画像の変位量を算出する。The partial document image displacement amount calculating section 5 calculates, for each partial document image pair, the average value of the difference between the circumscribed rectangular coordinates in both partial document images of the character codes belonging to the matching character code string, and calculates the Calculate the amount of displacement.

【００１９】連結文書画像合成部６は、部分文書画像間
変位量算出部５で算出された部分文書画像対毎の変位量
をもとに連結文書画像を合成し、連結文書画像格納部７
に格納する。The connected document image synthesizing section 6 synthesizes a connected document image based on the displacement amount of each partial document image calculated by the partial document image displacement amount calculating section 5, and generates a connected document image storage section 7.
To be stored.

【００２０】このように構成した画像処理装置の動作お
よび作用とともに、本発明の画像処理方法の一実施形態
例を説明する。図２〜図５は、図１に示した画像処理装
置の動作とともに、本発明の画像処理方法の一実施形態
例を示すフローチャートである。An embodiment of the image processing method according to the present invention will be described together with the operation and operation of the image processing apparatus configured as described above. FIGS. 2 to 5 are flowcharts showing the operation of the image processing apparatus shown in FIG. 1 and an embodiment of the image processing method of the present invention.

【００２１】まず、ステップ１０において、部分文書画
像を読み込み、ステップ１１で文字外接矩形座標を算出
する。First, in step 10, the partial document image is read, and in step 11, the coordinates of the circumscribed rectangle of the character are calculated.

【００２２】ステップ１２で外接矩形内の文字の認識を
行い、文字パタンを文字コードに変換する。この処理は
ステップ１３およびステップ１４の判定処理で示される
ように、全ての部分文書画像内の全ての文字パタンの認
識が完了するまで繰り返される。In step 12, characters in the circumscribed rectangle are recognized, and the character pattern is converted into a character code. This processing is repeated until the recognition of all the character patterns in all the partial document images is completed, as indicated by the determination processing in steps 13 and 14.

【００２３】続いてステップ１５からステップ５１まで
の第ｎ着目部分文書画像についての処理を行う。まずス
テップ１６でｎを１加算後、ステップ１７で第ｎ部分文
書画像の文字コード列Ａ₁Ａ₂…Ａ_pを読込む。続いて、
ステップ１８で第ｎ着目部分文書画像ｎに対する照合先
部分文書画像の番号ｍの初期値としてｎ＋１を設定し、
ステップ１９で第ｍ部分文書画像の文字コード列Ｂ₁Ｂ₂
…Ｂ_qを読込む。Subsequently, the processing for the n-th focused partial document image from step 15 to step 51 is performed. After first 1 adds n in step 16, it reads the character code string A ₁ A ₂ ... A _p of the n partial document image in step 17. continue,
In step 18, n + 1 is set as the initial value of the number m of the collation destination partial document image with respect to the nth focused partial document image n,
In step 19, the character code string B ₁ B ₂ of the m-th partial document image
... reads the B _q.

【００２４】ステップ２０で照合開始文字位置番号ｓに
０を設定後、ステップ２１でＡ_1+sＡ_2+s…Ａ_pとＢ₁Ｂ₂
…Ｂ_p-sの一致文字数ｋ₁（ｓ）を計数する。ステップ２
２でｓに１を加算し、ステップ２３でｓがｐ未満か否か
を判定する。ｓがｐ未満のときは、新たなｓについて一
致文字数ｋ₁（ｓ）の計数を繰り返す。[0024] After setting the 0 to the verification start character position number s in step _{_{20, A 1 + s A 2}} + s ... A p and B ₁ B ₂ in step 21
... The number k ₁ (s) of matching characters of B _ps is counted. Step 2
In step 2, 1 is added to s, and in step 23, it is determined whether s is less than p. If s is less than p, the counting of the number of matching characters k ₁ (s) is repeated for a new s.

【００２５】ｓがｐ以上になったときは、図３のステッ
プ２４で照合開始文字位置番号ｓに再び０を設定後、ス
テップ２５でＡ₁Ａ₂…Ａ_1+sとＢ_q-sＢ_q-s+1…Ｂ_qの一致
文字数ｋ₂（ｓ）を計数する。ステップ２６でｓに１を
加算し、ステップ２７でｓがｑ未満か否かを判定する。
ｓがｑ未満のときは、新たなｓについて一致文字数ｋ₂
（ｓ）の計数を繰り返す。When s is equal to or greater than p, the collation start character position number s is set to 0 again in step 24 in FIG. 3, and in step 25, A ₁ A ₂ ... A _{1 + s} and B _qs B _{q- s + 1} ... The number of matching characters k ₂ (s) of B _q is counted. In step 26, 1 is added to s, and in step 27, it is determined whether s is less than q.
If s is less than q, the number of matching characters k ₂ for the new s
The counting of (s) is repeated.

【００２６】ｓがｑ以上になったときは、ステップ２８
へ進み、一致文字数ｋ₁（ｓ）を最大にするｓをＳ₁に代
入し、ステップ２９でｋ₁（Ｓ₁）をＫ₁に代入する。同
様に、ステップ３０、ステップ３１で一致文字数ｋ
₂（ｓ）を最大にするｓをＳ₂に、ｋ₂（Ｓ₂）をＫ₂に代
入する。続いて、ステップ３２でＫ₁がＫ₂より大きいか
否かを判定する。Ｋ₁がＫ₂より大きいときは、ステップ
３３、ステップ３４でＳにＳ₁、ＫにＫ₁を代入する。Ｋ
₁がＫ₂以下のときは、ステップ３５、ステップ３６でＳ
にＳ₂、ＫにＫ₂を代入する。If s is greater than q, step 28
Then, s that maximizes the number of matching characters k ₁ (s) is substituted for S ₁ , and k ₁ (S ₁ ) is substituted for K ₁ in step 29. Similarly, in steps 30 and 31, the number of matching characters k
Substitute s that maximizes ₂ (s) into S ₂ , and substitute k ₂ (S ₂ ) into K ₂ . Then, K ₁ at step 32 is equal to or greater than K _2. When K ₁ is greater than K _2, the step 33 is substituted for K ₁ to S _1, K to S at step 34. K
₁ when the K ₂ below, steps 35, S at step 36
Substituting K ₂ to S _2, K to.

【００２７】ステップ３７でこのようにして得られたＫ
が０か否かを判定し、０のときは重畳部分無しと見な
し、図５のステップ４８へジャンプし、次の照合先部分
文書画像の処理に進む。Ｋが０でないときは、図４のス
テップ３８でＳの値をもとに、Ａ₁Ａ₂…Ａ_pとＢ₁Ｂ₂…
Ｂ_qの一致文字コード列Ｃ₁Ｃ₂…Ｃ_Kを抽出する。The K thus obtained in step 37
Is determined to be 0, and if 0, it is considered that there is no overlapping portion, and the process jumps to step 48 in FIG. 5 to proceed to the processing of the next collation destination partial document image. If K is not 0, A ₁ A ₂ ... A _p and B ₁ B ₂ .
Extracting a matching character code string C ₁ C ₂ ... C _K of B _q.

【００２８】次に、ステップ３９で一致文字コード番号
ｋに初期値として１を設定した後、ステップ４０で文字
コードＣ_kについての第ｎ部分文書画像中における外接
矩形座標（Ｘ_Sn，Ｙ_Sn）および（Ｘ_En，Ｙ_En）を取出
す。さらに、ステップ４０でＣ_kについての第ｍ部分文
書画像中における外接矩形座標（Ｘ_Sm，Ｙ_Sm）および
（Ｘ_Em，Ｙ_Em）取出し、ステップ４１で両座標値の差
分：ＤＸ_Snm＝Ｘ_Sm−Ｘ_Sn ＤＹ_Snm＝Ｙ_Sm−Ｙ_Sn ＤＸ_Enm＝Ｘ_Em−Ｘ_En ＤＹ_Enm＝Ｙ_Em−Ｙ_En を算出する。ステップ４３でｋに１を加算した後、ステ
ップ４４でｋがＫ未満か否かを判定する。ｋがＫ未満の
ときは次の一致文字コードについて座標の差分を求める
処理を繰り返す。ｋがＫ以上のときは、ステップ４５で
ＤＸ_Snm，ＤＸ_Enmの全一致文字コードについての平均値
ＤＸを算出する。同様に、ステップ４６でＤＹ_Snm，Ｄ
Ｙ_Enmの平均値ＤＹを算出する。（ＤＸ，ＤＹ）は着目
部分文書画像ｎと照合先部分文書画像ｍの平均的なずれ
と考えられるので、ステップ４７で第ｎ部分文書画像と
第ｍ部分文書画像とを（ＤＸ，ＤＹ）ずらして重ね合わ
せる。Next, in step 39, the matching character code number k is set to 1 as an initial value, and in step 40, the circumscribed rectangular coordinates (X _Sn , Y _Sn ) of the character code C _k in the nth partial document image. And (X _En , Y _En ). Further, in step 40, the circumscribed rectangular coordinates (X _Sm , Y _Sm ) and (X _Em , Y _Em ) of C _k in the m-th partial document image are extracted. In step 41, the difference between the two coordinate values: DX _Snm = X _Sm -X _Sn DY _Snm = Y _Sm -Y _Sn DX _Enm = X _Em -X _En DY _Enm = Y _Em -Y _En is calculated. After adding 1 to k in step 43, it is determined in step 44 whether k is less than K. If k is smaller than K, the process of obtaining the coordinate difference for the next matching character code is repeated. If k is _{equal to} or _larger than K, an average value DX is calculated in step 45 for all matching character codes DX _Snm and DX _Enm . Similarly, in step 46, DY _Snm , D
The average value DY of Y _Enm is calculated. Since (DX, DY) is considered to be an average deviation between the focused partial document image n and the collation destination partial document image m, in step 47, the (n, n) th partial document image and the mth partial document image are shifted by (DX, DY). And overlap.

【００２９】以上の処理が完了したら、図５のステップ
４８でｍに１を加算し、ステップ４９でｍが部分文書画
像数より大きいか否かの判定を行う。ｍが部分文書画像
数以下の時は図２のステップ１９以降の処理を繰り返
す。ｍが部分文書画像数より大きい時は、ステップ５０
でｎに１を加算し、ステップ５１でｎが部分文書画像数
より大きいか否かの判定を行う。ｎが部分文書画像数以
下の時は図１のステップ１６以降の処理を繰り返す。ｎ
が部分文書画像数より大きい時は、処理を終了する。When the above processing is completed, 1 is added to m in step 48 of FIG. 5, and it is determined in step 49 whether m is larger than the number of partial document images. If m is equal to or smaller than the number of partial document images, the processing from step 19 onward in FIG. 2 is repeated. If m is larger than the number of partial document images, step 50
In step 51, it is determined whether or not n is greater than the number of partial document images. When n is equal to or less than the number of partial document images, the processing from step 16 onward in FIG. 1 is repeated. n
Is larger than the number of partial document images, the process ends.

【００３０】以上の説明では、ｐ≦ｑの場合について説
明したが、ｐ＞ｑの場合も一致文字数計数の繰り返し回
数を切り替えることで対応可能である。また、重畳部分
に複数の文字行が存在する場合も、以上説明した処理を
行毎に行うことで容易に拡張可能である。In the above description, the case of p ≦ q has been described. However, the case of p> q can be dealt with by changing the number of repetitions of counting the number of matching characters. Further, even when a plurality of character lines exist in the superimposed portion, the processing can be easily extended by performing the above-described processing for each line.

【００３１】図３は外接矩形座標を説明する図であっ
て、文字外接矩形座標とは、文書画像中における文字の
左右端および上下端の位置を表す。言い換えると、文字
を矩形で囲んだときの左上頂点および右下頂点の座標を
表す。FIG. 3 is a diagram for explaining the circumscribed rectangular coordinates. The character circumscribed rectangular coordinates indicate the positions of the left and right ends and the upper and lower ends of the character in the document image. In other words, it represents the coordinates of the upper left vertex and the lower right vertex when a character is enclosed by a rectangle.

【００３２】図４は処理の流れを説明する図であって、
１００は着目部分文書画像、１０１は照合先部分文書画
像、１０２は着目部分文書画像の認識結果文字コード
列、１０３は照合先部分文書画像の認識結果文字コード
列、１０４は一致文字数計数範囲、１０５は一致文字
数、１０６は連結文書画像である。FIG. 4 is a diagram for explaining the flow of processing.
100 is a partial document image of interest, 101 is a partial document image of the collation target, 102 is a character code string of the recognition result of the partial document image of interest, 103 is a character code string of the recognition result of the partial document image of the collation, 104 is a matching character number counting range, 105 Is the number of matching characters, and 106 is a connected document image.

【００３３】着目部分文書画像１００、照合先部分文書
画像１０１は、文字毎の外接矩形座標を算出され、文字
認識処理により、それぞれ文字コード列１０２“神奈川
県川崎”、１０３“川県川崎市幸区”が生成される。次
に、右方向に１文字ずつずらして一致文字数１０５を計
数する処理を重畳部分がなくなるまで実行する。続い
て、左方向にも１文字ずつずらして一致文字数１０５を
計数する処理を重畳部分がなくなるまで実行する。以上
の計数処理で求めた一致文字数が最大になる位置（図４
の例では、右方向照合処理の３回目）を求め、そのとき
の一致文字“川”“県”“川”“崎”の外接矩形が一致
するように着目部分文書画像１００、照合先部分文書画
像１０１を重ね合わせた結果が連結文書画像１０６であ
る。The circumscribed rectangular coordinates of each character are calculated for the target partial document image 100 and the collation target partial document image 101, and the character code strings 102 “Kawasaki, Kanagawa” and 103 “Kawasaki, Kawasaki” are obtained by character recognition processing. Ward ”is generated. Next, a process of counting the number of matching characters 105 shifted one character at a time to the right is executed until there is no overlapped portion. Subsequently, the process of counting the number of matching characters 105 shifted one character at a time in the left direction is performed until there is no overlapped portion. The position where the number of matching characters obtained by the above counting process becomes maximum (FIG. 4)
In the example of (3), the rightward collation process is performed for the third time), and the partial document image of interest 100 and the collation target partial document such that the circumscribed rectangles of the matching characters “kawa”, “prefecture”, “kawa” and “saki” match at that time The result obtained by superimposing the images 101 is the connected document image 106.

【００３４】なお、本実施形態例は文書画像の場合につ
いて説明したが、文字パタンを含む一般画像へも適用可
能である。Although the present embodiment has been described with reference to a document image, the present embodiment is also applicable to a general image including a character pattern.

【００３５】本発明は、データを保存しそれらを自由に
読み出し可能なハードディスクやそれに準ずる装置と、
データを処理する際に必要なバッファやそれに準ずる装
置と、最終的に検出されたカット点を表示、出力するデ
ィスプレイなどの装置を備え、それらハードディスク、
バッファ及びディスプレイなどをあらかじめ定められた
手順に基いて制御する中央演算装置などを備えたコンピ
ュータやそれに準ずる装置を基に、上述した実施形態例
の処理、ないしは、図２ないし図７までの一連の図に示
した方法ないしアルゴリズムを記述した処理プログラム
やそれに準ずる物を、該コンピュータに対して与え、制
御、実行させることで実現することが可能である。ここ
で、該処理プログラムやそれに準ずる物を、コンピュー
タが実行する際に読み出しを実行できるＣＤ−ＲＯＭ、
フロッピーディスク（ＦＤ）、光磁気ディスク（ＭＯ）
あるいはそれらに準ずる記憶媒体に記録して、配布する
ことが可能である。The present invention provides a hard disk capable of storing data and freely reading them and a device similar thereto,
It is equipped with devices such as a buffer necessary for processing data and a device equivalent to it, and a display etc. that displays and outputs the finally detected cut point,
Based on a computer having a central processing unit or the like that controls a buffer, a display, and the like based on a predetermined procedure, and the like, the processing of the above-described embodiment or a series of processes shown in FIGS. The present invention can be realized by providing a computer with a processing program describing a method or an algorithm shown in the drawing or an equivalent thereof, and controlling and executing the computer. Here, a CD-ROM that can read the processing program and the equivalents when the computer executes the processing program,
Floppy disk (FD), magneto-optical disk (MO)
Alternatively, they can be recorded on a storage medium corresponding to them and distributed.

【００３６】[0036]

【発明の効果】以上説明したように、本発明によれば、
画素ではなく文書画像中に存在する文字パタンを認識
し、その位置情報をもとに文書画像の重ね合わせを行う
ので、高速かつ正確な画像重ね合わせが可能となり、大
きなサイズの文書画像を対象とする場合や、高速性を要
する場合にも適用可能な画像処理方法及び装置が実現で
きる。As described above, according to the present invention,
It recognizes character patterns that exist in the document image instead of pixels, and superimposes the document images based on the position information, so that high-speed and accurate image superposition can be performed. And an image processing method and apparatus that can be applied even when high speed is required.

[Brief description of the drawings]

【図１】本発明の一実施形態例の画像処理装置を示すブ
ロック図である。FIG. 1 is a block diagram illustrating an image processing apparatus according to an embodiment of the present invention.

【図２】本発明の一実施形態例の画像処理装置の動作と
ともに本発明での画像処理方法の一実施形態例を示すフ
ローチャート（その１）である。FIG. 2 is a flowchart (part 1) illustrating an operation of the image processing apparatus according to the embodiment of the present invention and an image processing method according to an embodiment of the present invention.

【図３】本発明の一実施形態例の画像処理装置の動作と
ともに本発明での画像処理方法の一実施形態例を示すフ
ローチャート（その２）である。FIG. 3 is a flowchart (part 2) illustrating an operation of the image processing apparatus according to the embodiment of the present invention and an embodiment of the image processing method according to the present invention.

【図４】本発明の一実施形態例の画像処理装置の動作と
ともに本発明での画像処理方法の一実施形態例を示すフ
ローチャート（その３）である。FIG. 4 is a flowchart (part 3) illustrating an operation of the image processing apparatus according to the embodiment of the present invention and an embodiment of the image processing method according to the present invention.

【図５】本発明の一実施形態例の画像処理装置の動作と
ともに本発明での画像処理方法の一実施形態例を示すフ
ローチャート（その４）である。FIG. 5 is a flowchart (part 4) illustrating an operation of the image processing apparatus according to the embodiment of the present invention and an embodiment of the image processing method according to the present invention.

【図６】上記実施形態例での外接矩形座標を説明する図
である。FIG. 6 is a diagram illustrating circumscribed rectangular coordinates in the embodiment.

【図７】上記実施形態例での処理の流れを説明する図で
ある。FIG. 7 is a diagram illustrating a flow of a process in the embodiment.

[Explanation of symbols]

１…部分文書画像格納部２…文字外接矩形算出部３…文字認識部４…文字コード列照合部５…部分文書画像間変位量算出部６…連結文書画像合成部７…連結文書画像格納部１０〜５１…ステップ１００…着目部分文書画像１０１…照合先部分文書画像１０２…着目部分文書画像の認識結果文字コード列１０３…照合先部分文書画像の認識結果文字コード列１０４…一致文字数計数範囲１０５…一致文字数１０６…連結文書画像 DESCRIPTION OF SYMBOLS 1 ... Part document image storage part 2 ... Character circumscribed rectangle calculation part 3 ... Character recognition part 4 ... Character code string collation part 5 ... Displacement amount calculation part between partial document images 6 ... Concatenated document image synthesis part 7 ... Concatenated document image storage part 10 to 51: Step 100: Partial document image of interest 101: Partial document image of collation target 102: Recognition result character code string of partial document image of interest 103: Recognition character code string of partial document image of collation 104: Matching character number counting range 105 ... Number of matching characters 106 ... Concatenated document image

Claims

[Claims]

1. An image processing method for combining a plurality of partial document images having a superimposed portion into a document image obtained by concatenating the plurality of partial document images, comprising: for each character pattern present in each partial document image, a circumscribed rectangle of the character pattern A step of calculating coordinates; a step of recognizing each character pattern surrounded by the circumscribed rectangle for each partial document image to generate a character code string for each partial document image; Collating the columns and extracting a matching character code string when the number of matching characters is maximized for each partial document image pair; and for all partial document image pairs, a partial document of a character code belonging to the matching character code string Calculating a displacement amount between partial document images from circumscribed rectangular coordinates in the image; and synthesizing a connected document image using displacement amounts of all the partial document image pairs. An image processing method characterized by the following.

2. An image processing method for combining a plurality of partial document images having a superimposed portion and a document image obtained by linking the plurality of partial document images, wherein for each character pattern present in each partial document image, a circumscribed rectangular coordinate of the character pattern is determined. A calculating procedure, a procedure for recognizing each character pattern surrounded by the circumscribed rectangle for each partial document image, and generating a character code string for each partial document image, and a procedure for calculating the character code string between all partial document images. A step of performing matching and extracting a matching character code string when the number of matching characters is maximized for each partial document image pair; and for all partial document image pairs, a partial document image of a character code belonging to the matching character code string Calculating the displacement between the partial document images from the circumscribed rectangle coordinates of the partial document image, and combining the connected document images using the displacement amounts of all the partial document image pairs. A recording medium recording an image processing method, wherein a program to be executed by the computer is recorded on a computer-readable medium.

3. An image processing apparatus for synthesizing a plurality of partial document images having a superimposed portion and combining them into a document image, comprising: for each character pattern present in each partial document image, a circumscribed rectangle of the character pattern; Means for calculating coordinates; means for recognizing each character pattern surrounded by the circumscribed rectangle for each partial document image to generate a character code string for each partial document image; Means for collating the strings and extracting a matching character code string when the number of matching characters is maximized for each partial document image pair; and for all partial document image pairs, a partial document of a character code belonging to the matching character code string Means for calculating the amount of displacement between partial document images from the circumscribed rectangular coordinates in the image, and means for combining connected document images using the amounts of displacement for all pairs of partial document images An image processing apparatus characterized by the above-mentioned.