JP2000011192A

JP2000011192A - Inter-image positioning method

Info

Publication number: JP2000011192A
Application number: JP10175245A
Authority: JP
Inventors: Katsuhiko Sato; 克彦佐藤; Hideaki Tanaka; 秀明田中; Kazuyuki Nako; 和行名古
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1998-06-23
Filing date: 1998-06-23
Publication date: 2000-01-14
Anticipated expiration: 2018-06-23
Also published as: JP3604909B2

Abstract

PROBLEM TO BE SOLVED: To provide an inter-image positioning method for obtaining images whose inter-image positions are aligned as desired. SOLUTION: This is an inter-image positioning method for operating the inter-image positioning of continuous pages in images inputted for each page, and this method comprises a step for judging whether or not the image is a desired image, a step for obtaining a template being the reference of positioning from the image which is judged as the desired image, and step for correcting the inter-image position based on the template.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、書籍を画像ベース
として入力して電子書籍用コンテンツを作成するオーサ
リングシステムに関し、特に、入力された書籍の画像デ
ータの中で所望のページ間の位置合わせを行う画像間位
置合わせ方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an authoring system for creating an electronic book content by inputting a book as an image base, and more particularly, to aligning a desired page in image data of the input book. The present invention relates to a method of performing an image alignment.

【０００２】[0002]

【従来の技術】現在、ハードウエアおよびソフトウエア
の発展にともない、既存の紙メディアに変わる新たな書
籍の形態として、電子書籍の発表が活発化してきてい
る。これらの電子書籍は、音声、画像、動画、アニメ等
のいわゆるマルチメディア系データに対しても対応可能
であるものの、基本的にはテキスト（文字コード）ベー
スの形式をとっている。2. Description of the Related Art Presently, with the development of hardware and software, the publication of electronic books has been activated as a new book form replacing existing paper media. These electronic books are capable of supporting so-called multimedia data such as audio, images, moving images, animations, etc., but are basically in a text (character code) -based format.

【０００３】一方、紙メディアによる書籍（いわゆる
「本」）は、現在でも年間５０万タイトルを数え、その
総数は膨大なものであるが、電子化されたタイトル数は
非常に少なく、そのほとんどが紙メディアしか存在して
いない。[0003] On the other hand, books on paper media (so-called "books") still count 500,000 titles a year, and the total number is enormous. There is only paper media.

【０００４】[0004]

【発明が解決しようとする課題】しかし、このような従
来の電子書籍は、テキスト（文字コード）ベースのため
以下の問題点がある。However, such a conventional electronic book has the following problems because it is based on a text (character code).

【０００５】紙メディアの書籍をオーサリングするに
は、人手またはＯＣＲを利用しテキストデータを作成し
なければならない。このためコンテンツ作成に多くの時
間を必要とし、タイムリーでかつ大量のコンテンツ供給
が困難である。[0005] To author a book on paper media, text data must be created using manual or OCR. Therefore, a lot of time is required for content creation, and it is difficult to supply a large amount of content in a timely manner.

【０００６】また、漫画や写真雑誌などテキスト以外の
データが大部分を占める書籍は、コンテンツ化し難い。[0006] Also, books, such as comics and photo magazines, in which data other than texts occupy a large portion, are difficult to convert into contents.

【０００７】このような事情のため、従来の電子書籍コ
ンテンツ数は数百程度と少なく、かつ内容も辞書系に偏
っている。このため、電子書籍の普及率は、紙書籍の１
％にも達していないのが現状である。特に、コンテンツ
の種類が少ない問題は致命的で、電子書籍普及の大きな
さまたげとなっている。[0007] Under such circumstances, the number of electronic book contents in the related art is as small as about several hundreds, and the contents are biased toward dictionary systems. For this reason, the diffusion rate of e-books is 1
% Is not yet reached. In particular, the problem of a small number of types of content is fatal, and is a major hindrance to the spread of electronic books.

【０００８】そこで、上記問題を解決する手段として
は、画像ベースで入力して電子書籍コンテンツを得るこ
とが考えられる。これは、以下の利点を有する。Therefore, as a means for solving the above problem, it is conceivable to obtain electronic book contents by inputting on an image basis. This has the following advantages.

【０００９】基本的には、既存の紙メデイアの書籍をス
キャニングするだけでコンテンツ作成が行なえ、短期間
に大量のコンテンツ供給が可能となる。Basically, content can be created simply by scanning an existing paper-media book, and a large amount of content can be supplied in a short period of time.

【００１０】漫画や写真雑誌など、テキストベースの電
子書籍では不可能であったコンテンツ供給が可能とな
る。[0010] It is possible to supply contents such as comics and photo magazines, which were not possible with text-based electronic books.

【００１１】外字や異体字など、または古文書などのよ
うに、現在の文字コード体系に合っていない文字が存在
しても容易に取り込める。[0011] Even if there is a character that does not conform to the current character code system, such as an external character, a variant character, or an old document, it can be easily captured.

【００１２】言語（文字コード）に依存しないため、ビ
ューアやオーサリングシステム全体の海外展開（普及）
が容易である。Since it does not depend on the language (character code), overseas development of the entire viewer and authoring system (spread)
Is easy.

【００１３】上記利点により、画像ベースの電子書籍
は、テキストベースの電子書籍の問題点を全て解決する
事ができる。With the above advantages, the image-based electronic book can solve all the problems of the text-based electronic book.

【００１４】ところが、画像ベースで入力して電子書籍
コンテンツを得るには、ＡＤＦ（ＡｕｔｏＤｏｃｕｍ
ｅｎｔＦｅｅｄｅｒ）等でスキャナ入力して、文書構
造化等の各種処理を施せばよいが、以下のような問題点
を生じる。However, in order to obtain electronic book contents by inputting on an image basis, an ADF (Auto Docum) is required.
ent Feeder) to perform various processes such as document structuring, but the following problems arise.

【００１５】スキャナ入力の際に画像の傾きやずれが生
じ、電子書籍の場合、ビューアに基準となる直線（例え
ば、ＣＲＴや液晶のエッジ部分）があり、紙に比べて、
より傾きが強調されユーザーに不快感を与える。このた
め、これを補正する処理が必要となる。これを人手にた
よっても相当な作業量を必要とし、著しいオーサリング
時間の増大を招いてしまう。In the case of an electronic book, a viewer has a reference straight line (for example, an edge portion of a CRT or a liquid crystal) in the case of an electronic book.
The inclination is emphasized more, giving the user discomfort. Therefore, a process for correcting this is required. Even if this is manually performed, a considerable amount of work is required, and the authoring time is significantly increased.

【００１６】特に、電子書籍の本文ページの場合に傾き
やずれがあると、ビューアで見るときに大きな不快感を
もたらしてしまう。In particular, if there is a tilt or deviation in the body page of an electronic book, a great discomfort is caused when viewed with a viewer.

【００１７】さらに、電子書籍コンテンツの校閲のため
全ページを確認していると、オーサリング時間の増大を
招き、短期間に大量のコンテンツ供給ができなくなるた
め、適切なエラー処理が必要となる。Further, if all pages are checked for reviewing electronic book contents, the authoring time is increased, and a large amount of contents cannot be supplied in a short period of time. Therefore, appropriate error processing is required.

【００１８】本発明の目的は、上記問題点に鑑み、所望
の画像間の位置が揃った画像を得ることができる画像間
位置合わせ方法を提供することにある。An object of the present invention is to provide an inter-image registration method capable of obtaining an image in which the positions of desired images are aligned in view of the above problems.

【００１９】[0019]

【課題を解決するための手段】請求項１記載の画像間位
置合わせ方法は、ページごとに入力された画像の中で、
連続するページの画像間の位置合わせを行う画像間位置
合わせ方法であって、所望の画像であることを判定する
ステップと、所望の画像であると判定された画像から位
置合わせの基準となるテンプレートを求めるステップ
と、該テンプレートに基づいて画像間の位置を補正する
ステップと、を有することを特徴とする。According to a first aspect of the present invention, there is provided a method for aligning images, comprising the steps of:
An image-to-image alignment method for aligning images of consecutive pages, the method comprising: determining a desired image; and a template serving as a reference for alignment based on the image determined to be a desired image. And a step of correcting a position between images based on the template.

【００２０】請求項２記載の画像間位置合わせ方法は、
請求項１記載の画像間位置合わせ方法において、上記テ
ンプレートは、入力された画像を走査して得られたエッ
ジ点の集合から求めた外接線からなる矩形の位置情報で
あることを特徴とする。According to a second aspect of the present invention, there is provided a method of aligning images,
2. The method according to claim 1, wherein the template is rectangular position information including a tangent line obtained from a set of edge points obtained by scanning the input image.

【００２１】請求項３記載の画像間位置合わせ方法は、
請求項２記載の画像間位置合わせ方法において、所望の
画像であると判定された場合、入力された画像の位置情
報と上記テンプレートの位置情報とが所定範囲外の場合
に警告データを生成するステップをさらに有することを
特徴とする。According to a third aspect of the present invention, there is provided a method for aligning images,
3. A method according to claim 2, wherein, if it is determined that the image is a desired image, if the positional information of the input image and the positional information of the template are out of a predetermined range, generating warning data. Is further provided.

【００２２】請求項４記載の画像間位置合わせ方法は、
書籍のページごとに入力された画像の中で、連続するペ
ージの本文画像間の位置合わせを行う画像間位置合わせ
方法であって、本文画像であることを判定するステップ
と、本文画像であると判定された画像から位置合わせの
基準となるテンプレートを求めるステップと、該テンプ
レートに基づいて本文画像間の位置を補正するステップ
と、を有することを特徴とする。According to a fourth aspect of the present invention, there is provided a method of aligning images,
An image-to-image alignment method for aligning the body images of successive pages in an image input for each page of a book, wherein the step of determining that the body image is a body image is performed. A step of obtaining a template serving as a reference for alignment from the determined image; and a step of correcting a position between text images based on the template.

【００２３】[0023]

【発明の実施の形態】（第１の実施の形態）図１に、本
実施の形態の画像間位置合わせ装置のブロック図を示
す。この装置は、ページ毎に切り離された書籍の原稿が
画像入力部１０１で画像データとしてページ毎に読み込
まれ、ページ輪郭検出部１０２、ページ内容領域抽出部
１０３、傾き補正部１０４、ページ位置補正部１０５、
及びページ情報処理部１０７で後述する処理が各種バッ
ファを用いて行われ、ページ間の位置合わせ補正がされ
た画像データが画像出力部１０６から出力される。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS (First Embodiment) FIG. 1 shows a block diagram of an inter-image positioning apparatus according to the present embodiment. In this apparatus, an original of a book separated for each page is read for each page as image data by an image input unit 101, and a page outline detection unit 102, a page content area extraction unit 103, a tilt correction unit 104, a page position correction unit 105,
In addition, the page information processing unit 107 performs processing described later using various buffers, and the image output unit 106 outputs the image data corrected for the alignment between pages.

【００２４】次に、本装置に入力される書籍として小説
本の場合の構成を示す概念図を図２に示す。図２から明
らかなように、タイトル（表紙）ページ、目次ページ、
本文ページ（偶数ページ、奇数ページ）、索引ページ、
裏表紙ページからなる。この中でほとんどのページは、
本文ページであり、この本文ページは、どのページもほ
ぼ決まった位置にヘッダ領域（本例の場合、ページを示
す）、フッタ領域（本例の場合、第何章を示す）、及び
内容領域（本文領域）がある。本発明では、このような
本の構成の特徴を用いて、ページ間の位置合わせを行う
のである。FIG. 2 is a conceptual diagram showing the structure of a novel book as a book input to the apparatus. As is clear from FIG. 2, the title (cover) page, the table of contents page,
Body pages (even and odd pages), index pages,
It consists of a back cover page. Most of these pages are
The main body page is a main body page. In the main body page, a header area (in this example, a page is indicated), a footer area (in this example, what chapter is indicated), and a content area ( Body area). In the present invention, alignment between pages is performed by using such a feature of the book configuration.

【００２５】次に、図１の各部の処理について説明す
る。画像入力部１０１は、スキャナなどから２値あるい
は多値の画像を取り込み、入力画像バッファ１０８に保
存する。この画像は、モノクロ画像であってもカラー画
像であってもよい。画像の入力のおよその向きは正し
く、本の最初から、あるいは最後からページ順に入力さ
れる。オートフィーダを用いる場合に誤って傾いて入力
されることも考慮にいれ、スキャナなどで画像を取り込
む際は、本のページの大きさよりスキャナの入力領域を
大きくするのが望ましく、以下の説明では、この場合に
ついて説明する。また、ページがスキャナの入力領域よ
り大きいときは、分割して入力し、入力画像バッファ１
０８内でつなげるようにすればよい。Next, the processing of each section in FIG. 1 will be described. The image input unit 101 takes in a binary or multi-valued image from a scanner or the like and stores it in the input image buffer 108. This image may be a monochrome image or a color image. The approximate orientation of the image input is correct, starting at the beginning of the book or at the end of the book in page order. When using an auto feeder, taking into account that the input is performed at an incorrect inclination, it is desirable to make the input area of the scanner larger than the size of the page of the book when capturing an image with a scanner. This case will be described. If the page is larger than the input area of the scanner, it is divided and input, and the input image buffer 1
08 should be connected.

【００２６】次に、ページ輪郭検出部１０２について説
明する。画像入力部１０１で入力された画像は、ページ
の大きさがスキャナの入力領域より小さいため、本の実
際のページ領域と、後ろの背景領域とからなる。ページ
輪郭検出部１０２は、入力された画像から背景領域とペ
ージ領域とを区別し、そのページ領域の輪郭を抽出す
る。ここで、既存の本をページ毎に切り離して用いる場
合、背の部分であったページ端は傾いたり、ちぎれたり
していることがあるため、正確な矩形でない場合が多
い。そのため、ページの輪郭の形状はおよそ矩形で近似
できるものとする。Next, the page outline detecting section 102 will be described. The image input by the image input unit 101 has an actual page area of a book and a background area behind it because the page size is smaller than the input area of the scanner. The page outline detection unit 102 distinguishes a background area and a page area from the input image, and extracts the outline of the page area. Here, when an existing book is separated and used for each page, the page end, which is the back portion, may be inclined or torn, and thus is often not an accurate rectangle. Therefore, the shape of the page outline can be approximated by a rectangle.

【００２７】ページ領域の輪郭検出の手法としては、画
像からエッジ部分を検出し、そのエッジ点から矩形の各
角度が９０度であることを検出する方法、あるいは画像
を走査し、輝度の変化の大きいエッジ点の集合を抽出
し、その集合から直線を求めて、輪郭線を抽出する方法
等がある。As a method of detecting the outline of the page area, a method of detecting an edge portion from an image and detecting that each angle of the rectangle is 90 degrees from the edge point, or scanning the image and detecting a change in luminance. There is a method of extracting a set of large edge points, obtaining a straight line from the set, and extracting a contour line.

【００２８】ページ領域の輪郭検出の手法の一例を図３
及び図４を用いて説明する。図３は、この方法の様子に
ついての説明図である。図４は、この方法のフローチャ
ートである。本発明では、ページ領域の輪郭は四角形で
あるから、画像のもっとも外側にある４本の直線を検出
する方法について説明する。FIG. 3 shows an example of a method for detecting the contour of the page area.
This will be described with reference to FIG. FIG. 3 is an explanatory diagram of the state of this method. FIG. 4 is a flowchart of the method. In the present invention, since the outline of the page area is a rectangle, a method for detecting the four outermost straight lines of the image will be described.

【００２９】まず、図３（ａ）で示されるページ領域の
左端の輪郭の直線を検出する場合について説明する。ま
ず、走査をおこなうラインを選択する。ここでは横方向
に走査するので、一番上の列を選ぶ。左端から右端へと
走査するので、左端の点を初期値（ｉ＝０）とする。順
に画像を走査してゆき、その点の輝度を入力画像バッフ
ァ１０８から得る。その点がエッジ点かどうか判定す
る。判定方法は、横方向の一次微分をとる。例えば、ソ
ーベル（Ｓｏｂｅｌ）フィルタを用いる方法である。エ
ッジ点であると判定された場合、その点の座標を記憶
し、そのラインの走査は終了し、次のラインを選択す
る。この走査をすべてのラインで一番下の列まで行う。
上記の結果、エッジ点の座標の集合が得られるが、これ
らの多くは、一つの直線上に集まっており、その直線を
計算する。この方法の代表例として、ハフ変換を用いて
行う。以上の処理を、入力画像の４方向（図４のフロー
でＤ＝０〜３に対応）についてそれぞれ行い（図３
（ｂ）参照）、４本の直線を求め、これらの直線をペー
ジ領域の輪郭として検出する（図３（ｃ））。First, the case of detecting the straight line of the contour at the left end of the page area shown in FIG. 3A will be described. First, a line to be scanned is selected. Here, since scanning is performed in the horizontal direction, the top row is selected. Since scanning is performed from the left end to the right end, the point on the left end is set as an initial value (i = 0). The image is scanned in order, and the brightness at that point is obtained from the input image buffer 108. It is determined whether the point is an edge point. The determination method takes a first derivative in the horizontal direction. For example, a method using a Sobel filter is used. If it is determined that the point is an edge point, the coordinates of that point are stored, scanning of that line ends, and the next line is selected. This scan is performed for all lines up to the bottom row.
As a result of the above, a set of coordinates of the edge points is obtained. Many of them are gathered on one straight line, and the straight line is calculated. As a typical example of this method, Hough transform is performed. The above processing is performed for each of the four directions of the input image (corresponding to D = 0 to 3 in the flow of FIG. 4) (FIG. 3).
(See (b).) Four straight lines are obtained, and these straight lines are detected as the outline of the page area (FIG. 3C).

【００３０】次に、ページ内容領域抽出部１０３につい
て、図５および図６を用いて説明する。ページ内容領域
抽出部１０３は、ページ輪郭検出部１０２で求めたペー
ジ領域の輪郭内の画像からページ内容領域の抽出を行
う。図５は、この方法の様子についての説明図である。
図６は、この方法のフローチャートである。Next, the page content area extracting unit 103 will be described with reference to FIGS. The page content area extraction unit 103 extracts a page content area from an image within the outline of the page area obtained by the page outline detection unit 102. FIG. 5 is a diagram illustrating the state of this method.
FIG. 6 is a flowchart of this method.

【００３１】図５（ａ）に示すように、画像を線順次に
走査して、まずエッジ点を抽出する。これはページ輪郭
抽出部１０２と同じ方法である。このエッジ点は、文字
とのエッジであったり、図や表の罫線、あるいは漫画の
吹き出し部分とのエッジであるが、本例の場合、文字領
域とのエッジ点を求めている。得られたエッジ点集合
は、文字列などであれば、１直線上に存在しているわけ
ではない。そこで、本発明では、これらのエッジ集合の
外接線（直線）を求め（図５（ｂ））、この直線をペー
ジ内容領域の境界線とする（図５（ｃ））。As shown in FIG. 5A, an image is scanned line-sequentially, and an edge point is first extracted. This is the same method as the page outline extraction unit 102. The edge point is an edge with a character, a ruled line of a diagram or table, or an edge with a speech balloon portion of a comic. In this example, an edge point with a character area is obtained. If the obtained edge point set is a character string or the like, it does not necessarily exist on one straight line. Therefore, in the present invention, an outer tangent line (straight line) of these edge sets is obtained (FIG. 5B), and this straight line is used as a boundary line of the page content area (FIG. 5C).

【００３２】いま一度、外接線の求め方を図６で説明す
る。まず、エッジ点集合を求める。エッジ点集合から２
点を選択し、この２点を通る直線式を求める。直線式
は、２点の座標を（ｘ１，ｙ１），（ｘ２，ｙ２）とす
ると（ｙ２−ｙ１）ｘ−（ｘ２−ｘ１）ｙ−（ｘ１ｙ２−ｘ
２ｙ１）＝０となる。Referring again to FIG. 6, how to determine the external tangent will be described. First, an edge point set is obtained. 2 from edge point set
A point is selected, and a straight line equation passing through these two points is obtained. When the coordinates of two points are (x1, y1) and (x2, y2), the straight line equation is (y2-y1) x- (x2-x1) y- (x1y2-x
2y1) = 0.

【００３３】選ばれなかった他のエッジ点がこの直線の
どちら側にあるか判定する。この判定式はＦ（ｘ，ｙ）＝（ｙ２−ｙ１）ｘ＋（ｘ２−ｘ１）ｙ−
ｘ１ｙ２＋ｘ２ｙ１とすると、（ｘ，ｙ）の点は、Ｆ（ｘ，ｙ）＜０なら原
点側、Ｆ（ｘ，ｙ）＞０なら外側となる。It is determined on which side of this straight line the other unselected edge point lies. This determination formula is: F (x, y) = (y2-y1) x + (x2-x1) y-
Assuming that x1y2 + x2y1, the point (x, y) is on the origin side when F (x, y) <0 and outside when F (x, y)> 0.

【００３４】すべての点が同じ側にあった場合、この直
線式は、外接線である。そうでなければ、新らたに別の
２点を選択する。すべての組について行えば、必ず外接
線を求めることができる。If all points are on the same side, the straight line equation is a tangent. Otherwise, another two new points are selected. If all the sets are performed, the tangent can be obtained.

【００３５】以上を、走査方向の各４方向（図６のフロ
ーでＤ＝０〜３に対応）について行えば、外接線で囲ま
れた図形のページ内容領域を抽出することができる（図
５（ｃ））。By performing the above for each of the four scanning directions (corresponding to D = 0 to 3 in the flow of FIG. 6), it is possible to extract the page content area of the figure surrounded by the external tangent (FIG. 5). (C)).

【００３６】次に、傾き補正部１０４について説明す
る。傾き補正部１０４は、ページ内容領域抽出部１０３
で抽出した外接線の図形に基づいて基準座標軸に対し回
転させる処理をおこない、入力画像の傾きを補正する。
この処理は、本の全てのページに対して行う。Next, the tilt correction unit 104 will be described. The inclination correction unit 104 includes a page content area extraction unit 103
The rotation of the input image is corrected with respect to the reference coordinate axis based on the external tangent graphic extracted in step (1).
This process is performed for all pages of the book.

【００３７】図７に、画像回転の様子を示す。この回転
変換の中心はページ内容領域の角の座標（ｃｘ，ｃｙ）
にする。入力画像をθだけ回転した時、入力画像上のあ
る点の座標を（ｘ，ｙ）、変換後の補正画像上での座標
を（ｘ’，ｙ’）とする。この回転の変換式はFIG. 7 shows how the image is rotated. The center of this rotation transformation is the coordinates (cx, cy) of the corner of the page content area
To When the input image is rotated by θ, the coordinates of a point on the input image are (x, y), and the coordinates on the corrected corrected image are (x ′, y ′). The conversion formula for this rotation is

【００３８】[0038]

【数１】 (Equation 1)

【００３９】である。Is as follows.

【００４０】回転処理では、入力画像バッファ１０８の
各画素にこの式を当てはめ、補正画像バッファ１０９上
の変換後の座標に輝度あるいは色を割り当てる。In the rotation process, this formula is applied to each pixel of the input image buffer 108, and luminance or color is assigned to the converted coordinates on the corrected image buffer 109.

【００４１】次に、ページ情報処理部１０７について説
明する。ページ情報処理部１０７は、入力されている本
の画像について本文ページであるか、それ以外のページ
であるかについて判定する。この判定の方法は、ページ
内容領域抽出部１０３で抽出された外接線の図形の大き
さあるいは形状を比較して、所定の大きさや形状の範囲
であれば本文ページであると判定する方法である。その
他に、ページ順に入力されるので、ページ内容領域抽出
部１０３で抽出された外接線の矩形の大きさは、本文ペ
ージであれば、ほぼ一定の大きさとなるので、外接線の
矩形の大きさがほぼ一定の大きさが続く場合にはその最
初のページから本文ページであると判定する、あるいは
ページ順に順次読み込まれるので、本文の始まるページ
と終わるページ（何枚目）を外部から予め指定する方法
等もある。また、ページ情報処理部１０７は、本文ペー
ジであると判断した場合に、ページ位置情報バッファ１
１０にその外接線の矩形のページ内容領域の位置情報を
記憶させて、本文画像間、つまりページ間の位置合わせ
のテンプレートとする。なお、すでにページ位置合せの
ための基準となる本文ページであることを示すテンプレ
ートの位置情報がページ位置情報バッファ１１０内に格
納されている時は、ページ位置補正部１０６へ進む。Next, the page information processing section 107 will be described. The page information processing unit 107 determines whether the input book image is a body page or another page. This determination method is a method of comparing the size or shape of the figure of the circumscribed tangent extracted by the page content area extraction unit 103, and determining that the image is a text page if the size or shape is within a predetermined size or shape. . In addition, since input is performed in the order of pages, the size of the rectangle of the external tangent line extracted by the page content area extraction unit 103 is substantially constant in the case of a text page, and therefore, the size of the rectangular shape of the external tangent line If the size continues to be almost constant, it is determined that the page is the body page from the first page, or the pages are sequentially read in the page order. There are also methods. If the page information processing unit 107 determines that the page is the body page, the page information buffer 1
10 stores the position information of the rectangular page content area of the circumscribed tangent line, and uses it as a template for positioning between body images, that is, between pages. If the position information of the template indicating that the page is a text page serving as a reference for page alignment has already been stored in the page position information buffer 110, the process proceeds to the page position correction unit 106.

【００４２】次に、ページ位置補正部１０５について説
明する。入力画像は、傾きは補正されているが、読み込
む時の機械的ずれ等により、入力画像によって本文のペ
ージ内容領域の位置が違ってくる。これは、本文ページ
を位置補正せずにこのまま出力して電子書籍としてビュ
ーワ等で本文ページの内容を見ると、本文ページの位置
ずれがあることになり、不快感を与える。そこで、ペー
ジ位置補正部１０５は、ページ位置情報バッファ１１０
に一旦記憶した本文ページであることを示すテンプレー
トの位置情報に合致するように、傾きが補正された本文
ページの画像の位置を平行移動させて補正する。その結
果、画像出力部１０６からは本文ページ間では位置ずれ
のない画像データが得られる。つまり、一旦、本文ペー
ジであると判定されたページからテンプレートを求めれ
ば、そのページに続く本文ページをそのテンプレートを
基準にして位置合わせするのである。Next, the page position correction unit 105 will be described. Although the inclination of the input image is corrected, the position of the page content area of the text differs depending on the input image due to a mechanical shift or the like at the time of reading. This means that if the text page is output as it is without correcting the position and the content of the text page is viewed as an electronic book with a viewer or the like, there is a displacement of the text page, which causes discomfort. Therefore, the page position correction unit 105 sets the page position information buffer 110
Then, the position of the image of the body page whose inclination has been corrected is translated and corrected so as to match the position information of the template indicating that the body page is stored once. As a result, the image output unit 106 can obtain image data with no positional deviation between the main text pages. That is, once a template is obtained from a page determined to be a text page, the text page following that page is aligned with reference to the template.

【００４３】また、上記テンプレートは、本文ページの
ヘッダ領域、フッタ領域、及び内容領域すべてを含むよ
うにしているが、各領域を図８（ａ）に示すように分け
て、図８（ｂ）に示すような形態でページ位置情報バッ
ファ１１０に記憶してもよい。このように分割すること
で、文字認識やキーワード抽出等をしやすくでき、文書
の構造化を容易にする。The template includes all of the header area, the footer area, and the content area of the body page. Each area is divided as shown in FIG. The information may be stored in the page position information buffer 110 in such a form. Such division facilitates character recognition, keyword extraction, and the like, and facilitates document structuring.

【００４４】以上の説明では、すべての処理は連続して
自動化され、処理されているが、前記ページ輪郭検出部
１０２、ページ内容領域抽出部１０３、傾き補正部１０
４、は手動でも行ってもよい。In the above description, all the processes are continuously automated and processed. However, the page outline detection unit 102, the page content area extraction unit 103, the inclination correction unit 10
Step 4 may be performed manually or.

【００４５】（第２の実施の形態）通常、本の画像は、
自動的に順次入力され、次の入力画像が本文ページであ
るのか、あるいはそれ以外であるのかは入力時点ではわ
からない。また、たとえ本文ページであっても、例え
ば、章の最後のページでページ内容領域の大きさが違っ
ていたりする。そこで、本文ページであると判定されて
も、ページ位置情報バッファ１１０のテンプレートの位
置情報と新たに入力された画像のページ内容領域の位置
情報とが著しく異なる場合（位置情報が所定範囲外の場
合）は、エラーとみなし、エラーバッファ（図示せず）
にエラービットをページごとに書き込み、警告データを
生成して残しておく。これによって、ユーザーは本全体
の自動処理の終了後、このエラーバッファから警告デー
タを表示手段（図示せず）でみることで、手動で補正処
理を行なわなければならないページをすぐに知ることが
できる。(Second Embodiment) Normally, a book image is
It is automatically and sequentially input, and it is not known at the time of input whether the next input image is a body page or other. In addition, even if it is a body page, for example, the size of the page content area is different on the last page of the chapter. Therefore, even if the position information is determined to be a text page, the position information of the template in the page position information buffer 110 is significantly different from the position information of the page content area of the newly input image (when the position information is out of the predetermined range). ) Is regarded as an error and an error buffer (not shown)
An error bit is written for each page, and warning data is generated and left. Thus, after the automatic processing of the entire book is completed, the user can immediately see the page that must be manually corrected by viewing the warning data from the error buffer on the display means (not shown). .

【００４６】上述した一連の処理は、プログラムで実現
できるが、このプログラムを光ディスクやフロッピディ
スク等のコンピュータ読み取り可能な記録媒体に記憶さ
せておいて、必要に応じて読み出して使用してもよい。The above-described series of processing can be realized by a program, but the program may be stored in a computer-readable recording medium such as an optical disk or a floppy disk, and may be read and used as needed.

【００４７】[0047]

【発明の効果】請求項１記載の発明によれば、ページご
とに入力された画像の中で、所望の連続する画像間の位
置合わせを短期間にできる。According to the first aspect of the present invention, it is possible to quickly align desired continuous images among images input for each page.

【００４８】請求項２記載の発明によれば、外接線を用
いるので、文字領域であっても正確なテンプレートをえ
ることができ、位置合わせの精度を高めることができ
る。According to the second aspect of the present invention, since an external tangent is used, an accurate template can be obtained even in a character area, and the accuracy of positioning can be improved.

【００４９】請求項３記載の発明によれば、画像間の位
置合わせ失敗を検知できるため、例えばオーサリング中
あるいは終了後の修正に利便を図ることができる。According to the third aspect of the present invention, since a registration failure between images can be detected, it is possible to conveniently perform, for example, correction during or after authoring.

【００５０】請求項４記載の発明によれば、ページごと
に入力された本文画像の中で、本文画像間の位置合わせ
を短期間にできる。その結果、電子書籍コンテンツを短
期間で作成することができると共に、電子書籍をビュー
ワで見る場合に、本文画像のビューワで見る場合の位置
が揃っているので、不快感をなくすことができる。According to the fourth aspect of the present invention, it is possible to quickly align the text images among the text images input for each page. As a result, the electronic book content can be created in a short period of time, and when the electronic book is viewed by the viewer, the positions of the body image when viewed by the viewer are aligned, so that discomfort can be eliminated.

[Brief description of the drawings]

【図１】第１の実施の形態のブロック図を示したもので
ある。FIG. 1 shows a block diagram of a first embodiment.

【図２】本の構成を表す概念図である。FIG. 2 is a conceptual diagram illustrating a configuration of a book.

【図３】ページ輪郭検出を説明するための図である。FIG. 3 is a diagram for explaining page outline detection;

【図４】ページ輪郭検出のフローチャートである。FIG. 4 is a flowchart of page outline detection.

【図５】ページ内容領域抽出を説明するための図であ
る。FIG. 5 is a diagram for explaining page content area extraction.

【図６】ページ内容領域抽出のフローチャートである。FIG. 6 is a flowchart of page content area extraction.

【図７】回転変換を説明するための図である。FIG. 7 is a diagram for explaining rotation conversion.

【図８】ページ位置情報バッファに格納されるデータ例
である。FIG. 8 is an example of data stored in a page position information buffer.

[Explanation of symbols]

１０１画像入力部１０２ページ輪郭検出部１０３ページ内容領域抽出部１０４傾き補正部１０５ページ位置補正部１０６画像出力部１０７ページ情報処理部１０８入力画像バッファ１０９補正画像バッファ１１０ページ位置情報バッファ 101 Image Input Unit 102 Page Outline Detection Unit 103 Page Content Area Extraction Unit 104 Skew Correction Unit 105 Page Position Correction Unit 106 Image Output Unit 107 Page Information Processing Unit 108 Input Image Buffer 109 Corrected Image Buffer 110 Page Position Information Buffer

───────────────────────────────────────────────────── フロントページの続き (72)発明者名古和行大阪府大阪市阿倍野区長池町22番22号シャープ株式会社内Ｆターム(参考） 5B050 BA16 DA06 EA06 EA12 EA13 EA20 FA17 ──────────────────────────────────────────────────続き Continued on the front page (72) Inventor Kazuyuki Nago 22-22, Nagaikecho, Abeno-ku, Osaka-shi, Osaka F-term (reference) 5B050 BA16 DA06 EA06 EA12 EA13 EA20 FA17

Claims

[Claims]

An image registration method for performing registration between images of consecutive pages in an image input for each page, comprising: a step of determining that the image is a desired image; An image alignment method comprising: obtaining a template as a reference for alignment from an image determined to be an image; and correcting a position between images based on the template.

2. The inter-image position according to claim 1, wherein the template is rectangular position information composed of external tangents obtained from a set of edge points obtained by scanning the input image. Matching method.

3. The method according to claim 2, further comprising the step of generating warning data when the position information of the input image and the position information of the template are out of a predetermined range when it is determined that the image is a desired image. 3. The method for aligning images according to claim 2, wherein:

4. An inter-image registration method for performing registration between body images on successive pages in an image input for each page of a book, the method comprising: determining that the image is a body image; An image alignment method comprising: obtaining a template as a reference for alignment from an image determined to be a text image; and correcting a position between text images based on the template. .