JP2005167522A

JP2005167522A - Printing system

Info

Publication number: JP2005167522A
Application number: JP2003402361A
Authority: JP
Inventors: Takeshi Shibuya; 竹志澁谷; Naoyuki Urata; 直之浦田; Tatsu Yokozawa; 横沢　　達
Original assignee: Ricoh Printing Systems Ltd; Hitachi Ltd
Current assignee: Ricoh Printing Systems Ltd; Hitachi Ltd
Priority date: 2003-12-02
Filing date: 2003-12-02
Publication date: 2005-06-23

Abstract

<P>PROBLEM TO BE SOLVED: To provide a technology of improving the reduction in a character recognition efficiency caused by an OCR incapable of recognizing semantic connections between words and sentences separated by typesetting, and a technology of attaining simultaneous optical character recognition for a plurality of pieces of printed matter. <P>SOLUTION: Regional elements of a sentence, their connection relations and sequential relations are embedded in the sentence as electronic watermark information so that the sequential relations of characters can accurately be grasped before recognition processing to enhance the character recognition efficiency. Further, a regional designation method adopts a relative coordinate reference region to attain the simultaneous optical character recognition for a plurality of pieces of printed matters while accurately grasping the correspondence to the printed matters. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、スキャナ・プリンタ・ＦＡＸなどの情報入力／印刷装置からなる印刷システムに関する。 The present invention relates to a printing system including an information input / printing apparatus such as a scanner, a printer, and a FAX.

スキャナなどによるＯＣＲ（Optical Character Reader）技術としては、下記特許文献１などが知られているが、一般に、ＯＣＲにおいては、局所的な文章の認識処理は、比較的効率良く行われるものの、テキスト領域と図表領域の切り分けや、テキスト領域どうしの意味的なつながりや、論理構造に即した読み取りの自動認識は困難であった。 As an OCR (Optical Character Reader) technique using a scanner or the like, the following Patent Document 1 is known. In general, in OCR, although local text recognition processing is performed relatively efficiently, a text region is used. It is difficult to automatically identify readings based on the logical structure of the text area, the semantic connection between the text areas, and the logical structure.

このため、組版の複雑な文章に対する簡易的なＯＣＲでは、改行による単語や文や段落の分断により、知識データベースを利用した間違い訂正が効率良く機能できないなどの問題で、認識効率が低下する問題があった。 For this reason, in simple OCR for complex text in typesetting, there is a problem that recognition efficiency decreases due to the fact that error correction using a knowledge database cannot function efficiently due to division of words, sentences and paragraphs due to line breaks. there were.

一方で、下記非特許文献１に開示されているように、２値の画像データ、特に印刷物に対して、ある程度の情報を電子透かしとして埋め込むことが可能であることが知られている。 On the other hand, as disclosed in Non-Patent Document 1 below, it is known that a certain amount of information can be embedded as a digital watermark in binary image data, particularly a printed matter.

特開平８−５０６３３号公報JP-A-8-50633

日本経済新聞：２００３年１０月３日の記事Nikkei Newspaper: October 3, 2003 article

本発明が解決しようとする課題は、組版により分断された単語や文さらには、段落のような文章の意味的なつながりが、ＯＣＲで認識することができないことによる文字認識の効率の低下を防止することである。 The problem to be solved by the present invention is to prevent a reduction in character recognition efficiency due to the fact that the semantic connection of words and sentences separated by typesetting, as well as paragraphs, cannot be recognized by OCR. It is to be.

また、上記問題によりＯＣＲ時に作業者に対して発生するＯＣＲ領域指定などの作業負荷を軽減することである。 Another object of the present invention is to reduce the work load such as OCR area designation generated for the worker during OCR due to the above problem.

上記課題は、印刷レイアウト上のテキスト領域要素と、その連結関係や順序関係などのＯＣＲ補助情報を、自動的に生成するか、印刷物作成者の作業として印刷時に生成し、印刷物上の電子透かしとして埋め込むことにより解決される。 The above problem is that the text area element on the print layout and the OCR auxiliary information such as the connection relation and the order relation thereof are automatically generated or generated at the time of printing as the work of the print creator, and used as a digital watermark on the print It is solved by embedding.

本発明のプリントシステムでは、文章領域要素中に電子透かしとして埋め込んだＯＣＲ補助情報により、組版により分断された単語・文・段落などの、文章の意味的なつながりが復元され、ＯＣＲ認識効率の低下を防止することが可能となる。 In the printing system of the present invention, the semantic connection of sentences such as words, sentences, and paragraphs divided by typesetting is restored by the OCR auxiliary information embedded as a digital watermark in the sentence area element, and the OCR recognition efficiency is reduced. Can be prevented.

また、ＯＣＲを支援する印刷物を提供するサービスを想定した場合には、ＯＣＲ補助情報を生成する作業負荷が、ＯＣＲ時から印刷物生成時に移ったことは、サービス向上のメリットとなる。 In addition, when assuming a service that provides a printed material that supports OCR, the shift in the workload for generating the OCR auxiliary information from the OCR time to the printed material generation time is a merit for improving the service.

本発明の実施例としては、電子情報に基づいて印刷画像データを生成する情報処理装置、印刷画像データをプレビューする画像表示装置、プレビュー画像上で、テキスト領域や画像領域などの領域属性を指示する領域指示手段、領域属性を電子透かし情報として画像データに埋め込む情報処理装置上の手続き、印刷画像データの印刷を行う印刷装置および、前記印刷装置により印刷された印刷物から、ＯＣＲ補助情報を抽出し、ＯＣＲにおけるヒント情報として利用するＯＣＲシステムとにより構築されるＯＣＲ補助情報埋め込み印刷システムである。 As an embodiment of the present invention, an information processing apparatus that generates print image data based on electronic information, an image display apparatus that previews print image data, and an area attribute such as a text area and an image area are indicated on the preview image OCR auxiliary information is extracted from an area instruction means, a procedure on an information processing apparatus that embeds area attributes in image data as digital watermark information, a printing apparatus that prints print image data, and a printed matter printed by the printing apparatus, This is an OCR auxiliary information embedded printing system constructed by an OCR system used as hint information in OCR.

ＯＣＲは一般には光学文字認識処理のことであるが、ここでは、図表・写真などの非テキスト領域を切り分け、ビットマップ情報として読み込む処理もＯＣＲの処理の一部として含めるものとする。 OCR is generally an optical character recognition process, but here, a process of separating non-text areas such as charts and photographs and reading it as bitmap information is also included as part of the OCR process.

図１は、Ａ４（２１０mm×２９７mm）を想定した本発明の対象となる印刷物の概念図である。図１に示すように、通常ビジネスドキュメントは、幾つかのテキスト領域(領域２〜領域１３)と、図表、写真などの非テキスト領域(領域１４)から構成される。 FIG. 1 is a conceptual diagram of a printed matter that is an object of the present invention assuming A4 (210 mm × 297 mm). As shown in FIG. 1, a normal business document is composed of several text regions (regions 2 to 13) and non-text regions (regions 14) such as charts and photographs.

本実施例では、単純化のために、それぞれの構成要素の領域は、それを取り囲む矩形要素で分離されるものとする。一般的なビジネス文書は、大抵の場合この前提を満たしている。また、図１で破線により示した各領域は、意味的な区切りではなく、物理的なレイアウトに基づく領域分割の例を示している。 In the present embodiment, for the sake of simplicity, it is assumed that each component region is separated by a surrounding rectangular element. Common business documents often meet this assumption. Further, each area indicated by a broken line in FIG. 1 is an example of area division based on a physical layout, not a semantic division.

テキスト領域である領域２〜領域１３には、一連の文脈に従う本文と、表題領域３や、図表のキャプション領域７、脚注領域９などの独立な本文の流れとは直接には関係しない浮動要素などがある。 The text areas 2 to 13 include a body according to a series of contexts, floating elements that are not directly related to the flow of independent text such as the title area 3, the caption area 7 of the chart, and the footnote area 9. There is.

このうち、一連の文脈に従う本文と、本文の流れに組み込み可能な浮動要素に対しては、文脈に従った順番を付けることが可能である。例えば、図１の場合では、日付領域２、表題領域３、執筆者名領域４、アブストラクト領域５、本文領域６、８、１０、１２、１３が、この順に順序づけ可能である。 Of these, the body according to a series of contexts and the floating elements that can be incorporated into the body flow can be given an order according to the context. For example, in the case of FIG. 1, the date area 2, title area 3, author name area 4, abstract area 5, body areas 6, 8, 10, 12, and 13 can be ordered in this order.

このうち、日付領域２、表題領域３、執筆者名領域４、アブストラクト領域５の順序は、必ずしもこの順である必要性は無いが、本文の順序関係は、正確に守る必要がある。図１では、領域の順序づけの例を各要素領域に○囲みの数字で示している。 Among these, the order of the date area 2, the title area 3, the author name area 4, and the abstract area 5 does not necessarily need to be in this order, but the order relation of the texts must be accurately observed. In FIG. 1, an example of ordering of regions is indicated by a circled number in each element region.

図２に、これらの領域情報を表現する情報テーブル４０の例を示す。図２では、先頭のヘッダ２０として、バージョン情報２３、付加情報３１へのバイトオフセット２４がそれぞれ１バイトづつ用意される。以後５バイト毎の固定長で、各領域情報エントリ２１が、各領域の順序付けに従って、第１エントリ２１ａ、第２エントリ２２ｂ、…の様に逐次記述される。 FIG. 2 shows an example of the information table 40 expressing these area information. In FIG. 2, a byte offset 24 to the version information 23 and the additional information 31 is prepared for each byte as the head header 20. Thereafter, each area information entry 21 is sequentially described as a first entry 21a, a second entry 22b,... According to the ordering of each area, with a fixed length of every 5 bytes.

また、それぞれの領域情報エントリ２１は、先頭上位４ｂｉｔの型識別子２５、先頭下位４ｂｉｔの補助情報２６、および、それに続く各１バイトの左端座標値２７、右端座標値２８、上端座標値２９、下端座標値３０、の計５バイトで構成される。これらの要素の詳細に関しては後ほど説明する。 Also, each area information entry 21 includes a top high 4 bit type identifier 25, a top low 4 bit auxiliary information 26, and a 1 byte left end coordinate value 27, a right end coordinate value 28, a top end coordinate value 29, and a bottom end. Coordinate value 30 consists of a total of 5 bytes. Details of these elements will be described later.

図２の例では付加情報３１として、４バイト文書識別番号を与えているが、この他に日時、印刷者識別コードなどを埋め込むのも有効である。付加情報量３１を情報テーブル４０の末尾に配置したのは、バージョン、あるいは用途による付加情報の変更に柔軟に対応可能とするためである。 In the example of FIG. 2, a 4-byte document identification number is given as the additional information 31, but it is also effective to embed date and time, a printer identification code, and the like. The reason why the additional information amount 31 is arranged at the end of the information table 40 is to flexibly cope with the change of the additional information depending on the version or the application.

例えば、幾つかの情報の組み合わせの型を付加情報３１として定義しておき、バージョン情報２３の値に応じて情報テーブル４０に埋め込まれる付加情報３１の実装を切り換えるようにしておくことで、文書目的や埋め込み可能データ量に応じた柔軟な対応が可能となる。 For example, by defining a combination type of some information as additional information 31 and switching the implementation of the additional information 31 embedded in the information table 40 according to the value of the version information 23, the document purpose is changed. And flexible support according to the amount of data that can be embedded.

ここで、埋め込みに要するデータ量を図１の場合で見積もると、ヘッダ（２バイト）＋（エントリ５バイト）×１３＋（文書識別番号４バイト）＝７１バイトの情報量となっている。これに対して、電子透かしとして印刷物に埋め込み可能な情報量は、非特許文献１によれば、およそ７０文字当たり８バイトであるから、６２０文字程度の文書であれば、上記情報を埋め込むことが可能なことが分かる。これは、例えば１ページ当たり２５文字×５０行×２カラムの特許公報では、実質１／３ページの文章量に相当する。 Here, when the amount of data required for embedding is estimated in the case of FIG. 1, the information amount is header (2 bytes) + (entry 5 bytes) × 13 + (document identification number 4 bytes) = 71 bytes. On the other hand, according to Non-Patent Document 1, the amount of information that can be embedded in printed matter as a digital watermark is approximately 8 bytes per 70 characters. I understand that it is possible. For example, in the patent publication of 25 characters × 50 rows × 2 columns per page, this corresponds to a sentence volume of substantially 1/3 page.

従って、付加情報３１としてさらに多く情報を付加することも文書によっては十分可能である。透かしを埋め込んだ印刷物の印字品質が問題にならず、むしろ埋め込み情報量が優先される場合には、付加情報３１に多くの情報を実装したバージョンを用意すると良い。また、透かし情報の増加に伴う画質劣化を最小限に抑える目的では、付加情報領域３１を実装せず、領域情報エントリ２１も文字認識を前提とするテキスト領域に限定し、情報をできる限り少なくするといった対応が実用上有効となる。 Accordingly, it is possible to add more information as the additional information 31 depending on the document. If the print quality of the printed matter in which the watermark is embedded does not matter, but rather the amount of embedded information is given priority, a version in which a lot of information is mounted in the additional information 31 may be prepared. Further, for the purpose of minimizing image quality degradation due to an increase in watermark information, the additional information area 31 is not implemented, and the area information entry 21 is limited to a text area premised on character recognition, and information is reduced as much as possible. Such a response is practically effective.

次に、領域情報エントリ２１の詳細を説明する。図１に示したようにそれぞれの領域は矩形で定義される。領域情報エントリ２１の各左端座標値２７、右端座標値２８、上端座標値２９、下端座標値３０は、図１の用紙左上端１を基準に１．２ｍｍを最小単位として８ｂｉｔ符号無し整数で記述する(但し、型識別子２５の値が図３のａ〜ｄの場合は、後述するように基準点が変更される)。 Next, details of the area information entry 21 will be described. As shown in FIG. 1, each area is defined by a rectangle. Each left end coordinate value 27, right end coordinate value 28, upper end coordinate value 29, and lower end coordinate value 30 of the area information entry 21 are described in 8-bit unsigned integers with 1.2 mm as the minimum unit with reference to the upper left corner 1 of FIG. (However, when the value of the type identifier 25 is a to d in FIG. 3, the reference point is changed as described later).

従って、図１の領域５を例とした場合、印刷上での領域５の左上端部１７と基準点１のｘ方向距離(mm単位)を１．２mmで割って小数点以下を切り捨てた値が情報テーブル４０の５番目のエントリ２１における左端座標値２７の値となる。 Accordingly, when the area 5 in FIG. 1 is taken as an example, the value obtained by dividing the x-direction distance (in mm) between the upper left end 17 of the area 5 and the reference point 1 by 1.2 mm on the printing and rounding down the decimal point. This is the value of the left end coordinate value 27 in the fifth entry 21 of the information table 40.

同様に、領域５の右下端部１８と基準点１のｘ方向実寸から右端座標値２８の値、領域５の左上端部１７と基準点１のｙ方向実寸から右端座標値２９の値、領域５の右下端部１７と基準点１のｙ方向実寸から右端座標値３０の値がそれぞれ決定される。 Similarly, the value of the right end coordinate value 28 from the x-direction actual size of the lower right end portion 18 and the reference point 1 of the region 5, the value of the right end coordinate value 29 from the y upper end portion 17 of the region 5 and the y-direction actual size of the reference point 1, the region The value of the right end coordinate value 30 is determined from the right lower end 17 of 5 and the actual size in the y direction of the reference point 1.

領域情報エントリ２１の型識別子２５には、各領域の内容に従って、図３に示した分類の型識別子を登録する。図３では、型識別子の値を０〜ｆの１６進数で表記し、括弧内に２進数で表記している。 In the type identifier 25 of the area information entry 21, the type identifier of the classification shown in FIG. 3 is registered according to the contents of each area. In FIG. 3, the value of the type identifier is represented by a hexadecimal number of 0 to f, and is represented by a binary number in parentheses.

図１の例の場合、日付領域２に対応する領域情報エントリ２１の型識別子２５の値として、横書きテキストを表す「２（００１０）」を対応させる。次いで表題領域３を日付領域２に継続するテキスト領域として登録するためには、表題領域３に対応する領域情報エントリ２１の型識別子２５の値として、継続テキストを表す「３（００１１）」を対応させる。 In the case of the example of FIG. 1, “2 (0010)” representing horizontal text is associated as the value of the type identifier 25 of the area information entry 21 corresponding to the date area 2. Next, in order to register the title area 3 as a text area continuing to the date area 2, “3 (0011)” representing the continuation text is supported as the value of the type identifier 25 of the area information entry 21 corresponding to the title area 3. Let

型識別子２５により継続テキスト属性を与えられた領域は、ＯＣＲ時には、先行する最後のテキスト属性(型識別子値１〜４)をもつ領域情報エントリ２１に対応する領域に継続するテキスト領域として解釈する。従って、継続テキスト属性を与えられた領域が、縦書きか横書きかは、先行するテキスト属性領域の縦・横に従う。 The region given the continuation text attribute by the type identifier 25 is interpreted as a text region continuing to the region corresponding to the region information entry 21 having the preceding last text attribute (type identifier value 1 to 4) at the time of OCR. Therefore, whether the area given the continuation text attribute is vertical writing or horizontal writing follows the vertical and horizontal directions of the preceding text attribute area.

同様に、図１の領域４〜領域６、領域８、領域１０〜領域１３には、継続テキスト属性を与え、この順番に情報テーブル４０の領域情報エントリ２１に登録する。 Similarly, the continuation text attribute is given to the area 4 to the area 6, the area 8, and the area 10 to the area 13 in FIG. 1 and registered in the area information entry 21 of the information table 40 in this order.

図表領域１４に対応する領域情報エントリ２１の型識別子の値としては、図１の例の場合、線画を表す「５（０１０１）」を指定する。型識別子値が線画属性５の領域は、ＯＣＲで取り込む際、解像度を優先した取り込みを行う。これに対して解像度よりは階調性を重視して読み込むことが適切と考えられる場合には、型式別子の値として、写真を表す「７（０１１１）」を指定する。 As the value of the type identifier of the area information entry 21 corresponding to the chart area 14, “5 (0101)” representing a line drawing is specified in the example of FIG. When the area having the type identifier value of the line drawing attribute 5 is captured by OCR, the resolution is prioritized. On the other hand, if it is considered appropriate to read with emphasis on gradation rather than resolution, “7 (0111)” representing a photograph is designated as the value of the type identifier.

この他、罫線により行列構造が明確にされている表領域の場合には、型識別子の値として表を表す「６（０１１０）」を指定する。この場合、ＯＣＲ時には、罫線認識を優先することで行列構造を解析し、認識処理のヒントとして利用する。 In addition, in the case of a table region whose matrix structure is clarified by ruled lines, “6 (0110)” representing the table is designated as the value of the type identifier. In this case, during OCR, the matrix structure is analyzed by prioritizing ruled line recognition and used as a hint for recognition processing.

図表に付随するキャプション領域７および脚注領域９は、本文の文脈には自然に順序付けて挿入されない独立な文なので、対応する領域情報エントリの型識別子としては、継続属性の無い横書きテキスト属性「１（０００１）」を指定する。 The caption area 7 and the footnote area 9 attached to the chart are independent sentences that are not inserted in a natural order in the context of the main text. Therefore, as the type identifier of the corresponding area information entry, the horizontal text attribute “1 ( 0001) ".

図３の強継続テキスト属性「４（０１００）」は、もともと一つの段落であるにもかかわらず、レイアウト構造上分断されたテキスト領域のように、文脈の結合関係が強いことを強調する場合に使用する。図１の例では、領域１０、領域１２、領域１３が、継続する直前の領域の末尾の段落がレイアウト上分断されていることを想定している。 The strong continuation text attribute “4 (0100)” in FIG. 3 is used when emphasizing that the context is strongly connected like a text area divided in the layout structure, although it is originally one paragraph. use. In the example of FIG. 1, it is assumed that the last paragraph of the area 10, the area 12, and the area 13 immediately before continuing is divided in the layout.

この場合、領域１０、領域１２、領域１３の型識別子値に「４（０１００）」を指定することで、論理的な継続を特に強調することができる。これは、特にもともとの領域分割をレイアウト基準でなく、段落境界とレイアウト境界の両方に沿った分割で与えておくと、後のＯＣＲ時に段落単位の構造を認識結果に付加することが可能になるので、特に有効な情報となる。 In this case, logical continuity can be particularly emphasized by designating “4 (0100)” as the type identifier values of the regions 10, 12, and 13. This is because, in particular, if the original area division is given not by the layout standard but by division along both the paragraph boundary and the layout boundary, it becomes possible to add the structure of the paragraph unit to the recognition result during the subsequent OCR. So it becomes particularly useful information.

領域情報エントリ２１の４ｂｉｔの補助情報２６には、レイアウト上の補助情報として、上位ｂｉｔから順に、領域の上下左右の空白状況を登録する。 In the 4-bit auxiliary information 26 of the area information entry 21, as the auxiliary information on the layout, the blank statuses of the upper, lower, left and right areas are registered in order from the upper bit.

例えば、図１の領域２の様に領域情報エントリ２１に対応する領域上方が空白の領域場合に補助情報２６の３ｂｉｔ目(最上位ｂｉｔ)を１とし、そうでなければ同ｂｉｔは０とする。同様に領域下方が空白になる領域９や領域３では、補助情報２６の２ｂｉｔ目を１とする。補助情報２６の１ｂｉｔ目(左方空白)、０ｂｉｔ目(右方空白)についても同様である。 For example, when the upper area of the area corresponding to the area information entry 21 is blank as in the area 2 of FIG. 1, the third bit (most significant bit) of the auxiliary information 26 is set to 1, otherwise the bit is set to 0. . Similarly, in the area 9 and the area 3 where the area below is blank, the second bit of the auxiliary information 26 is set to 1. The same applies to the 1st bit (left blank) and 0th bit (right blank) of the auxiliary information 26.

これらの補助情報２６は、ＯＣＲ時の原稿の移動・回転・拡大/縮小・ゆがみ等の補正ヒントとして利用される他、スキャン不要な領域を明確にして処理速度を向上させる効果もある。但し、より確実な位置決め情報が必要な場合には、位置決め基準マーカ１５、１６を併用することも有効である。特にカラーのシステムの場合には、位置決め基準マーカ１５、１６の色をイエローにすると目立たないので良い。 The auxiliary information 26 is used as a correction hint for movement, rotation, enlargement / reduction, distortion, etc. of the document during OCR, and also has an effect of clarifying an area that does not require scanning and improving processing speed. However, when more reliable positioning information is required, it is also effective to use the positioning reference markers 15 and 16 together. Particularly in the case of a color system, the color of the positioning reference markers 15 and 16 may be inconspicuous if the color is yellow.

図１の例は、既に印刷されている文書に対するＯＣＲの補助情報埋め込みの例であったが、本発明は、印刷時に印刷されない領域に対して読み取り指示を文書に埋め込む方法としても有効である。図４に例を示す。 The example of FIG. 1 is an example of embedding OCR auxiliary information in a document that has already been printed. However, the present invention is also effective as a method for embedding a reading instruction in an area that is not printed during printing. An example is shown in FIG.

図４の例では、案内文であるテキスト領域５１は、ＯＣＲの対象ではなく、文面を受け取った第三者対する写真５６貼り付け用の写真領域５５および、手書き記入用の領域５２、５３、５４をＯＣＲの対象とすることを想定している。 In the example of FIG. 4, the text area 51 that is a guide sentence is not an object of OCR, but is a photo area 55 for pasting a photograph 56 and third handwriting entry areas 52, 53, 54 for a third party who received the text. Are subject to OCR.

この場合、印刷物５０のテキスト領域５１に電子透かしとして埋め込む情報テーブル４０には、テキスト領域５１の領域情報エントリ２１の他に領域５２〜領域５５の領域情報エントリ２１を登録する。 In this case, the area information entry 21 of the areas 52 to 55 is registered in addition to the area information entry 21 of the text area 51 in the information table 40 embedded as a digital watermark in the text area 51 of the printed matter 50.

このときそれぞれの領域情報エントリ２１の型識別子２５値として、領域５１には図３の相対座標基準属性「０（００００）」を指定し、領域５２〜５４には、白黒ビットマップ相対座標読み取り領域属性「ｃ（１１００）」を指示し、領域５５には、カラービットマップ相対座標読み取り領域属性「ｄ（１１０１）」を指示する。 At this time, the relative coordinate reference attribute “0 (0000)” of FIG. 3 is designated in the area 51 as the type identifier 25 value of each area information entry 21, and the monochrome bitmap relative coordinate reading area is designated in the areas 52 to 54. The attribute “c (1100)” is instructed, and the area 55 is instructed with the color bitmap relative coordinate reading area attribute “d (1101)”.

さらに領域５２〜５４に対しては、文字認識による相対座標読み取り領域属性「ｂ（１０１１）」を指示する領域情報エントリ２１を重複させて登録することにより、画像としての読み込みと文字認識の両方を指示することができる。
ここで、図３における、相対座標読み取り領域とは、情報テーブル４０における直前の相対座標基準属性(型識別子２５の値として「０（００００）」)をもつ領域情報エントリ２１の左端座標値２７および下端座標値３０を基点とした相対座標値で領域を定義することを意味する。これに対して、絶対座標読み取り領域とは、これまでの説明にある型識別子値０〜７と同様に、座標基準点１（用紙角部）を基準とする座標値で領域を定義することである。 Furthermore, by registering the area information entry 21 for designating the relative coordinate reading area attribute “b (1011)” by character recognition for the areas 52 to 54, both reading as an image and character recognition are performed. Can be directed.
Here, the relative coordinate reading area in FIG. 3 is the left end coordinate value 27 of the area information entry 21 having the previous relative coordinate reference attribute (the value of the type identifier 25 is “0 (0000)”) in the information table 40. This means that the region is defined by relative coordinate values with the lower end coordinate value 30 as a base point. On the other hand, the absolute coordinate reading area is defined by defining an area with a coordinate value based on the coordinate reference point 1 (paper corner) as in the case of the type identifier values 0 to 7 described above. is there.

相対座標基準属性の領域は、その領域座標情報である左端座標値２７〜下端座標値３０から得られる領域サイズ情報の他に、その領域に埋め込まれた電子透かし情報の組み合わせから、座標基準点１に依存しないで領域の位置をスキャン画像の中で特定するために使用される。 In addition to the region size information obtained from the left end coordinate value 27 to the bottom end coordinate value 30 as the region coordinate information, the relative coordinate reference attribute region is obtained from the combination of the digital watermark information embedded in the region, the coordinate reference point 1 It is used to specify the position of the region in the scanned image without depending on the.

また、図４の印刷物５０を印刷する場合において、個々の印刷物毎に、電子透かしとして埋め込む情報テーブル４０の付加情報３１の文書識別番号を変えておくことにより、個々の印刷物５０と読み込みデータを１対１に対応付けることが可能となる。 Further, when printing the printed matter 50 of FIG. 4, the individual printed matter 50 and the read data are set to 1 by changing the document identification number of the additional information 31 of the information table 40 embedded as the electronic watermark for each printed matter. It becomes possible to make it correspond to a pair.

これにより、図５の様に複数の印刷物５０を１ページにレイアウトして同時にスキャンしても、ＯＣＲすべき領域を各印刷物に対応付けて正しく認識することが可能となる。 As a result, even if a plurality of printed materials 50 are laid out on one page and simultaneously scanned as shown in FIG. 5, the region to be OCR can be correctly recognized in association with each printed material.

なお、先の図３の型識別子２５の値８〜ｄは、印刷時には印刷情報の無い領域属性を示すものであり、その領域にデータが存在しない、あるいはその領域の画像品質が保証されない可能性を想定している点で型識別子値０〜７とは異なる領域属性として扱われる。 Note that the values 8 to d of the type identifier 25 in FIG. 3 indicate an area attribute without print information at the time of printing, and there is a possibility that no data exists in the area or the image quality of the area is not guaranteed. Is treated as a region attribute different from the type identifier values 0 to 7.

図６には、本発明のシステムにおけるＯＣＲ補助情報を埋め込んだ印刷物生成の流れを示す。 FIG. 6 shows a flow of print generation in which OCR auxiliary information is embedded in the system of the present invention.

ここでは、本発明の実施形態として図９に示すような、情報処理装置１０２、光学読み取り装置１００、画像表示装置１０１、情報入力装置１０３、領域指示装置１０４、印刷装置１０５で構成されるシステムを想定する。なお、光学読み取り装置１００は、情報処理装置１０１に搭載されるＯＣＲソフトウエアとの組み合わせで、ＯＣＲシステムを構築している。 Here, as an embodiment of the present invention, a system including an information processing apparatus 102, an optical reading apparatus 100, an image display apparatus 101, an information input apparatus 103, an area instruction apparatus 104, and a printing apparatus 105 as shown in FIG. Suppose. The optical reading apparatus 100 constructs an OCR system in combination with OCR software installed in the information processing apparatus 101.

まず、ステップ６０では、情報処理装置１０２上のアプリケーションソフトウエアにより印刷するべきドキュメントデータを作成する。 First, in step 60, document data to be printed is created by application software on the information processing apparatus 102.

ステップ６１では、情報処理装置１０２上で動作するプリンタドライバにより、用紙サイズ、余白設定、印刷枚数等、各種印刷条件を設定する。 In step 61, various printing conditions such as paper size, margin setting, number of prints, and the like are set by a printer driver operating on the information processing apparatus 102.

ステップ６２では、プリンタドライバに対して、ＯＣＲ補助情報埋め込みを行うかどうかの指示を与える。ＯＣＲ補助情報埋め込み処理を行う場合には、画像表示装置１０１上に、印刷プレビュー画像１０６が表示される。 In step 62, the printer driver is instructed whether to embed OCR auxiliary information. When the OCR auxiliary information embedding process is performed, the print preview image 106 is displayed on the image display device 101.

ステップ６４でユーザは、印刷プレビュー画像１０６に対して、領域指示装置１０４を用いて、図１の各領域に相当するような個々の領域とその属性指定を行うことで、情報テーブル４０を生成する。 In step 64, the user generates an information table 40 by designating each area and its attribute corresponding to each area in FIG. 1 using the area instruction device 104 for the print preview image 106. .

次いでステップ６５の印刷命令の発効により、プリンタドライバは、ドキュメントデータを印刷画像データに展開し、ステップ６６で、先の情報テーブル４０を電子透かしとしてこの印刷画像データに付加した後、ステップ６７で印刷画像データをプリンタへと送信する。 Next, when the print command is issued in step 65, the printer driver expands the document data into print image data. In step 66, the previous information table 40 is added to the print image data as a digital watermark. Send the image data to the printer.

先のステップ６２で、ＯＣＲ補助情報の埋め込み処理を行わない指示を与える場合には、ステップ６８の印刷命令により、ステップ６９の通常の印刷画像データ生成のみを行い、ステップ６７でプリンタへ生成された印刷画像データを送信する。 If an instruction not to perform the OCR auxiliary information embedding process is given in the previous step 62, only normal print image data generation in step 69 is performed by the print command in step 68, and the print data generated in step 67 is generated in the printer. Send print image data.

図６の例は、ユーザが対話的に情報テーブル４０を生成する例であるが、文書データ７０の形態によっては、これを自動生成することも可能である。このような文書データ７０の形態としては、論理構造がレイアウト構造とは独立に記述され提供されるＴｅＸのようなマークアップ言語が適している。図７に情報テーブル４０を自動生成する場合の処理の流れを示す。 The example of FIG. 6 is an example in which the user interactively generates the information table 40. However, depending on the form of the document data 70, this can be automatically generated. As a form of such document data 70, a markup language such as TeX in which a logical structure is described and provided independently of a layout structure is suitable. FIG. 7 shows the flow of processing when the information table 40 is automatically generated.

図７では文書データ７０に対して、ステップ７２で領域情報エントリ２１を作成しつつステップ７１で文書データをビットマップに展開することで情報テーブル４０を自動的に生成する。ステップ７３で、情報テーブル４０を電子透かし情報としてビットマップデータに付加し、ステップ７４で、プリンタエンジンへデータを送信する。 In FIG. 7, the information table 40 is automatically generated by developing the region information entry 21 in step 72 for the document data 70 and expanding the document data into a bitmap in step 71. In step 73, the information table 40 is added to the bitmap data as digital watermark information, and in step 74, the data is transmitted to the printer engine.

この場合、上記ステップ７１〜７３の処理は、自動処理なので、印刷装置１０５に組み込みのコントローラでも処理は可能である。ステップ７４におけるプリンタエンジンは、印刷装置１０５における組み込みコントローラを除く部分を意味する。 In this case, since the processing in steps 71 to 73 is automatic processing, the processing can be performed by a controller incorporated in the printing apparatus 105. The printer engine in step 74 means a portion excluding the embedded controller in the printing apparatus 105.

図８には、本発明のシステムのＯＣＲ側処理の流れを示す。
ステップ８０では、情報処理装置１０２により、写真領域の解像度指定や、カラー/モノクロ指定などのスキャン条件設定を行う。
ステップ８１では、光学読み取り装置１００により原稿をスキャンし、ビットマップデータ化する。 FIG. 8 shows the flow of OCR side processing of the system of the present invention.
In step 80, the information processing apparatus 102 performs scan condition settings such as photo area resolution designation and color / monochrome designation.
In step 81, the original is scanned by the optical reading device 100 and converted into bitmap data.

ステップ８２では、スキャンされたビットマップデータに、電子透かしとして埋め込まれたＯＣＲ補助情報としての、情報テーブル４０の有無を検出する。情報テーブル４０が無い場合には、情報処理装置１０２において通常の文字認識処理９２を行う。 In step 82, the presence or absence of the information table 40 as OCR auxiliary information embedded as digital watermark in the scanned bitmap data is detected. If there is no information table 40, the information processing apparatus 102 performs normal character recognition processing 92.

情報テーブル４０がある場合には、ステップ８３で情報テーブル４０を抽出する。 If there is an information table 40, the information table 40 is extracted in step 83.

ステップ８４では、抽出された情報テーブル４０の情報と、簡易的なビットマップデータの認識情報に基づいて、原稿の平行移動、回転、拡大/縮小、ゆがみなどを検出し、これを補正する。 In step 84, based on the extracted information in the information table 40 and simple bitmap data recognition information, translation, rotation, enlargement / reduction, distortion, etc. of the document are detected and corrected.

ステップ８５では、情報テーブル４０のテキスト領域属性のエントリに基づいて、テキスト領域と、テキスト領域間の継続関係を抽出する。その後、ステップ８６で、文字認識を行う。文字認識は、文脈に対するデータベースの利用が認識率を上げる上で効果的に作用するため、継続するテキスト領域にまたがる部分の認識処理は同時に解析を行う。 In step 85, based on the text area attribute entry of the information table 40, the text area and the continuation relationship between the text areas are extracted. Thereafter, in step 86, character recognition is performed. In character recognition, the use of a database for the context effectively works to increase the recognition rate. Therefore, the recognition processing of the portion that extends over the continuous text area is simultaneously analyzed.

文字認識が完了すると、ステップ８７で、情報テーブル４０のエントリに出現する最初の図表・写真など非文字領域の抽出を行う。 When the character recognition is completed, in step 87, the first non-character region such as a chart / photo that appears in the entry of the information table 40 is extracted.

読み取り属性として、高解像度読み取りや、カラーとしての読み取りなどで、再スキャンが必要な場合には、ステップ８９で、光学読み取り装置１００のスキャン条件を再設定し、該当領域の再スキャンを行う。 If rescanning is necessary for high resolution reading or color reading as a reading attribute, the scanning condition of the optical reading device 100 is reset in step 89 and the corresponding area is rescanned.

非文字列領域処理が残っている場合には、情報テーブル４０に出現する次の非文字列領域に対して、ステップ８７〜ステップ９１の処理を反復する。
全ての処理対象のエントリの処理が終了するとステップ９３でデータを出力あるいはファイル保存する。 When the non-character string area process remains, the processes of step 87 to step 91 are repeated for the next non-character string area appearing in the information table 40.
When processing of all entries to be processed is completed, data is output or saved in a file in step 93.

以上の説明では、図９のハードウエア構成のシステムを想定したが、印刷データ生成、印刷、ＯＣＲは同時に行う物ではないので、これらの光学読み取り装置１００、情報処理装置１０２、印刷装置１０６は、複数で構成されても、どのような組み合わせで一体化された装置であっても良い。また、時間的空間的に離れたシステムであっても問題はない。 In the above description, the system having the hardware configuration shown in FIG. 9 is assumed. However, since print data generation, printing, and OCR are not performed at the same time, the optical reading device 100, the information processing device 102, and the printing device 106 are The apparatus may be composed of a plurality of devices or integrated in any combination. Moreover, there is no problem even if the system is separated in time and space.

本発明の印刷システムでは、テキスト領域要素中に電子透かしとして埋め込まれた情報テーブルから、ＯＣＲ時にテキスト領域およびテキスト領域の継続関係を正確に予測することができる。
このため、テキスト改行やテキスト領域の分断による単語・文・段落の分割に対しても、正確な語順を予測することができるため、ＯＣＲ効率を高めることが可能となり、ＯＣＲ作業者の作業負荷が軽減される。 In the printing system of the present invention, the continuation relationship between the text area and the text area can be accurately predicted from the information table embedded as a digital watermark in the text area element during OCR.
For this reason, since it is possible to predict the exact word order even for word / sentence / paragraph division due to text line breaks or text area divisions, it is possible to increase OCR efficiency and reduce the workload of OCR workers. It is reduced.

また、印刷時には存在しない印刷物の利用者によって挿入される情報をＯＣＲすべき領域として相対的に指示することも可能であるため、アンケートなどの情報整理の一括処理に効果的である。 Further, since information inserted by a user of a printed matter that does not exist at the time of printing can be relatively designated as an area to be OCRed, it is effective for collective processing of information organization such as a questionnaire.

印刷物の概念図である。It is a conceptual diagram of printed matter. 情報テーブルの構成図である。It is a block diagram of an information table. 型識別子値と内容属性の対応表である。It is a correspondence table of type identifier values and content attributes. 書き込みを想定した印刷物の概念図である。It is a conceptual diagram of the printed matter which assumed writing. 複数の印刷物の同時ＯＣＲの例である。It is an example of simultaneous OCR of a plurality of printed matter. 印刷物生成の流れ図である。It is a flowchart of printed matter generation. 印刷物自動生成の流れ図である。It is a flowchart of printed matter automatic generation. ＯＣＲの処理の流れ図である。It is a flowchart of a process of OCR. システム例である。It is a system example.

Explanation of symbols

１…座標基準点、２…日付領域、3…表題領域、４…執筆者名領域、５…領域、６…領域、１５、１６…位置決め基準マーカ、２０…ヘッダ、２１…領域情報エントリ、２３…バージョン情報、２４…付加情報オフセット、２５…型識別子、２６…補助情報、２７…左端座標値、２８…右端座標値、２９…上端座標値、３０…下端座標値、３１…付加情報、４０…情報テーブル、５０…印刷物、５１…テキスト領域、５２…領域、５３…領域、５４…領域、５５…写真領域、１００…光学読み取り装置、１０１…画像表示装置、１０２…情報処理装置、１０３…情報入力装置、１０４…領域指示装置、１０５…印刷装置、１０６…プレビュー画像。
DESCRIPTION OF SYMBOLS 1 ... Coordinate reference point, 2 ... Date area, 3 ... Title area, 4 ... Author name area, 5 ... Area, 6 ... Area, 15, 16 ... Positioning reference marker, 20 ... Header, 21 ... Area information entry, 23 ... version information, 24 ... additional information offset, 25 ... type identifier, 26 ... auxiliary information, 27 ... left end coordinate value, 28 ... right end coordinate value, 29 ... upper end coordinate value, 30 ... lower end coordinate value, 31 ... additional information, 40 ... information table, 50 ... printed matter, 51 ... text area, 52 ... area, 53 ... area, 54 ... area, 55 ... photographic area, 100 ... optical reading apparatus, 101 ... image display apparatus, 102 ... information processing apparatus, 103 ... Information input device 104... Area instruction device 105... Printing device 106.

Claims

An information processing apparatus that generates print image data based on electronic information, a printing apparatus that prints print image data, and a print that includes an OCR system that optically reads information printed on printed matter and performs recognition processing A system,
The information processing apparatus embeds text area information and text area order information as digital watermark information in print image data,
The OCR system performs character recognition processing using digital watermark information reproduced on a printed matter printed by the printing apparatus as auxiliary information.

The printing system according to claim 1,
The printing system according to claim 1, wherein the text area order information is given as an order of entries of the text area information.

The printing system according to claim 1,
A printing system characterized by embedding area attribute information and area information as electronic watermark information in a text, as well as non-text attribute areas such as charts and photo areas.

A printing system according to claim 1 or claim 3, wherein
The information processing apparatus includes an image display device and an image area instruction device,
A printing system comprising a user interface that allows a print preview image to be displayed on the image display apparatus to be instructed in an interactive environment with a user in a text area, a non-text area, and their order relationship. .

A printing system according to claim 1 or claim 3, wherein
The information processing apparatus estimates the text region, the non-text region, and their order relationship directly or indirectly from the electronic information.

The printing system according to claim 5, wherein
The text area is divided along a logical structure such as a paragraph boundary of a sentence, and connection information is electronically connected to a text area on the same logical structure divided into a plurality of text areas in the document layout. A printing system characterized by being added to watermark information.

An information processing device that generates print image data based on electronic information, an image display device, an image area instruction device, a printing device that prints print image data, and optically reads information printed on printed matter A printing system having an OCR system for performing recognition processing,
The information processing apparatus can specify text area information, non-text area information, and order information thereof to be read as OCR for a print preview image displayed on the image display apparatus in an interactive environment with a user. And embedding the text area information, non-text area information, and order information as digital watermark information in the print image data,
The OCR system reads a text area or a non-text area instructed in the digital watermark information for a printed matter in which the digital watermark information is embedded.

The printing system according to claim 7,
The OCR system switches a processing method such as color / monochrome, resolution, presence / absence of recognition processing according to attributes of a text region and a non-text region.