JP2002041497A

JP2002041497A - Document image processor and method for processing document image

Info

Publication number: JP2002041497A
Application number: JP2000228496A
Authority: JP
Inventors: Ichiro Iida; 一郎飯田; Masahiro Saito; 雅宏斎藤; Kenjiro Ikehata; 健二郎池畑
Original assignee: Toppan Printing Co Ltd
Current assignee: Toppan Inc
Priority date: 2000-07-28
Filing date: 2000-07-28
Publication date: 2002-02-08

Abstract

PROBLEM TO BE SOLVED: To provide a document image processor and a method for processing document image, by which the data of a document image in a page description language in particular is divided into areas, a tag/attribute value is allocated to data in the divided areas and a document image in a structured description language is generated, on the basis of the tag/attribute values about the processing of document image data of a magazine, a catalog, a leaflet, etc., in which information such as much merchandise and many stores are described. SOLUTION: This document image processor has at least a means for classifying unit data, constituting the document image according to colors applied to the unit data and a means for allocating the tag/attribute values to the classification.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は雑誌やカタログ、チ
ラシなどの文書画像のデータの処理に関し、特にページ
記述言語による文書画像のデータを領域に分割し、分割
した領域内のデータにタグ・属性値を割り当て、これら
に基づき、構造化記述言語による文書画像を生成する文
書画像処理装置及び文書画像処理方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to processing of document image data such as magazines, catalogs, and flyers, and more particularly, to dividing document image data in a page description language into regions, and tagging / attributing data in the divided regions. The present invention relates to a document image processing apparatus and a document image processing method for assigning values and generating a document image in a structured description language based on these values.

【０００２】[0002]

【従来の技術】多数の商品や店舗などの情報を掲載した
雑誌やカタログ、チラシなどの文書画像は、高い顧客吸
引力を必要とすることから、写真、イラストの他、解説
の文字、マーク、ロゴ、が様々なフォント、デザインで
示されている。これら文書画像のデータの多くは、高い
ページ再現力を有するページ記述言語によって作成され
ている。特にＰＤＦ形式（ＰｏｒｔａｂｌｅＤｏｃｕ
ｍｅｎｔＦｏｒｍａｔ、米国アドビ社商標）で作成さ
れるＰＤＦファイルは、高い画像再現力を有するので好
適に使用されている。2. Description of the Related Art Document images such as magazines, catalogs, and flyers that contain information on a large number of products and stores require high customer attraction, so in addition to photographs and illustrations, commentary characters, marks, The logo is shown in various fonts and designs. Most of the data of these document images are created by a page description language having high page reproducibility. In particular, PDF format (Portable Docu
Ment Format, a trademark of Adobe in the United States), is preferably used because it has high image reproducibility.

【０００３】一方、近年の情報通信技術の著しい発展か
ら、このような情報を掲載した文書画像の情報をコンピ
ューターに取り込んでデータベース化し、及び／又はそ
の情報を他の表示媒体例えばＷｅｂ上や液晶画面を有す
る携帯電話上で表示できるような構造化記述言語に情報
加工することが望まれてきている。[0003] On the other hand, due to the remarkable development of information communication technology in recent years, information of a document image carrying such information is taken into a computer and made into a database, and / or the information is stored on another display medium such as a Web or a liquid crystal screen. It has been desired to process the information into a structured description language that can be displayed on a mobile phone having the following.

【０００４】構造化記述言語としてはＳＧＭＬ（Ｓｔａ
ｎｄａｒｄＧｅｎｅｒａｌｉｚｅｄＭａｒｋｕｐ
Ｌａｎｇｕａｇｅ）、ＨＴＭＬ（ＨｙｐｅｒＴｅｘｔ
ＭａｒｋｕｐＬａｎｇｕａｇｅ）、ＸＭＬ（Ｅｘｔ
ｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ）等
があるが、データベース他媒体表示には、その拡張性や
柔軟性から、ＸＭＬが好適に使用されている。As a structured description language, SGML (Sta
nd Generalized Markup
Language), HTML (Hyper Text)
Markup Language), XML (Ext
Although there is an enable markup language, etc., XML is suitably used for the display of other media in a database due to its expandability and flexibility.

【０００５】このような情報加工方法としては手作業に
よるところが大きい。一部新聞記事などでは、レイアウ
ト構造から論理構造を導き出し、文書画像と論理構造を
結び付けて構造化記述言語によるファイルとする処理を
半自動で行おうという試みがなされている。しかしなが
ら、多数の商品や店舗などの情報を掲載した雑誌やカタ
ログ、チラシなどの文書画像の場合、写真、イラストの
他、解説の文字、マーク、ロゴ、が様々なフォント、デ
ザインで示されることから、従来のような変換方法では
十分な結果は得られず、各種アプリケーションやデータ
ベースなどで利用できるようにするといった要求には答
えることができないものであった。[0005] Such an information processing method largely depends on manual operations. In some newspaper articles, attempts have been made to semi-automatically derive a logical structure from a layout structure and link a document image and the logical structure into a file in a structured description language. However, in the case of document images such as magazines, catalogs, and flyers that contain information on a large number of products and stores, in addition to photos and illustrations, explanatory characters, marks, logos, etc. are shown in various fonts and designs. However, a conventional conversion method cannot provide a sufficient result, and cannot respond to a request for use in various applications or databases.

【０００６】[0006]

【発明が解決しようとする課題】本発明はこのような問
題点を解決するためになされたものであり、その課題と
するところは、多数の商品や店舗などの情報を掲載した
雑誌やカタログ、チラシなどの文書画像のデータの処理
に関し、特にページ記述言語による文書画像のデータを
領域に分割し、分割した領域内のデータにタグ・属性値
を割り当て、これらに基づき、構造化記述言語による文
書画像を生成する文書画像処理装置及び文書画像処理方
法に関する。特にＰＤＦ形式により表現される文書から
ＸＭＬ形式により表現される文書を生成する文書処理装
置及び文書画像処理方法を提供することにある。SUMMARY OF THE INVENTION The present invention has been made in order to solve such problems, and it is an object of the present invention to provide a magazine, catalog, or the like, which contains information on a large number of products and stores. Regarding the processing of document image data such as flyers, in particular, the document image data in the page description language is divided into regions, tags and attribute values are assigned to the data in the divided regions, and based on these, documents in the structured description language are used. The present invention relates to a document image processing apparatus and a document image processing method for generating an image. In particular, it is an object of the present invention to provide a document processing apparatus and a document image processing method for generating a document represented by an XML format from a document represented by a PDF format.

【０００７】[0007]

【課題を解決するための手段】本発明はこの課題を解決
するため、すなわち請求項１記載の発明は、文書画像を
構成する単位データを、少なくとも該単位データに適用
されている色により分類する手段と、前記分類にタグ・
属性値を割り当てる手段とを有することを特徴とする文
書画像処理装置である。The present invention solves this problem, that is, according to the first aspect of the present invention, classifies unit data constituting a document image by at least a color applied to the unit data. Means and tags
Means for assigning attribute values.

【０００８】本発明はこの手段により、従来のタグ・属
性値付けが困難だった多数の商品や店舗などの情報を掲
載した雑誌やカタログ、チラシなどの写真、イラストの
他、解説の文字、マーク、ロゴ、が様々なフォント、デ
ザインで示されているものに対して、特に単位データの
色により分けることにより、タグ・属性値を容易に把握
可能とした。つまり、本発明において、色はこれら多数
の商品や店舗などの情報を掲載した雑誌やカタログ、チ
ラシの単位データの意味づけとして重要な意味があるこ
とを見出し、またその編集方針により明確に分類し、タ
グ・属性値を付けることを可能とした。According to the present invention, by means of this method, in addition to photographs and illustrations of magazines, catalogs, flyers, etc., which contain information on a large number of products and stores, for which tagging and attribute valuation were difficult in the past, explanatory characters, marks, etc. , Logo, and the like are indicated by various fonts and designs, and the tags and attribute values can be easily grasped by dividing them by the color of the unit data. In other words, in the present invention, it has been found that colors have an important meaning as meanings of unit data of magazines, catalogs, and flyers that publish information of these many products and stores, and are clearly classified according to their editing policies. , Tags and attribute values.

【０００９】また請求項２記載の発明は、ページ記述言
語による文書画像を読み込む手段と、文書画像を構成す
る単位データを、少なくとも該単位データに適用されて
いる色により分類する手段と、前記分類にタグ・属性値
を割り当てる手段と、前記単位データ間の関連限界距離
を設定する手段と、ページ記述言語内の順序及び／又は
ページ記述言語により指定された座標に基づき、単位デ
ータの順序づけを行う手段と、前記単位データの関連限
界距離、分類及びそのタグ・属性値、および順序づけに
基づき、文書画像のデータを領域に分割する手段と、前
記分割した領域内のデータと割り当てられたタグ・属性
値に基づき、構造化記述言語による文書画像を生成する
手段とを有することを特徴とする文書画像処理装置であ
る。According to a second aspect of the present invention, a means for reading a document image in a page description language, a means for classifying unit data constituting a document image by at least a color applied to the unit data, Means for assigning a tag / attribute value to the unit, means for setting a relevant limit distance between the unit data, and ordering of the unit data based on an order in the page description language and / or coordinates specified by the page description language. Means, means for dividing the data of the document image into regions based on the relevant limit distance of the unit data, classification and its tag / attribute value, and ordering, and tags / attributes assigned to the data in the divided regions Means for generating a document image in a structured description language on the basis of a value.

【００１０】本発明はこの手段により、従来、レイアウ
ト構造から論理構造を導き出し、文書画像と論理構造を
結び付けて構造化記述言語によりファイルとする処理方
法では不可能だった多数の商品や店舗などの情報を掲載
した雑誌やカタログ、チラシなどの写真、イラストの
他、解説の文字、マーク、ロゴ、が様々なフォント、デ
ザインで示されているものに対して、特に単位データの
色による分類及びそのタグ・属性値付け、単位データの
関連限界距離の設定、単位データの順序づけに基づき文
書画像のデータを領域に分割すること可能とし、これに
より構造化記述言語化することを可能とした。例えば、
単位データ関連限界距離により初期段階の領域分割を行
い、その後単位データの順序順にタグ・属性値の近接単
位データとの相違を識別しながらさらに分割、結合を行
い、最終的な領域の分割を行い、構造化記述言語化を行
う、といったことにより可能となる。According to the present invention, a logical structure is derived from a layout structure by this means, and a document image and a logical structure are linked to form a file using a structured description language. In addition to photographs and illustrations of magazines, catalogs, flyers, etc. that have posted information, texts, marks, logos, etc. of commentary that are indicated in various fonts and designs, especially classification by unit data color and its It is possible to divide document image data into regions based on tag / attribute value assignment, setting of a limit distance related to unit data, and ordering of unit data, thereby enabling a structured description language. For example,
Area division is performed at the initial stage based on the unit data-related limit distance, and then further divided and combined in the order of the unit data while discriminating the difference of the tag / attribute value from the adjacent unit data, and finally dividing the area. It is possible to perform structured description language.

【００１１】また請求項３記載の発明は、ページ記述言
語による文書画像を読み込む手段と、文書画像を構成す
るデータを、単位文字データと単位画像データに分け
て、該単位文字データを少なくとも該文字に適用されて
いる色により分類する手段と、前記分類にタグ・属性値
を割り当てる手段と、前記単位文字データ間の関連限界
行間距離、関連限界文字間距離を設定する手段と、ペー
ジ記述言語内の順序及び／又はページ記述言語により指
定された座標に基づき、単位文字データ及び単位画像デ
ータの順序づけを行う手段と、前記単位文字データの関
連限界行間距離、関連限界文字間距離、分類及びそのタ
グ・属性値、および順序づけに基づき、文書画像のデー
タを領域に分割する手段と、前記単位画像データのペー
ジ記述言語内の順序及び／又はページ記述言語により指
定された座標と、文書画像の領域及び該領域に割り当て
られたタグ・属性値に基づき、各単位画像データにタグ
・属性値を割り当てる手段と、前記分割した領域内の文
字データと画像データと割り当てられたタグ・属性値に
基づき、構造化記述言語による文書画像を生成する手段
とを有することを特徴とする文書画像処理装置である。According to a third aspect of the present invention, there is provided means for reading a document image in a page description language, dividing data constituting the document image into unit character data and unit image data, and converting the unit character data into at least the character Means for classifying according to the color applied to the group, means for assigning a tag / attribute value to the classification, means for setting the related limit line-to-line distance between the unit character data and the related limit character-to-character distance, Means for ordering the unit character data and the unit image data based on the order of the unit character data and / or the coordinates specified by the page description language, and the related limit line distance, related limit character distance, classification, and tag of the unit character data. Means for dividing the document image data into regions based on the attribute values and the ordering, and the order of the unit image data in the page description language Means for allocating a tag / attribute value to each unit image data based on coordinates specified by a page description language and / or an area of the document image and a tag / attribute value assigned to the area; Means for generating a document image in a structured description language based on the character data and the image data and the assigned tag / attribute value.

【００１２】本発明はこの手段により、従来、レイアウ
ト構造から論理構造を導き出し、文書画像と論理構造を
結び付けて構造化記述言語によりファイルとする処理方
法では不可能だった多数の商品や店舗などの情報を掲載
した雑誌やカタログ、チラシなどの写真、イラストの
他、解説の文字、マーク、ロゴ、が様々なフォント、デ
ザインで示されている単位文字データ・単位画像データ
からなるものに対して、特に単位文字データの色による
分類及びそのタグ・属性値付け、単位文字データの関連
限界行間距離、関連限界文字間距離の設定、単位データ
の順序づけに基づき文書画像のデータを領域に分割する
こと可能とし、これにより構造化記述言語化することを
可能とした。例えば、単位文字データ関連限界距離によ
り初期段階の領域分割を行い、その後単位文字データの
順序順にタグ・属性値の近接単位データとの相違を識別
しながらさらに分割、結合を行い、最終的な領域の分割
を行ない、さらにこの分割領域と単位画像データ座標の
相違を識別しながら単位画像データのタグ・属性値を割
り当て、構造化記述言語化を行う、といったことにより
可能となる。According to the present invention, a logical structure is derived from a layout structure by this means, and a document image and a logical structure are linked to form a file using a structured description language. In addition to photos and illustrations of magazines, catalogs, flyers, etc. that posted information, those with commentary characters, marks, logos, etc. consisting of unit character data and unit image data shown in various fonts and designs, In particular, it is possible to classify document image data into regions based on the classification of unit character data by color and its tag / attribute value setting, setting of the related limit line spacing and related limit character distance of unit character data, and ordering of unit data. This makes it possible to make it into a structured description language. For example, the area is divided at the initial stage by the unit character data-related limit distance, and then further divided and combined in the order of the unit character data while identifying the difference between the tag / attribute value and the adjacent unit data, thereby performing the final area. Is performed, the tags and attribute values of the unit image data are assigned while identifying the difference between the divided area and the unit image data coordinates, and the structured description language is created.

【００１３】また請求項４記載の発明は、前記単位デー
タ又は文字データの色は印刷物の色分解データに基づく
ことを特徴とする請求項１〜３のいずれかに記載の文書
画像処理装置である。The invention according to claim 4 is the document image processing apparatus according to any one of claims 1 to 3, wherein the color of the unit data or character data is based on color separation data of a printed matter. .

【００１４】本発明はこの手段により、前記単位データ
又は単位文字データを分類するための色の情報が画面表
示のデータより多い印刷物の色分解データに基づくこと
により、より分類が容易になりタグ・属性値の決定を容
易に可能とする。特に本文となる単位文字データには墨
単色が多用されるが、それ以外の単位文字データの場合
との分類がより容易になる。また同じ書体、大きさであ
っても白抜き、網掛け等をより容易に分類可能となる。According to the present invention, by this means, the color information for classifying the unit data or the unit character data is based on the color separation data of the printed matter which is larger than the data of the screen display, so that the classification becomes easier. Attribute values can be easily determined. In particular, black single color is frequently used for unit character data serving as a text, but classification with other unit character data becomes easier. In addition, even if the typeface and the size are the same, it is possible to more easily classify the outline, hatching, and the like.

【００１５】また請求項５記載の発明は、前記ページ記
述言語による文書画像はＰＤＦ形式による文書画像であ
ることを特徴とする請求項２〜４のいずれかに記載の文
書画像処理装置である。The invention according to claim 5 is the document image processing apparatus according to any one of claims 2 to 4, wherein the document image in the page description language is a document image in a PDF format.

【００１６】本発明はこの手段により、現在多くの組
版、校正で用いられているＰＤＦ形式のファイルを用い
ることが可能となる。また、特に各文字データや画像デ
ータが座標で定められており、その流用も可能である。
さらに色表示に関しては作成した色を登録する機能など
もあり、これらに対応することも可能となる。According to the present invention, it is possible to use the PDF format file currently used for many typesetting and proofreading by this means. In particular, each character data and image data are determined by coordinates, and the data can be used.
Further, there is a function of registering the created color for the color display, and it is possible to correspond to these functions.

【００１７】また請求項６記載の発明は、前記構造化記
述言語による文書画像はＸＭＬ形式による文書画像であ
ることを特徴とする請求項２〜５のいずれかに記載の文
書画像処理装置である。The invention according to claim 6 is the document image processing apparatus according to any one of claims 2 to 5, wherein the document image in the structured description language is a document image in an XML format. .

【００１８】本発明はこの手段により、従来のＨＴＭ
Ｌ、ＳＧＭＬよりもさらに進んだ形式であるＸＭＬ形式
のファイルとすることが可能となる。また特にタグ・属
性値の設定を自由に行えるので、特定の色に対して「商
品名」「価格」「店舗名」といった設定、定義を設けて
おけば、いずれも同じ編集方針である同種のデータから
は同種の構造を導き出すことが可能となり、かつ膨大な
データに対してデータベース化、検索、一部属性のみの
データの更新なども可能となる。According to the present invention, the conventional HTM
It is possible to use an XML format file which is a format that is more advanced than L and SGML. In particular, since the setting of tags and attribute values can be freely set, setting and defining such as "product name", "price", and "store name" for a specific color allows the same editing policy to be applied. It is possible to derive the same kind of structure from the data, and it is also possible to create a database, search, and update data of only a part of the huge data.

【００１９】また請求項７記載の発明は、文書画像を構
成する単位データを、少なくとも該単位データに適用さ
れている色により分類する工程と、前記分類にタグ・属
性値を割り当てる工程とを有することを特徴とする文書
画像処理方法である。The invention according to claim 7 has a step of classifying unit data constituting a document image by at least a color applied to the unit data, and a step of assigning a tag / attribute value to the classification. A document image processing method characterized in that:

【００２０】本発明はこの手段により、従来のタグ・属
性値付けが困難だった多数の商品や店舗などの情報を掲
載した雑誌やカタログ、チラシなどの写真、イラストの
他、解説の文字、マーク、ロゴ、が様々なフォント、デ
ザインで示されているものに対して、特に単位データの
色により分けることにより、タグ・属性値を容易に把握
可能とした。つまり、本発明において、色はこれら多数
の商品や店舗などの情報を掲載した雑誌やカタログ、チ
ラシの単位データの意味づけとして重要な意味があるこ
とを見出し、またその編集方針により明確に分類し、タ
グ・属性値を付けることを可能としたAccording to the present invention, by this means, in addition to photographs and illustrations of magazines, catalogs, flyers, and the like, which include information on a large number of products and stores, for which tagging and attribute valuation were difficult in the past, commentary characters and marks. , Logo, and the like are indicated by various fonts and designs, and the tags and attribute values can be easily grasped by dividing them by the color of the unit data. In other words, in the present invention, it has been found that colors have an important meaning as meanings of unit data of magazines, catalogs, and flyers that publish information of these many products and stores, and are clearly classified according to their editing policies. , Tag and attribute values

【００２１】また請求項８記載の発明は、ページ記述言
語による文書画像を読み込む工程と、前記文書画像を構
成する単位データを、少なくとも該単位データに適用さ
れている色により分類する工程と、前記分類にタグ・属
性値を割り当てる工程と、前記単位データ間の関連限界
距離を設定する工程と、前記ページ記述言語内の順序及
び／又はページ記述言語により指定された座標に基づ
き、単位データの順序づけを行う工程と、前記単位デー
タの関連限界距離、分類及びそのタグ・属性値、および
順序づけに基づき、文書画像のデータを領域に分割する
工程と、前記分割した領域内のデータと割り当てられた
タグ・属性値に基づき、構造化記述言語による文書画像
を生成する工程とからなることを特徴とする文書画像処
理方法である。The invention according to claim 8 is a step of reading a document image in a page description language, a step of classifying unit data constituting the document image by at least a color applied to the unit data, Assigning a tag / attribute value to a classification, setting an associated limit distance between the unit data, and ordering the unit data based on an order in the page description language and / or coordinates specified by the page description language. And dividing the document image data into regions based on the relevant limit distance of the unit data, the classification and its tag / attribute value, and the ordering. The data in the divided region and the assigned tags A step of generating a document image in a structured description language based on the attribute value.

【００２２】本発明はこの手段により、従来、レイアウ
ト構造から論理構造を導き出し、文書画像と論理構造を
結び付けて構造化記述言語によりファイルとする処理方
法では不可能だった多数の商品や店舗などの情報を掲載
した雑誌やカタログ、チラシなどの写真、イラストの
他、解説の文字、マーク、ロゴ、が様々なフォント、デ
ザインで示されているものに対して、特に単位データの
色による分類及びそのタグ・属性値付け、単位データの
関連限界距離の設定、単位データの順序づけに基づき文
書画像のデータを領域に分割すること可能とし、これに
より構造化記述言語化することを可能とした。例えば、
単位データ関連限界距離により初期段階の領域分割を行
い、その後単位データの順序順にタグ・属性値の近接単
位データとの相違を識別しながらさらに分割、結合を行
い、最終的な領域の分割を行い、構造化記述言語化を行
う、といったことにより可能となる。According to the present invention, by this means, a logical structure is derived from a layout structure, and a document image and a logical structure are linked to form a file using a structured description language. In addition to photographs and illustrations of magazines, catalogs, flyers, etc. that have posted information, texts, marks, logos, etc. of commentary that are indicated in various fonts and designs, especially classification by unit data color and its It is possible to divide document image data into regions based on tag / attribute value assignment, setting of a limit distance related to unit data, and ordering of unit data, thereby enabling a structured description language. For example,
Area division is performed at the initial stage based on the unit data-related limit distance, and then further divided and combined in the order of the unit data while discriminating the difference of the tag / attribute value from the adjacent unit data, and finally dividing the area. It is possible to perform structured description language.

【００２３】また請求項９記載の発明は、ページ記述言
語による文書画像を読み込む工程と、文書画像を構成す
るデータを、単位文字データと単位画像データに分け
て、該単位文字データを少なくとも該文字に適用されて
いる色により分類する工程と、前記分類にタグ・属性値
を割り当てる工程と、前記単位文字データ間の関連限界
行間距離、関連限界文字間距離を設定する工程と、ペー
ジ記述言語内の順序及び／又はページ記述言語により指
定された座標に基づき、単位文字データ及び単位画像デ
ータの順序づけを行う工程と、前記単位文字データの関
連限界行間距離、関連限界文字間距離、分類及びそのタ
グ・属性値、および順序づけに基づき、文書画像のデー
タを領域に分割する工程と、前記単位画像データのペー
ジ記述言語内の順序及び／又はページ記述言語により指
定された座標と、文書画像の領域及び該領域に割り当て
られたタグ・属性値に基づき、各単位画像データにタグ
・属性値を割り当てる工程と、前記分割した領域内の文
字データと画像データと割り当てられたタグ・属性値に
基づき、構造化記述言語による文書画像を生成する工程
を有することをとする文書画像処理方法である。According to a ninth aspect of the present invention, a step of reading a document image in a page description language, dividing data constituting the document image into unit character data and unit image data, and converting the unit character data into at least the character Classifying according to the color applied to the class, assigning a tag / attribute value to the class, setting a related limit line-to-line distance between the unit character data, and a related limit character-to-character distance. Ordering the unit character data and the unit image data based on the order of the unit character data and / or the coordinates specified by the page description language, and the related limit line distance, related limit character distance, classification, and tag of the unit character data. Dividing the document image data into regions based on the attribute values and the ordering, and the order of the unit image data in the page description language And / or assigning a tag / attribute value to each unit image data based on the coordinates specified by the document description language and / or the area of the document image and the tag / attribute value assigned to the area. And generating a document image in a structured description language based on the character data and the image data and the assigned tag / attribute value.

【００２４】本発明はこの手段により、従来、レイアウ
ト構造から論理構造を導き出し、文書画像と論理構造を
結び付けて構造化記述言語によりファイルとする処理方
法では不可能だった多数の商品や店舗などの情報を掲載
した雑誌やカタログ、チラシなどの写真、イラストの
他、解説の文字、マーク、ロゴ、が様々なフォント、デ
ザインで示されている単位文字データ・単位画像データ
からなるものに対して、特に単位文字データの色による
分類及びそのタグ・属性値付け、単位文字データの関連
限界行間距離、関連限界文字間距離の設定、単位データ
の順序づけに基づき文書画像のデータを領域に分割する
こと可能とし、これにより構造化記述言語化することを
可能とした。例えば、単位文字データ関連限界距離によ
り初期段階の領域分割を行い、その後単位文字データの
順序順にタグ・属性値の近接単位データとの相違を識別
しながらさらに分割、結合を行い、最終的な領域の分割
を行ない、さらにこの分割領域と単位画像データ座標の
相違を識別しながら単位画像データのタグ・属性値を割
り当て、構造化記述言語化を行う、といったことにより
可能となる。According to the present invention, by this means, a logical structure is derived from a layout structure, and a document image and a logical structure are linked to form a file using a structured description language. In addition to photos and illustrations of magazines, catalogs, flyers, etc. that posted information, those with commentary characters, marks, logos, etc. consisting of unit character data and unit image data shown in various fonts and designs, In particular, it is possible to classify document image data into regions based on the classification of unit character data by color and its tag / attribute value setting, setting of the related limit line spacing and related limit character distance of unit character data, and ordering of unit data. This makes it possible to make it into a structured description language. For example, the area is divided at the initial stage by the unit character data-related limit distance, and then further divided and combined in the order of the unit character data while identifying the difference between the tag / attribute value and the adjacent unit data, thereby performing the final area. Is performed, the tags and attribute values of the unit image data are assigned while identifying the difference between the divided area and the unit image data coordinates, and the structured description language is created.

【００２５】また請求項１０記載の発明は、前記単位デ
ータ又は単位文字データの色は印刷物の色分解データに
基づくことを特徴とする請求項７〜９のいずれかに記載
の文書画像処理方法である。The invention according to claim 10 is the document image processing method according to any one of claims 7 to 9, wherein the color of the unit data or unit character data is based on color separation data of a printed matter. is there.

【００２６】本発明はこの手段により、本発明はこの手
段により、前記単位データ又は単位文字データを分類す
るための色の情報が画面表示のデータより多い印刷物の
色分解データに基づくことにより、より分類が容易にな
りタグ・属性値の決定を容易に可能とする。特に本文と
なる単位文字データには墨単色が多用されるが、それ以
外の単位文字データの場合との分類がより容易になる。
また同じ書体、大きさであっても白ヌキ文字、網掛け文
字等をより容易に分類可能となる。According to the present invention, by this means, the present invention is based on the fact that the color information for classifying the unit data or the unit character data is based on the color separation data of the printed matter which is more than the screen display data. Classification is facilitated and tags and attribute values can be easily determined. In particular, black single color is frequently used for unit character data serving as a text, but classification with other unit character data becomes easier.
Also, even if the typeface and size are the same, white blank characters, shaded characters, and the like can be more easily classified.

【００２７】また請求項１１記載の発明は、前記ページ
記述言語による文書画像はＰＤＦ形式による文書画像で
あることを特徴とする請求項８〜１０のいずれかに記載
の文書画像処理方法である。The invention according to claim 11 is the document image processing method according to any one of claims 8 to 10, wherein the document image in the page description language is a document image in a PDF format.

【００２８】本発明はこの手段により、現在多くの組
版、校正で用いられているＰＤＦ形式のファイルを用い
ることが可能となる。また、特に各文字データや画像デ
ータが座標で定められており、その流用も可能である。
さらに色表示に関しては作成した色を登録する機能など
もあり、これらに対応することも可能となる。According to the present invention, it is possible to use a PDF format file which is used in many typesetting and proofreading by this means. In particular, each character data and image data are determined by coordinates, and it is possible to divert them.
Further, there is a function of registering the created color for the color display, and it is possible to correspond to these functions.

【００２９】また請求項１２記載の発明は、前記構造化
記述言語による文書画像はＸＭＬ形式による文書画像で
あることを特徴とする請求項８〜１１のいずれかに記載
の文書画像処理方法である。The invention according to claim 12 is the document image processing method according to any one of claims 8 to 11, wherein the document image in the structured description language is a document image in an XML format. .

【００３０】本発明はこの手段により、従来のＨＴＭ
Ｌ、ＳＧＭＬよりもさらに進んだ形式であるＸＭＬ形式
のファイルとすることが可能となる。また特にタグ・属
性値の設定を自由に行えるので、特定の色に対して「商
品名」「価格」「店舗名」といった設定、定義を設けて
おけば、いずれも同じ編集方針である同種のデータから
は同種の構造を導き出すことが可能となり、かつ膨大な
データに対してデータベース化、検索、一部属性のみの
データの更新なども可能となる。According to the present invention, the conventional HTM
It is possible to use an XML format file which is a format that is more advanced than L and SGML. In particular, since the setting of tags and attribute values can be freely set, setting and defining such as "product name", "price", and "store name" for a specific color allows the same editing policy to be applied. It is possible to derive the same kind of structure from the data, and it is also possible to create a database, search, and update data of only a part of the huge data.

【００３１】[0031]

【発明の実施の形態】以下、本発明を詳細に説明する。
本発明における文書画像とは、文字データと画像データ
とからなり、特には多数の商品や店舗などの情報を掲載
したカラフルな雑誌やカタログ、チラシなどの印刷物が
好適に適用可能であるが、特にこれに限定するものでは
なく、ネットワーク上の広告画像などであってもよい。BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described in detail.
The document image in the present invention is composed of character data and image data, and in particular, colorful magazines and catalogs in which information of a large number of products and stores are posted, printed matter such as a flyer can be preferably applied, The present invention is not limited to this, and may be an advertisement image on a network.

【００３２】本発明における単位データとしては、主に
文字が挙げられるが特にこれに限定されるものではな
く、画像としてマーク、ロゴ、サイン、記号、シンボ
ル、標識、標章、紋章、商標、挿し絵、写真、外部画像
データなどであっても良い。The unit data in the present invention mainly includes characters, but is not particularly limited thereto, and includes, as an image, a mark, a logo, a sign, a sign, a symbol, a sign, a mark, an emblem, a trademark, and an illustration. , Photos, external image data, and the like.

【００３３】本発明における少なくとも単位データの色
による分類としては、色がデータとして特徴が特定され
れば良いものであり、Ｙ、Ｍ、Ｃ、Ｋの印刷物における
分解色のデータが好適に用いられる。この場合特にＫ
（墨）の組み込まれたデータも含めたものとなるので、
より分類を容易に行うことが可能となるが、特にこれに
限定されるものではなく、表示色データとしてＲＧＢの
データであっても良い。また、色以外のデータとしては
フォントの種類、大きさ、回転角（縦書き、横書き）等
もある。しかし本発明においては特に色に注目すること
により分類、タグ・属性値を割り振ることを特徴とする
ものである。これは店舗の紹介や商品のカタログなとの
紹介においては編集方針により、特定の色が特定の目的
に使用されることが定められて行われていることにあ
る。In the present invention, at least the classification of the unit data according to the color is only required if the feature of the color is specified as the data, and the data of the separated colors in the printed matter of Y, M, C and K is preferably used. . In this case, especially K
Since it also includes data with (black) embedded,
Although classification can be performed more easily, the present invention is not particularly limited to this, and RGB data may be used as display color data. The data other than the color also includes the font type, size, rotation angle (vertical writing, horizontal writing), and the like. However, the present invention is characterized in that classification, tags and attribute values are assigned by paying particular attention to colors. This is due to the fact that a specific color is used for a specific purpose according to an editing policy when introducing a store or a catalog of a product.

【００３４】本発明における分類に割り当てるタグ・属
性値としては、文書画像の文字であれば従来、本文、タ
イトル、サブタイトル、見出し、リード、キャプショ
ン、柱、ノンブル、雑誌名等があるが、本発明において
は店舗名、料理名、材料名、作り方、店舗所在地、電話
番号等が好適に可能であるがこれらに限定されるもので
はなく、画像データに対しては、完成料理図、イラス
ト、顔写真等であってもよい。As the tag / attribute value to be assigned to the classification in the present invention, the text, the title, the subtitle, the heading, the lead, the caption, the pillar, the page number, the magazine name, etc. have conventionally been used for the characters of the document image. , The store name, the name of the dish, the name of the ingredient, the method of making, the location of the store, the telephone number, and the like are preferably possible, but are not limited to these. And so on.

【００３５】本発明におけるページ記述言語による文書
画像としては、流通および閲覧を考慮した頁単位の文書
画像であり、オリジナルのレイアウト情報を有し、文字
や画像を単位ごとに座標を示して設定するもので、多種
多様な形式を統一的に記述することが可能である。さら
に色表示に関しては作成した色を登録する機能などもあ
り、これらに対応することも可能となる。The document image in the page description language in the present invention is a page-based document image in consideration of distribution and browsing, has original layout information, and sets characters and images by indicating coordinates for each unit. It is possible to uniformly describe various forms. Further, there is a function of registering the created color for the color display, and it is possible to correspond to these functions.

【００３６】また、構造化記述言語による文書画像とし
ては、オリジナルのレイアウト情報にとらわれず、文書
の内容を優先した形式であり、文字や画像が混在したデ
ータにタグ・属性値付け行う方法であり、任意のタグ・
属性値付けが行えるＸＭＬ形式による文書画像が好適に
実施可能であるが、特にこれに限定されるものはなく、
リンク機能を有するＨＴＭＬ、特定のタグを設定するＳ
ＧＭＬ等でも可能である。A document image in the structured description language is a format in which the contents of the document are prioritized without being tied to the original layout information, and is a method of assigning tags and attribute values to data in which characters and images are mixed. , Any tag
A document image in an XML format in which attribute value can be assigned can be preferably implemented, but is not particularly limited to this.
HTML with link function, S to set specific tag
GML or the like is also possible.

【００３７】本発明における単位データ間の関連限界距
離とは、各単位データが同じ領域に属するデータである
かないかを判別するための距離であり、これより遠いも
のでは別の領域、近ければ同じ領域と初期段階に判断す
る為のものであるが、とくにこれに限定されるものでは
なく、例えば言い方を逆にして領域分けをするための許
容距離としてこれより近いものは同じ領域とするという
こともできる。また特に文字データにおいては表示上で
の縦と横で関連限界行間距離、関連限界文字間距離とし
て、段落としての文字の領域を決定することが可能とな
るが、特にこの方法に限定するものではなく、例えば関
連限界行間距離、関連限界文字間距離の片方のみの設定
でも良い。The relevant limit distance between unit data in the present invention is a distance for determining whether each unit data belongs to the same area or not. If the distance is longer than this, another area is used. It is used to determine the area and the initial stage, but it is not particularly limited to this.For example, the area that is shorter than this as the permissible distance for dividing the area into the opposite is the same area Can also. In particular, in character data, it is possible to determine a character area as a paragraph as a related limit line distance in the vertical and horizontal directions on the display, and a related limit character distance, but it is not particularly limited to this method. Instead, for example, only one of the related limit line distance and the related limit character distance may be set.

【００３８】本発明における、単位データの順序づけと
しては、ページ記述言語内での単位データ指定順及び／
又は指定された縦横座標と縦書きか横書きかの設定に基
づいて行うことが可能であるが特にこれに限定されるも
のではなく、どちらか片方のみによるものであってもよ
く、縦書き／横書きの設定は無くても良い。In the present invention, the order of the unit data includes the unit data designation order in the page description language and / or
Alternatively, the setting can be performed based on the designated vertical / horizontal coordinates and the setting of vertical writing or horizontal writing, but the present invention is not particularly limited to this. May not be set.

【００３９】本発明による文書画像のデータの領域の分
割とは、ページ記述言語の文書画像を構造化記述言語の
文書画像とする為の単位データの集まりであり、最終的
にこれら領域１つにつき１つのタグ・属性値が割り当て
られるが、これに限定されるものではなく、この領域内
に他の領域を含むものであっても良く、例えば本文中に
特に色の異なる文字で注釈があり、これに他のタグ・属
性値を割り振るものとして複合化した構造をとったもの
であってもよい。The division of the document image data area according to the present invention is a group of unit data for converting a page description language document image into a structured description language document image. Although one tag / attribute value is assigned, the present invention is not limited to this, and may include another area in this area. It may have a compounded structure in which other tags / attribute values are assigned to this.

【００４０】[0040]

【実施例】以下、本発明の実施例を図面に基づき詳細に
説明する。図１に本発明の文書画像処理装置の構造の一
実施例を示す。ＣＰＵを通じて入力部としてキーボー
ド、マウスを有し、表示部としてディスプレイを有し、
恒常的及び一時的データ保持部としてハードディスクと
ＲＡＭを有し、外部とのデータ入出力部としてネット回
線やモデムなど、またＦＤ・ＭＯといった媒体読出／書
込部を有する。一方、メモリ内は、ページ記述言語文書
画像読込部、単位文字データ分類部、タグ・属性値割当
部、関連限界行間距離・関連限界文字間距離・縦書横書
設定部、単位文字データ順序づけ部、文書画像データ領
域分割部、構造化技術言語文書画像作成部、の各部から
なる。Embodiments of the present invention will be described below in detail with reference to the drawings. FIG. 1 shows an embodiment of the structure of the document image processing apparatus of the present invention. Having a keyboard and a mouse as an input unit through the CPU, a display as a display unit,
A hard disk and a RAM are provided as permanent and temporary data holding units, and a network read / write unit such as an FD / MO is provided as a data input / output unit with the outside, such as a network or a modem. On the other hand, in the memory, a page description language document image reading unit, a unit character data classifying unit, a tag / attribute value allocating unit, a related limit line spacing / related limit character distance / vertical / horizontal writing setting unit, a unit character data ordering unit , A document image data area dividing unit, and a structured technology language document image creating unit.

【００４１】図２〜図４に本発明の文書画像処理方法に
おける一実施例のフローを示す。図２にＰＤＦファイル
の読込から関連限界行間距離・関連限界文字間距離設定
までのフローを示す。ＰＤＦファイルを読み込み、ペー
ジデータを読み取る。ここで構造化に必要なＸＭＬ対応
テーブルが存在するならそれを用い、そうでないなら、
色を含む文字情報を取得して分類して一覧を作成し、構
造化に必要なＸＭＬ対応テーブルを作成する。その後、
関連限界行間距離・関連限界文字間距離設定を行って、
次の工程に進む。FIGS. 2 to 4 show the flow of one embodiment of the document image processing method of the present invention. FIG. 2 shows a flow from the reading of the PDF file to the setting of the related limit line spacing and related limit character distance. Read the PDF file and read the page data. Here, if the XML correspondence table required for structuring exists, it is used. If not,
Character information including colors is acquired and classified to create a list, and an XML correspondence table required for structuring is created. afterwards,
Set the related limit line spacing and related limit character spacing,
Proceed to the next step.

【００４２】図３に単位文字データの順序づけから全て
の単位文字データの領域（以下段落構造とする。）決定
までのフローを示す。まず、順序づけを行った後、最初
の文字に領域が設定されているかを判別する。あればさ
らに前記設定距離に段落構造があるかを判別する。あれ
ばさらに処理中の文字のタグ・属性値と前記段落構造の
タグ・属性値が同じかどうか判断する。そしてあれば、
処理中の文字は前記段落構造に所属される。それ以外の
場合、新規に段落構造を作成し、処理中の文字を所属さ
せるとともにタグ・属性値を割り当てる。ここまでを全
ての文字について繰り返し、次の工程に進む。FIG. 3 shows a flow from the ordering of the unit character data to the determination of all the unit character data areas (hereinafter referred to as paragraph structure). First, after ordering, it is determined whether or not an area is set for the first character. If so, it is further determined whether the set distance has a paragraph structure. If so, it is further determined whether the tag / attribute value of the character being processed is the same as the tag / attribute value of the paragraph structure. And if
The character being processed belongs to the paragraph structure. In other cases, a new paragraph structure is created, the characters being processed belong, and tags and attribute values are assigned. This is repeated for all characters, and the process proceeds to the next step.

【００４３】図４に前記単位画像データの順序づけから
ＸＭＬファイル作成までのフローを示す。まず、順序づ
けを行った後、これを内包する前記単位文字データで作
成した段落構造があるかないか判断する。あれば所属さ
せ、なければ独立画像として専用の段落構造を作成す
る。この後この画像に関するファイル情報はＰＤＦ内に
あるかないか判断する。あればＰＤＦ内に記述されちる
外部ファイルの実体があるかないか判断する。あれば外
部ファイルを所定ディスクにコピーし、それ以外の場合
は、ＰＤＦ内画像をＰＤＦ内から切り出し、外部ファイ
ルを生成する。ここまでを全ての画像について繰り返
し、ＸＭＬファイルを作成し、また外部画像ファイルを
作成する。FIG. 4 shows a flow from the ordering of the unit image data to the creation of the XML file. First, after the ordering is performed, it is determined whether or not there is a paragraph structure created by the unit character data including the order. If so, let it belong, otherwise create a dedicated paragraph structure as an independent image. Thereafter, it is determined whether or not file information relating to this image exists in the PDF. If there is, it is determined whether there is an entity of the external file described in the PDF. If so, the external file is copied to a predetermined disk. Otherwise, the image in the PDF is cut out from the PDF to generate an external file. This process is repeated for all the images to create an XML file and an external image file.

【００４４】図５〜図１５に本発明の画像処理装置の一
実施例による処理状態を示す。FIGS. 5 to 15 show a processing state according to an embodiment of the image processing apparatus of the present invention.

【００４５】図５はＰＤＦ文書画像の一部を指定してデ
ータを読み込んでいる状態を示す。料理の画像データと
とその作り方等に関する文字データが読み込まれてい
る。FIG. 5 shows a state where data is read by designating a part of the PDF document image. The image data of the dish and the character data on how to make it are read.

【００４６】図６は前記読み込んだ文書画像から文字デ
ータにより分類した状態を示す。この状態では５つの分
類が表示されているが、それ以外は次頁のボタンをクリ
ックすることにより表示切り換え可能となっている。確
認した分類は左にあるチェック欄をクリックすることで
入れた後、ＯＫのボタンをクリックすることで決定す
る。チェックを取り消す場合はキャンセルのボタンをク
リックする。FIG. 6 shows a state where the read document images are classified by character data. In this state, five classifications are displayed. In other cases, the display can be switched by clicking a button on the next page. The confirmed classification is entered by clicking the check box on the left, and is determined by clicking the OK button. Click the Cancel button to cancel the check.

【００４７】図７は読み込んで分類したデータから領域
を決定する状態を示す。調整項目として、ベースライン
許容量として関連限界文字間距離が設定され、段落スキ
ップ許容量として関連限界行間距離が設定されている。
また、組み方として組版が横書きになっていることを設
定している。ここでは５つの分類が表示されているが、
それ以外は次頁のボタンをクリックにより表示切り換え
可能となっている。確認したい領域には左にある白抜き
のボタン欄をクリックすることにより領域のタグ・属性
値を読み込む準備がなされる。この状態ではどのタグ・
属性値も特定されていない。意味属性（Ｒ）のボタンを
クリックして、設定可能なタグ・属性値を取り込む。次
に図８のＩＴＥＭ欄をクリックし、アイテム意味属性編
集でタグ・属性値を決定する。ここで、同じフォント
名、サイズ、色（Ｃ、Ｍ、Ｙ、Ｋの％で示される）のも
のが同じ領域てあるように読み込まれるようになる。こ
のようにして決定した後、ＯＫのボタンをクリックする
ことで決定する。チェックを取り消す場合はキャンセル
のボタンをクリックする。ＯＫボタンで決定したものに
関しては別名保存のボタンをクリックすることで、別名
で保存することも可能となっている。FIG. 7 shows a state in which an area is determined from the read and classified data. As the adjustment items, the related limit inter-character distance is set as the baseline allowable amount, and the related limit line distance is set as the paragraph skip allowable amount.
In addition, it is set that the composition is horizontal writing. Here, five categories are displayed,
Otherwise, the display can be switched by clicking a button on the next page. By clicking the white button column on the left of the region to be checked, preparations are made to read the tag / attribute value of the region. In this state,
No attribute values are specified. By clicking the button of the semantic attribute (R), a settable tag / attribute value is fetched. Next, the ITEM column in FIG. 8 is clicked, and the tag / attribute value is determined by editing the item meaning attribute. Here, the same font name, size, and color (indicated by C, M, Y, K in%) are read as if they were in the same area. After the decision is made in this way, the decision is made by clicking the OK button. Click the Cancel button to cancel the check. By clicking the save as button, those determined by the OK button can be saved as another name.

【００４８】図８は図７同様の読み込んで分類したデー
タから領域を決定する状態を示しているが、内包する文
字データのつながり（本文中の注意書き等で色などが変
化している場合など）がある場合はＩＴＥＭＳＴＲ（ア
イテムストリングの意味）欄がクリックされており、こ
の場合、アイテムＳＴＲ意味属性編集のボタンをクリッ
クすることで、内包している複数の文字データから領域
を決定する状態を示す画面となる。FIG. 8 shows a state in which the area is determined from the read and classified data as in FIG. 7, but the connection of the included character data (for example, when the color or the like is changed due to a precautionary statement in the text or the like). ) Indicates that the ITEMSTR (meaning of item string) field has been clicked. In this case, clicking the button for editing the item STR meaning attribute changes the state in which an area is determined from a plurality of included character data. The screen shown below is displayed.

【００４９】図９はこのようにして領域を一時的に決定
した状態の各データの領域分けが行われている。まだ調
整がなされていないので、実際の題名の各文字間が分か
れていたり、実際の段落が各行ごとに分かれていたりし
ている。FIG. 9 shows an area division of each data in a state where the area is temporarily determined as described above. Since no adjustments have been made yet, the actual titles are separated between the characters, or the actual paragraphs are separated by lines.

【００５０】図１０はこの後、各文字について段落構造
のフローにしたがって所属を自動的に割り当て、実際の
段落構造となった状態を示している。FIG. 10 shows a state in which the affiliation is automatically assigned to each character according to the flow of the paragraph structure, and the actual paragraph structure is obtained.

【００５１】図１１はこの後、各段落構造に意味づけ
（料理名、作り方）等が自動的に行われた状態を示す。FIG. 11 shows a state in which the meaning of each paragraph structure (the name of the dish, how to make it) is automatically performed.

【００５２】図１２は画像データの読み込みがなされた
状態を示している。FIG. 12 shows a state in which image data has been read.

【００５３】図１３はこれら自動処理の後、手作業で領
域のタグ・属性値名を変更した際の例を示している。FIG. 13 shows an example in which the tag / attribute value name of the area is manually changed after these automatic processes.

【００５４】図１４はこれらの作業により作成されたＸ
ＭＬファイルのデータの状態を示している。FIG. 14 shows the X created by these operations.
The state of the data of the ML file is shown.

【００５５】図１５は前記ＸＭＬファイルによるデータ
を表示した場合の状態を示している。FIG. 15 shows a state in which data based on the XML file is displayed.

【００５６】[0056]

【発明の効果】以上に示したように、本発明の請求項１
記載の発明により、従来のタグ・属性値付けが困難だっ
た多数の商品や店舗などの情報を掲載した雑誌やカタロ
グ、チラシなどの写真、イラストの他、解説の文字、マ
ーク、ロゴ、が様々なフォント、デザインで示されてい
るものに対して、特に単位データの色により分けること
により、タグ・属性値を容易に把握可能とした。つま
り、本発明において、色はこれら多数の商品や店舗など
の情報を掲載した雑誌やカタログ、チラシの単位データ
の意味づけとして重要な意味があることを見出し、また
その編集方針により明確に分類し、タグ・属性値を付け
ることを可能としたという作用効果を奏する。As described above, the first aspect of the present invention is as follows.
Due to the invention described, magazines, catalogs, flyers, and other photos and illustrations that contain information on a large number of products and stores that were difficult to price tags and attributes in the past, as well as various explanatory characters, marks, logos, etc. Tags and attribute values can be easily grasped by using fonts and designs that are indicated by different colors, especially by unit data colors. In other words, in the present invention, it has been found that colors have an important meaning as meanings of unit data of magazines, catalogs, and flyers that publish information of these many products and stores, and are clearly classified according to their editing policies. And tag / attribute values.

【００５７】また、請求項２記載の発明により、従来、
レイアウト構造から論理構造を導き出し、文書画像と論
理構造を結び付けて構造化記述言語によりファイルとす
る処理方法では不可能だった多数の商品や店舗などの情
報を掲載した雑誌やカタログ、チラシなどの写真、イラ
ストの他、解説の文字、マーク、ロゴ、が様々なフォン
ト、デザインで示されているものに対して、特に単位デ
ータの色による分類及びそのタグ・属性値付け、単位デ
ータの関連限界距離の設定、単位データの順序づけに基
づき文書画像のデータを領域に分割すること可能とし、
これにより構造化記述言語化することを可能とした。例
えば、単位データ関連限界距離により初期段階の領域分
割を行い、その後単位データの順序順にタグ・属性値の
近接単位データとの相違を識別しながらさらに分割、結
合を行い、最終的な領域の分割を行い、構造化記述言語
化を行う、といったことにより可能となるという作用効
果を奏する。According to the second aspect of the present invention,
Photos of magazines, catalogs, flyers, etc. that contain information on a large number of products and stores that were not possible with a processing method that derives a logical structure from the layout structure and links the document image with the logical structure to create a file using a structured description language , Illustrations, commentary characters, marks, logos, etc. are shown in various fonts and designs, especially for classification by color of unit data and its tag / attribute valuation, related limit distance of unit data Setting, the document image data can be divided into regions based on the ordering of the unit data,
This makes it possible to make it into a structured description language. For example, the area is divided at the initial stage based on the unit data-related limit distance, and then further divided and combined in the order of the unit data while identifying the difference between the tag / attribute value and the adjacent unit data, and finally dividing the area. , And a structured description language is provided, which has the effect of being made possible.

【００５８】また、請求項３記載の発明により、従来、
レイアウト構造から論理構造を導き出し、文書画像と論
理構造を結び付けて構造化記述言語によりファイルとす
る処理方法では不可能だった多数の商品や店舗などの情
報を掲載した雑誌やカタログ、チラシなどの写真、イラ
ストの他、解説の文字、マーク、ロゴ、が様々なフォン
ト、デザインで示されている単位文字データ・単位画像
データからなるものに対して、特に単位文字データの色
による分類及びそのタグ・属性値付け、単位文字データ
の関連限界行間距離、関連限界文字間距離の設定、単位
データの順序づけに基づき文書画像のデータを領域に分
割すること可能とし、これにより構造化記述言語化する
ことを可能とした。例えば、単位文字データ関連限界距
離により初期段階の領域分割を行い、その後単位文字デ
ータの順序順にタグ・属性値の近接単位データとの相違
を識別しながらさらに分割、結合を行い、最終的な領域
の分割を行ない、さらにこの分割領域と単位画像データ
座標の相違を識別しながら単位画像データのタグ・属性
値を割り当て、構造化記述言語化を行う、といったこと
により可能となるという作用効果を奏する。According to the third aspect of the present invention,
Photos of magazines, catalogs, flyers, etc. that contain information on a large number of products and stores that were not possible with a processing method that derives a logical structure from the layout structure and links the document image with the logical structure to create a file using a structured description language , Illustrations, commentary characters, marks, logos, etc. consisting of unit character data and unit image data shown in various fonts and designs, especially classification by unit character data color and its tags, It is possible to divide document image data into areas based on attribute value setting, related limit line spacing of unit character data, setting of related limit character distance, and ordering of unit data, thereby making it a structured description language. Made it possible. For example, the area is divided at the initial stage by the unit character data-related limit distance, and then further divided and combined in the order of the unit character data while identifying the difference between the tag / attribute value and the adjacent unit data, thereby performing the final area. And assigning tags / attribute values of the unit image data while identifying the difference between the divided area and the unit image data coordinates, and performing structured description linguistics. .

【００５９】また、請求項４記載の発明により、前記単
位データ又は単位文字データを分類するための色の情報
が画面表示のデータより多い印刷物の色分解データに基
づくことにより、より分類が容易になりタグ・属性値の
決定を容易に可能とする。特に本文となる単位文字デー
タには墨単色が多用されるが、それ以外の単位文字デー
タの場合との分類がより容易になる。また同じ書体、大
きさであっても白抜き、網掛け等をより容易に分類可能
となるという作用効果を奏する。According to the fourth aspect of the present invention, the color information for classifying the unit data or the unit character data is based on the color separation data of the printed matter which is larger than the data of the screen display, so that the classification can be more easily performed. The tag / attribute value can be easily determined. In particular, black single color is frequently used for unit character data serving as a text, but classification with other unit character data becomes easier. In addition, there is an effect that the outline, shading, and the like can be more easily classified even if they have the same typeface and size.

【００６０】また、請求項５記載の発明により、現在多
くの組版、校正で用いられているＰＤＦ形式のファイル
を用いることが可能となる。また、特に各文字データや
画像データが座標で定められており、その流用も可能で
ある。さらに色表示に関しては作成した色を登録する機
能などもあり、これらに対応することも可能であるとい
う作用効果を奏する。Further, according to the fifth aspect of the present invention, it is possible to use a PDF format file which is currently used for many typesetting and proofreading. In particular, each character data and image data are determined by coordinates, and it is possible to divert them. In addition, there is a function of registering the created color for the color display, and it is possible to respond to these functions.

【００６１】また、請求項６記載の発明により、従来の
ＨＴＭＬ、ＳＧＭＬよりもさらに進んだ形式であるＸＭ
Ｌ形式のファイルとすることが可能となる。また特にタ
グ・属性値の設定を自由に行えるので、特定の色に対し
て「商品名」「価格」「店舗名」といった設定、定義を
設けておけば、いずれも同じ編集方針である同種のデー
タからは同種の構造を導き出すことが可能となり、かつ
膨大なデータに対してデータベース化、検索、一部属性
のみのデータの更新なども可能となるという作用効果を
奏する。According to the sixth aspect of the present invention, XM which is a format which is more advanced than conventional HTML and SGML.
It is possible to use an L format file. In particular, since the setting of tags and attribute values can be freely set, setting and defining such as "product name", "price", and "store name" for a specific color allows the same editing policy to be applied. It is possible to derive the same kind of structure from the data, and it is also possible to produce a database for a huge amount of data, search, update data of only some attributes, and the like.

【００６２】また、請求項７記載の発明により、従来の
タグ・属性値付けが困難だった多数の商品や店舗などの
情報を掲載した雑誌やカタログ、チラシなどの写真、イ
ラストの他、解説の文字、マーク、ロゴ、が様々なフォ
ント、デザインで示されているものに対して、特に単位
データの色により分けることにより、タグ・属性値を容
易に把握可能とした。つまり、本発明において、色はこ
れら多数の商品や店舗などの情報を掲載した雑誌やカタ
ログ、チラシの単位データの意味づけとして重要な意味
があることを見出し、またその編集方針により明確に分
類し、タグ・属性値を付けることを可能としたという作
用効果を奏する。According to the invention of claim 7, in addition to photographs and illustrations of magazines, catalogs, flyers, etc., which have posted information on a large number of products and stores, for which tagging and attribute pricing were difficult in the past, illustrations and commentary are also provided. Tags and attribute values can be easily grasped by separating characters, marks, and logos with various fonts and designs, especially by the color of the unit data. In other words, in the present invention, it has been found that colors have an important meaning as meanings of unit data of magazines, catalogs, and flyers that publish information of these many products and stores, and are clearly classified according to their editing policies. And tag / attribute values.

【００６３】また、請求項８記載の発明により、従来、
レイアウト構造から論理構造を導き出し、文書画像と論
理構造を結び付けて構造化記述言語によりファイルとす
る処理方法では不可能だった多数の商品や店舗などの情
報を掲載した雑誌やカタログ、チラシなどの写真、イラ
ストの他、解説の文字、マーク、ロゴ、が様々なフォン
ト、デザインで示されているものに対して、特に単位デ
ータの色による分類及びそのタグ・属性値付け、単位デ
ータの関連限界距離の設定、単位データの順序づけに基
づき文書画像のデータを領域に分割すること可能とし、
これにより構造化記述言語化することを可能とした。例
えば、単位データ関連限界距離により初期段階の領域分
割を行い、その後単位データの順序順にタグ・属性値の
近接単位データとの相違を識別しながらさらに分割、結
合を行い、最終的な領域の分割を行い、構造化記述言語
化を行う、といったことにより可能となるという作用効
果を奏する。According to the eighth aspect of the present invention,
Photos of magazines, catalogs, flyers, etc. that contain information on a large number of products and stores that were not possible with a processing method that derives a logical structure from the layout structure and links the document image with the logical structure to create a file using a structured description language , Illustrations, commentary characters, marks, logos, etc. are shown in various fonts and designs, especially for classification by color of unit data and its tag / attribute valuation, related limit distance of unit data Setting, the document image data can be divided into regions based on the ordering of the unit data,
This makes it possible to make it into a structured description language. For example, the area is divided at the initial stage based on the unit data-related limit distance, and then further divided and combined in the order of the unit data while identifying the difference between the tag / attribute value and the adjacent unit data, and finally dividing the area. , And a structured description language is provided, which has the effect of being made possible.

【００６４】また、請求項９記載の発明により、従来、
レイアウト構造から論理構造を導き出し、文書画像と論
理構造を結び付けて構造化記述言語によりファイルとす
る処理方法では不可能だった多数の商品や店舗などの情
報を掲載した雑誌やカタログ、チラシなどの写真、イラ
ストの他、解説の文字、マーク、ロゴ、が様々なフォン
ト、デザインで示されている単位文字データ・単位画像
データからなるものに対して、特に単位文字データの色
による分類及びそのタグ・属性値付け、単位文字データ
の関連限界行間距離、関連限界文字間距離の設定、単位
データの順序づけに基づき文書画像のデータを領域に分
割すること可能とし、これにより構造化記述言語化する
ことを可能としたという作用効果を奏する。According to the ninth aspect of the present invention,
Photos of magazines, catalogs, flyers, etc. that contain information on a large number of products and stores that were not possible with a processing method that derives a logical structure from the layout structure and links the document image with the logical structure to create a file using a structured description language , Illustrations, commentary characters, marks, logos, etc. consisting of unit character data and unit image data shown in various fonts and designs, especially classification by unit character data color and its tags, It is possible to divide document image data into areas based on attribute value setting, related limit line spacing of unit character data, setting of related limit character distance, and ordering of unit data, thereby making it a structured description language. This has the effect of making it possible.

【００６５】また、請求項１０記載の発明により、前記
単位データ又は単位文字データを分類するための色の情
報が画面表示のデータより多い印刷物の色分解データに
基づくことにより、より分類が容易になりタグ・属性値
の決定を容易に可能とする。特に本文となる単位文字デ
ータには墨単色が多用されるが、それ以外の単位文字デ
ータの場合との分類がより容易になる。また同じ書体、
大きさであっても白抜き、網掛け等をより容易に分類可
能となるという作用効果を奏する。According to the tenth aspect of the present invention, the color information for classifying the unit data or the unit character data is based on the color separation data of the printed matter which is larger than the data on the screen display, so that the classification can be more easily performed. The tag / attribute value can be easily determined. In particular, black single color is frequently used for unit character data serving as a text, but classification with other unit character data becomes easier. Also the same typeface,
Even if the size is large, there is an operational effect that it is possible to more easily classify white outlines, shading, and the like.

【００６６】また、請求項１１記載の発明により、現在
多くの組版、校正で用いられているＰＤＦ形式のファイ
ルを用いることが可能となる。また、特に各文字データ
や画像データが座標で定められており、その流用も可能
である。さらに色表示に関しては作成した色を登録する
機能などもあり、これらに対応することも可能であると
いう作用効果を奏する。According to the eleventh aspect of the present invention, it is possible to use a PDF format file which is currently used in many typessetting and proofreading. In particular, each character data and image data are determined by coordinates, and it is possible to divert them. In addition, there is a function of registering the created color for the color display, and it is possible to respond to these functions.

【００６７】また、請求項１２記載の発明により、従来
のＨＴＭＬ、ＳＧＭＬよりもさらに進んだ形式であるＸ
ＭＬ形式のファイルとすることが可能となる。また特に
タグ・属性値の設定を自由に行えるので、特定の色に対
して「商品名」「価格」「店舗名」といった設定、定義
を設けておけば、いずれも同じ編集方針である同種のデ
ータからは同種の構造を導き出すことが可能となり、か
つ膨大なデータに対してデータベース化、検索、一部属
性のみのデータの更新なども可能となるという作用効果
を奏する。According to the twelfth aspect of the present invention, X, which is a more advanced format than conventional HTML and SGML,
It becomes possible to use a file in the ML format. In particular, since the setting of tags and attribute values can be freely set, setting and defining such as "product name", "price", and "store name" for a specific color allows the same editing policy to be applied. It is possible to derive the same kind of structure from the data, and it is also possible to produce a database for a huge amount of data, search, update data of only some attributes, and the like.

[Brief description of the drawings]

【図１】本発明における文書画像処理装置の構造の一実
施例を示す説明図である。FIG. 1 is an explanatory diagram showing one embodiment of the structure of a document image processing apparatus according to the present invention.

【図２】本発明における文書画像処理方法における一実
施例のフローを示す説明図である。FIG. 2 is an explanatory diagram showing a flow of one embodiment of a document image processing method according to the present invention.

【図３】本発明における文書画像処理方法における一実
施例のフローを示す説明図である。FIG. 3 is an explanatory diagram showing a flow of one embodiment of a document image processing method according to the present invention.

【図４】本発明における文書画像処理方法における一実
施例のフローを示す説明図である。FIG. 4 is an explanatory diagram showing a flow of one embodiment of a document image processing method according to the present invention.

【図５】本発明の画像処理装置の一実施例による処理状
態を示す説明図である。FIG. 5 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図６】本発明の画像処理装置の一実施例による処理状
態を示す説明図である。FIG. 6 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図７】本発明の画像処理装置の一実施例による処理状
態を示す説明図である。FIG. 7 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図８】本発明の画像処理装置の一実施例による処理状
態を示す説明図である。FIG. 8 is an explanatory diagram showing a processing state according to an embodiment of the image processing apparatus of the present invention.

【図９】本発明の画像処理装置の一実施例による処理状
態を示す説明図である。FIG. 9 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図１０】本発明の画像処理装置の一実施例による処理
状態を示す説明図である。FIG. 10 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図１１】本発明の画像処理装置の一実施例による処理
状態を示す説明図である。FIG. 11 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図１２】本発明の画像処理装置の一実施例による処理
状態を示す説明図である。FIG. 12 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図１３】本発明の画像処理装置の一実施例による処理
状態を示す説明図である。FIG. 13 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図１４】本発明の画像処理装置の一実施例による処理
状態を示す説明図である。FIG. 14 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

【図１５】本発明の画像処理装置の一実施例による処理
状態を示す説明図である。FIG. 15 is an explanatory diagram illustrating a processing state according to an embodiment of the image processing apparatus of the present invention.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ０６Ｔ 11/60 １００Ｇ０６Ｔ 11/60 １００ＡＨ０４Ｎ 1/387 Ｈ０４Ｎ 1/387 Ｆターム(参考） 5B009 NC05 NG00 QA06 5B050 BA10 BA16 EA03 EA09 EA10 EA18 5C076 AA01 AA14 AA40 BA09 ──────────────────────────────────────────────────の Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G06T 11/60 100 G06T 11/60 100A H04N 1/387 H04N 1/387 F term (reference) 5B009 NC05 NG00 QA06 5B050 BA10 BA16 EA03 EA09 EA10 EA18 5C076 AA01 AA14 AA40 BA09

Claims

[Claims]

1. A document image comprising: means for classifying unit data constituting a document image by at least a color applied to the unit data; and means for assigning tags and attribute values to the classification. Processing equipment.

2. A means for reading a document image in a page description language; a means for classifying unit data constituting a document image by at least a color applied to the unit data; and assigning a tag / attribute value to the classification. Means for setting an associated limit distance between the unit data; means for ordering the unit data based on an order in a page description language and / or coordinates specified by the page description language; Means for dividing the document image data into regions based on the relevant limit distance, classification and its tag / attribute value, and ordering; structured description based on the data within the divided region and the assigned tag / attribute value Means for generating a document image in a language.

3. A means for reading a document image in a page description language, dividing data constituting the document image into unit character data and unit image data, and dividing the unit character data by at least a color applied to the character. Means for classifying; means for assigning a tag / attribute value to the classification; means for setting a related limit line spacing and a related limit character distance between the unit character data; and an order and / or page description in a page description language. Means for ordering the unit character data and the unit image data based on the coordinates specified by the language; and, for the related limit line distance, related limit character distance, classification and its tag / attribute value, and ordering of the unit character data. Means for dividing document image data into regions based on the order and / or page description language of the unit image data in a page description language Means for assigning a tag / attribute value to each unit image data based on the coordinates designated by (1) and the document image area and the tag / attribute value assigned to the area; character data and image data within the divided area Means for generating a document image in a structured description language based on the assigned tag / attribute value.

4. The document image processing apparatus according to claim 1, wherein the color of the unit data or the unit character data is based on color separation data of a printed matter.

5. A document image in the page description language is a PD.
3. A document image in an F format.
5. The document image processing device according to any one of items 1 to 4.

6. A document image in the structured description language is an XM
3. A document image in an L format.
6. The document image processing apparatus according to any one of claims 1 to 5.

7. A document image comprising: a step of classifying unit data constituting a document image by at least a color applied to the unit data; and a step of assigning tags and attribute values to the classification. Processing method.

8. A step of reading a document image in a page description language; a step of classifying unit data constituting the document image by at least a color applied to the unit data; Allocating; setting a relevant limit distance between the unit data; ordering the unit data based on an order in the page description language and / or coordinates specified by the page description language; Dividing the document image data into regions based on the relevant limit distance of the data, classification and its tag / attribute value, and ordering; and constructing the structure based on the data within the divided region and the assigned tag / attribute values. Generating a document image in a structured description language.

9. A step of reading a document image in a page description language, dividing data constituting the document image into unit character data and unit image data, and dividing the unit character data by at least a color applied to the character. Classifying; assigning a tag / attribute value to the classification; setting an associated limit line spacing and an associated limit character distance between the unit character data; and an order and / or page description in a page description language. A step of ordering the unit character data and the unit image data based on the coordinates specified by the language; and a related limit line distance, a related limit character distance, a classification and its tag / attribute value, and an ordering of the unit character data. Dividing the document image data into regions based on the order and / or the page description language of the unit image data in a page description language. Assigning a tag / attribute value to each unit image data based on the coordinates specified by the following and the document image area and the tag / attribute value assigned to the area; and character data and image data within the divided area. And generating a document image in a structured description language based on the assigned tag / attribute value.

10. The document image processing method according to claim 7, wherein a color of said unit data or unit character data is based on color separation data of a printed matter.

11. A document image in the page description language is P
The document image processing method according to claim 8, wherein the document image is a document image in a DF format.

12. A document image in the structured description language is X
The document image processing method according to any one of claims 8 to 11, wherein the document image is a document image in an ML format.