JP2005157905A

JP2005157905A - Image processing device and method and program

Info

Publication number: JP2005157905A
Application number: JP2003397990A
Authority: JP
Inventors: Kenichi Ota; 健一太田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2003-11-27
Filing date: 2003-11-27
Publication date: 2005-06-16
Anticipated expiration: 2023-11-27
Also published as: JP4310176B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image processing device where paper documents or difficult-to-reuse image information can be treated as reusable electronic files. <P>SOLUTION: A composite machine 100 reads image data from a document, and, if pointer information is added to the image data read, obtains an original electronic file according to the pointer information; if no pointer information is added to the image data read, it converts the image data into vector data. During this conversion to the vector data, the content to be converted is varied depending on a file format designated by the user, e.g., word processor format, tabular format, presentation format, and the like. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、イメージデータを入力する画像処理装置、画像処理方法およびプログラムに関する。 The present invention relates to an image processing apparatus, an image processing method, and a program for inputting image data.

近年、環境問題が叫ばれる中、オフィスにおけるペーパーレス化が急速に進んでいる。従来、バインダ等に蓄積された紙文書をスキャナで読み取り、データベースとして画像記憶装置に蓄積することで、文書管理システムを構築することが行われている（例えば、特許文献１参照）。 In recent years, paperless offices are rapidly becoming paperless as environmental issues are screamed. Conventionally, a document management system has been constructed by reading a paper document stored in a binder or the like with a scanner and storing it in an image storage device as a database (see, for example, Patent Document 1).

また従来、機能が拡張された複写機などの複合機では、予め画像を紙文書として記録する際、画像記憶装置内で、この画像のオリジナル電子ファイルが格納された場所を示す注釈情報を、文書の表紙あるいは文書中の記載情報に付加情報として記録しておき、再度、この文書を複写などに再利用する際、この注釈情報からオリジナル電子ファイルの格納場所を検出し、この電子ファイルの情報を編集や印刷に用いることで、紙文書全体の保存を削減することが行われている（例えば、特許文献２参照）。
特開平１１−２３２１５５号公報特開２００１−６７３４９号公報 Conventionally, in a multifunction machine such as a copying machine with an extended function, when an image is recorded in advance as a paper document, annotation information indicating the location where the original electronic file of the image is stored is stored in the image storage device. This information is recorded as additional information on the front cover of the document or in the document, and when the document is reused for copying again, the storage location of the original electronic file is detected from the annotation information, and the information on the electronic file is stored. By using it for editing and printing, saving of the entire paper document is reduced (for example, refer to Patent Document 2).
Japanese Patent Laid-Open No. 11-232155 JP 2001-67349 A

しかしながら、上記従来の画像処理装置では、以下に掲げる問題があり、その改善が要望されていた。即ち、前者の文書管理システムでは、紙文書を保存するファイルがイメージ情報であるので、この文書の一部のオブジェクトを再利用することはできない。したがって、再利用する場合、図、表等を、新たにアプリケーションソフトウェアを用いて再度作成しなければならなかった。 However, the above-described conventional image processing apparatus has the following problems, and improvements have been demanded. That is, in the former document management system, a file for storing a paper document is image information, and therefore some objects of this document cannot be reused. Therefore, when reusing, a figure, a table, and the like have to be newly created using application software.

また、後者の複合機では、出力された紙文書に対応するオリジナル電子ファイルに直接アクセスできる場合、容易に再利用することができるが、例えば、オリジナル電子ファイルにアクセスできない場合や、そもそもオリジナル電子ファイルへの注釈情報を持たない文書の場合、それらを再利用することができなかった。 In the latter multifunction device, if the original electronic file corresponding to the output paper document can be directly accessed, it can be easily reused. For example, when the original electronic file cannot be accessed, or to the original electronic file in the first place. Documents that do not have the annotation information cannot be reused.

そこで、本発明は、紙文書もしくは再利用が困難なイメージ情報を再利用可能な電子ファイルとして扱えるようにする画像処理装置、画像処理方法およびプログラムを提供することを目的とする。 SUMMARY An advantage of some aspects of the invention is that it provides an image processing apparatus, an image processing method, and a program that enable a paper document or image information that is difficult to be reused to be handled as a reusable electronic file.

上記目的を達成するために、本発明の画像処理装置は、イメージデータを入力する入力手段から入力されたイメージデータからベクトルデータを生成するベクトル化手段と、該生成されたベクトルデータを所定のファイル形式で出力する出力手段とを備える。 In order to achieve the above object, an image processing apparatus according to the present invention includes vectorization means for generating vector data from image data input from input means for inputting image data, and the generated vector data as a predetermined file. Output means for outputting in a format.

本発明の画像処理方法は、イメージデータを入力する入力ステップと、該入力されたイメージデータからベクトルデータを生成するベクトル化ステップと、該生成されたベクトルデータを所定のファイル形式で出力する出力ステップとを有する。 The image processing method of the present invention includes an input step for inputting image data, a vectorization step for generating vector data from the input image data, and an output step for outputting the generated vector data in a predetermined file format. And have.

本発明のプログラムは、画像処理装置内のＣＰＵによって実行されるプログラムであって、イメージデータを入力する入力ステップと、該入力されたイメージデータからベクトルデータを生成するベクトル化ステップと、該生成されたベクトルデータを所定のファイル形式で出力する出力ステップとを含むことを特徴とする。 The program of the present invention is a program executed by a CPU in an image processing apparatus, and includes an input step for inputting image data, a vectorization step for generating vector data from the input image data, and the generation step. And outputting the vector data in a predetermined file format.

本発明によれば、入力されたイメージデータからベクトルデータを生成し、所定のファイル形式で出力するので、紙文書もしくは再利用が困難なイメージ情報を、再利用可能な電子ファイルとして扱えるようにすることができる。これにより、あたかもオリジナルな電子ファイルを入手したのと同等になる。また、ユーザ所望のアプリケーションに適したファイル形式でベクトルデータを生成することができる。 According to the present invention, vector data is generated from input image data and output in a predetermined file format, so that a paper document or image information that is difficult to reuse can be handled as a reusable electronic file. be able to. This is equivalent to getting the original electronic file. Moreover, vector data can be generated in a file format suitable for a user-desired application.

本発明の画像処理装置、画像処理方法およびプログラムの実施の形態について図面を参照しながら説明する。本実施形態の画像処理装置は画像処理システムに適用される。 Embodiments of an image processing apparatus, an image processing method, and a program according to the present invention will be described with reference to the drawings. The image processing apparatus of this embodiment is applied to an image processing system.

［第１の実施形態］
図１は第１の実施形態における画像処理システムの構成を示す図である。本実施形態の画像処理システムは、オフィス１０内のネットワーク（ＬＡＮ）１０７と、オフィス２０内のネットワーク（ＬＡＮ）１０８とが、インターネット１０４を介して接続された環境で実現される。 [First Embodiment]
FIG. 1 is a diagram illustrating a configuration of an image processing system according to the first embodiment. The image processing system according to this embodiment is realized in an environment in which a network (LAN) 107 in the office 10 and a network (LAN) 108 in the office 20 are connected via the Internet 104.

オフィス１０内に構築されたＬＡＮ１０７には、複合機１００、この複合機１００を制御するマネージメントＰＣ１０１、クライアントＰＣ１０２、文書管理サーバ１０６、データベース１０５およびプロキシサーバ１０３が接続されている。一方、オフィス２０内に構築されたＬＡＮ１０８には、文書管理サーバ１１６、データベース１１５およびプロキシサーバ１１３が接続されている。また、ＬＡＮ１０７およびＬＡＮ１０８は、プロキシサーバ１０３、１１３を介してインターネット１０４に接続される。 A LAN 107 constructed in the office 10 is connected to a multifunction device 100, a management PC 101 that controls the multifunction device 100, a client PC 102, a document management server 106, a database 105, and a proxy server 103. On the other hand, a document management server 116, a database 115, and a proxy server 113 are connected to the LAN 108 built in the office 20. The LAN 107 and the LAN 108 are connected to the Internet 104 via proxy servers 103 and 113.

複合機１００は、紙文書の画像を読み取り、読み取った画像信号に対する画像処理を行い（前処理）、ＬＡＮ１０９を通じて画像データをマネージメントＰＣ１０１に伝送する。マネージメントＰＣ１０１は、通常のＰＣからなり、画像記憶部、画像処理部、表示部および入力部を有する。マネージメントＰＣ１０１の一部は、複合機１００と一体に構成されている。 The multi-function device 100 reads an image of a paper document, performs image processing on the read image signal (preprocessing), and transmits the image data to the management PC 101 via the LAN 109. The management PC 101 is a normal PC and includes an image storage unit, an image processing unit, a display unit, and an input unit. A part of the management PC 101 is configured integrally with the multifunction device 100.

図２は複合機１００の電気的構成を示す図である。この複合機１００は、画像読み取り部２１０、記憶部２１１、記録部２１２、入力部２１３、表示部２１６、データ処理部２１５およびネットワークインタフェース（Ｉ／Ｆ）２１４、２１７を有する。入力部２１３には、キー操作部が設けられている。 FIG. 2 is a diagram illustrating an electrical configuration of the multifunction peripheral 100. The multifunction device 100 includes an image reading unit 210, a storage unit 211, a recording unit 212, an input unit 213, a display unit 216, a data processing unit 215, and network interfaces (I / F) 214 and 217. The input unit 213 is provided with a key operation unit.

画像読み取り部２１０は、オートドキュメントフィーダ（ＡＤＦ）を有し、搬送された原稿を光源で照射し、原稿からの反射光をレンズで固体撮像素子上に結像し、固体撮像素子からラスタ画像信号を密度６００ｄｐｉのイメージ情報として取得する。通常の複写を行う場合、データ処理部２１５は、ラスタ画像信号の画像処理を行い、記録信号（記録データ）とする。そして、複数枚の複写である場合、一旦、１ページ分の記録データを記憶部２１１に記憶・保持した後、記録部２１２に順次出力し、用紙上に画像を形成する。 The image reading unit 210 includes an auto document feeder (ADF), irradiates a conveyed document with a light source, forms an image of reflected light from the document on a solid-state image sensor with a lens, and outputs a raster image signal from the solid-state image sensor. Is acquired as image information having a density of 600 dpi. When normal copying is performed, the data processing unit 215 performs image processing of the raster image signal to obtain a recording signal (recording data). In the case of a plurality of copies, the recording data for one page is temporarily stored and held in the storage unit 211 and then sequentially output to the recording unit 212 to form an image on a sheet.

また、クライアントＰＣ１０２から出力されるプリントデータは、ＬＡＮ１０７からネットワークＩＦ２１４を経由して、データ処理部２１５に読み込まれると、データ処理部２１５で記録（印刷）可能なラスタデータに変換された後、記録部２１２によって用紙上に画像として形成される。また、複合機１００に対する操作者の指示は、複合機１００に設けられたキー操作部、マネージメントＰＣ１０１に設けられたキーボードおよびマウスから行われる。また、入力状態の表示および処理中の画像データの表示は、表示部２１６で行われる。これら一連の動作は、データ処理部２１５内の制御部（ＣＰＵ）によって制御される。 Also, when print data output from the client PC 102 is read into the data processing unit 215 from the LAN 107 via the network IF 214, the print data is converted into raster data that can be recorded (printed) by the data processing unit 215, and then recorded. An image is formed on the paper by the unit 212. An operator's instruction to the multifunction device 100 is issued from a key operation unit provided in the multifunction device 100, a keyboard and a mouse provided in the management PC 101. Further, the display of the input state and the image data being processed are performed on the display unit 216. These series of operations are controlled by a control unit (CPU) in the data processing unit 215.

また、記憶部２１１は、マネージメントＰＣ１０１からも制御される。複合機１００およびマネージメントＰＣ１０１間のデータの授受および制御は、ネットワークＩＦ２１７および直結したＬＡＮ１０９を用いて行われる。 The storage unit 211 is also controlled from the management PC 101. Data exchange and control between the MFP 100 and the management PC 101 are performed using a network IF 217 and a directly connected LAN 109.

上記構成を有する画像処理システムの動作を示す。図３および図４は複合機１００の動作処理手順を示すフローチャートである。この処理プログラムは、複合機１００およびマネージメントＰＣ１０１内の記憶媒体に格納されており、データ処理部２１５内のＣＰＵおよびマネージメントＰＣ１０１内のＣＰＵによって実行される。まず、複合機１００の画像読み取り部２１０を動作させ、１枚の原稿をラスタ方式で走査し、分解能６００ｄｐｉ、８ビットからなる画像データを取得する処理（イメージ情報入力処理）を行う（ステップＳ１）。このとき、データ処理部２１５は、取得した画像データの前処理を行い、１ページ分の画像データを記憶部２１１に保存する。 The operation of the image processing system having the above configuration will be described. 3 and 4 are flowcharts showing the operation processing procedure of the multifunction peripheral 100. FIG. This processing program is stored in a storage medium in the multifunction peripheral 100 and the management PC 101, and is executed by the CPU in the data processing unit 215 and the CPU in the management PC 101. First, the image reading unit 210 of the multi-function device 100 is operated to scan one original by the raster method, and perform processing (image information input processing) for obtaining image data having a resolution of 600 dpi and 8 bits (step S1). . At this time, the data processing unit 215 performs preprocessing of the acquired image data, and stores image data for one page in the storage unit 211.

マネージメントＰＣ１０１のＣＰＵは、ブロック選択（ＢＳ）処理を行い（ステップＳ２）、記憶部２１１に保存された画像データに対し、文字／線画部分とハーフトーンの画像部分とに領域を分離する。さらに、文字／線画部分のうち、文字部を段落毎に塊として纏まっているブロック毎に分離し、、線画部を線で構成された表、図形に分離して、それぞれをセグメント化する。また、ハーフトーンで表現される画像部分を、矩形に分離されたブロックの画像部、背景部など、いわゆるブロック毎に独立したオブジェクトに分割する。 The CPU of the management PC 101 performs block selection (BS) processing (step S2), and separates the area into character / line image portions and halftone image portions for the image data stored in the storage unit 211. Further, among the character / line drawing parts, the character part is separated into blocks each grouped as a lump for each paragraph, the line drawing part is separated into a table and a figure composed of lines, and each is segmented. In addition, the image portion expressed in halftone is divided into so-called independent objects for each block, such as an image portion and a background portion of a block separated into rectangles.

このとき、原稿画像中に付加情報として記録された２次元バーコード、あるいはＵＲＬに相当するオブジェクトを検出し、２次元バーコードの場合、マークを解読し、ＵＲＬの場合、ＯＣＲで文字を認識する処理（ＯＣＲ／ＯＭＲ処理）を行う（ステップＳ３）。そして、原稿のオリジナル電子ファイルが格納されている記憶装置内のポインタ情報を検出する（ステップＳ４）。尚ポインタ情報を付加する方法としては、この他、文字と文字の間隔に情報を埋め込む方法、ハーフトーン画像に埋め込む方法など、直接可視化されない、いわゆる電子透かしによる方法であってもよい。 At this time, a two-dimensional barcode recorded as additional information in the document image or an object corresponding to the URL is detected. In the case of the two-dimensional barcode, the mark is decoded, and in the case of the URL, the character is recognized by OCR. Processing (OCR / OMR processing) is performed (step S3). Then, the pointer information in the storage device storing the original electronic file of the document is detected (step S4). In addition, as a method of adding pointer information, there may be a so-called digital watermark method that is not directly visualized, such as a method of embedding information between characters and a method of embedding in a halftone image.

ポインタ情報が検出されたか否かを判別する（ステップＳ５）。ポインタ情報が検出された場合、ポインタ情報に含まれるアドレスから元の電子ファイルを検索し（ステップＳ６Ａ）、電子ファイルの有無を確認する（ステップＳ６）。ここで、電子ファイルは、クライアントＰＣ１０２のハードディスク内、あるいはオフィス１０、２０の各ＬＡＮに接続された文書管理サーバ１０６、１１６によって管理されるデータベース１０５、１１５内、あるいは複合機１００の記憶部２１１内のいずれかに格納されており、ステップＳ３で得られたポインタ情報を基に検索される。 It is determined whether pointer information is detected (step S5). When the pointer information is detected, the original electronic file is searched from the address included in the pointer information (step S6A), and the presence or absence of the electronic file is confirmed (step S6). Here, the electronic file is stored in the hard disk of the client PC 102, in the databases 105 and 115 managed by the document management servers 106 and 116 connected to the LANs of the offices 10 and 20, or in the storage unit 211 of the MFP 100. And is searched based on the pointer information obtained in step S3.

ステップＳ６で電子ファイルが見つからなかった場合、あるいは電子ファィルは見つかったが、ＰＤＦ形式あるいはｔｉｆｆ形式に代表されるいわゆるイメージファイルであった場合、ステップＳ７の処理に移行する。また、ステップＳ５でポインタ情報が存在しなかった場合、ステップＳ７に移行する。 If the electronic file is not found in step S6, or if the electronic file is found but is a so-called image file represented by the PDF format or the tiff format, the process proceeds to step S7. If no pointer information exists in step S5, the process proceeds to step S7.

そして、文書検索処理を行う（ステップＳ７）。この文書検索処理では、各文字ブロック毎に行われたＯＣＲの結果を基に単語を抽出する全文検索が行われたり、あるいは各オブジェクトの配列と各オブジェクトの属性を基にレイアウト検索が行われる。 Then, a document search process is performed (step S7). In this document search process, a full-text search for extracting words based on the result of OCR performed for each character block is performed, or a layout search is performed based on the array of each object and the attributes of each object.

全文あるいはレイアウト検索の結果類似度の高い電子ファイルが見つかった場合、サムネイル画像等でファィル候補を表示し、複数のファィル候補の中から操作者の選択が必要である場合、操作者の入力操作によってファイルを特定する（ステップＳ８）。尚、ファイル候補が１つのファイルだけであっても、本実施形態では、そのままステップＳ８の処理に移行していたが、このような場合、ステップＳ８の処理を行わず、ステップＳ９、Ｓ１０に移行し、ファイル候補のアドレスをユーザに通知するようにしてもよい。 As a result of full text or layout search When an electronic file with a high degree of similarity is found, a file candidate is displayed as a thumbnail image, etc., and when an operator needs to be selected from a plurality of file candidates, A file is specified (step S8). Even if the file candidate is only one file, in the present embodiment, the process directly proceeds to step S8. In such a case, the process proceeds to step S9 and S10 without performing step S8. Alternatively, the file candidate address may be notified to the user.

そして、電子ファイルが見つかったか否かを判別し（ステップＳ９）、見つかった場合、ステップＳ８で特定された電子ファイルのアドレスをユーザに通知する（ステップＳ１０）。また、ステップＳ６でポインタ情報を基に電子ファイルが見つかった場合も、その電子ファイルが格納されているアドレスをユーザに通知する。一方、ステップＳ９で電子ファイルが見つからなかった場合、ステップＳ１３の処理に移行する。 Then, it is determined whether or not an electronic file is found (step S9). If found, the address of the electronic file specified in step S8 is notified to the user (step S10). If an electronic file is found based on the pointer information in step S6, the user is notified of the address where the electronic file is stored. On the other hand, if the electronic file is not found in step S9, the process proceeds to step S13.

アドレスを基に見つかった電子ファイルを取得（入手）する（ステップＳ１１）。取得した電子ファイルがＰＤＦ形式あるいはｔｉｆｆ形式に代表されるいわゆるイメージファイルであるか否かを判別する（ステップＳ１２）。イメージファイルであると判別された場合、ステップＳ１で入力されたイメージ情報と同等であるので、ベクトル化処理を行う（ステップＳ１３）。このベクトル化処理については後述する。ベクトル化処理を行った後、一般のアプリケーションで編集可能なデータに変換する（ステップＳ１４）。 An electronic file found based on the address is obtained (obtained) (step S11). It is determined whether or not the acquired electronic file is a so-called image file represented by a PDF format or a tiff format (step S12). If it is determined that the file is an image file, since it is equivalent to the image information input in step S1, vectorization processing is performed (step S13). This vectorization process will be described later. After the vectorization process, the data is converted into data that can be edited by a general application (step S14).

編集可能なデータに変換した後、電子ファイルの格納場所を新たなポインタ情報として電子ファイルに付加し（ステップＳ１５）、所定の格納領域に格納する（ステップＳ１６）。また、ステップＳ１２でイメージ情報でないと判別された場合も同様、ステップＳ１５で電子ファイルの格納場所を新たなポインタ情報として電子ファイルに付加し、ステップＳ１６で所定の格納領域に格納する。このとき、ステップＳ１５では、電子ファイルの格納場所をポインタ情報として保存される電子ファイルに付加するようにする。これにより、保存された文書が次回以降の検索対象となり、電子ファイルの印刷時にポインタ情報を２次元バーコードなどで印字することができ、印刷文書から保存された文書に容易にアクセスすることができる。この後、本処理を終了する。 After conversion to editable data, the storage location of the electronic file is added to the electronic file as new pointer information (step S15) and stored in a predetermined storage area (step S16). Similarly, when it is determined that the image information is not image information in step S12, the storage location of the electronic file is added to the electronic file as new pointer information in step S15, and stored in a predetermined storage area in step S16. At this time, in step S15, the storage location of the electronic file is added to the electronic file stored as pointer information. As a result, the stored document becomes a search target from the next time, pointer information can be printed with a two-dimensional barcode or the like when printing an electronic file, and the stored document can be easily accessed from the printed document. . Then, this process is complete | finished.

ステップＳ１３のベクトル化処理では、イメージデータからベクトルデータへの変換処理が行われる。具体的に、ステップＳ２で所定のブロック毎に分割されたオブジェクトに対し、ベクトル化処理が行われるが、このベクトル処理には、以下の処理が含まれる。 In the vectorization processing in step S13, conversion processing from image data to vector data is performed. Specifically, the vectorization process is performed on the object divided for each predetermined block in step S2. This vector process includes the following processes.

・ステップＳ３でＯＣＲによりコード情報に変換された文字ブロックに対し、文字のサイズ、スタイル、字体を認識し、原稿を走査して得られた文字に近くなるように可視的に忠実なフォントデータに変換する。 -Recognize the character size, style, and font for the character block converted to code information by OCR in step S3, and make the font data visually and faithfully close to the character obtained by scanning the document. Convert.

・ステップＳ３で処理されていない文字オブジェクトを１文字単位で切り出し、ＯＣＲによりコード情報に変換する。 A character object that has not been processed in step S3 is cut out in units of one character and converted into code information by OCR.

・文字オブジェクトであるが、ＯＣＲにより認識不可能な文字の輪郭を追跡（トレース）し、輪郭情報（アウトライン情報）を線分のつながりとして表現する形式に変換する。 -The outline of a character object that is unrecognizable by OCR is traced (traced), and the outline information (outline information) is converted into a format that represents the connection of line segments.

・図形オブジェクトの場合、輪郭情報を追跡（トレース）し、線分のつながりとして表現する形式に変換する。 In the case of a graphic object, the outline information is tracked (traced) and converted into a format that is expressed as a connection of line segments.

・上記線分形式のアウトライン情報をベジエ関数などの関数によりフィッティングした関数情報に変換する。 -The outline information in the above line segment format is converted into function information fitted by a function such as a Bezier function.

・図形オブジェクトの輪郭情報から図形の形状を認識し、円、矩形、多角形などの図形定義情報に変換する。 -Recognize the shape of the figure from the outline information of the figure object, and convert it into figure definition information such as a circle, rectangle, or polygon.

・表形式のオブジェクトの場合、罫線や枠線を認識し、所定のフォーマットの帳票フォーマット情報に変換する。 In the case of a tabular object, it recognizes ruled lines and frame lines and converts them into form format information in a predetermined format.

これらの処理の他、ラスタ情報を所定のコマンドやコード情報に置き換えるような各種ベクトル化処理であってもよい。そして、これらのベクトル化処理は、各オブジェクト毎に行われ、各オブジェクトのレイアウト情報が保存されると、前述したように、ステップＳ１４で一般のアプリケーションで編集可能なデータに変換された後、ステップＳ１６で電子ファイルとして記憶部２１１に格納される。 In addition to these processes, various vectorization processes such as replacing raster information with predetermined command or code information may be used. These vectorization processes are performed for each object, and when the layout information of each object is stored, as described above, after being converted into data that can be edited by a general application in step S14, In S16, it is stored in the storage unit 211 as an electronic file.

ここで、編集可能なアプリケーションのデータ形式を示す。通常、データ形式は、使用するアプリケーションに依存しており、一般的に知られているものとしては、米国マイクロソフト社のワードプロセッサであるＷｏｒｄ（登録商標）や表計算アプリケーションであるＥｘｃｅｌ（登録商標）などがある。これらのアプリケーションは、それぞれ使用目的が異なっており、目的に応じたファイル形式が定義され、そのファイル形式でファイル（データ）を保存するようにしている。 Here, the data format of an editable application is shown. Usually, the data format depends on the application to be used. Commonly known data formats include Word (registered trademark), a word processor of Microsoft Corp., and Excel (registered trademark), a spreadsheet application. There is. Each of these applications has a different purpose of use, and a file format corresponding to the purpose is defined, and a file (data) is stored in the file format.

また、ある程度、汎用的に使用可能なファイル形式としては、米国マイクロソフト社のＲＴＦ（ＲｉｃｈＴｅｘｔＦｏｒｍａｔ）形式、近年ではＳＶＧ（ＳｃａｌａｂｌｅＶｅｃｔｏｒＧｒａｐｈｉｃｓ）形式、あるいは単純にテキストデータのみを扱うプレーンテキスト形式などが知られており、これらは、対応するアプリケーションである限り、共通に使用可能なファイル形式となっている。 In addition, as a file format that can be used to some extent, there is a Microsoft Text Corp. RTF (Rich Text Format) format, in recent years an SVG (Scalable Vector Graphics) format, or a plain text format that simply handles only text data. As long as it is a corresponding application, these are file formats that can be used in common.

本実施形態では、ベクトル化された情報をどのようなファイル形式で生成するかを選択可能とし、かつ選択されたファイル形式に応じて最適なベクトル化処理を施すことが可能である。また、このベクトル化処理は、単純にイメージデータを直接扱う場合に比べ、ベクトルデータとして編集可能になるだけでなく、情報量を削減でき、蓄積効率を向上でき、伝送時間を短縮できる。また、記録・表示する際、高品位なデータとして扱うことができる。 In the present embodiment, it is possible to select in what file format the vectorized information is generated, and it is possible to perform an optimal vectorization process according to the selected file format. In addition, this vectorization processing not only enables editing as vector data, but also reduces the amount of information, improves the storage efficiency, and shortens the transmission time, compared to a case where image data is directly handled. Moreover, when recording and displaying, it can be handled as high-quality data.

［ブロック選択（ＢＳ）処理］
図５はブロック選択処理によって記憶部２１１に保存された画像データを、文字／線画部分とハーフトーンの画像部分とに領域を分離する例を示す図である。同図（Ａ）はブロック選択処理を行う前の画像データを示し、同図（Ｂ）はブロック選択処理を行って分離された領域を示す。このブロック選択（セレクション）処理は、読み取った１頁分のイメージデータ（同図（Ａ）参照）を、各オブジェクト毎の塊として認識し、文字（ＴＥＸＴ）、写真（Ｐｈｏｔｏ）、線（Ｌｉｎｅ）、表（Ｔａｂｌｅ）など、各ブロックの属性を判定し、異なる属性を持つ領域に分割する処理である。 [Block selection (BS) processing]
FIG. 5 is a diagram showing an example in which the image data stored in the storage unit 211 by the block selection process is divided into a character / line image part and a halftone image part. FIG. 4A shows image data before the block selection process is performed, and FIG. 4B shows an area separated by the block selection process. In this block selection (selection) process, the read image data for one page (see FIG. 1A) is recognized as a block for each object, and is displayed as text (TEXT), photograph (Photo), line (Line). This is a process of determining the attribute of each block, such as a table, and dividing it into areas having different attributes.

具体的に、まず入力した画像を白黒画像に二値化し、輪郭線追跡を行い、黒画素の輪郭で囲まれる画素の塊を抽出する。面積の大きい黒画素の塊については、内部にある白画素に対しても輪郭線追跡を行って白画素の塊を抽出し、さらに一定面積以上の白画素の塊の内部から再帰的に黒画素の塊を抽出する。 Specifically, first, the input image is binarized into a black and white image, contour tracing is performed, and a block of pixels surrounded by a black pixel contour is extracted. For a black pixel block with a large area, the white pixel block is extracted by tracing the contours of the white pixel inside, and then the black pixel is recursively extracted from the inside of the white pixel block having a certain area or more. Extract the lump.

このようにして得られた黒画素の塊を、大きさおよび形状で分類し、異なる属性を持つ領域に分類する。例えば、縦横比が値１に近く、大きさが一定の範囲のものを文字相当の画素の塊とし、さらに近接する文字が整列良くグループ化可能な部分を文字領域とする。また、扁平な画素の塊を線領域とする。また、一定の大きさ以上でかつ四角形の白画素の塊を整列よく内包する黒画素の塊が占める範囲を表領域とする。また、不定形の画素の塊が散在している領域を写真領域とする。それ以外の任意形状の画素の塊を図画領域とする。 The black pixel blocks obtained in this way are classified by size and shape, and are classified into areas having different attributes. For example, an area having an aspect ratio close to a value 1 and having a constant size is defined as a block of pixels corresponding to a character, and a portion where adjacent characters can be grouped in a well-aligned manner is defined as a character region. Further, a flat pixel block is defined as a line region. Further, a range occupied by a block of black pixels that are not less than a certain size and includes a block of square white pixels in a well-aligned manner is defined as a surface area. Further, an area where irregularly shaped clusters of pixels are scattered is defined as a photographic area. A block of pixels having any other shape is used as a drawing area.

図６はブロック選択処理で得られた各ブロックのブロック情報を示す図である。各ブロック情報は、ベクトル化あるいは検索のための情報として用いられる。ここでは、各ブロック毎に、属性、座標Ｘ、座標Ｙ、幅Ｗ、高さＨ、ＯＣＲ情報の有無がブロック情報として用いられる。属性の値１、２、３、４、５はそれぞれ文字、図形、表、線、写真に相当する。 FIG. 6 is a diagram showing block information of each block obtained by the block selection process. Each block information is used as information for vectorization or search. Here, for each block, the attribute, coordinate X, coordinate Y, width W, height H, and presence / absence of OCR information are used as block information. The attribute values 1, 2, 3, 4, and 5 correspond to characters, figures, tables, lines, and photographs, respectively.

［ポインタ情報の検出］
つぎに、ステップＳ３における電子ファイルの格納位置をイメージ情報から抽出するＯＣＲ／ＯＭＲ処理を示す。図７は原稿画像中に付加された２次元バーコードシンボル（ＱＲコード）を復号してデータ文字列を出力する処理手順を示すフローチャートである。図８は２次元バーコードが付加された原稿を示す図である。 [Detection of pointer information]
Next, OCR / OMR processing for extracting the storage location of the electronic file from the image information in step S3 will be described. FIG. 7 is a flowchart showing a processing procedure for decoding a two-dimensional barcode symbol (QR code) added to a document image and outputting a data character string. FIG. 8 is a diagram showing a document to which a two-dimensional barcode is added.

まず、データ処理部２１５内のページメモリに格納された原稿３１０のイメージ画像をＣＰＵによって走査し、前述したブロック選択処理の結果を基に、２次元バーコードシンボル３１１の位置を検出する（ステップＳ２１）。ここで、ＱＲコードの位置検出パターンは、シンボル３１１の４隅のうち、３隅に配置された同一の位置検出要素パターン３１１ａから構成される。 First, the image of the original 310 stored in the page memory in the data processing unit 215 is scanned by the CPU, and the position of the two-dimensional barcode symbol 311 is detected based on the result of the block selection process described above (step S21). ). Here, the QR code position detection pattern is composed of the same position detection element patterns 311 a arranged at three corners of the four corners of the symbol 311.

ＱＲコードの位置検出パターンに隣接する形式情報を復元し、シンボル３１１に適用されている誤り訂正レベルおよびマスクパターンを取得する（ステップＳ２２）。シンボル３１１の型番を決定し（ステップＳ２３）、形式情報から得られたマスクパターンを使って符号化領域ビットパターンをＸＯＲ演算することによって、マスク処理を解除する（ステップＳ２４）。モデルに対応する配置規則に従い、シンボルキャラクタを読み取り、メッセージのデータおよび誤り訂正コード語を復元する（ステップＳ２５）。 The format information adjacent to the QR code position detection pattern is restored, and the error correction level and mask pattern applied to the symbol 311 are acquired (step S22). The model number of the symbol 311 is determined (step S23), and the mask process is canceled by performing an XOR operation on the encoded area bit pattern using the mask pattern obtained from the format information (step S24). In accordance with the arrangement rule corresponding to the model, the symbol character is read to restore the message data and the error correction code word (step S25).

復元されたコード上に誤りがあるか否かを検出し（ステップＳ２６）、誤りが検出された場合、これを訂正する（ステップＳ２７）。一方、誤りが検出されない場合、ステップＳ２８の処理に移行する。誤り訂正されたデータから、モード指示子および文字数指示子に基づき、データコード語をセグメントに分割する（ステップＳ２８）。最後に、仕様モードに基づいてデータ文字列を復号化し、その結果を出力する（ステップＳ２９）。この後、本処理を終了する。 It is detected whether or not there is an error in the restored code (step S26). If an error is detected, it is corrected (step S27). On the other hand, if no error is detected, the process proceeds to step S28. Based on the mode indicator and the character number indicator, the data code word is divided into segments from the error-corrected data (step S28). Finally, the data character string is decoded based on the specification mode, and the result is output (step S29). Then, this process is complete | finished.

尚、２次元バーコード内に組み込まれたデータは、対応する電子ファイルのアドレス情報を表しており、例えば、ファイルサーバ名およびファイル名からなるパス情報から構成される。または、対応する電子ファイルへのＵＲＬから構成される。 The data incorporated in the two-dimensional barcode represents the address information of the corresponding electronic file, and is composed of path information including a file server name and a file name, for example. Alternatively, it consists of a URL to the corresponding electronic file.

また、本実施形態では、２次元バーコードを用いてポインタ情報が付与された原稿３１０を示したが、ポインタ情報が文字列で直接記録される場合、所定のルールに則った文字列のブロックを先のブロック選択処理で検出し、ポインタ情報を示す文字列の各文字を文字認識することで、元の電子ファイルのアドレス情報を直接得ることも可能である。 Further, in the present embodiment, the document 310 provided with pointer information using a two-dimensional barcode is shown. However, when pointer information is directly recorded as a character string, a block of character strings in accordance with a predetermined rule is displayed. It is also possible to directly obtain the address information of the original electronic file by detecting each character of the character string indicating the pointer information detected by the previous block selection process.

また、原稿３１０の文字ブロックあるいは文字列に対し、隣接する文字と文字の間隔などに、視認し難い程度の変調を加え、この文字間隔に情報を埋め込むことでもポインタ情報を付与できる。この透かし情報を基に後述する文字認識処理を行う際、各文字の間隔を検出することにより、ポインタ情報が得られる。また、自然画の中に、電子透かしとしてポインタ情報を付加することも可能である。 In addition, pointer information can be given to the character block or character string of the original 310 by adding modulation that is difficult to visually recognize to the interval between adjacent characters and embedding information in the character interval. When character recognition processing, which will be described later, is performed based on this watermark information, pointer information is obtained by detecting the interval between characters. It is also possible to add pointer information as a digital watermark in a natural image.

［ポインタ情報によるファイル検索］
図９はステップＳ６Ａにおけるポインタ情報を基に電子ファイルを検索する処理手順を示すフローチャートである。まず、ポインタ情報に含まれるアドレスに基づき、ファイルサーバを特定する（ステップＳ３１）。ここでは、ファイルサーバとして、クライアントＰＣ１０２、データベース１０５を管理する文書管理サーバ１０６、あるいは記憶部２１１を有する複合機１００が特定される。また、アドレスは、ＵＲＬ、あるいはサーバ名およびファイル名からなるパス情報である。 [File search by pointer information]
FIG. 9 is a flowchart showing a processing procedure for searching for an electronic file based on the pointer information in step S6A. First, the file server is specified based on the address included in the pointer information (step S31). Here, the client PC 102, the document management server 106 that manages the database 105, or the multifunction peripheral 100 having the storage unit 211 is specified as the file server. The address is URL or path information including a server name and a file name.

ファイルサーバを特定すると、ファイルサーバにアドレスを転送する（ステップＳ３２）。ファイルサーバはアドレスを受け取ると、該当するファイルを検索する。そして、ファイルサーバからその検索結果を受信する（ステップＳ３３）。ファイルがあるか否かを判別する（ステップＳ３４）。ファイルが存在しない場合、その旨を複合機１００に通知し、本処理を終了し、メイン処理に復帰する。一方、ファイルが存在する場合、ファイルのアドレスを通知するとともに、ファイルサーバに対し、該当するファイルの転送を指示する（ステップＳ３５）。この後、本処理を終了し、メインの処理に復帰する。 When the file server is specified, the address is transferred to the file server (step S32). When the file server receives the address, it searches for the corresponding file. Then, the search result is received from the file server (step S33). It is determined whether or not there is a file (step S34). If the file does not exist, this is notified to the multi-function device 100, the process is terminated, and the process returns to the main process. On the other hand, if the file exists, the file address is notified and the file server is instructed to transfer the file (step S35). Thereafter, the present process is terminated and the process returns to the main process.

図３および図４に示すメイン処理では、ステップＳ３３の検索結果を基に、前述したように、ステップＳ６で該当する電子ファイルが存在するか否かを判別し、該当する電子ファイルが存在しない場合、その旨を通知し、ステップＳ７の処理に移行する。一方、該当する電子ファイルが存在する場合、ステップＳ１０で電子ファイルのアドレスを通知し、ステップＳ１１でファイルサーバから転送される電子ファイルを受信する。 In the main process shown in FIGS. 3 and 4, based on the search result of step S33, as described above, it is determined whether or not the corresponding electronic file exists in step S6, and the corresponding electronic file does not exist. , To that effect, the process proceeds to step S7. On the other hand, if the corresponding electronic file exists, the electronic file address is notified in step S10, and the electronic file transferred from the file server is received in step S11.

［ファイル検索処理］
図１０および図１１はステップＳ７におけるファイル検索処理手順を示すフローチャートである。ステップＳ７の処理は、前述したように、ステップＳ５で入力された原稿のイメージ情報（入力ファイル）にポインタ情報が存在しなかった場合、あるいはステップＳ６でポインタ情報は存在しても、電子ファイルが見つからなかった場合に実行される。 [File search processing]
10 and 11 are flowcharts showing the file search processing procedure in step S7. As described above, the processing in step S7 is performed when the pointer information does not exist in the image information (input file) of the document input in step S5, or even if the pointer information exists in step S6. It is executed when not found.

ここでは、ステップＳ３の処理結果、抽出された各ブロックおよび入力ファイルが図６に示す情報（ブロック情報、入力ファイル情報）を有する場合を示す。前述したように、図６では、ブロック情報の内容として、属性、座標位置、幅と高さのサイズ、ＯＣＲ情報の有無が挙げられている。 Here, a case is shown where each block and input file extracted as a result of the processing in step S3 have the information (block information and input file information) shown in FIG. As described above, in FIG. 6, the contents of the block information include attributes, coordinate positions, width and height sizes, and presence / absence of OCR information.

属性は、文字、線、写真、絵、表その他に分類される。また、ここでは、説明を簡単にするため、各ブロックには、その座標Ｘの小さい順に（Ｘ１＜Ｘ２＜Ｘ３＜Ｘ４＜Ｘ５＜Ｘ６）、ブロック１、ブロック２、ブロック３、ブロック４、ブロック５、ブロック６と番号が付けられている。また、ブロック総数Ｎは、入力ファイル中の全ブロック数である。ここでは、ブロック総数Ｎは値６である。 Attributes are classified into characters, lines, photographs, pictures, tables and others. Also, here, in order to simplify the description, each block has a block 1, a block 2, a block 3, a block 4, a block, in order of increasing coordinates X (X1 <X2 <X3 <X4 <X5 <X6). 5 and block 6 are numbered. The total number N of blocks is the total number of blocks in the input file. Here, the total number N of blocks is the value 6.

そして、これらの情報（ブロック情報、入力ファイル情報）を使用し、データベースの中から、入力ファイルに類似したファイルのレイアウト検索を行う。図１０の処理では、入力ファイルとデータベース中のファイルが順次比較される。まず、最初のデータベース内のファイルに対し、類似率などの初期化を行う（ステップＳ４１）。そして、ブロック総数の比較を行う（ステップＳ４２）。このブロック数の比較は、数式（１）にしたがって行われる。ここで、ｎはデータベース内のファイルのブロック総数である。ΔＮは入力ファイルのブロック総数の誤差である。 Then, using these pieces of information (block information and input file information), a layout search for a file similar to the input file is performed from the database. In the process of FIG. 10, the input file and the file in the database are sequentially compared. First, initialization such as a similarity ratio is performed on a file in the first database (step S41). Then, the total number of blocks is compared (step S42). The comparison of the number of blocks is performed according to the mathematical formula (1). Here, n is the total number of blocks in the file in the database. ΔN is an error in the total number of blocks in the input file.

Ｎ−ΔＮ＜ｎ＜Ｎ＋ΔＮ …… （１）
ステップＳ４２で数式（１）を満足する場合、さらにファイル内のブロック情報を順次比較する。このブロック情報の比較では、属性類似率、サイズ類似率、ＯＣＲ類似率をそれぞれ算出し、これらの値を基に、総合類似率を算出する。すなわち、入力ファイルとデータベース内のファイルのブロック属性を比較し（ステップＳ４３）、一致する場合、データベース内のファイルの属性類似率を更新する（ステップＳ４４）。さらに、入力ファイルとデータベース内のファイルのサイズが数式（２）を満足するか否かの判別、つまりサイズ比較処理を行う（ステップＳ４５）。ここで、Ｗは入力ファイルのブロック幅である。Ｈは入力ファイルのブロック高さである。ΔＷは入力ファイルのブロック幅の誤差である。ΔＨは入力ファイルのブロック高さである。ｗはデータベース内のファイルのブロック幅である。ｈはデータベース内のファイルのブロック高さである。 N−ΔN <n <N + ΔN (1)
If the formula (1) is satisfied in step S42, the block information in the file is further compared sequentially. In this block information comparison, an attribute similarity rate, a size similarity rate, and an OCR similarity rate are calculated, and an overall similarity rate is calculated based on these values. That is, the block attributes of the input file and the file in the database are compared (step S43), and if they match, the attribute similarity of the file in the database is updated (step S44). Further, it is determined whether or not the sizes of the input file and the file in the database satisfy the formula (2), that is, a size comparison process is performed (step S45). Here, W is the block width of the input file. H is the block height of the input file. ΔW is an error in the block width of the input file. ΔH is the block height of the input file. w is the block width of the file in the database. h is the block height of the file in the database.

Ｗ−ΔＷ＜ｗ＜Ｗ＋ΔＷかつＨ−ΔＨ＜ｈ＜Ｈ＋ΔＨ …… （２）
数式（２）を満足する場合、データベース内のファイルのサイズ類似率を更新する（ステップＳ４６）。さらに、入力ファイルとデータベース内のファイルが共にＯＣＲ情報を有するか否かを判別する（ステップＳ４７）。ＯＣＲ情報を有する場合、ＯＣＲ情報を比較し（ステップＳ４８）、ＯＣＲ類似率を更新する（ステップＳ４９）。尚、属性、サイズ、ＯＣＲ情報の各類似率の算出には、周知の技術が用いられるので、その説明を省略する。 W−ΔW <w <W + ΔW and H−ΔH <h <H + ΔH (2)
If the mathematical formula (2) is satisfied, the file size similarity in the database is updated (step S46). Further, it is determined whether or not both the input file and the file in the database have OCR information (step S47). If it has OCR information, the OCR information is compared (step S48), and the OCR similarity is updated (step S49). In addition, since a well-known technique is used for calculation of each similarity rate of an attribute, size, and OCR information, the description is abbreviate | omitted.

そして、入力ファイルの全てのブロックについて、類似率の比較を行ったか否かを判別する（ステップＳ５０）。全てのブロックについて終わっていない場合、ステップＳ５５の処理を行った後、ステップＳ４３の処理に戻る。ステップＳ５５では、Ｎ≦ｎである場合、つまりデータベース内のファイルのブロック総数ｎが入力ファイルのブロック総数Ｎ以上である場合、入力ファイル内の次のブロックを処理対象とし、一方、Ｎ＞ｎである場合、つまりデータベース内のファイルのブロック総数ｎが入力ファイルのブロック総数Ｎより少ない場合、データベース内のファイルの次のブロックを処理対象とする。また、ステップＳ４３で入力ファイルとデータベース内のファイルのブロック属性とが不一致である場合、ステップＳ４５で入力ファイルとデータベース内のファイルのサイズが数式（２）を満足しない場合、およびステップＳ４７で入力ファイルとデータベース内のファイルが共にＯＣＲ情報を有しない場合、同様にステップＳ５５の処理を行った後、ステップＳ４３の処理に戻る。 And it is discriminate | determined whether the similarity rate comparison was performed about all the blocks of an input file (step S50). If all the blocks are not finished, the process returns to the process of step S43 after performing the process of step S55. In step S55, if N ≦ n, that is, if the total block number n of the file in the database is greater than or equal to the total block number N of the input file, the next block in the input file is processed, while N> n. In some cases, that is, when the total block number n of the file in the database is smaller than the total block number N of the input file, the next block of the file in the database is processed. If the input file and the block attribute of the file in the database do not match in step S43, the size of the input file and the file in the database does not satisfy Expression (2) in step S45, and the input file in step S47. If the file in the database does not have OCR information, the process of step S55 is similarly performed, and then the process returns to step S43.

そして、ステップＳ５０で入力ファイルの全てのブロックについて、類似率の比較を終えた場合、総合類似率を算出する（ステップＳ５１）。算出された総合類似率が予め設定された閾値Ｔｈより高いか否かを判別し（ステップＳ５２）、予め設定された閾値Ｔｈより高い場合、そのファイルを類似候補として保存する（ステップＳ５３）。この後、データベース内の全ファイルを終了したか否かを判別する（ステップＳ５４）。データベース内の全ファイルを終了していない場合、次のファイルを処理対象とし（ステップＳ５６）、ステップＳ４１の処理に戻る。また、ステップＳ５２で、総合類似率が予め設定された閾値Ｔｈ以下である場合、次のファイルを処理対象とし（ステップＳ５６）、ステップＳ４１の処理に戻る。一方、ステップＳ５４でデータベース内の全ファイルを終了した場合、本処理を終了する。尚、ステップＳ４５のサイズ比較処理では、座標Ｘ、Ｙを基に位置情報の比較を行うようにしてもよい。 When the comparison of the similarity ratios is completed for all the blocks of the input file in step S50, the total similarity ratio is calculated (step S51). It is determined whether or not the calculated total similarity is higher than a preset threshold Th (step S52). If the calculated total similarity is higher than a preset threshold Th, the file is stored as a similarity candidate (step S53). Thereafter, it is determined whether or not all files in the database have been completed (step S54). If all the files in the database have not been terminated, the next file is set as the processing target (step S56), and the process returns to step S41. If the total similarity is equal to or smaller than the preset threshold value Th in step S52, the next file is set as a processing target (step S56), and the process returns to step S41. On the other hand, when all the files in the database are finished in step S54, this process is finished. In the size comparison process in step S45, the position information may be compared based on the coordinates X and Y.

こうして検索を行った結果総合類似率（類似度）が閾値Ｔｈより高く、ステップＳ５３で類似候補として保存されたデータベース内のファイルは、前述したステップＳ８でサムネイル画像として表示される。そして、複数の類似候補の中から、必要に応じて操作者の入力操作よりファイルか特定される。 As a result of the search, the total similarity (similarity) is higher than the threshold value Th, and the files in the database stored as similar candidates in step S53 are displayed as thumbnail images in step S8 described above. Then, a file is specified from among a plurality of similar candidates as required by an operator input operation.

［ベクトル化処理］
つぎに、ステップＳ１３におけるベクトル化処理を示す。この処理は、ステップＳ９で電子ファイルを特定できなかった場合、イメージ全体をベクトル化するものである。まず、文字ブロック内の各文字に対する文字認識処理を行う。 [Vectorization processing]
Next, vectorization processing in step S13 will be described. In this process, when the electronic file cannot be specified in step S9, the entire image is vectorized. First, character recognition processing is performed for each character in the character block.

（文字認識）
文字認識処理では、文字単位で切り出された画像に対し、パターンマッチングの手法を用いて認識処理を行い、対応する文字コードを取得する。この認識処理は、文字画像から得られる特徴を数十次元の数値列に変換した観測特徴ベクトルと、予め字種毎に求められている辞書特徴ベクトルとを比較し、最も距離の近い字種を認識結果とする処理である。特徴ベクトルの抽出には、種々の方法が知られており、例えば、文字をメッシュ状に分割し、各メッシュ内の文字線を方向別に線素としてカウントする、メッシュ数次元ベクトルを特徴とする方法が知られている。 (Character recognition)
In the character recognition processing, recognition processing is performed on an image cut out in character units using a pattern matching method, and a corresponding character code is acquired. This recognition process compares an observed feature vector obtained by converting a feature obtained from a character image into a numerical sequence of several tens of dimensions with a dictionary feature vector obtained in advance for each character type, and determines the character type with the closest distance. This is a process for obtaining a recognition result. Various methods are known for extracting feature vectors. For example, a method that features a mesh number-dimensional vector that divides characters into meshes and counts the character lines in each mesh as line elements by direction. It has been known.

ステップＳ２のブロック選択処理で抽出された文字領域に対して文字認識を行う場合、まず、該当する領域に対して横書き／縦書きの判定を行い、各々対応する方向に行を切り出し、その後、文字を切り出して文字画像を得る。ここで、横書き／縦書きの判定は、該当する領域内で画素値に対する水平／垂直の射影を取り、水平射影の分散が大きい場合、横書き領域と判断し、垂直射影の分散が大きい場合、縦書き領域と判断することで行われる。また、文字列および文字への分解は、横書きである場合、水平方向の射影を利用して行を切り出し、さらに切り出された行に対する垂直方向の射影から、文字を切り出すことで行われる。一方、縦書きの文字領域に対する文字列および文字への分解は、水平と垂直を逆にして、横書きの場合と同様に行えばよい。また、このとき、文字のサイズが検出される。 When character recognition is performed on the character area extracted by the block selection process in step S2, first, horizontal / vertical writing is determined for the corresponding area, and a line is cut out in each corresponding direction. To obtain a character image. Here, the horizontal / vertical writing is determined by taking a horizontal / vertical projection of the pixel value in the corresponding area, and determining that the horizontal projection area is large when the horizontal projection dispersion is large. This is done by determining that it is a writing area. In the case of horizontal writing, the character string and the character are decomposed by cutting out a line using a horizontal projection, and further cutting out a character from a vertical projection with respect to the cut out line. On the other hand, the character string and the character for the vertically written character area may be decomposed into characters and characters in the same manner as in the horizontal writing with the horizontal and vertical reversed. At this time, the character size is detected.

（フォント認識）
文字認識の際に用いられる字種数分の辞書特徴ベクトルを、文字形状種（フォント種）に対して複数用意し、マッチングの際、文字コードとともにフォント種を出力することで、文字のフォントが認識可能である。 (Font recognition)
By preparing multiple dictionary feature vectors for the number of character types used for character recognition for the character shape type (font type), and outputting the font type together with the character code at the time of matching, the font of the character Recognizable.

（文字のベクトル化）
文字認識およびフォント認識によって得られた、文字コードおよびフォント情報を基に、予め用意されたアウトラインデータを用いて、文字部分の情報をベクトルデータに変換する。元の原稿がカラーである場合、カラー画像から各文字の色を抽出してベクトルデータとともに記録する。これにより、文字ブロックに属するイメージ情報を、形状、大きさおよび色がほぼ忠実に再現されるベクトルデータに変換できる。 (Character vectorization)
Based on the character code and font information obtained by character recognition and font recognition, the character portion information is converted into vector data using outline data prepared in advance. When the original document is in color, the color of each character is extracted from the color image and recorded together with vector data. Thereby, the image information belonging to the character block can be converted into vector data in which the shape, size and color are reproduced almost faithfully.

（文字以外の部分のベクトル化）
ブロック選択処理において、図形、画像、線、表領域として分離された領域を対象とし、その中で抽出された画素塊の輪郭をベクトルデータに変換する。具体的に、輪郭をなす画素の点列を角とみなされる点で区切り、各区間を部分的な直線あるいは曲線で近似する。図１２は角を曲線で近似する様子を示す図である。ここで、角とは、曲率が極大となる点である。曲率が極大となる点は、任意点Ｐｉに対し、左右ｋ個の離れた点Ｐｉ−ｋ、Ｐｉ＋ｋの間に弦を引いた場合、この弦と点Ｐｉとの距離が極大となる点として求められる。さらに、点Ｐｉ−ｋ、点Ｐｉ＋ｋ間の弦の長さ／弧の長さをＲとし、Ｒの値が閾値以下である点を角とみなすことができる。角によって分割された後の各区間では、直線を点列に対する最小二乗法などを用いてベクトル化し、曲線を３次スプライン関数などを用いてベクトル化する。また、対象が内輪郭を持つ場合、ブロック選択処理で抽出した白画素輪郭の点列を用いて、同様に部分的に直線あるいは曲線で内輪郭を近似する。 (Vectorization of parts other than characters)
In the block selection process, a region separated as a figure, an image, a line, and a table region is targeted, and the contour of the pixel block extracted therein is converted into vector data. Specifically, a point sequence of pixels forming an outline is divided by points regarded as corners, and each section is approximated by a partial straight line or curve. FIG. 12 is a diagram illustrating a state in which corners are approximated by curves. Here, the corner is a point where the curvature becomes a maximum. The point where the curvature is maximum is obtained as the point where the distance between the string and the point Pi becomes maximum when a string is drawn between the left and right points Pi-k and Pi + k with respect to the arbitrary point Pi. It is done. Furthermore, let R be the chord length / arc length between the points Pi-k and Pi + k, and the point where the value of R is equal to or less than the threshold value can be regarded as a corner. In each section after being divided by the corners, a straight line is vectorized using a least square method for a point sequence, and a curve is vectorized using a cubic spline function or the like. When the target has an inner contour, the inner contour is similarly partially approximated by a straight line or a curve using the point sequence of the white pixel contour extracted by the block selection process.

このように、輪郭の区分線近似を用いることにより、任意形状の図形のアウトラインをベクトル化することができる。元の原稿がカラーである場合、カラー画像から図形の色を抽出してベクトルデータとともに記録する。 In this way, the outline of a figure having an arbitrary shape can be vectorized by using the contour lane marking approximation. When the original document is in color, the color of the figure is extracted from the color image and recorded together with the vector data.

図１３は太さを持った線として表現する様子を示す図である。ある区間で外輪郭と、内輪郭あるいは別の外輪郭とが近接している場合、２つの輪郭線をひとまとめにし、太さを持った線として表現することができる。具体的に、ある輪郭の各点Ｐｉから別の輪郭上で最短距離となる点Ｑｉまで線を引き、各距離ＰＱｉが平均して一定の長さ以下である場合、注目区間を、線分ＰＱｉの中点を点列とする直線あるいは曲線で近似し、その太さを線分ＰＱｉの平均値とする。線や線の集合体である表罫線は、このような太さを持つ線の集合として、ベクトルで効率良く表現することができる。 FIG. 13 is a diagram illustrating a state of expressing as a line having a thickness. When an outer contour and an inner contour or another outer contour are close to each other in a certain section, two contour lines can be combined and expressed as a line having a thickness. Specifically, when a line is drawn from each point Pi of a certain contour to a point Qi that is the shortest distance on another contour, and each distance PQi is equal to or less than a certain length on average, the section of interest is defined as a line segment PQi. Is approximated by a straight line or a curve having a middle point as a point sequence, and its thickness is taken as the average value of the line segments PQi. A table ruled line, which is a line or a set of lines, can be efficiently expressed as a set of lines having such a thickness by a vector.

尚、前述したように、文字ブロックにおける文字認識処理を用いてベクトル化する場合、この文字認識処理の結果、辞書からの距離が最も近い文字を認識結果として用いる。この距離が所定値以上である場合、必ずしも本来の文字と一致せず、形状が類似する文字に誤認識されることが多い。そこで、本実施形態では、このような誤認識され易い文字については、一般的な線画と同じように扱って、この文字をアウトライン化する。これにより、従来、文字認識処理で誤認識を起こし易い文字に対しても誤った文字にベクトル化されず、可視的にイメージデータに忠実なアウトラインによるベクトル化を行うことができる。また、写真と判定されたブロックについては、ベクトル化することができないので、イメージデータのままとする。 As described above, when vectorization is performed using the character recognition process in the character block, the character having the closest distance from the dictionary as a result of the character recognition process is used as the recognition result. When this distance is greater than or equal to a predetermined value, it is not always the same as the original character, and is often erroneously recognized as a character having a similar shape. Therefore, in the present embodiment, such a character that is easily misrecognized is handled in the same manner as a general line drawing, and the character is outlined. Thereby, conventionally, even a character that is likely to be erroneously recognized in the character recognition process is not vectorized into an erroneous character, and can be vectorized by an outline that is visually faithful to image data. In addition, a block determined to be a photograph cannot be vectorized, and thus remains as image data.

（図形認識）
つぎに、任意形状の図形の輪郭（アウトライン）をベクトル化した後、これらベクトル化された区分線を図形オブジェクト毎にグループ化する処理を示す。図１４はベクトルデータを図形オブジェクト毎にグループ化する処理手順を示すフローチャートである。まず、各ベクトルデータの始点および終点を算出する（ステップＳ６１）。算出された各ベクトルデータの始点および終点情報を用いて、図形要素を検出する（ステップＳ６２）。ここで、図形要素の検出とは、区分線が構成する閉図形を検出することである。この閉図形を検出する際、閉じた形状（閉図形）を構成する各ベクトルは、その両端にそれぞれ連結されるベクトルを有するという原理を応用する。そして、図形要素内に存在する他の図形要素あるいは区分線をグループ化し、１つの図形オブジェクトとし、一方、図形要素内に他の図形要素あいは区分線が存在しない場合、図形要素を図形オブジェクトとする（ステップＳ６３）。この後、本処理を終了する。 (Figure recognition)
Next, processing for vectorizing the outline (outline) of a figure having an arbitrary shape and then grouping the vectorized dividing lines for each graphic object will be described. FIG. 14 is a flowchart showing a processing procedure for grouping vector data for each graphic object. First, the start point and end point of each vector data are calculated (step S61). A graphic element is detected using the calculated start point and end point information of each vector data (step S62). Here, the detection of the graphic element means detecting a closed graphic formed by the dividing line. When detecting this closed figure, the principle that each vector which comprises a closed shape (closed figure) has the vector respectively connected with the both ends is applied. Then, other graphic elements or dividing lines existing in the graphic element are grouped into one graphic object. On the other hand, if there is no dividing line between other graphic elements in the graphic element, the graphic element is regarded as a graphic object. (Step S63). Then, this process is complete | finished.

図１５は図形要素を検出する処理手順を示すフローチャートである。まず、ベクトルデータを基に、両端に連結されていない不要なベクトルを除去し、閉図形構成ベクトルを抽出する（ステップＳ７１）。閉図形構成ベクトルの中から、このベクトルの始点を開始点とし、時計回りにベクトルを順次追っていく。再び開始点に戻るまで追跡を行い、通過したベクトルを全て１つの図形要素を構成する閉図形としてグループ化する（ステップＳ７２）。このステップＳ７２のグループ処理では、閉図形内部にある閉図形構成ベクトルも全てグループ化する。さらに、まだグループ化されていないベクトルの始点を開始点とし、同様の処理を繰り返す。この後、ステップＳ７１で除去された不要ベクトルのうち、ステップＳ７２で閉図形としてグループ化されたベクトルに接合されているものを検出し、１つの図形要素としてグループ化する（ステップＳ７３）。この後、本処理を終了する。これにより、図形ブロックを個別に再利用可能な個別の図形オブジェクトとして扱うことが可能である。 FIG. 15 is a flowchart showing a processing procedure for detecting a graphic element. First, based on the vector data, unnecessary vectors not connected to both ends are removed, and a closed graphic component vector is extracted (step S71). From the closed figure constituent vector, the starting point of this vector is set as the starting point, and the vectors are sequentially followed clockwise. Tracking is performed until the start point is returned again, and all the passed vectors are grouped as closed figures constituting one figure element (step S72). In the group processing in step S72, all the closed graphic constituent vectors inside the closed graphic are also grouped. Further, the same processing is repeated with the starting point of the vectors not yet grouped as the starting point. Thereafter, among the unnecessary vectors removed in step S71, those joined to the vector grouped as a closed figure in step S72 are detected and grouped as one figure element (step S73). Then, this process is complete | finished. Thereby, it is possible to handle the graphic block as an individual graphic object that can be reused individually.

（アプリデータへの変換処理）
前述したブロック選択処理（ステップＳ２参照）で、１頁分のイメージデータをブロック化し、ベクトル化処理（ステップＳ１３参照）を行った結果、イメージデータは、中間データ形式のファイルに変換されている。この中間データ形式は、ドキュメント・アナリシス・アウトプット・フォーマット（ＤＡＯＦ）形式と呼ばれる。図１６はＤＡＯＦ形式のデータ構造を示す図である。ＤＡＯＦ形式のデータ構造は、ヘッダ（ｈｅａｄｅｒ）７９１、レイアウト記述データ部７９２、文字認識記述データ部７９３、表記述データ部７９４および画像記述データ部７９５からなる。 (Conversion to application data)
In the block selection process (see step S2), the image data for one page is blocked and vectorized (see step S13). As a result, the image data is converted into an intermediate data format file. This intermediate data format is referred to as a document analysis output format (DAOF) format. FIG. 16 is a diagram showing a data structure in the DAOF format. The data structure of the DAOF format includes a header 791, a layout description data portion 792, a character recognition description data portion 793, a table description data portion 794, and an image description data portion 795.

Ｈｅａｄｅｒ７９１には、処理対象の文書画像データに関する情報が保持される。レイアウト記述データ部７９２は、文書画像データ中のＴＥＸＴ（文字）、ＴＩＴＬＥ（タイトル）、ＣＡＰＴＩＯＮ（キャプション）、ＬＩＮＥＡＲＴ（線画）、ＰＩＣＴＵＲＥ（自然画）、ＦＲＡＭＥ（枠）、ＴＡＢＬＥ（表）等の属性毎に認識された各ブロックの属性情報およびその矩形アドレス情報を保持する。文字認識記述データ部７９３は、ＴＥＸＴ、ＴＩＴＬＥ、ＣＡＰＴＩＯＮ等のＴＥＸＴブロックを文字認識して得られる文字認識結果を保持する。表記述データ部７９４は、ＴＡＢＬＥブロックの構造の詳細を格納する。画像記述データ部７９５は、ＰＩＣＴＵＲＥやＬＩＮＥＡＲＴ等のブロックのイメージデータを文書画像データから切り出して保持する。 A header 791 holds information regarding document image data to be processed. The layout description data portion 792 includes attributes such as TEXT (character), TITLE (title), CAPTION (caption), LINEART (line drawing), PICTURE (natural image), FRAME (frame), and TABLE (table) in the document image data. The attribute information of each block recognized every time and its rectangular address information are held. The character recognition description data portion 793 holds a character recognition result obtained by character recognition of a TEXT block such as TEXT, TITLE, and CAPTION. The table description data portion 794 stores details of the structure of the TABLE block. The image description data portion 795 cuts out image data of blocks such as PICTURE and LINEART from the document image data and holds them.

このようなＤＡＯＦは、中間データとしてのみならず、それ自体がファイル化されて保存される場合もあるが、このファイルの状態では、いわゆる一般の文書作成アプリケーションによって個々のオブジェクトを再利用することはできない。そこで、このＤＡＯＦ形式のデータからアプリケーションデータ（単に、アプリデータという）に変換する処理（ステップＳ１４参照）について示す。 Such a DAOF is not only used as intermediate data but may be stored as a file itself. In this file state, it is not possible to reuse individual objects by a so-called general document creation application. Can not. Therefore, a process (see step S14) for converting the data in the DAOF format into application data (simply referred to as application data) will be described.

図１７はステップＳ１４におけるアプリケーションデータの生成処理手順を示すフローチャートである。まず、ＤＡＯＦを入力する（ステップＳ８１）。アプリデータの元となる文書構造ツリーを生成する（ステップＳ８２）。文書構造ツリーを基に、ＤＡＯＦに実データを挿入し、実際のアプリデータを生成する（ステップＳ８３）。この後、本処理を終了する。 FIG. 17 is a flowchart showing the application data generation processing procedure in step S14. First, DAOF is input (step S81). A document structure tree that is the source of application data is generated (step S82). Based on the document structure tree, actual data is inserted into the DAOF to generate actual application data (step S83). Then, this process is complete | finished.

図１８は文書構造ツリー生成処理手順を示すフローチャートである。図１９は文書構造ツリーを示す図である。全体制御の基本ルールとして、ミクロブロック（単一ブロック）からマクロブロック（ブロックの集合体）に移行するように処理が行われる。ここでは、ブロックとは、ミクロブロックおよびマクロブロック全体を指す。 FIG. 18 is a flowchart showing a document structure tree generation processing procedure. FIG. 19 shows a document structure tree. As a basic rule of overall control, processing is performed so as to shift from a micro block (a single block) to a macro block (an aggregate of blocks). Here, the block refers to the micro block and the entire macro block.

まず、ブロック単位で縦方向の関連性を基に再グループ化する（ステップＳ９１）。スタート直後、ミクロブロック単位で判定が行われる。ここで、関連性とは、距離が近いこと、ブロック幅（横方向の場合、ブロック高さ）がほぼ同一であること等によって、定義される。また、距離、幅、高さなどの情報は、ＤＡＯＦを参照して抽出される。 First, regrouping is performed on a block-by-block basis based on the vertical relationship (step S91). Immediately after the start, a determination is made in units of micro blocks. Here, the relevance is defined by the fact that the distance is close and the block width (the block height in the case of the horizontal direction) is substantially the same. Information such as distance, width, and height is extracted with reference to DAOF.

図１９（Ａ）は実際のページ構成を示し、図１９（Ｂ）はその文書構造ツリーを示す。ステップＳ９１のグルーピングの結果、ブロックＴ３、Ｔ４、Ｔ５が１つのグループとなるブロックＶ１、およびブロックＴ６、Ｔ７が１つのグループとなるブロックＶ２が同じ階層のグループとして生成される。 FIG. 19A shows the actual page configuration, and FIG. 19B shows the document structure tree. As a result of the grouping in step S91, the block V1 in which the blocks T3, T4, and T5 form one group and the block V2 in which the blocks T6 and T7 form one group are generated as a group in the same hierarchy.

縦方向のセパレータの有無をチェックする（ステップＳ９２）。ここで、セパレータとは、例えば、物理的にＤＡＯＦ中でライン属性を持つオブジェクトである。また論理的な意味としては、アプリケーション中、明示的にブロックを分割する要素である。セパレータを検出した場合、同じ階層で再分割する。 The presence / absence of a vertical separator is checked (step S92). Here, the separator is, for example, an object physically having a line attribute in the DAOF. Also, logically, it is an element that explicitly divides a block in an application. If a separator is detected, it is subdivided at the same level.

縦方向のグループ長を利用し、縦方向のグルーピング長がページ高さであるか否か、つまり分割がこれ以上存在し得ないか否かを判定する（ステップＳ９３）。縦方向のグループ長がページ高さとなっている場合、文書構造ツリー生成処理を終了する。 Using the group length in the vertical direction, it is determined whether or not the grouping length in the vertical direction is the page height, that is, whether there are no more divisions (step S93). When the group length in the vertical direction is the page height, the document structure tree generation process ends.

一方、縦方向のグループ長がページ高さとなっていない場合、ステップＳ９４の処理に移行する。図１９では、セパレータもなく、グループ高さはページ高さとなっていないので、ステップＳ９４の処理に進む。 On the other hand, if the group length in the vertical direction is not the page height, the process proceeds to step S94. In FIG. 19, since there is no separator and the group height is not the page height, the process proceeds to step S94.

そして、ブロック単位で横方向の関連性を基に再グループ化する（ステップＳ９４）。この場合も、スタート直後の第１回目では、ミクロブロック単位で判定を行うことになる。また、関連性およびその判定情報の定義は、縦方向の場合と同じである。図１９では、ブロックＴ１、Ｔ２がブロックＴ１、Ｔ２の１つ上の同じ階層のＨ１グループとして、ブロックＶ１、Ｖ２がブロックＶ１、Ｖ２の１つ上の同じ階層のＨ２グループとして生成される。 Then, regrouping is performed on a block basis based on the relevance in the horizontal direction (step S94). Also in this case, in the first time immediately after the start, the determination is performed in units of micro blocks. The definition of the relevance and the determination information is the same as in the vertical direction. In FIG. 19, blocks T1 and T2 are generated as H1 groups in the same hierarchy one above blocks T1 and T2, and blocks V1 and V2 are generated as H2 groups in the same hierarchy one above blocks V1 and V2.

横方向セパレータの有無をチェックする（ステップＳ９５）。図１９では、セパレータＳ１が存在するので、これをツリーに登録し、ブロックＨ１、Ｓ１、Ｈ２という階層が生成される。 The presence / absence of a horizontal separator is checked (step S95). In FIG. 19, since the separator S1 exists, it is registered in the tree, and a hierarchy of blocks H1, S1, and H2 is generated.

横方向のグループ長を利用し、横方向のグルーピング長がページ幅であるか否か、つまり分割がこれ以上存在し得ないか否かを判定する（ステップＳ９６）。横方向のグループ長がページ幅となっている場合、文書構造ツリー生成処理を終了する。 Using the group length in the horizontal direction, it is determined whether or not the grouping length in the horizontal direction is the page width, that is, whether there are no more divisions (step S96). If the horizontal group length is the page width, the document structure tree generation process is terminated.

一方、横方向のグループ長がページ幅となっていない場合、ステップＳ９１に戻り、再びもう一段上の階層に対し、縦方向の関連性チェックから、同様の処理を繰り返す。図１９では、分割幅がページ幅になっているので、ここで処理を終了し、最後にページ全体を表す最上位階層のブロックＶ０が文書構造ツリーに付加される。 On the other hand, if the group length in the horizontal direction is not the page width, the process returns to step S91, and the same process is repeated from the relevance check in the vertical direction for the next higher layer again. In FIG. 19, since the division width is the page width, the processing ends here, and finally the highest-level block V0 representing the entire page is added to the document structure tree.

文書構造ツリーが完成した後、その情報を基に、ステップＳ８３でアプリデータの生成が行われる。図１９の場合、ブロックＨ１には、横方向に２つのブロックＴ１、Ｔ２があるので、２カラムとして出力し、ブロックＴ１の内部情報（ＤＡＯＦの参照、文字認識結果の文章、画像など）を出力した後、カラムを変え、ブロックＴ２の内部情報を出力した後、セパレータＳ１の出力となる。また、ブロックＨ２には、横方向に２つのブロックＶ１、Ｖ２があるので、２カラムとして出力し、ブロックＶ１では、ブロックＴ３、Ｔ４、Ｔ５の順にその内部情報を出力した後、カラムを変え、ブロックＶ２のブロックＴ６、Ｔ７の内部情報を出力する。こうしてアプリデータへの変換処理が行われる。 After the document structure tree is completed, application data is generated in step S83 based on the information. In the case of FIG. 19, since the block H1 has two blocks T1 and T2 in the horizontal direction, it is output as two columns, and internal information of the block T1 (DAOF reference, text of character recognition result, image, etc.) is output. After that, the column is changed, the internal information of the block T2 is output, and then the separator S1 is output. In addition, since there are two blocks V1 and V2 in the horizontal direction in the block H2, the data is output as two columns. In the block V1, the internal information is output in the order of the blocks T3, T4, and T5, and then the columns are changed. The internal information of the blocks T6 and T7 of the block V2 is output. In this way, conversion processing to application data is performed.

［ポインタ情報の付加］
つぎに、ステップＳ１５におけるポインタ情報の付加処理を示す。処理すべき文書が検索処理で特定された場合、あるいはベクトル化によって元のファイルが再生できた場合、この文書を記録処理する（紙への記録）際、ポインタ情報を付与することで、この印刷文書を用いて再度、各種処理を行う際、簡単に元のファイルデータを取得できる。 [Add pointer information]
Next, pointer information addition processing in step S15 will be described. When the document to be processed is specified by the search process, or when the original file can be reproduced by vectorization, this printing is performed by adding pointer information when recording the document (recording on paper). When performing various processes again using a document, the original file data can be easily obtained.

図２０はステップＳ１５におけるポインタ情報の付加処理手順を示すフローチャートである。このポインタ情報の付加処理では、ポインタ情報としてのデータ文字列を２次元バーコードシンボル（ＱＲコード：ＪＩＳＸ０５１０）３１１に符号化し、画像中に付加する。２次元バーコードに組み込まれるデータは、対応する電子ファイルのアドレス情報を表しており、例えばファイルサーバ名およびファイル名からなるパス情報で構成される。あるいは、対応する電子ファイルへのＵＲＬ、対応する電子ファイルが格納されているデータベース１０５、１１５あるいは複合機１００内の記憶部２１１で管理されるファイルＩＤ等で構成される。 FIG. 20 is a flowchart showing the pointer information addition processing procedure in step S15. In this pointer information addition processing, a data character string as pointer information is encoded into a two-dimensional barcode symbol (QR code: JIS X0510) 311 and added to the image. The data incorporated in the two-dimensional barcode represents the address information of the corresponding electronic file, and is composed of, for example, path information including a file server name and a file name. Alternatively, the URL includes a URL to the corresponding electronic file, the database 105 or 115 in which the corresponding electronic file is stored, or a file ID managed by the storage unit 211 in the multifunction peripheral 100.

まず、符号化する種々の異なる文字を識別するため、入力データ列を分析する（ステップＳ１０１）。また、誤り検出および誤り訂正レベルを選択し、入力データが収容可能な最小型番を選択する。 First, in order to identify various different characters to be encoded, the input data string is analyzed (step S101). Further, an error detection and error correction level is selected, and a minimum model number that can accommodate input data is selected.

入力データ列を所定のビット列に変換し、必要に応じてデータのモード（数字、英数字、８ビットバイト、漢字等）を表す指示子や終端パターンを付加する。さらに所定のビットコード語に変換する（ステップＳ１０２）。誤り訂正を行うため、コード語列を型番および誤り訂正レベルに応じて所定のブロック数に分割し、各ブロック毎に誤り訂正コード語を生成し、データコード語列の後に付加する（ステップＳ１０３）。 An input data string is converted into a predetermined bit string, and an indicator or termination pattern indicating a data mode (numeric, alphanumeric, 8-bit byte, kanji, etc.) is added as necessary. Further, it is converted into a predetermined bit code word (step S102). In order to perform error correction, the code word string is divided into a predetermined number of blocks according to the model number and the error correction level, an error correction code word is generated for each block, and is added after the data code word string (step S103). .

ステップＳ１０３で得られた各ブロックのデータコード語を接続し、各ブロックの誤り訂正コード語、必要に応じて剰余コード語を後続させる（ステップＳ１０４）。マトリクスに、位置検出パターン、分離パターン、タイミングパターンおよび位置合わせパターン等とともにコード語モジュールを配置する（ステップＳ１０５）。 The data code words of the respective blocks obtained in step S103 are connected, and the error correction code word of each block and, if necessary, the remainder code word are followed (step S104). A code word module is arranged in the matrix together with a position detection pattern, a separation pattern, a timing pattern, an alignment pattern, and the like (step S105).

シンボルの符号化領域に対して最適なマスクパターンを選択し、マスク処理パターンをステップＳ１０５で得られたモジュールにＸＯＲ演算により変換する（ステップＳ１０６）。ステップＳ１０６で得られたモジュールに形式情報および型番情報を生成し、２次元コードシンボルを完成する（ステップＳ１０７）。この後、本処理を終了する。 An optimal mask pattern is selected for the symbol coding region, and the mask processing pattern is converted into a module obtained in step S105 by an XOR operation (step S106). Format information and model number information are generated in the module obtained in step S106 to complete a two-dimensional code symbol (step S107). Then, this process is complete | finished.

こうしてアドレス情報が組み込まれた２次元バーコードは、例えばクライアントＰＣ１０２により、電子ファイルをプリントデータとして、記録部２１２の用紙上に記録画像として形成される場合、データ処理部２１５によって記録可能なラスタデータに変換された後、ラスタデータ上の所定の個所に付加され、画像が形成される。ここで、画像が形成された用紙が配布されたユーザは、画像読み取り部２１０によって画像を読み取ることにより、ステップＳ４でポインタ情報からオリジナル電子ファイルの格納場所を検出することができる。 The two-dimensional barcode in which the address information is incorporated in this way is raster data that can be recorded by the data processing unit 215 when the client PC 102 forms an electronic file as print data and as a recorded image on a sheet of the recording unit 212, for example. Then, the image is added to a predetermined location on the raster data to form an image. Here, the user to whom the paper on which the image is formed is distributed can detect the storage location of the original electronic file from the pointer information in step S4 by reading the image with the image reading unit 210.

尚、付加情報を付与する方法としては、本実施形態で示した２次元バーコードの他、例えば、ポインタ情報を直接文字列で文書に付加する方法、文書内の文字列、特に文字と文字の間隔を変調して情報を埋め込む方法、文書中の中間調画像中に埋め込む方法など、一般に、電子透かしと呼ばれる方法を適用することができる。 In addition to the two-dimensional bar code shown in the present embodiment, for example, a method of adding pointer information directly to a document as a character string, a character string in a document, particularly a character and character In general, a method called digital watermark can be applied, such as a method of embedding information by modulating an interval, or a method of embedding in a halftone image in a document.

［ファイル形式の選択と指定］
上記処理により、オリジナル電子ファイルを特定できなかった場合であっても、あたかも元のアプリケーションで作成したのと同等な電子ファイルを得ることができる。ただし、ここで生成されるベクトル化された電子ファイルのファイル形式は、あくまで予め決められたフォーマットに限られる。実際に想定されるアプリケーションは種々様々であり、どのような形式で電子化するかによって、ユーザの利便性は大きく変わってくることになる。 [Select and specify file format]
Even if the original electronic file cannot be specified by the above processing, an electronic file equivalent to that created by the original application can be obtained. However, the file format of the vectorized electronic file generated here is limited to a predetermined format. There are a wide variety of applications that are actually assumed, and the convenience of the user will vary greatly depending on the type of computerization.

そこで、本実施形態では、生成されるファイル形式をユーザが選択可能となるような、ユーザインターフェースを提供する。このユーザによる選択処理は、ステップＳ１３のベクトル化処理の開始前や、ステップＳ１のイメージ情報入力処理の開始前などで行われる。図２１は出力ファイル形式を選択する操作画面を示す図である。図２１の操作画面には、複合機１００もしくはマネージメントＰＣ１０１のユーザ操作画面に表示されるメッセージが示されている。ここでは、「出力ファイル形式選択画面」となっており、ベクトル化処理された電子データをいずれの形式で書き出すかをユーザが指定できるようになっている。複数のファイル形式の中から選択可能になっており、ＯＮまたはＯＦＦのチェックボックスにより、指定した形式で書き出すように動作が行われる。ここでは、「プレーンテキスト形式」、「ワープロ形式」、「表形式」、「プレゼンテーション形式」、「ＳＶＧ」あるいは「ＲＴＦ」の形式に選択可能である。本実施形態では、ワープロ形式としてＷＯＲＤ（登録商標）形式、表形式としてＥｘｃｅｌ（登録商標）形式、プレゼンテーション形式としてＰｏｗｅｒＰｏｉｎｔ（登録商標）形式を例示するが、その他の形式であってもよいことは勿論である。また、複数の形式を指定した場合、それぞれの形式に則った複数の電子データが出力されることになる。 Therefore, in this embodiment, a user interface is provided so that the user can select a file format to be generated. This selection process by the user is performed before the start of the vectorization process in step S13 or before the start of the image information input process in step S1. FIG. 21 is a diagram showing an operation screen for selecting an output file format. The operation screen of FIG. 21 shows a message displayed on the user operation screen of the multifunction device 100 or the management PC 101. Here, an “output file format selection screen” is displayed, and the user can specify in which format the electronic data subjected to vectorization processing is to be written. It is possible to select from a plurality of file formats, and an operation is performed so as to write in a specified format by an ON or OFF check box. Here, a “plain text format”, “word processor format”, “table format”, “presentation format”, “SVG”, or “RTF” format can be selected. In this embodiment, the WORD (registered trademark) format is exemplified as the word processor format, the Excel (registered trademark) format is exemplified as the table format, and the PowerPoint (registered trademark) format is exemplified as the presentation format, but other formats may be used. It is. When a plurality of formats are specified, a plurality of electronic data according to each format is output.

ここでは、ＷＯＲＤ（登録商標）形式とＳＶＧ形式を選択しているので、生成されたベクトル化データは、これら２つの形式でデータ格納領域に出力されることになる。また、本実施形態では、選択された各形式毎に最適なベクトル化処理が適用される。 Here, since the WORD (registered trademark) format and the SVG format are selected, the generated vectorized data is output to the data storage area in these two formats. In the present embodiment, an optimal vectorization process is applied to each selected format.

例えば、プレーンテキスト形式だけを選択した場合、この形式で扱えるデータは文字コード情報だけであるので、ベクトル化処理は、最低限のレベルしか適用されず、ＯＣＲによる文字画像のコード化処理だけを行うことで、十分である。 For example, when only the plain text format is selected, the only data that can be handled in this format is character code information. Therefore, the vectorization processing is applied only to the minimum level, and only the character image encoding processing by OCR is performed. That is enough.

また、ＷＯＲＤ（登録商標）形式やＲＴＦ形式を選択した場合、文字コード情報、表形式の枠／罫線情報、ビットマップ画像情報を混在して扱うことが可能であるので、ＯＣＲによる文字コード化、表データの枠と罫線の直線ベクトル化、およびレイアウトされたビットマップ画像の切り出し処理などが実行される。逆に、図形アウトラインの関数近似処理などを行わないようにする。 In addition, when the WORD (registered trademark) format or the RTF format is selected, character code information, tabular frame / ruled line information, and bitmap image information can be mixedly handled. Table data frame and ruled line vectorization, layout bitmap image cutout processing, and the like are executed. On the other hand, the function of the figure outline is not approximated.

また、ＳＶＧ形式を選択した場合、ほとんど全てのベクトル化情報を定義することが可能であるので、前述したベクトル化処理における全ての処理を実行し、ベクトル化したデータをＳＶＧ言語に変換して出力するようにする。 When the SVG format is selected, almost all vectorization information can be defined. Therefore, all the vectorization processes described above are executed, and the vectorized data is converted into the SVG language and output. To do.

また、処理の実行／非実行を制御するだけでなく、処理の優先度を変える場合も、想定することができる。例えば、Ｅｘｃｅｌ（登録商標）形式を選択した場合、入力画像が縦の罫線、横の罫線で表現された表形式である場合が多いと予想されるので、ベクトル化処理において、ＢＳ処理で分割されたオブジェクト毎に、縦の直線および横の直線を抽出する処理を優先させるようにする。このように処理した結果、表以外の図形部でも、縦／横の直線部が抽出されやすくなり、図形の塊としての抽出精度が若干下がることになるが、表の抽出精度が高くなるので、結果的には好ましい出力が得られることになる。 In addition to controlling execution / non-execution of processing, it is possible to assume a case where the priority of processing is changed. For example, when the Excel (registered trademark) format is selected, it is expected that the input image is often a table format expressed by vertical ruled lines and horizontal ruled lines. For each object, priority is given to the process of extracting vertical and horizontal straight lines. As a result of processing in this manner, even in graphic parts other than the table, vertical / horizontal straight line parts are easily extracted and the extraction accuracy as a block of figures is slightly lowered, but the extraction accuracy of the table is increased. As a result, a preferable output can be obtained.

また、ＰｏｗｅｒＰｏｉｎｔ（登録商標）形式を選択した場合、矩形や円などの図形情報が多く含まれていると予想されるので、そのような図形抽出処理を優先的に行うようにすることも可能である。この場合、円や矩形などの図形情報は、それらを定義するコマンド形式のデータ（Ｃｉｒｃｌｅ、Ｓｑｕａｒｅなど）として出力されることになる。 In addition, when the PowerPoint (registered trademark) format is selected, it is expected that a lot of graphic information such as rectangles and circles is included. Therefore, it is possible to preferentially perform such graphic extraction processing. is there. In this case, graphic information such as circles and rectangles is output as command format data (Circle, Square, etc.) that defines them.

図２２はベクトル化処理の変更方式を示す図である。この例では、ページ８０１上に、実線のラインで番号付けされた複数のオブジェクトを含むような文書画像が示されている。ページ８０１全体は、画像読み取り部２１０からデジタル的なラスタ画像データとして入力され、ＢＳ処理（ステップＳ２参照）で各オブジェクトに分割された後、各オブジェクトがベクトル化処理対象となるものとする。これらの各オブジェクトが、出力ファイル形式によって、どのようなベクトル化処理を受けるかを示す。
（ａ）プレーンテキスト形式が指定された場合
この場合、ベクトル化の対象となるのは文字列だけである。図中、符号８０３、８０６、８０９、８１３で示される各文字列のオブジェクトだけがベクトル化の対象となり、ＯＣＲ処理により文字コード化される。それ以外のオブジェクトは無視される。
（ｂ）ＰｏｗｅｒＰｏｉｎｔ、ＷＯＲＤもしくはＲＴＦ形式が指定された場合
文字のコード化と基本図形への置き換え処理を行う。文字列は、上記（ａ）と同様、ＯＣＲ処理により文字コードに変換される。楕円形のオブジェクト８０２、８１２は、図形のアウトライン情報に基づき、最適な長径／短径を持つ楕円図形に置き換えられる。長方形オブジェクト８０５およびひし形オブジェクト８０８も、同様に変換される。矢印８０４、８０７、８１１は、始点および終点の座標に基づき、直線ベクトルに置き換えられ、かつ先端に矢印が付加されていることから、終点に矢印を持つ直線図形として定義される。曲線８１０は、始点から終点までのアウトラインに基づき、短い直線の集合としてベクトル化され、かつ同様に終点に矢印マークが付与される。 FIG. 22 is a diagram showing a changing method of vectorization processing. In this example, a document image including a plurality of objects numbered by solid lines is shown on a page 801. The entire page 801 is input as digital raster image data from the image reading unit 210 and divided into objects by BS processing (see step S2), and then each object becomes a vectorization processing target. It shows what kind of vectorization processing each of these objects is subjected to depending on the output file format.
(A) When plain text format is specified In this case, only a character string is subject to vectorization. In the figure, only the objects of the respective character strings indicated by reference numerals 803, 806, 809, and 813 are subject to vectorization, and are character-coded by OCR processing. Other objects are ignored.
(B) When PowerPoint, WORD, or RTF format is specified Character encoding and replacement with basic graphics are performed. The character string is converted into a character code by OCR processing as in (a) above. The elliptical objects 802 and 812 are replaced with an elliptical figure having an optimum major axis / minor axis based on the outline information of the figure. The rectangular object 805 and the diamond object 808 are similarly converted. The arrows 804, 807, and 811 are defined as straight lines having an arrow at the end point because they are replaced with straight line vectors based on the coordinates of the start point and end point, and an arrow is added at the end. The curve 810 is vectorized as a set of short straight lines based on the outline from the start point to the end point, and similarly, an arrow mark is given to the end point.

（ｃ）ＳＶＧ形式が選択された場合
文字列の処理は、上記（ａ）、（ｂ）と同様であるが、ここではＳＶＧ特有の処理としてフォント形状のベクトル化処理が合わせて行われる。すなわち、文字列を１文字単位に分解し、各文字のアウトライン情報に基づき、ＳＶＧフォント定義を生成する。このフォント情報がＯＣＲによる文字コード情報と一体として定義されるようにする。このような処理を行うと、文字列に特殊なフォント形式が使われているような文書であっても、その文字形状を保存したまま、ベクトル化処理することが可能になる。 (C) When SVG format is selected The character string processing is the same as (a) and (b) above, but here, font shape vectorization processing is also performed as processing unique to SVG. That is, the character string is decomposed into character units, and the SVG font definition is generated based on the outline information of each character. This font information is defined integrally with the character code information by OCR. By performing such processing, even a document in which a special font format is used for a character string can be vectorized while the character shape is preserved.

また、各図形オブジェクト８０２、８０５、８０８、８１２に対しては、図形データに変換せず、アウトラインデータのベジエ関数近似を行うようにすることも可能である。同様に、曲線８１０に対しても、アウトラインをベジエ関数に近似することで、滑らかな曲線をそのままベクトル化することが可能となる。矢印の処理は、上記（ｂ）と同様である。 Further, it is possible to perform Bezier function approximation of outline data for each graphic object 802, 805, 808, and 812 without converting to graphic data. Similarly, a smooth curve can be vectorized as it is by approximating the outline to the curve 810 with a Bezier function. The processing of the arrow is the same as (b) above.

このように、指定された出力ファイル形式に応じて、各種ベクトル化処理を適宜施すことが可能となる。また、生成されるベクトルデータに対する、アプリケーションの使用による再利用性がさらに高まることとなる。上記処理方法はあくまで一例であり、その他様々な処理方法が考えられることは勿論である。また、それぞれ選択されたファイル形式は、それぞれ独自の文書構造ツリー情報、ヘッダ情報、データ格納形式を有しているが、選択された形式に応じた形式のアプリデータが生成されることは勿論である。 As described above, various vectorization processes can be appropriately performed according to the designated output file format. In addition, the reusability of the generated vector data by using the application is further enhanced. The above processing method is merely an example, and various other processing methods can of course be considered. Each selected file format has its own document structure tree information, header information, and data storage format. Of course, application data in a format corresponding to the selected format is generated. is there.

第１の実施形態の画像処理システムによれば、原稿を読み取り走査することによって得られたイメージ情報から、この原稿の電子ファイルを特定する際、電子ファイルを特定できなかった場合、イメージ情報からベクトルデータを生成してアプリケーションファイルに変換することができる。したがって、あたかもオリジナルの電子ファイルを入手した場合と同様の効果を得ることができる。また、ユーザがアプリケーションファイルの形式を指定することにより、ユーザが希望する任意のアプリケーションファイルを生成できる。しかも、このとき行われるベクトル化処理は、指定されたアプリケーションで最適となるように設定されるので、ユーザにとっては望ましい変換結果を得ることができる。 According to the image processing system of the first embodiment, when specifying an electronic file of an original from image information obtained by reading and scanning the original, if the electronic file cannot be specified, a vector is calculated from the image information. Data can be generated and converted to application files. Therefore, it is possible to obtain the same effect as if the original electronic file was obtained. In addition, an application file desired by the user can be generated by designating the format of the application file by the user. In addition, the vectorization process performed at this time is set so as to be optimal for the designated application, so that a conversion result desirable for the user can be obtained.

［第２の実施形態］
扱われる文書ファイルの中には、第三者による再利用を制限すべきものがある。前記第１の実施形態では、ファイルサーバに蓄積されたファイルは、全て自由にアクセス可能であり、ファイル全体あるいはその一部のオブジェクトが全て再利用可能であることを前提とした。そこで、第２の実施形態では、ポインタ情報からファイルを検索した際、検索の結果、特定したファイルにアクセス権の制限が有る場合を示す。図２３は第２の実施形態のステップＳ６Ａにおけるポインタ情報を基に電子ファイルを検索する処理手順を示すフローチャートである。ステップＳ１２１〜Ｓ１２４までの処理は、前記第１の実施形態の図９におけるステップＳ３１〜Ｓ３４までの処理と同様であるので、その説明を省略する。 [Second Embodiment]
Some document files to be handled should limit reuse by third parties. In the first embodiment, it is assumed that all the files stored in the file server can be freely accessed, and the entire file or a part of its objects can be reused. Therefore, in the second embodiment, when a file is searched from pointer information, a case is shown in which the access right is restricted for the specified file as a result of the search. FIG. 23 is a flowchart showing a processing procedure for searching for an electronic file based on the pointer information in step S6A of the second embodiment. Since the processing from step S121 to S124 is the same as the processing from step S31 to S34 in FIG. 9 of the first embodiment, the description thereof is omitted.

ステップＳ１２４でファイルが特定された場合、ファイルサーバに、そのファイルのアクセス権情報の有無を調べさせる（ステップＳ１２５）。アクセス制限がある場合、パスワードの送信要求を受けると（ステップＳ１２６）、複合機は、操作者に対してパスワードの入力を促し、入力されたパスワードをファイルサーバに送信する（ステップＳ１２７）。 If the file is specified in step S124, the file server is checked for the presence of access right information for the file (step S125). If there is an access restriction, upon receiving a password transmission request (step S126), the multifunction device prompts the operator to input the password, and transmits the input password to the file server (step S127).

ファイルサーバによる、送信されたパスワードの照合結果を判別する（ステップＳ１２８）。照合の結果、パスワード不一致で認証失敗である場合、ステップＳ１２６の処理に戻る。一方、パスワード一致で認証成功である場合、ファイルのアドレスを通知すると共に、ユーザ希望の処理が画像ファイルデータの取得である場合、ファイルサーバに対し、該当するファイルの転送を指示する（ステップＳ１２９）。この後、本処理を終了し、メインの処理に復帰する。 The verification result of the transmitted password by the file server is determined (step S128). As a result of the verification, if the password does not match and the authentication fails, the process returns to step S126. On the other hand, if the password matches and the authentication is successful, the file address is notified, and if the user-desired process is acquisition of image file data, the file server is instructed to transfer the corresponding file (step S129). . Thereafter, this process is terminated and the process returns to the main process.

尚、アクセス権を制御する際の認証の方法は、ステップＳ１２６、Ｓ１２７で示したパスワードによる方法に限定されず、例えば指紋認証など、一般に広く用いられている生体認証、カードによる認証など、全ての認証手段を用いることが可能である。また、第２の実施形態では、紙文書に付与されたポインタ情報により、ファイルを特定した場合の実施形態を示したが、ステップＳ７、Ｓ８に示すように、ファイル検索処理でファイルを特定した場合においても、同様に適用可能である。 The authentication method for controlling the access right is not limited to the password method shown in steps S126 and S127. For example, all authentication methods such as fingerprint authentication and widely used biometric authentication and card authentication are available. Authentication means can be used. In the second embodiment, the embodiment is shown in which the file is specified by the pointer information given to the paper document. However, as shown in steps S7 and S8, the file is specified by the file search process. The same applies to the above.

一方ファイルサーバによるファイルの特定ができなかった場合、ステップＳ１３のベクトル化処理に対しても、制限を加えることが可能である。すなわち、紙文書を走査して得られたイメージ情報から、この紙文書に対するアクセス権の制限の存在を検出した場合、認証確認が取れた場合だけ、ベクトル化処理を行うことで、機密性の高い文書の使用に制限を加えることができる。 On the other hand, when the file cannot be specified by the file server, it is possible to add a restriction to the vectorization process in step S13. In other words, when the presence of a restriction on the access right to this paper document is detected from the image information obtained by scanning the paper document, the vectorization process is performed only when authentication is confirmed, thereby providing high confidentiality. Limit the use of documents.

［第３の実施形態］
前記第１の実施形態では、ファイル検索処理で、元の電子ファイルを特定できなかった場合、イメージ画像全体に対してベクトル化処理を行うが、例えば、一般の文書の場合、文書中のオブジェクト全てが新規に作成されたものでなく、一部のオブジェクトは他のファイルから流用して作成されたものである場合がある。 [Third Embodiment]
In the first embodiment, when the original electronic file cannot be specified by the file search process, the vectorization process is performed on the entire image image. For example, in the case of a general document, all objects in the document are processed. Are not newly created, and some objects may be created by diverting from other files.

例えば、背景オブジェクト（壁紙）として、文書作成アプリケーションで、いくつかのパターンが予め用意されており、その中から選択して用いることが通常である。このようなオブジェクトは、文書ファイルデータベースの中の他の文書ファイル中に存在している可能性が高く、また再利用可能なベクトルデータとして存在する可能性が高い。 For example, as a background object (wallpaper), several patterns are prepared in advance by a document creation application, and it is normal to select and use them. Such an object is likely to exist in other document files in the document file database, and is likely to exist as reusable vector data.

したがってこのような背景から、ステップＳ１３におけるベクトル化処理として、ブロック選択処理で、個別のオブジェクトに分割された各オブジェクトに対し、このオブジェクト単位でデータベースの中から、一致するオブジェクトを一部に含むファイルを検索し、一致したオブジェクトに対し、個別にファイルからオブジェクト単位でベクトルデータを取得する。これにより、文書全体をベクトル化する必要が無くなり、より高速にベクトル化処理を行うことができ、さらにベクトル化による画質劣化を防止できる。 Therefore, from such a background, as a vectorization process in step S13, for each object divided into individual objects in the block selection process, a file partially including a matching object from the database in this object unit Are retrieved, and vector data is obtained for each matching object individually from the file in units of objects. As a result, it is not necessary to vectorize the entire document, vectorization processing can be performed at a higher speed, and image quality deterioration due to vectorization can be prevented.

また、第３の実施形態では、出力ファイル形式を図２１に示すように指定した場合、検索処理の対象を選択した形式のファイルに限定することも可能である。例えば、ＰｏｗｅｒＰｏｉｎ（登録商標）形式を選択した場合、ファイルサーバ上のＰｏｗｅｒＰｏｉｎｔ（登録商標）ファイルだけを対象として、オブジェクト単位のベクトルデータの検索および取得を行うようにすることで、処理を簡略化することができる。 Further, in the third embodiment, when the output file format is specified as shown in FIG. 21, it is possible to limit the search processing target to the selected format file. For example, when the PowerPoint (registered trademark) format is selected, the processing is simplified by searching and acquiring the vector data in units of objects for only the PowerPoint (registered trademark) file on the file server. be able to.

一方、ステップＳ７におけるファイル検索処理では、元のファイルがＰＤＦファイルとして特定された場合、このＰＤＦファイルがその文書の文字オブジェクトに対して、既に文字認識された文字コードを付加ファイルとして有している場合がある。このようなＰＤＦファイルをベクトル化する場合、この文字コードファイルを用いることにより、ステップＳ１３以降のベクトル化処理の中の文字認識処理を省くことができる。すなわち、ベクトル化処理をより高速に処理することが可能となる。 On the other hand, in the file search process in step S7, when the original file is specified as a PDF file, this PDF file has a character code that has already been recognized for the character object of the document as an additional file. There is a case. When vectorizing such a PDF file, the character recognition process in the vectorization process after step S13 can be omitted by using this character code file. That is, the vectorization process can be performed at a higher speed.

以上が本発明の実施の形態の説明であるが、本発明は、これら実施の形態の構成に限られるものではなく、特許請求の範囲で示した機能、または実施の形態の構成が持つ機能が達成できる構成であればどのようなものであっても適用可能である。 The above is the description of the embodiments of the present invention. However, the present invention is not limited to the configurations of these embodiments, and the functions shown in the claims or the functions of the configurations of the embodiments are included. Any configuration that can be achieved is applicable.

また、本発明の目的は、実施の形態の機能を実現するソフトウェアのプログラムコードを記録した記憶媒体を、システム或いは装置に供給し、そのシステム或いは装置のコンピュータ（またはＣＰＵやＭＰＵ等）が記憶媒体に格納されたプログラムコードを読み出して実行することによっても達成される。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施の形態の機能を実現することになり、そのプログラムコードを記憶した記憶媒体は本発明を構成することになる。 Another object of the present invention is to supply a storage medium storing software program codes for realizing the functions of the embodiments to a system or apparatus, and a computer (or CPU, MPU, etc.) of the system or apparatus stores the storage medium. It is also achieved by reading out and executing the program code stored in. In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention.

また、プログラムコードを供給するための記憶媒体としては、例えば、ＲＯＭ、フロッピー（登録商標）ディスク、ＰＣＭＣＩＡカードやコンパクトフラッシュ（登録商標）等のメモリカード、ハードディスク、マイクロＤＡＴ、光磁気ディスク、ＣＤ−ＲやＣＤ−ＲＷ等の光ディスク、ＤＶＤ等の相変化型光ディスク等で構成されてもよい。 Examples of storage media for supplying the program code include ROM, floppy (registered trademark) disk, memory card such as PCMCIA card and compact flash (registered trademark), hard disk, micro DAT, magneto-optical disk, CD- It may be composed of an optical disk such as R or CD-RW, a phase change optical disk such as DVD, or the like.

また、コンピュータが読み出したプログラムコードを実行することにより、上記実施の形態の機能が実現されるだけでなく、そのプログラムコードの指示に基づき、コンピュータ上で稼動しているＯＳ（オペレーティングシステム）等が実際の処理の一部または全部を行い、その処理によって前述した実施の形態の機能が実現される場合も含まれる。 Further, by executing the program code read by the computer, not only the functions of the above-described embodiments are realized, but also an OS (operating system) running on the computer based on the instruction of the program code. A case where part or all of the actual processing is performed and the functions of the above-described embodiments are realized by the processing is also included.

更に、記憶媒体から読み出されたプログラムコードが、コンピュータに挿入された機能拡張ボードやコンピュータに接続された機能拡張ユニットに備わるメモリに書き込まれた後、そのプログラムコードの指示に基づき、その機能拡張ボードや機能拡張ユニットに備わるＣＰＵ等が実際の処理の一部または全部を行い、その処理によって前述した実施の形態の機能が実現される場合も含まれる。 Further, after the program code read from the storage medium is written in a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, the function expansion is performed based on the instruction of the program code. This includes the case where the CPU or the like provided in the board or the function expansion unit performs part or all of the actual processing and the functions of the above-described embodiments are realized by the processing.

第１の実施形態における画像処理システムの構成を示す図である。It is a figure which shows the structure of the image processing system in 1st Embodiment. 複合機１００の電気的構成を示す図である。2 is a diagram illustrating an electrical configuration of the multifunction peripheral 100. FIG. 複合機１００の動作処理手順を示すフローチャートである。3 is a flowchart illustrating an operation processing procedure of the multifunction peripheral 100. 図３につづく複合機１００の動作処理手順を示すフローチャートである。FIG. 4 is a flowchart illustrating an operation processing procedure of the multifunction peripheral 100 continued from FIG. ブロック選択処理によって記憶部２１１に保存された画像データを、文字／線画部分とハーフトーンの画像部分とに領域を分離する例を示す図である。It is a figure which shows the example which isolate | separates the area | region of the image data preserve | saved at the memory | storage part 211 by the block selection process into the character / line drawing part and the halftone image part. ブロック選択処理で得られた各ブロックのブロック情報を示す図である。It is a figure which shows the block information of each block obtained by the block selection process. 原稿画像中に付加された２次元バーコードシンボル（ＱＲコード）を復号してデータ文字列を出力する処理手順を示すフローチャートである。It is a flowchart which shows the process sequence which decodes the two-dimensional barcode symbol (QR code) added in the original image, and outputs a data character string. ２次元バーコードが付加された原稿を示す図である。It is a figure which shows the original document to which the two-dimensional barcode was added. ステップＳ６Ａにおけるポインタ情報を基に電子ファイルを検索する処理手順を示すフローチャートである。It is a flowchart which shows the process sequence which searches an electronic file based on the pointer information in step S6A. ステップＳ７におけるファイル検索処理手順を示すフローチャートである。It is a flowchart which shows the file search processing procedure in step S7. 図１０につづくステップＳ７におけるファイル検索処理手順を示すフローチャートである。It is a flowchart which shows the file search process procedure in step S7 following FIG. 角を曲線で近似する様子を示す図である。It is a figure which shows a mode that an angle is approximated with a curve. 太さを持った線として表現する様子を示す図である。It is a figure which shows a mode that it expresses as a line with thickness. ベクトルデータを図形オブジェクト毎にグループ化する処理手順を示すフローチャートである。It is a flowchart which shows the process sequence which groups vector data for every graphic object. 図形要素を検出する処理手順を示すフローチャートである。It is a flowchart which shows the process sequence which detects a graphic element. ＤＡＯＦ形式のデータ構造を示す図である。It is a figure which shows the data structure of DAOF format. ステップＳ１４におけるアプリケーションデータの生成処理手順を示すフローチャートである。It is a flowchart which shows the production | generation process procedure of the application data in step S14. 文書構造ツリー生成処理手順を示すフローチャートである。It is a flowchart which shows a document structure tree production | generation process procedure. 文書構造ツリーを示す図である。It is a figure which shows a document structure tree. ステップＳ１５におけるポインタ情報の付加処理手順を示すフローチャートである。It is a flowchart which shows the addition process procedure of the pointer information in step S15. 出力ファイル形式を選択する操作画面を示す図である。It is a figure which shows the operation screen which selects an output file format. ベクトル化処理の変更方式を示す図である。It is a figure which shows the change system of a vectorization process. 第２の実施形態のステップＳ６Ａにおけるポインタ情報を基に電子ファイルを検索する処理手順を示すフローチャートである。It is a flowchart which shows the process sequence which searches an electronic file based on the pointer information in step S6A of 2nd Embodiment.

Explanation of symbols

１００複合機
１０１マネジメントＰＣ
１０２クライアントＰＣ
１０５、１１５データベース
１０６、１１６文書管理サーバ
２１０画像読み取り部
２１１記憶部
２１２記録部
２１３入力部
２１５データ処理部
２１６表示部
３１１二次元バーコードシンボル（ＱＲコード） 100 MFP 101 Management PC
102 Client PC
105, 115 Database 106, 116 Document management server 210 Image reading unit 211 Storage unit 212 Recording unit 213 Input unit 215 Data processing unit 216 Display unit 311 Two-dimensional barcode symbol (QR code)

Claims

Vectorization means for generating vector data from image data input from input means for inputting image data;
An image processing apparatus comprising: output means for outputting the generated vector data in a predetermined file format.

Selecting means for selecting the predetermined file format;
The image processing apparatus according to claim 1, wherein the vectorization unit changes content of vectorization processing when generating the vector data based on the selection result.

The image processing apparatus according to claim 1, wherein the input unit optically reads an image of a document to generate an image signal, and converts the generated image signal into image data.

Read means described in the converted image data and reading pointer information indicating a storage location of the original electronic file,
4. The image according to claim 3, wherein when the electronic file acquired based on the read pointer information is image data, the vectorization means generates the vector data from the converted image data. Processing equipment.

The image processing apparatus according to claim 1, wherein the input unit inputs an array of two-dimensional digital pixel data as the image data.

The image processing apparatus according to claim 1, wherein the vectorization unit converts the input raster format image data into command definition format data such as code information, graphic information, and function information.

2. The image processing apparatus according to claim 1, wherein the vectorization means includes format conversion means for converting an object of the image data converted into the vector data into a predetermined format that can be handled by predetermined application software. .

8. The image processing apparatus according to claim 7, wherein the predetermined format is a designated file format or a document structure format defined by software that generates the file format.

An input step for inputting image data;
A vectorization step of generating vector data from the input image data;
And an output step of outputting the generated vector data in a predetermined file format.

A program executed by a CPU in the image processing apparatus,
An input step for inputting image data;
A vectorization step of generating vector data from the input image data;
An output step of outputting the generated vector data in a predetermined file format.