JP2021043500A

JP2021043500A - Information processing apparatus and program

Info

Publication number: JP2021043500A
Application number: JP2019162765A
Authority: JP
Inventors: 久保　周作; Shusaku Kubo; 周作久保; 上野　邦和; Kunikazu Ueno; 邦和上野; 邦彦小林; Kunihiko Kobayashi
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2019-09-06
Filing date: 2019-09-06
Publication date: 2021-03-18

Abstract

To cause validity of an imprint that exists at a specific stamped part in a document to be confirmed without opening a document file.SOLUTION: An image formation apparatus 10 includes a document type identification unit 13 that identifies a type of a document from a read image of the document, a key acquisition unit 14 that acquires a key corresponding to the type of the identified document from a key information storage unit 22, an image extraction unit 15 that extract an image linked to each key as an imprint, a reference character acquisition unit 16 that acquires a character included in a stamp to be stamped at a specified stamped portion in the document as the reference character, a verification character acquisition unit 171 that acquires a character included in an image of the imprint extracted from the image extraction unit 15 as a verification character, a determination unit 17 that determines the validity of the imprint existing at the specified stamped portion in the document to be processed by comparing the verification character and the reference character, and a determination result output unit 18 that presents a determination result of the validity of the imprint for each key by the determination unit 16.SELECTED DRAWING: Figure 1

Description

本発明は、情報処理装置及びプログラムに関する。 The present invention relates to an information processing device and a program.

申込書、契約書、あるいは稟議書などの文書は、所定の押印箇所に押印されたものを原本とし、正式な文書として取り扱われる。また、電子化され文書ファイルにて保存される場合もある。 Documents such as application forms, contracts, and approval documents are treated as official documents, with the original stamped at the designated stamped location. It may also be digitized and saved as a document file.

文書の中には、正しい印章の押印が義務づけられているものもあるが、正しい印章で所定の押印箇所に押印されているかどうかを文書ファイルで確認するには、文書ファイルを開き画面に表示させて正しい印章の印影であるかどうかを確認する必要があった。 Some documents are required to be stamped with the correct seal, but to check in the document file whether the stamp is stamped with the correct stamp, open the document file and display it on the screen. It was necessary to confirm whether it was the correct imprint of the seal.

特開平８−２１２３４３号公報Japanese Unexamined Patent Publication No. 8-212343

しかしながら、文書内の特定の押印箇所に正しい印章の印影があるかどうかを確認するために文書ファイルを開き画面に表示させることは、手間がかかり面倒であった。 However, it is troublesome and troublesome to open the document file and display it on the screen in order to confirm whether or not the imprint of the correct seal is present at a specific stamped part in the document.

本発明は、文書ファイルを開くことなく文書内の特定の押印箇所に存在する印影の正当性を確認させることを目的とする。 An object of the present invention is to confirm the validity of an imprint existing at a specific imprinted portion in a document without opening the document file.

本発明に係る情報処理装置は、プロセッサを備え、前記プロセッサは、処理対象の文書の読取画像から前記文書内の特定の押印箇所に存在する印影の画像を抽出し、前記印影の画像に含まれている文字を検証文字として取得し、前記特定の押印箇所に押印されるべき印章に含まれる文字を基準文字として取得し、前記検証文字と前記基準文字との対比によって前記特定の押印箇所に存在する印影の正当性を判定し、前記特定の押印箇所に存在する印影の正当性の判定結果を提示することを特徴とする。 The information processing apparatus according to the present invention includes a processor, which extracts an image of an imprint existing at a specific imprinted portion in the document from a scanned image of the document to be processed and includes the image of the imprint. Characters are acquired as verification characters, characters included in the seal to be stamped on the specific stamped portion are acquired as reference characters, and are present at the specific stamped portion by comparing the verification character with the reference character. It is characterized in that the validity of the imprint to be imprinted is determined, and the determination result of the validity of the imprint existing at the specific imprinted portion is presented.

また、前記プロセッサは、前記印影の画像に含まれている文字が文字認識機能を利用して取得できない場合、前記印影の画像に含まれている直線成分を抽出すると共に、前記文書に押印されるべき印章の向きに対する抽出した各直線成分の傾き角度を検出し、抽出した直線成分の数量を傾き角度毎に集計した結果を参照して前記印影の画像の傾き角度を推定し、前記印影の画像に対して傾き補正をした後に文字認識機能を実行することで前記印影の画像に含まれている文字を取得することを特徴とする。 Further, when the character included in the image of the imprint cannot be acquired by using the character recognition function, the processor extracts the linear component included in the image of the imprint and stamps the document. The tilt angle of each extracted linear component with respect to the orientation of the power stamp is detected, the tilt angle of the image of the imprint is estimated by referring to the result of totaling the quantity of the extracted linear components for each tilt angle, and the image of the imprint is estimated. It is characterized in that the characters included in the image of the imprint are acquired by executing the character recognition function after correcting the inclination of the imprint.

また、前記プロセッサは、検出した傾き角度の中に直交する傾き角度の組が存在する場合、当該傾き角度の組の中で前記集計した結果の和が最大となる傾き角度の組に含まれる直線成分のうち前記集計した結果の大きい直線成分の傾き角度を前記印影の画像の傾き角度と推定することを特徴とする。 Further, when the detected tilt angle has a set of orthogonal tilt angles, the processor includes a straight line included in the set of tilt angles that maximizes the sum of the aggregated results in the set of tilt angles. It is characterized in that the tilt angle of the linear component having a large aggregated result among the components is estimated as the tilt angle of the image of the imprint.

また、前記プロセッサは、検出した傾き角度の中に直交する傾き角度の組が存在しない場合、前記集計した結果が最大となる直線成分の傾き角度を前記印影の画像の傾き角度と推定することを特徴とする。 Further, the processor estimates that the tilt angle of the linear component that maximizes the aggregated result is the tilt angle of the imprint image when there is no set of orthogonal tilt angles in the detected tilt angles. It is a feature.

また、前記直線成分の数量を傾き角度毎に集計した結果は、当該傾き角度の直線成分の数又は当該傾き角度の直線成分の長さの総和であることを特徴とする。 Further, the result of totaling the quantity of the linear components for each tilt angle is the sum of the number of linear components of the tilt angle or the length of the linear components of the tilt angle.

また、前記プロセッサは、推定した傾き角度による傾き補正をした後の前記印影の画像を９０度ずつ回転させて得た４種類の画像に対して文字認識機能を実行し、４種類の画像の中から文字認識精度が最大となる画像が印章と正立する前記印影の画像として抽出することを特徴とする。 Further, the processor executes a character recognition function on four types of images obtained by rotating the image of the imprint after tilt correction based on the estimated tilt angle by 90 degrees, and among the four types of images. It is characterized in that the image having the maximum character recognition accuracy is extracted as an image of the imprint that stands upright with the seal.

また、前記プロセッサは、前記処理対象の文書の読取画像を解析することによって前記処理対象の文書の種類を特定し、文書内に存在すべき特定の画像に紐付く特定の文字が文書の種類毎に設定されている特定文字情報を参照することによって、前記処理対象の文書の種類において当該文書内に存在すべき特定の画像に紐付く特定の文字を取得し、取得した前記特定の文字に対応して予め設定されている探索条件に合致する画像を、前記印影の画像として抽出することを特徴とする。 Further, the processor identifies the type of the document to be processed by analyzing the scanned image of the document to be processed, and the specific characters associated with the specific image that should exist in the document are for each type of document. By referring to the specific character information set in, the specific character associated with the specific image that should exist in the document in the type of the document to be processed is acquired, and the acquired specific character is supported. An image that matches the preset search conditions is extracted as an image of the imprint.

また、前記プロセッサは、取得した前記特定の文字と、当該特定の文字に紐付く前記印影の正当性の判定結果と、を組にして提示することを特徴とする。 Further, the processor is characterized in that the acquired specific character and the determination result of the validity of the imprint associated with the specific character are presented as a set.

また、前記プロセッサは、取得した前記特定の文字と、当該特定の文字に紐付く前記印影の正当性の判定結果と、を組にして前記処理対象の文書のファイル名に含めることを特徴とする。 Further, the processor is characterized in that the acquired specific character and the determination result of the validity of the imprint associated with the specific character are combined and included in the file name of the document to be processed. ..

また、前記プロセッサは、取得した前記特定の文字と、当該特定の文字に紐付く前記印影の正当性の判定結果と、の組を含むファイルを生成することを特徴とする。 Further, the processor is characterized in that it generates a file including a set of the acquired specific character and the determination result of the validity of the imprint associated with the specific character.

本発明に係る情報処理装置は、プロセッサを備え、前記プロセッサは、処理対象の文書の読取画像から前記文書内の特定の押印箇所に存在する印影の画像を抽出し、前記印影の画像に含まれている文字が文字認識機能を利用して取得できない場合、前記印影の画像に含まれている直線成分を抽出すると共に、前記文書に押印されるべき印章の向きに対する抽出した各直線成分の傾き角度を検出し、抽出した直線成分の数量を傾き角度毎に集計した結果を参照して前記印影の画像の傾き角度を推定し、前記印影の画像に対して傾き補正をした後に文字認識機能を実行することで前記印影の画像に含まれている文字を取得することを特徴とする。 The information processing apparatus according to the present invention includes a processor, and the processor extracts an image of an imprint existing at a specific imprinted portion in the document from a scanned image of the document to be processed and includes the image of the imprint. When the character is not acquired by using the character recognition function, the linear component contained in the image of the imprint is extracted, and the inclination angle of each extracted linear component with respect to the direction of the stamp to be stamped on the document. Is detected, the tilt angle of the imprint image is estimated by referring to the result of totaling the quantity of the extracted linear components for each tilt angle, and the character recognition function is executed after the tilt correction is performed on the imprint image. By doing so, it is characterized in that the characters included in the image of the imprint are acquired.

本発明に係るプログラムは、コンピュータに、処理対象の文書の読取画像から前記文書内の特定の押印箇所に存在する印影の画像を抽出する機能、前記印影の画像に含まれている文字が文字認識機能を利用して取得できない場合、前記印影の画像に含まれている直線成分を抽出すると共に、前記文書に押印されるべき印章の向きに対する抽出した各直線成分の傾き角度を検出する機能、抽出した直線成分の数量を傾き角度毎に集計した結果を参照して前記印影の画像の傾き角度を推定する機能、前記印影の画像に対して傾き補正をした後に文字認識機能を実行することで前記印影の画像に含まれている文字を取得する機能、を実現させる。 The program according to the present invention has a function of extracting an image of an imprint existing at a specific stamped portion in the document from a scanned image of the document to be processed by a computer, and character recognition of characters included in the image of the imprint. If it cannot be obtained by using the function, the linear component contained in the image of the imprint is extracted, and the tilt angle of each extracted linear component with respect to the orientation of the stamp to be stamped on the document is detected. The function of estimating the tilt angle of the image of the imprint by referring to the result of totaling the quantity of the linear components for each tilt angle, and the character recognition function after correcting the tilt of the image of the imprint. The function of acquiring the characters contained in the image of the imprint is realized.

請求項１に記載の発明によれば、文書ファイルを開くことなく文書内の特定の押印箇所に存在する印影の正当性を確認させることができる。 According to the invention of claim 1, it is possible to confirm the validity of the imprint existing at a specific stamped portion in the document without opening the document file.

請求項２に記載の発明によれば、文字を構成する直線成分の多くは、文書に押印されるべき印章の向きと関係性があると想定して印影の画像を傾き補正することができる。 According to the invention of claim 2, most of the linear components constituting the character can be tilt-corrected on the assumption that the image of the imprint is related to the orientation of the seal to be imprinted on the document.

請求項３に記載の発明によれば、文字を構成する直線成分の多くは、文書に押印されるべき印章の向きと平行する方向又は直交する方向に向いていると想定して印影の画像を傾き補正することができる。 According to the invention of claim 3, most of the linear components constituting the character are oriented in a direction parallel to or orthogonal to the direction of the seal to be stamped on the document, and the image of the imprint is formed. Tilt correction can be performed.

請求項４に記載の発明によれば、文字を構成する直線成分の多くは、文書に押印されるべき印章の向きと平行していると想定して印影の画像を傾き補正することができる。 According to the invention of claim 4, most of the linear components constituting the character can be tilt-corrected on the assumption that the image of the imprint is parallel to the direction of the seal to be imprinted on the document.

請求項５に記載の発明によれば、直線成分の数量の多い方向と基準方向とのずれが印影の画像の傾き角度と推定することができる。 According to the fifth aspect of the present invention, it can be estimated that the deviation between the direction in which the number of linear components is large and the reference direction is the inclination angle of the image of the imprint.

請求項６に記載の発明によれば、印影の画像に含まれている文字を精度良く取得することができる。 According to the invention of claim 6, the characters included in the image of the imprint can be acquired with high accuracy.

請求項７に記載の発明によれば、文書に含まれている印影の画像を自動的に抽出することができる。 According to the invention of claim 7, the image of the imprint contained in the document can be automatically extracted.

請求項８に記載の発明によれば、特定の文字に紐付く印影の正当性の判定結果を確認させることができる。 According to the invention of claim 8, it is possible to confirm the determination result of the validity of the imprint associated with a specific character.

請求項９に記載の発明によれば、特定の文字と当該特定の文字に紐付く印影の正当性の判定結果を、文書ファイルを開くことなく確認させることができる。 According to the invention of claim 9, it is possible to confirm the determination result of the validity of a specific character and the imprint associated with the specific character without opening the document file.

請求項１０に記載の発明によれば、文書の種類に対応する特定の文字が複数存在する場合、あるいは処理対象とする文書が複数存在する場合に、印影の正当性の判定結果をまとめて確認させることができる。 According to the invention of claim 10, when there are a plurality of specific characters corresponding to the type of the document, or when there are a plurality of documents to be processed, the determination result of the validity of the imprint is collectively confirmed. Can be made to.

請求項１１に記載の発明によれば、文書ファイルを開くことなく文書内の特定の押印箇所に存在する印影の正当性を確認させることができる。 According to the invention of claim 11, it is possible to confirm the validity of the imprint existing at a specific imprinted portion in the document without opening the document file.

請求項１２に記載の発明によれば、文書ファイルを開くことなく文書内の特定の押印箇所に存在する印影の正当性を確認させることができる。 According to the invention of claim 12, it is possible to confirm the validity of the imprint existing at a specific imprinted portion in the document without opening the document file.

本発明に係る情報処理装置の一実施の形態を示すブロック構成図である。It is a block block diagram which shows one Embodiment of the information processing apparatus which concerns on this invention. 本実施の形態における画像形成装置のハードウェア構成図である。It is a hardware block diagram of the image forming apparatus in this embodiment. 本実施の形態におけるキー情報記憶部に記憶されるキー情報のデータ構成例を示す図である。It is a figure which shows the data structure example of the key information stored in the key information storage part in this embodiment. 本実施の形態における探索条件情報記憶部に記憶される探索条件情報のデータ構成例を示す図である。It is a figure which shows the data structure example of the search condition information stored in the search condition information storage part in this embodiment. 本実施の形態における印影正当性判定処理を示すフローチャートである。It is a flowchart which shows the imprint justification determination processing in this embodiment. 本実施の形態において文書の種類が図面に分類される文書の概略的なレイアウトを示す図である。It is a figure which shows the schematic layout of the document which the document type is classified into a drawing in this embodiment. 傾いて押印された印影の例を示す図である。It is a figure which shows the example of the imprint which was tilted and imprinted. 図７Ａに示す印影から抽出した直線成分のみを示す図である。It is a figure which shows only the linear component extracted from the imprint shown in FIG. 7A. 図７Ｂにおいて直線成分の傾き角度を示す図である。It is a figure which shows the inclination angle of the linear component in FIG. 7B. 図７Ｂに示す直線成分の数及び長さの傾き角度毎の集計結果を示す図である。It is a figure which shows the aggregation result for each inclination angle of the number of linear components and the length shown in FIG. 7B. 図７Ａに示す印影の傾き補正後の印影を９０度ずつ回転させた印影及び当該印影に対する文字認識の結果を示す図である。It is a figure which shows the imprint which rotated the imprint after the inclination correction of the imprint shown in FIG. 7A by 90 degrees, and the result of character recognition with respect to the imprint.

以下、図面に基づいて、本発明の好適な実施の形態について説明する。 Hereinafter, preferred embodiments of the present invention will be described with reference to the drawings.

図１は、本発明に係る情報処理装置の一実施の形態を示すブロック構成図である。本実施の形態では、情報処理装置であるコンピュータを内蔵する画像形成装置を例にして説明する。 FIG. 1 is a block configuration diagram showing an embodiment of an information processing device according to the present invention. In the present embodiment, an image forming apparatus having a built-in computer, which is an information processing apparatus, will be described as an example.

図２は、本実施の形態における画像形成装置１０のハードウェア構成図である。画像形成装置１０は、コピー機能、スキャナ機能等各種機能を搭載した複合機で形成可能である。図２において、ＣＰＵ３１は、ＲＯＭ３９に格納されたプログラムに従ってスキャナ３４やプリンタエンジン３６等本装置に搭載された各種機構の動作制御を行う。アドレスデータバス３２は、ＣＰＵ３１の制御対象となる各種機構と接続してデータの通信を行う。操作パネル３３は、ユーザからの指示の受け付け、情報の表示を行う。スキャナ３４は、ユーザがセットした原稿を読み取る。ＨＤＤ３５は、スキャナ３４を使用して読み取った電子文書などを格納する。プリンタエンジン３６は、ＣＰＵ３１で実行される制御プログラムからの指示に従い出力用紙上に画像を印字する。ネットワークインタフェース（Ｉ／Ｆ）３７は、ネットワーク１を接続し、本装置が生成した電子データの送信、本装置宛に送信されてきた電子メールの受信、またブラウザ経由による本装置へのアクセスなどに利用される。本実施の形態におけるネットワークインタフェース３７は、外部のシステムからネットワーク１を介して印影の正当性の判断に利用される基準文字を受信する。ＲＡＭ３８は、プログラム実行時のワークメモリや電子データ送受信時の通信バッファとして利用される。ＲＯＭ３９は、本装置の制御や電子データの送受信に関する各種プログラムが格納されている。各種プログラムが実行されることで後述する各構成要素が所定の処理機能を発揮する。外部メディアインタフェース（Ｉ／Ｆ）４０は、ＵＳＢメモリ、フラッシュメモリ等の外部メモリ機器とのインタフェースである。本実施の形態おける画像形成装置１０のハードウェア構成は、従前からある構成と同様でよい。 FIG. 2 is a hardware configuration diagram of the image forming apparatus 10 according to the present embodiment. The image forming apparatus 10 can be formed by a multifunction device equipped with various functions such as a copy function and a scanner function. In FIG. 2, the CPU 31 controls the operation of various mechanisms mounted on the present device, such as the scanner 34 and the printer engine 36, according to the program stored in the ROM 39. The address data bus 32 communicates data by connecting to various mechanisms controlled by the CPU 31. The operation panel 33 accepts instructions from the user and displays information. The scanner 34 reads a document set by the user. The HDD 35 stores an electronic document or the like read by using the scanner 34. The printer engine 36 prints an image on the output paper according to an instruction from the control program executed by the CPU 31. The network interface (I / F) 37 connects the network 1 to transmit electronic data generated by the device, receive e-mails sent to the device, access the device via a browser, and the like. It will be used. The network interface 37 in the present embodiment receives a reference character used for determining the validity of the imprint from an external system via the network 1. The RAM 38 is used as a work memory when executing a program and as a communication buffer when transmitting and receiving electronic data. The ROM 39 stores various programs related to the control of the present device and the transmission / reception of electronic data. When various programs are executed, each component described later exerts a predetermined processing function. The external media interface (I / F) 40 is an interface with an external memory device such as a USB memory or a flash memory. The hardware configuration of the image forming apparatus 10 in the present embodiment may be the same as the conventional configuration.

図１に戻り、本実施の形態における画像形成装置１０は、スキャンデータ取得部１１、文字認識処理部１２、文書種類特定部１３、キー取得部１４、画像抽出部１５、基準文字取得部１６、判定部１７、判定結果出力部１８、文書種類情報記憶部２１、キー情報記憶部２２、探索条件情報記憶部２３及び出力形式情報記憶部２４を有している。なお、本実施の形態において説明に用いない構成要素については、図から省略している。 Returning to FIG. 1, the image forming apparatus 10 in the present embodiment includes a scan data acquisition unit 11, a character recognition processing unit 12, a document type identification unit 13, a key acquisition unit 14, an image extraction unit 15, and a reference character acquisition unit 16. It has a determination unit 17, a determination result output unit 18, a document type information storage unit 21, a key information storage unit 22, a search condition information storage unit 23, and an output format information storage unit 24. The components not used in the description in the present embodiment are omitted from the drawings.

スキャンデータ取得部１１は、スキャナ３４を使用して読み取ったスキャンデータ（以下、「読取画像」ともいう）を取得する。本実施の形態において取り扱う文書は、特定の文字を含むテキスト文字と特定の画像とが混在する文書である。「特定の文字」というのは、文書の種類毎に設定されており、文書内に存在すべき特定の画像に紐づくテキスト文字である。本実施の形態では、特定の文字を「キー」と称している。また、本実施の形態における「特定の画像」というのは、印影の画像である。なお、厳密には、紙文書にあるのは印影であり、紙文書の読取画像に含まれているのは印影の画像であるが、説明の便宜上、印影と印影の画像を同義に用いて説明する場合もある。 The scan data acquisition unit 11 acquires scan data (hereinafter, also referred to as “scanned image”) read by using the scanner 34. The document handled in the present embodiment is a document in which text characters including specific characters and specific images are mixed. The "specific character" is a text character that is set for each type of document and is associated with a specific image that should exist in the document. In this embodiment, a specific character is referred to as a "key". Further, the "specific image" in the present embodiment is an image of an imprint. Strictly speaking, what is in the paper document is the imprint, and what is included in the scanned image of the paper document is the image of the imprint. In some cases.

文字認識処理部１２は、ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）機能（「文字認識機能」ともいう）を利用して文字認識を行うことによって、読み取った文書に記載されている文字列（すなわち、テキスト文字）を抽出する。文書種類特定部１３は、読み取った文書の読取画像から当該文書の種類を特定する。キー取得部１４は、文書内に存在する印影に紐づく特定の文字（すなわち、キー）に関する情報である特定文字情報を参照することによって、当該文書の種類において当該文書内に存在する印影に紐付くキーを取得する。特定文字情報は、キー情報としてキー情報記憶部２２に記憶される。 The character recognition processing unit 12 performs character recognition using an OCR (Optical Character Reader) function (also referred to as a “character recognition function”) to perform character recognition, and thus a character string (that is, a text character) described in a read document. Is extracted. The document type specifying unit 13 specifies the type of the document from the scanned image of the read document. The key acquisition unit 14 links to the imprint existing in the document in the type of the document by referring to the specific character information which is the information about the specific character (that is, the key) associated with the imprint existing in the document. Get the attached key. The specific character information is stored in the key information storage unit 22 as key information.

画像抽出部１５は、キー取得部１４が取得したキーに対応して探索条件に合致する画像、すなわち印影の画像を読取画像から抽出する。探索条件に関する探索条件情報は、探索条件情報記憶部２３に予め設定されている。基準文字取得部１６は、処理対象の文書の特定の押印箇所に押印されるべき印章に含まれる文字を基準文字として取得する。 The image extraction unit 15 extracts an image that matches the search conditions, that is, an image of the imprint, corresponding to the key acquired by the key acquisition unit 14 from the scanned image. The search condition information regarding the search condition is preset in the search condition information storage unit 23. The reference character acquisition unit 16 acquires characters included in the seal to be stamped at a specific stamped portion of the document to be processed as reference characters.

判定部１７は、検証文字と基準文字との対比によって処理対象の文書の特定の押印箇所に存在する印影の正当性を判定する。判定部１７に含まれる検証文字取得部１７１は、画像抽出部１５が抽出した印影の画像に含まれている文字を検証文字として取得する。判定結果出力部１８は、指定された出力形式に従い判定結果を出力する。出力形式に関する出力形式情報は、出力形式情報記憶部２４に予め設定されている。 The determination unit 17 determines the validity of the imprint existing at a specific stamped portion of the document to be processed by comparing the verification character with the reference character. The verification character acquisition unit 171 included in the determination unit 17 acquires the characters included in the imprint image extracted by the image extraction unit 15 as verification characters. The determination result output unit 18 outputs a determination result according to a designated output format. The output format information regarding the output format is preset in the output format information storage unit 24.

図３は、本実施の形態におけるキー情報記憶部２２に記憶されるキー情報のデータ構成例を示す図である。キー情報は、文書の種類毎に、当該文書の種類に対応する１又は複数のキーを含む。 FIG. 3 is a diagram showing a data configuration example of key information stored in the key information storage unit 22 in the present embodiment. The key information includes one or more keys corresponding to the document type for each document type.

図４は、本実施の形態における探索条件情報記憶部２３に記憶される探索条件情報のデータ構成例を示す図である。探索条件情報は、文書の種類毎に設定される。図４には、文書の種類が図面の場合の探索条件情報の設定例が示されている。探索条件情報には、当該文書の種類に対応するキー毎に対応付けして探索条件が設定される。文書の種類が図面の場合のキーは、図３に例示したように承認者、製図、構造、設計及び検図であるが、図４に例示するように、文書の種類が図面の場合、キー毎に探索条件が設定される。各キーに対応する探索条件には、処理対象の文書内に、取得したキーに紐付いて存在する印影を特定するための条件である「画像に対する条件」と、当該キーと処理対象の文書内に当該機キーに紐付いて存在する印影との位置関係を示す条件である「位置関係」とが含まれている。「位置関係」への設定によって当該印影に対応する特定の押印箇所が特定される。図４に示す設定例では、各キーとも同じ項目値を設定しているが、文書のレイアウトに従って異なる探索条件が指定される場合もあり得る。「画像に対する条件」には、当該印影に含まれる色又は形状の少なくとも一方に関する条件が含めてよい。図４に示す設定例では、「赤丸」と赤色及び丸形状の双方が含まれている。 FIG. 4 is a diagram showing a data configuration example of the search condition information stored in the search condition information storage unit 23 in the present embodiment. The search condition information is set for each type of document. FIG. 4 shows an example of setting search condition information when the document type is a drawing. The search condition is set in association with each key corresponding to the type of the document in the search condition information. The key when the document type is drawing is the approver, drafting, structure, design and inspection as illustrated in FIG. 3, but as illustrated in FIG. 4, the key is when the document type is drawing. Search conditions are set for each. The search conditions corresponding to each key include "conditions for images", which are conditions for identifying the imprint that exists associated with the acquired key in the document to be processed, and the key and the document to be processed. The "positional relationship", which is a condition indicating the positional relationship with the imprint existing associated with the machine key, is included. By setting to "positional relationship", a specific imprint location corresponding to the imprint is specified. In the setting example shown in FIG. 4, the same item value is set for each key, but different search conditions may be specified according to the layout of the document. The "condition for an image" may include a condition relating to at least one of the colors or shapes contained in the imprint. In the setting example shown in FIG. 4, both the "red circle" and the red and round shapes are included.

なお、文書種類情報記憶部２１及び出力形式情報記憶部２４については、動作の説明と合わせて説明する。 The document type information storage unit 21 and the output format information storage unit 24 will be described together with the description of the operation.

画像形成装置１０における各構成要素１１〜１８は、画像形成装置１０に内蔵されるコンピュータと、コンピュータに搭載されたＣＰＵ３１で動作するプログラムとの協調動作により実現される。また、各記憶部２１〜２４は、画像形成装置１０に搭載されたＨＤＤ３５にて実現される。あるいは、ＲＡＭ３８又は外部にある記憶手段をネットワーク経由で利用してもよい。 Each component 11 to 18 in the image forming apparatus 10 is realized by a cooperative operation of a computer built in the image forming apparatus 10 and a program operated by a CPU 31 mounted on the computer. Further, each of the storage units 21 to 24 is realized by the HDD 35 mounted on the image forming apparatus 10. Alternatively, the RAM 38 or an external storage means may be used via the network.

また、本実施の形態で用いるプログラムは、通信手段により提供することはもちろん、ＣＤ−ＲＯＭやＵＳＢメモリ等のコンピュータ読み取り可能な記録媒体に格納して提供することも可能である。通信手段や記録媒体から提供されたプログラムはコンピュータにインストールされ、コンピュータのＣＰＵがプログラムを順次実行することで各種処理が実現される。 Further, the program used in the present embodiment can be provided not only by communication means but also by storing it in a computer-readable recording medium such as a CD-ROM or a USB memory. The programs provided by the communication means and the recording medium are installed in the computer, and various processes are realized by sequentially executing the programs by the CPU of the computer.

文書の種類や文書のフォームによって押印される位置は異なってくるかもしれないが、本実施の形態において取り扱う文書には、特定の押印箇所に特定の印章によって押印されることで印影が形成される。しかしながら、場合によっては、ユーザが正しくない押印箇所に誤って押印してしまう場合がある。あるいは、ユーザが誤った印章で特定の押印箇所に押印してしまう場合がある。換言すると、特定の押印箇所に誤った印章で押印される場合があり得る。そこで、本実施の形態においては、所定の押印箇所に所定の印章で正しく押印されて印影が形成されたのか、その印影の正当性を、文書ファイルを個々に開いて確認しなくてもすむようにしたことを特徴としている。 The position to be stamped may differ depending on the type of document and the form of the document, but in the document handled in this embodiment, a stamp is formed by stamping a specific stamped portion with a specific stamp. .. However, in some cases, the user may mistakenly stamp an incorrect stamped portion. Alternatively, the user may stamp a specific stamped portion with an erroneous stamp. In other words, a specific stamp may be stamped with the wrong stamp. Therefore, in the present embodiment, it is not necessary to individually open the document file to confirm whether the imprint is formed by correctly imprinting the imprint on the predetermined imprinted portion with the predetermined seal. It is characterized by having done it.

なお、本実施の形態において「正しい印章」というのは、特定の押印箇所に押印されるべき印章のことをいい、上記「特定の印章」と同義である。印章が正しい押印箇所に押印されて始めて「正しい印章」となる。換言すると、印影と押印されている箇所とが正しい組合せとなって始めてその印影は正当となる。「誤った印章」というのは、正しい印章以外の印章のことをいう。 In the present embodiment, the "correct seal" means a seal to be stamped on a specific stamped portion, and is synonymous with the above-mentioned "specific stamp". Only when the seal is stamped on the correct stamped part will it become the "correct stamp". In other words, the imprint is valid only when the imprint and the imprinted part are in the correct combination. "Incorrect seal" means a seal other than the correct seal.

次に、本実施の形態における動作について説明するが、以下、本実施の形態において特徴的な印影の正当性の判定を行う処理について図５に示すフローチャートを用いて説明する。 Next, the operation in the present embodiment will be described. Hereinafter, the process for determining the validity of the characteristic imprint in the present embodiment will be described with reference to the flowchart shown in FIG.

まず、ユーザが処理対象とする文書をスキャナ３４に読み取らせると、スキャンデータ取得部１１は、その文書の読取画像を取得する（ステップ１０１）。続いて、文字認識処理部１２は、ＯＣＲ機能を利用して文字認識を行うことによって、読み取った文書に記載されている文字列を抽出する（ステップ１０２）。ＯＣＲを実行する際、読取画像の正立や背景色を除去するクレンジング等の前処理を行うようにしてもよい。 First, when the user causes the scanner 34 to read the document to be processed, the scan data acquisition unit 11 acquires the scanned image of the document (step 101). Subsequently, the character recognition processing unit 12 extracts the character string described in the read document by performing character recognition using the OCR function (step 102). When executing OCR, preprocessing such as cleansing for removing the upright image and the background color of the scanned image may be performed.

次に、文書種類特定部１３は、文書の種類を特定する（ステップ１０３）。具体的には、以下に例示するいずれかの手法を用いて文書の種類を特定する。 Next, the document type specifying unit 13 specifies the type of the document (step 103). Specifically, the type of document is specified by using one of the methods exemplified below.

第１に、例えば文書のフォーム若しくは文書の識別情報を特定するＱＲコード（登録商標）等のデータコードが文書に付されている場合、そのデータコードを読み取る。この場合、文書種類情報記憶部２１には、データコードに、当該データコードに対応する文書の種類が対応付けして文書種類情報として記憶されており、文書種類特定部１３は、読取画像から得たデータコードを文書種類情報に含まれるデータコードと照合することで文書の種類を特定する。 First, when a data code such as a QR code (registered trademark) that identifies a document form or document identification information is attached to the document, the data code is read. In this case, the document type information storage unit 21 associates the data code with the type of the document corresponding to the data code and stores it as the document type information, and the document type identification unit 13 obtains from the read image. The type of document is specified by collating the data code with the data code included in the document type information.

第２に、文書の読取画像、特にレイアウトを解析することによって文書の種類を推定により特定する。例えば、文書上の罫線の位置を検出し、その罫線のレイアウトを取得する。この罫線のレイアウトから文書の種類を推定する。あるいは、罫線が表を形成している場合、表に含まれる項目の名称を抽出する。この場合、文書種類情報記憶部２１には、文書の種類に、表の項目名が対応付けして文書種類情報として記憶されており、文書種類特定部１３は、読取画像から得た表の項目名のリストを、文書種類情報に含まれる項目名のリストと照合することで、項目名の合致率等を参照に文書の種類を推定により特定する。 Second, the type of document is estimated and identified by analyzing the scanned image of the document, especially the layout. For example, the position of a ruled line on a document is detected, and the layout of the ruled line is acquired. The document type is estimated from the layout of this ruled line. Alternatively, when the ruled lines form a table, the names of the items included in the table are extracted. In this case, the document type information storage unit 21 stores the item names of the table in association with the document type as the document type information, and the document type identification unit 13 stores the table items obtained from the scanned image. By collating the list of names with the list of item names included in the document type information, the document type is identified by estimation with reference to the matching rate of the item names and the like.

第３に、文書の読取画像を解析することによって文書における書類名を抽出する。一般に、文書には、書類名が記載されており、その記載位置は、文書の最上段や上方の中央である。また括弧が付されていたり、文字サイズが大きかったりする。そこで、そのような特徴のある文字列を書類名と推定して抽出する。この場合、文書種類情報記憶部２１には、文書の種類に、書類名が対応付けして文書種類情報として記憶されており、文書種類特定部１３は、読取画像から得た書類名を、文書種類情報に含まれる書類名と照合することで、文書の種類を推定により特定する。 Third, the document name in the document is extracted by analyzing the scanned image of the document. Generally, a document has a document name, and the position of the document is at the top of the document or in the upper center of the document. In addition, parentheses are attached and the character size is large. Therefore, a character string having such a characteristic is estimated as a document name and extracted. In this case, the document type information storage unit 21 associates the document type with the document type and stores the document type information, and the document type identification unit 13 stores the document name obtained from the scanned image as the document. The type of document is identified by estimation by collating it with the document name included in the type information.

第４に、文書をスキャナ３４に読み取らせる際に、文書の種類をユーザに指定させる。より具体的には、スキャンデータ取得部１１が文書を読み取ると、文書種類特定部１３は、文書の種類の入力画面を操作パネル３３に表示させ、その入力画面から文書の種類をユーザに入力させる。あるいは、文書種類情報記憶部２１には、文書の種類の選択候補が記憶されており、文書種類特定部１３は、スキャンデータ取得部１１が文書を読み取ると、文書の種類の選択画面を操作パネル３３に表示させる。選択画面には、文書種類情報記憶部２１から読み出した文書の種類がリスト表示される。そして、文書種類特定部１３は、ユーザに選択より選択された文書の種類に特定する。 Fourth, when the scanner 34 reads the document, the user is asked to specify the type of the document. More specifically, when the scan data acquisition unit 11 reads the document, the document type identification unit 13 displays the document type input screen on the operation panel 33, and causes the user to input the document type from the input screen. .. Alternatively, the document type information storage unit 21 stores the document type selection candidates, and the document type identification unit 13 displays the document type selection screen on the operation panel when the scan data acquisition unit 11 reads the document. Display on 33. On the selection screen, the types of documents read from the document type information storage unit 21 are displayed in a list. Then, the document type specifying unit 13 specifies the type of the document selected by the user.

文書種類特定部１３がいずれかの手法にて文書の種類を特定すると、キー取得部１４は、特定された文書の種類に対応して設定されているキーをキー情報記憶部２２から取得する（ステップ１０４）。図３に示す設定例によると、文書種類特定部１３が特定した文書の種類が図面の場合、キー取得部１４は、キー情報記憶部２２から承認者、製図、構造、設計及び検図を取得することになる。 When the document type specifying unit 13 specifies the document type by any method, the key acquisition unit 14 acquires the key set corresponding to the specified document type from the key information storage unit 22 ( Step 104). According to the setting example shown in FIG. 3, when the document type specified by the document type specifying unit 13 is a drawing, the key acquisition unit 14 acquires the approver, drafting, structure, design, and drawing from the key information storage unit 22. Will be done.

次に、画像抽出部１５は、取得されたキー毎に以下の処理を繰り返し実行する。まず、以下の処理を実施していない未処理のキーを１つ選出する（ステップ１０５）。選出するキーの順番は、特に限定する必要はない。続いて、画像抽出部１５は、選出したキーを、ステップ１０２で実施した文字認識処理で得た文字列と照合して、文書上におけるキーの位置を特定する。なお、ステップ１０２で実施した文字認識処理の結果を利用せずに、この時点で改めて文字認識処理を実施してもよい。続いて、画像抽出部１５は、文書の種類が図面であって処理対象のキーに対応する探索条件を探索条件情報記憶部２３から取得する。例えば、キーが“承認者”の場合、画像抽出部１５は、“承認者”と印字されている文書の位置を特定する。そして、図４に示す探索条件の設定例によると、その文書のキー“承認者”の印字位置から下側に３ｃｍ以内にある画像であって、色が赤色で形状が丸（つまり、円形状）の画像を抽出する（ステップ１０６）。 Next, the image extraction unit 15 repeatedly executes the following processing for each acquired key. First, one unprocessed key that has not been subjected to the following processing is selected (step 105). The order of the keys to be selected does not have to be limited. Subsequently, the image extraction unit 15 collates the selected key with the character string obtained in the character recognition process performed in step 102, and identifies the position of the key on the document. The character recognition process may be performed again at this point without using the result of the character recognition process performed in step 102. Subsequently, the image extraction unit 15 acquires the search condition corresponding to the key to be processed when the document type is a drawing from the search condition information storage unit 23. For example, when the key is "Approver", the image extraction unit 15 specifies the position of the document in which "Approver" is printed. Then, according to the setting example of the search condition shown in FIG. 4, the image is within 3 cm below the print position of the key "approver" of the document, and the color is red and the shape is a circle (that is, a circular shape). ) Is extracted (step 106).

続いて、基準文字取得部１６は、処理対象のキーに対応して特定の押印箇所に押印されるべき印章に含まれる文字を基準文字として取得する（ステップ１０７）。基準文字の取得は、次のように実施する。 Subsequently, the reference character acquisition unit 16 acquires, as a reference character, a character included in the seal to be stamped at a specific stamped portion corresponding to the key to be processed (step 107). Acquisition of reference characters is carried out as follows.

ここでは、人名が少なくとも印影に含まれているものとして説明すると、例えば、処理対象の文書（図６に示す例では「設計図面」）において、各キーに対応して押印する者は誰（若しくはどの部署の人）であるのか、一意若しくはおおよそ特定されている場合が少なくない。従って、各キーに対応して押印する者（複数人でもよい）が予めデータベース化されている場合、そのデータベースにアクセスして処理対象のキーに対応する人名を取得する。あるいは、企業のデータベースをアクセスして文書の作成者及び所属部署を特定し、その部署のメンバ若しくは部署の上司等の人名を取得する。また、文書によっては、押印する者の人名が押印箇所に対応させて予め印字されている場合がある。この場合は、文書に印字されている人名を基準文字として取得する。なお、説明の便宜上、ここでは１人の人名が基準文字として取得されるものとして説明する。 Here, if the person's name is at least included in the imprint, for example, in the document to be processed (“design drawing” in the example shown in FIG. 6), who (or or) is imprinted corresponding to each key. In many cases, the person in which department) is unique or roughly specified. Therefore, when a person (or a plurality of people) who stamps each key is stored in a database in advance, the database is accessed and the name of the person corresponding to the key to be processed is acquired. Alternatively, the database of the company is accessed to identify the creator of the document and the department to which the document belongs, and the names of members of the department or the boss of the department are obtained. Further, depending on the document, the person's name of the person who stamps may be pre-printed corresponding to the stamped portion. In this case, the person's name printed on the document is acquired as the reference character. For convenience of explanation, it is assumed here that one person's name is acquired as a reference character.

基準文字取得部１６が基準文字を取得すると、続いて、判定部１７は、画像抽出部１５により抽出された印影の画像を解析することで文字を検証文字として取得する（ステップ１０８）。具体的には、判定部１７における検証文字取得部１７１は、ＯＣＲ機能を利用して文字認識を行うことによって印影の画像に含まれている文字を抽出する。そして、判定部１７は、基準文字と検証文字とを対比し、一致していれば（ステップ１１０でＹ）、文書から得られた印影は、正しい印章によって押印されて形成された、すなわち正当であると判定する（ステップ１１１）。一方、判定部１７は、基準文字と検証文字とが一致していなければ（ステップ１１０でＮ）、文書から得られた印影は、正しい印章によって押印されていない、すなわち不当であると判定する（ステップ１１２）。 When the reference character acquisition unit 16 acquires the reference character, the determination unit 17 subsequently acquires the character as a verification character by analyzing the image of the imprint extracted by the image extraction unit 15 (step 108). Specifically, the verification character acquisition unit 171 in the determination unit 17 extracts characters included in the imprint image by performing character recognition using the OCR function. Then, the determination unit 17 compares the reference character and the verification character, and if they match (Y in step 110), the imprint obtained from the document is imprinted and formed by the correct seal, that is, it is legitimate. It is determined that there is (step 111). On the other hand, if the reference character and the verification character do not match (N in step 110), the determination unit 17 determines that the imprint obtained from the document is not stamped with the correct seal, that is, is unreasonable (N). Step 112).

図６は、文書の種類が図面に分類される文書の概略的なレイアウトを示す図である。図６には、文書上の右下に所定の押印箇所として設けられている押印欄が設けられており、押印すべき者は、文書の特定の押印欄に押印することになる。押印欄の中の上方には、キーが印字されているので、ユーザは、キーを参照することによって押印する押印欄を確認できる。図６に示す押印欄２は、キー“承認者”に紐付く押印欄であるが、押印欄２には、ユーザ“久保”の印章が押印されている例が示されている。すなわち、検証文字は“久保”である。ここで、基準文字取得部１６がキー“承認者”に対応する基準文字として“久保”を取得していれば、押印欄２の印影は正当であると判定される。基準文字取得部１６がキー“承認者”に対応する基準文字として“久保”以外の文字を取得していれば、押印欄２の印影は不当であると判定される。 FIG. 6 is a diagram showing a schematic layout of documents in which the types of documents are classified into drawings. In FIG. 6, a stamping column provided as a predetermined stamping place is provided at the lower right of the document, and a person who should stamp the document will stamp a specific stamping column of the document. Since the key is printed on the upper part of the stamp field, the user can confirm the stamp field to be stamped by referring to the key. The stamp column 2 shown in FIG. 6 is a stamp column associated with the key “approver”, and the stamp column 2 shows an example in which the seal of the user “Kubo” is stamped. That is, the verification character is "Kubo". Here, if the reference character acquisition unit 16 has acquired "Kubo" as the reference character corresponding to the key "approver", it is determined that the imprint of the imprint column 2 is valid. If the reference character acquisition unit 16 has acquired a character other than "Kubo" as the reference character corresponding to the key "approver", it is determined that the imprint of the imprint column 2 is unreasonable.

以上説明した処理を実施していないキーに対しても同様に処理を行い（ステップ１１３でＮ，ステップ１０５〜１１２）、キー取得部１４が取得した全てのキーに対して処理を実施すると（ステップ１１３でＹ）、判定結果出力部１８は、判定部１７による印影の正当性の判定結果を提示する（ステップ１１４）。 When the same processing is performed on the keys that have not been processed as described above (N in step 113, steps 105 to 112), and the processing is performed on all the keys acquired by the key acquisition unit 14 (step 113). Y) in 113, the determination result output unit 18 presents the determination result of the validity of the imprint by the determination unit 17 (step 114).

ところで、印影は、必ずしも正しい向きに押印されていないことからＯＣＲ機能を利用して文字認識を行っても印影の画像に含まれている文字列を正確に抽出できない場合がある。図６に示す押印欄３には、正しい向きに押印されなかった印影の例が示されている。ここでいう「正しい向き」というのは、押印欄２に例示したように「文書に押印されるべき印章の向き」のことをいい、通常は、印章の正立状態である。「正立」とは、印章と印影との上下が同じ場合であることをいう。図６に示す押印欄２における印影は、図６に示す文書のｘ軸方向（つまり、水平方向）に沿って平行に横書きの文字列が押印されていることから、印章が正立した状態で押印されていることになる。なお、縦書きの文字列の場合、図６に示す文書のｙ軸方向（つまり、垂直方向）に沿って平行に押印されている場合が正しい向きである。つまり、文書に押印されるべき印章の向きで押印されている場合、判定部１７は、ＯＣＲ機能を実行することによって印影から文字列を正確に抽出することが可能となる。 By the way, since the imprint is not necessarily imprinted in the correct direction, it may not be possible to accurately extract the character string included in the imprint image even if character recognition is performed using the OCR function. In the stamp column 3 shown in FIG. 6, an example of a stamp imprint that was not stamped in the correct direction is shown. The "correct orientation" here means the "orientation of the stamp to be stamped on the document" as illustrated in the stamp column 2, and is usually the upright state of the stamp. "Upright" means that the upper and lower parts of the seal and the imprint are the same. The imprint in the stamp column 2 shown in FIG. 6 is a state in which the stamp is upright because the character string written horizontally is stamped in parallel along the x-axis direction (that is, the horizontal direction) of the document shown in FIG. It will be stamped. In the case of a vertically written character string, the correct orientation is when the documents shown in FIG. 6 are stamped in parallel along the y-axis direction (that is, the vertical direction). That is, when the stamp is stamped in the direction of the stamp to be stamped on the document, the determination unit 17 can accurately extract the character string from the stamp impression by executing the OCR function.

これに対し、図６に示す押印欄３には、図６に示す文書のｘ軸方向に沿って人名が表れるように印章が押印されていない、すなわち文書に押印されるべき印章の向きではなく傾いて押印されている例が示されている。この印影だと、ＯＣＲ機能を利用して人名を正確に抽出できないことが想定できる。本実施の形態では、このように印影の画像をそのまま利用しても、印影に含まれている文字がＯＣＲ機能を利用して取得できない場合にも対応できるようにしたことを特徴としている。この場合、検証文字取得部１７１は、次のように処理する。 On the other hand, in the stamp column 3 shown in FIG. 6, the stamp is not stamped so that the person's name appears along the x-axis direction of the document shown in FIG. 6, that is, the direction of the stamp to be stamped on the document is not. An example of tilting and imprinting is shown. With this imprint, it can be assumed that the person's name cannot be accurately extracted using the OCR function. The present embodiment is characterized in that even if the image of the imprint is used as it is, it is possible to deal with the case where the characters included in the imprint cannot be acquired by using the OCR function. In this case, the verification character acquisition unit 171 processes as follows.

図７Ａは、図６に示す押印欄３における印影のみを抽出した図である。検証文字取得部１７１は、まず印影から線のベクトル情報を抽出し、直線成分を抽出する。印影から直線成分のみを抽出した画像の例を図７Ｂに示す。そして、例えば“田”のように直線成分が接している場合には、直線成分毎に処理できるように各直線線分を切り離す。続いて、検証文字取得部１７１は、抽出した直線成分毎に、基準となる水平方向（ｘ軸方向に沿った方向）からの傾き角度及び線分の長さを検出する。図7Ｃには、直線成分４の傾き角度θを検出する場合が示されている。各直線成分につき傾き角度と線分の長さを検出すると、検証文字取得部１７１は、検出した傾き角度毎に直線成分の数を集計する。また検出した角度毎に直線成分の長さを集計する。図８は、角度毎に得た直線成分の数及び長さを集計した結果の例を示す図である。 FIG. 7A is a diagram in which only the imprint in the imprint column 3 shown in FIG. 6 is extracted. The verification character acquisition unit 171 first extracts the vector information of the line from the imprint and extracts the linear component. FIG. 7B shows an example of an image in which only the linear component is extracted from the imprint. Then, when the linear components are in contact with each other, for example, "field", each linear line segment is separated so that each linear component can be processed. Subsequently, the verification character acquisition unit 171 detects the inclination angle and the length of the line segment from the reference horizontal direction (direction along the x-axis direction) for each of the extracted linear components. FIG. 7C shows a case where the inclination angle θ of the linear component 4 is detected. When the inclination angle and the length of the line segment are detected for each linear component, the verification character acquisition unit 171 totals the number of linear components for each detected inclination angle. In addition, the length of the linear component is totaled for each detected angle. FIG. 8 is a diagram showing an example of the result of totaling the number and length of the linear components obtained for each angle.

続いて、検証文字取得部１７１は、印影の傾きを補正するために印影の傾き角度を次の手順にて従って推定する。 Subsequently, the verification character acquisition unit 171 estimates the inclination angle of the imprint according to the following procedure in order to correct the inclination of the imprint.

まず、図８に示す表の中から直線成分の数の多い２つの傾き角度、図８に示す数値例だとａ１とａ２を抽出する。そして、それらの傾き角度ａ１，ａ２が直交していたら数の多い傾き角度ａ１を印影の傾き角度と推定する。 First, from the table shown in FIG. 8, two tilt angles having a large number of linear components, a1 and a2 in the numerical example shown in FIG. 8, are extracted. Then, if the inclination angles a1 and a2 are orthogonal to each other, the inclination angle a1 having a large number is estimated as the inclination angle of the imprint.

該当する傾き角度の組が存在しない場合、数の多い傾き角度から順に数の多い傾き角度同士を順次直交しているかを判定する。例えば、傾き角度ａ１＞ａ２＞ａ３＞ａ４＞ａ５の場合、ａ１とａ２、ａ１とａ３、ａ１とａ４、ａ１とａ５、続いてａ２とａ３、ａ２とａ４、ａ２とａ５という順番の組合せで判定する。そして、直交している組が複数存在する場合、当該傾き角度の組の中で集計した結果の和、すなわち直線成分の数の和が最大となる傾き角度の組を特定し、その特定した組に含まれる傾き角度のうち数の多い傾き角度を印影の傾き角度と推定する。 When the corresponding set of tilt angles does not exist, it is determined whether the tilt angles having the largest number are orthogonal to each other in order from the tilt angle having the largest number. For example, when the tilt angle a1> a2> a3> a4> a5, the combination of a1 and a2, a1 and a3, a1 and a4, a1 and a5, then a2 and a3, a2 and a4, and a2 and a5. judge. Then, when there are a plurality of orthogonal pairs, the sum of the results aggregated in the tilt angle pairs, that is, the tilt angle pair that maximizes the sum of the number of linear components is specified, and the specified pair is specified. Of the tilt angles included in, the tilt angle with the largest number is estimated as the tilt angle of the imprint.

それでも該当する組が存在しない場合、直線成分の数が最大となる傾き角度（上記例でいうａ１）を印影の傾き角度と推定する。 If the corresponding set still does not exist, the tilt angle at which the number of linear components is maximized (a1 in the above example) is estimated as the tilt angle of the imprint.

以上のようにして、印影の傾き角度を推定すると、検証文字取得部１７１は、推定した傾き角度に従って印影の画像の傾きを補正する。実際には、この補正により印影の画像は、正立の状態になると考えられるが、印章が倒立の状態で押印されている場合（上下が正反対に押印されている場合）、印影の画像が倒立状態に補正される場合も考えられるので、本実施の形態においては、この場合にも対応可能なように、次のように処理する。 When the inclination angle of the imprint is estimated as described above, the verification character acquisition unit 171 corrects the inclination of the image of the imprint according to the estimated inclination angle. Actually, it is considered that the image of the imprint is upright due to this correction, but when the seal is stamped in the inverted state (when the stamp is stamped upside down), the image of the imprint is inverted. Since it is possible that the state is corrected, in the present embodiment, the following processing is performed so that this case can also be dealt with.

図９に示すように、傾き補正後の印影の画像を９０度ずつ回転させて４種類の画像、すなわち、上下左右の４方向の向きの印影の画像を形成する。そして、各画像についてＯＣＲ機能を実行して文字列を抽出する。図９は、４方向の向きの印影の画像と、各画像に対する文字認識の結果、すなわち、抽出した文字列を各画像に対応させて示す図である。この結果からも明らかなように、太線５で囲む正立した状態の印影の画像の文字認識精度が最大になると考えられる。検証文字取得部１７１は、以上のようにして傾き補正を行ってからＯＣＲ機能を実行することで、印影に含まれている文字、すなわち検証文字を取得する。 As shown in FIG. 9, the image of the imprint after tilt correction is rotated by 90 degrees to form four types of images, that is, images of imprints in four directions of up, down, left, and right. Then, the OCR function is executed for each image to extract the character string. FIG. 9 is a diagram showing an image of imprints in four directions and a result of character recognition for each image, that is, an extracted character string corresponding to each image. As is clear from this result, it is considered that the character recognition accuracy of the image of the imprint in the upright state surrounded by the thick line 5 is maximized. The verification character acquisition unit 171 acquires the characters included in the imprint, that is, the verification characters, by executing the OCR function after performing the tilt correction as described above.

なお、押印欄３に押印されているように、印章がデータ印の場合、部署名と人名と日付との組を基準文字とする。日付の場合、基準文字と検証文字とを比較する際に、日付が示す年月日の一致まで求めるようにしてもよいし、検証文字が示す日付が、基準文字にて指定される所定の範囲に含まれていれば一致とみなすようにしてもよい。あるいは、日付という情報が検証文字に含まれていれば一致とみなすようにしてもよい。 When the seal is a data seal as stamped in the seal column 3, the set of the department name, the person's name, and the date is used as the reference character. In the case of a date, when comparing the reference character and the verification character, the match of the date indicated by the date may be obtained, or the date indicated by the verification character is in a predetermined range specified by the reference character. If it is included in, it may be regarded as a match. Alternatively, if the information of the date is included in the verification character, it may be regarded as a match.

一般に、漢字は、直線成分が曲線成分より多く含まれ、また直線成分は、水平方向又は垂直方向に向いていることが多いと考えられる。本実施の形態において例示している“田中”は、まさに直線成分のみで構成され、更に各直線成分は、水平方向若しくは垂直方向のみを向いている。従って、ＯＣＲ機能を実行して抽出した文字列の直線成分の多くは、向きが水平方向又は垂直方向なので直交すると考えられる。そこで、本実施の形態においては、この点に着目して印影の傾きを補正し、補正後の印影の画像から文字列を正確に抽出できるようにした。 In general, it is considered that a kanji contains a linear component more than a curved component, and the linear component is often oriented in the horizontal direction or the vertical direction. The "Tanaka" illustrated in the present embodiment is composed of only linear components, and each linear component is oriented only in the horizontal direction or the vertical direction. Therefore, most of the linear components of the character string extracted by executing the OCR function are considered to be orthogonal because their orientations are horizontal or vertical. Therefore, in the present embodiment, the inclination of the imprint is corrected by paying attention to this point so that the character string can be accurately extracted from the image of the imprint after the correction.

なお、上記説明では、直線成分の数量として直線成分の数を用い、直線成分の数の集計結果を用いて傾き角度の順位を決定したが、直線成分の数の代わりに傾き角度毎の直線成分の長さの総和を用いるようにしてもよい。あるいは、直線成分の数と長さの総和それぞれに重みなどの係数を乗算することによって傾き角度の順位付けを行うようにしてもよい。 In the above description, the number of linear components is used as the number of linear components, and the order of the tilt angles is determined using the total result of the number of linear components. However, instead of the number of linear components, the linear components for each tilt angle are determined. You may use the sum of the lengths of. Alternatively, the tilt angle may be ranked by multiplying each of the number of linear components and the sum of the lengths by a coefficient such as a weight.

以上説明したように、判定部１７は、必要により印影の画像の傾き補正を行って印影の正当性を判定する。以上説明した処理を実施していないキーに対しても同様に処理を行い（ステップ１１３でＮ，ステップ１０５〜１１２）、キー取得部１４が取得した全てのキーに対して処理を実施すると（ステップ１１３でＹ）、判定結果出力部１８は、キーと、判定部１７による当該キーに紐付く印影の正当性の判定結果を提示する（ステップ１１４）。具体的には、以下に例示するいずれかの手法を用いて判定結果を出力する。 As described above, the determination unit 17 corrects the inclination of the image of the imprint as necessary to determine the validity of the imprint. When the same processing is performed on the keys that have not been subjected to the processing described above (N in step 113, steps 105 to 112), and the processing is performed on all the keys acquired by the key acquisition unit 14 (step 113). Y) in 113, the determination result output unit 18 presents the key and the determination result of the validity of the imprint associated with the key by the determination unit 17 (step 114). Specifically, the determination result is output using any of the methods illustrated below.

まず、第１に、判定結果を文書のファイル名に含める。出力形式情報記憶部２４に記憶される出力形式情報には、文書ファイルの命名規則が定義されている。例えば、“元ファイル名＋キー＋判定結果”というように、キーと当該キーに紐付いて存在すべき印影の正当性の判定結果と、を組にしてファイル名に含めるという命名規則が出力形式情報に設定されていた場合、元ファイル名が“ＡＢＣ”であり、キーが“承認者”であり、印影が正当の場合、命名規則に従って“ＡＢＣ＿承認者＿正当”と文書ファイルが命名される。一方、元ファイル名が“ＡＢＣ”であり、キーが“製図”であり、印影が不当の場合、“ＡＢＣ＿製図＿不当”と文書ファイルが命名される。なお、“＿”は各項目値を区切る区切り文字であるが、区切り文字はこれに限る必要はない。また、区切り文字をファイル名に必ずしも含めなくてもよい。また、キーが複数ある場合は、“ＡＢＣ＿承認者＿正当＿製図＿不当＿・・・”と、キーと当該キーに紐付いて存在する印影の正当性の判定結果との複数の組をファイル名に含める。 First, the determination result is included in the file name of the document. A naming convention for a document file is defined in the output format information stored in the output format information storage unit 24. For example, a naming convention such as "original file name + key + judgment result" that includes the key and the judgment result of the validity of the imprint that should exist associated with the key in the file name as a set is output format information. If the original file name is "ABC", the key is "Approver", and the imprint is valid, the document file is named "ABC_Approver_valid" according to the naming convention. On the other hand, when the original file name is "ABC", the key is "drafting", and the imprint is improper, the document file is named "ABC_drafting_illegal". Note that "_" is a delimiter that separates each item value, but the delimiter does not have to be limited to this. Also, the delimiter does not necessarily have to be included in the file name. If there are multiple keys, the file name is a plurality of pairs of "ABC_approver_legitimate_drafting_illegal_..." and the result of determining the validity of the imprint associated with the key. Include in.

このように、文書のファイル名に判定結果を含めると、文書を開いて文書の内容を参照するまでもなく各キーに紐付く印影の正当性の判定結果を確認させることができる。 In this way, if the determination result is included in the file name of the document, the determination result of the validity of the imprint associated with each key can be confirmed without opening the document and referring to the contents of the document.

第２に、判定結果を含むファイルを生成する。出力形式情報記憶部２４に記憶される出力形式情報には、キーと当該キーに紐付いて存在する印影の正当性の判定結果と、の組を含むファイル（以下、「判定結果ファイル」と称する）を生成することが定義されている。前述したように、キーと判定結果との組をファイル名に含める場合、キーの数が多いとファイル名が非常に長くなる可能性がある。そこで、この第２の出力形式を採用することでファイル名が長くなりすぎてしまうことを回避できる。判定結果を確認するためには、判定結果ファイルを開く必要が生じてくるかもしれない。しかしながら、判定結果ファイルに、複数の文書に対する判定結果を含めると、複数の文書に対する判定結果を確認する際にはただ１つの判定結果ファイルを開けばよく、複数の文書ファイルを個々に開く必要はない。また、判定結果ファイルを文書ファイルとは別に生成しておくと、印影の正当性の管理上、便利である。判定結果ファイルは、例えばＣＳＶ（ＣｏｍｍｍａＳｅｐａｒａｔｅｄＶａｌｕｅｓ）ファイルで作成する。なお、複数の文書に共通した判定結果ファイルを生成する場合、キー及び判定結果に文書名を対応付けて判定結果ファイルに登録することが好ましい。 Second, a file containing the determination result is generated. The output format information stored in the output format information storage unit 24 is a file containing a set of a key and a judgment result of the validity of the imprint existing associated with the key (hereinafter, referred to as a "judgment result file"). Is defined to generate. As described above, when the pair of the key and the judgment result is included in the file name, the file name may become very long if the number of keys is large. Therefore, by adopting this second output format, it is possible to prevent the file name from becoming too long. In order to check the judgment result, it may be necessary to open the judgment result file. However, if the judgment result file includes the judgment results for a plurality of documents, only one judgment result file needs to be opened when confirming the judgment results for a plurality of documents, and it is not necessary to open the plurality of document files individually. Absent. In addition, it is convenient for managing the validity of the imprint if the judgment result file is generated separately from the document file. The determination result file is created, for example, as a CSV (Comma Separated Values) file. When generating a judgment result file common to a plurality of documents, it is preferable to associate the document name with the key and the judgment result and register the judgment result file in the judgment result file.

第３の出力形式は、第２の出力形式とほぼ同様である。ただ、第２の出力形式では、判定結果として“正当”又は“不当”を含めるのに対し、第３の出力形式では、更に印影の画像そのものをファイルに含めるようにする。このようにすれば、印影の正当性だけでなく印影そのものを確認させることができる。 The third output format is almost the same as the second output format. However, in the second output format, "legitimate" or "illegal" is included as the determination result, whereas in the third output format, the imprint image itself is further included in the file. In this way, not only the validity of the imprint but also the imprint itself can be confirmed.

以上説明したように、本実施の形態においては、文書の読取画像から文書の種類を特定し、その特定した文書の種類に対応するキーを取得することによって、キーの近傍に存在する印影を抽出できるようにした。そして、本実施の形態においては、正しい印章に含まれる文字を基準文字として取得し、文書の読取画像から抽出した印影から得た文字を検証文字として取得し、基準文字と検証文字とが一致するかどうかによって印影の正当性を判定し、その判定結果を提示できるようにした。これにより、文書を開かなくても印影の正当性を確認することができる。このように、本実施の形態においては、文書の種類が特定でき、そして印影に紐付くキーが特定できれば、文書の読取画像から印影を抽出することができるので、どのようなフォームの文書にも適応することが可能となる。つまり、文書のフォームの影響を受けないので、多種類に及ぶ文書のフォームに個別に対処することなく印影の正当性の判定結果を得ることができる。 As described above, in the present embodiment, the type of the document is specified from the scanned image of the document, and the key corresponding to the specified type of the document is acquired to extract the imprint existing in the vicinity of the key. I made it possible. Then, in the present embodiment, the characters included in the correct seal are acquired as the reference characters, the characters obtained from the imprint extracted from the scanned image of the document are acquired as the verification characters, and the reference characters and the verification characters match. The validity of the imprint was judged depending on whether or not it was used, and the judgment result could be presented. As a result, the validity of the imprint can be confirmed without opening the document. As described above, in the present embodiment, if the type of the document can be specified and the key associated with the imprint can be specified, the imprint can be extracted from the scanned image of the document. It becomes possible to adapt. That is, since it is not affected by the form of the document, it is possible to obtain the judgment result of the validity of the imprint without individually dealing with various types of document forms.

なお、キーと同じ文字列が文書に複数存在する場合も想定できる。この場合、複数の文字列の中から印影と紐付くキーを特定するための条件を予め設定して自動的に選出できるようにしたり、ユーザに選択させたりして対応してもよい。 It can be assumed that the document has a plurality of character strings that are the same as the key. In this case, a condition for specifying a key associated with the imprint from a plurality of character strings may be set in advance so that the key can be automatically selected, or the user may be allowed to select the key.

また、本実施の形態では、文書の読取画像を利用するので、情報処理装置として画像形成装置１０を例にして説明したが、文書の読取画像を受け取って処理する汎用的なＰＣ等のコンピュータでもよい。 Further, in the present embodiment, since the scanned image of the document is used, the image forming apparatus 10 has been described as an example of the information processing device, but a computer such as a general-purpose PC that receives and processes the scanned image of the document may also be used. Good.

上記実施の形態において、プロセッサとは広義的なプロセッサを指し、汎用的なプロセッサ（例えばＣＰＵ：ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ等）や、専用のプロセッサ（例えばＧＰＵ：ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ、ＡＳＩＣ：ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ、ＦＰＧＡ：ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ、プログラマブル論理デバイス等）を含むものである。 In the above embodiment, the processor refers to a processor in a broad sense, and refers to a general-purpose processor (for example, CPU: Central Processing Unit, etc.) or a dedicated processor (for example, GPU: Graphics Processing Unit, ASIC: Application Special Integrated Circuit, FPGA). : Field Programgable Gate Array, programmable logic device, etc.).

また上記実施の形態におけるプロセッサの動作は、１つのプロセッサによって成すのみでなく、物理的に離れた位置に存在する複数のプロセッサが協働して成すものであってもよい。また、プロセッサの各動作の順序は上記実施の形態において記載した順序のみに限定されるものではなく、適宜変更してもよい。 Further, the operation of the processor in the above embodiment may be performed not only by one processor but also by a plurality of processors existing at physically separated positions in cooperation with each other. Further, the order of each operation of the processor is not limited to the order described in the above-described embodiment, and may be changed as appropriate.

１ネットワーク、１０画像形成装置、１１スキャンデータ取得部、１２文字認識処理部、１３文書種類特定部、１４キー取得部、１５画像抽出部、１６基準文字取得部、１７判定部、１８判定結果出力部、２１文書種類情報記憶部、２２キー情報記憶部、２３探索条件情報記憶部、２４出力形式情報記憶部、３１ＣＰＵ、３２アドレスデータバス、３３操作パネル、３４スキャナ、３５ハードディスクドライブ（ＨＤＤ）、３６プリンタエンジン、３７ネットワークインタフェース（Ｉ／Ｆ）、３８ＲＡＭ、３９ＲＯＭ、４０外部メディアインタフェース（Ｉ／Ｆ）、１７１検証文字取得部。
1 network, 10 image forming device, 11 scan data acquisition unit, 12 character recognition processing unit, 13 document type identification unit, 14 key acquisition unit, 15 image extraction unit, 16 reference character acquisition unit, 17 judgment unit, 18 judgment result output Unit, 21 Document type information storage unit, 22 Key information storage unit, 23 Search condition information storage unit, 24 Output format information storage unit, 31 CPU, 32 Address data bus, 33 Operation panel, 34 Scanner, 35 Hard disk drive (HDD) , 36 Printer engine, 37 Network interface (I / F), 38 RAM, 39 ROM, 40 External media interface (I / F), 171 Verification character acquisition unit.

Claims

Equipped with a processor
The processor
An image of the imprint existing at a specific imprinted portion in the document is extracted from the scanned image of the document to be processed.
The characters included in the image of the imprint are acquired as verification characters, and the characters are obtained.
The characters included in the seal to be stamped on the specific stamped part are acquired as reference characters.
The validity of the imprint existing at the specific stamped portion is determined by comparing the verification character with the reference character.
The result of determining the validity of the imprint existing at the specific imprint location is presented.
An information processing device characterized by this.

When the processor cannot acquire the characters contained in the image of the imprint by using the character recognition function, the processor
The linear components included in the image of the imprint are extracted, and the inclination angle of each extracted linear component with respect to the orientation of the seal to be stamped on the document is detected.
The tilt angle of the imprint image is estimated by referring to the result of totaling the quantity of the extracted linear components for each tilt angle.
It is characterized in that the characters included in the image of the imprint are acquired by executing the character recognition function after correcting the inclination of the image of the imprint.
The information processing device according to claim 1.

The processor
When there is a set of orthogonal tilt angles in the detected tilt angles, the totalized linear components included in the set of tilt angles that maximizes the sum of the aggregated results in the set of tilt angles. It is characterized in that the tilt angle of the linear component having a large result is estimated as the tilt angle of the image of the imprint.
The information processing device according to claim 2.

The processor
When there is no set of orthogonal tilt angles among the detected tilt angles, the tilt angle of the linear component that maximizes the aggregated result is estimated as the tilt angle of the imprint image.
The information processing device according to claim 2.

The result of totaling the quantity of the linear components for each tilt angle is the sum of the number of linear components of the tilt angle or the lengths of the linear components of the tilt angle.
The information processing device according to claim 2.

The processor
A character recognition function is executed on four types of images obtained by rotating the image of the imprint after tilt correction based on the estimated tilt angle by 90 degrees.
The image having the maximum character recognition accuracy is extracted from the four types of images as an image of the imprint that stands upright with the seal.
The information processing device according to claim 2.

The processor
By analyzing the scanned image of the document to be processed, the type of the document to be processed is specified.
Specific characters associated with a specific image that should exist in the document can be specified to exist in the document in the type of the document to be processed by referring to the specific character information set for each document type. Get the specific characters associated with the image of
An image that matches the acquired search conditions set in advance corresponding to the specific character is extracted as an image of the imprint.
The information processing device according to claim 1.

The processor
It is characterized in that the acquired specific character and the determination result of the validity of the imprint associated with the specific character are presented as a set.
The information processing device according to claim 7.

The processor
It is characterized in that the acquired specific character and the determination result of the validity of the imprint associated with the specific character are combined and included in the file name of the document to be processed.
The information processing device according to claim 8.

The processor
It is characterized in that a file including a set of the acquired specific character and the determination result of the validity of the imprint associated with the specific character is generated.
The information processing device according to claim 8.

Equipped with a processor
The processor
An image of the imprint existing at a specific imprinted portion in the document is extracted from the scanned image of the document to be processed.
When the character contained in the image of the imprint cannot be acquired by using the character recognition function, the linear component contained in the image of the imprint is extracted and the direction of the seal to be stamped on the document is extracted. Detects the tilt angle of each linear component
The tilt angle of the imprint image is estimated by referring to the result of totaling the quantity of the extracted linear components for each tilt angle.
By executing the character recognition function after correcting the inclination of the imprint image, the characters included in the imprint image are acquired.
An information processing device characterized by this.

On the computer
A function of extracting an image of an imprint existing at a specific imprinted portion in the document from the scanned image of the document to be processed.
When the character contained in the image of the imprint cannot be acquired by using the character recognition function, the linear component contained in the image of the imprint is extracted and the direction of the seal to be stamped on the document is extracted. A function to detect the tilt angle of each linear component,
A function to estimate the tilt angle of the imprint image by referring to the result of totaling the number of extracted linear components for each tilt angle.
A function of acquiring characters included in the imprint image by executing a character recognition function after correcting the inclination of the imprint image.
A program to realize.