JP2015204015A5

JP2015204015A5 -

Info

Publication number: JP2015204015A5
Application number: JP2014083637A
Authority: JP
Filing date: 2014-04-15
Publication date: 2017-04-20
Anticipated expiration: 2034-04-15

Description

上記課題を解決すべく、請求項１の発明は、電子文書生成システムであって、原稿のスキャン画像を生成する画像形成装置と、第１のアプリケーションを実行することによって、前記画像形成装置から受信した前記スキャン画像に基づき電子文書を生成する外部端末と、を備え、前記外部端末は、前記画像形成装置で生成された前記スキャン画像を取得する取得手段と、前記スキャン画像の文字領域から複数の部分領域を抽出する抽出手段と、前記スキャン画像内における前記複数の部分領域の位置をそれぞれ検出する検出手段と、電子文書生成に関連する少なくとも１つの処理を前記複数の部分領域のそれぞれに関して行うべき旨の指示と前記複数の部分領域のそれぞれに関する画像データとを、前記第１のアプリケーションから第２のアプリケーションに送出する指示送出手段と、前記複数の部分領域のそれぞれに関する前記少なくとも１つの処理の各処理結果を前記第２のアプリケーションから受け取る受取手段と、前記各処理結果に係るテキスト情報を、前記各処理結果にそれぞれ対応する部分領域の検出位置に配置して、前記電子文書を生成する生成手段と、を有し、前記抽出手段は、前記文字領域における複数行のうちの注目行内において複数の小領域を抽出し、当該複数の小領域を互いに異なる部分領域として区別して前記複数の部分領域を抽出することを特徴とする。 In order to solve the above-mentioned problem, the invention of claim 1 is an electronic document generation system, which receives an image forming apparatus that generates a scanned image of a document and the image forming apparatus by executing a first application. An external terminal that generates an electronic document based on the scanned image, and the external terminal includes a plurality of acquisition means for acquiring the scanned image generated by the image forming apparatus, and a plurality of character areas from the character area of the scanned image. Extraction means for extracting partial areas, detection means for detecting the positions of the plurality of partial areas in the scanned image, and at least one process related to electronic document generation should be performed for each of the plurality of partial areas. An instruction to that effect and image data relating to each of the plurality of partial areas are sent from the first application to the second application. Instruction sending means for sending to the application, receiving means for receiving each processing result of the at least one processing relating to each of the plurality of partial areas from the second application, and text information relating to each processing result, each processing result by arranging the detection position of the corresponding partial area, have a, a generating means for generating the electronic document, the extraction means, a plurality of small in attention row among the plurality of rows in the character region A region is extracted, and the plurality of small regions are distinguished from each other as different partial regions, and the plurality of partial regions are extracted.

請求項２の発明は、第１のアプリケーションを実行することによって、画像形成装置から受信したスキャン画像に基づき電子文書を生成する通信端末であって、前記画像形成装置で生成された前記スキャン画像を取得する取得手段と、前記スキャン画像の文字領域から複数の部分領域を抽出する抽出手段と、前記スキャン画像内における前記複数の部分領域の位置をそれぞれ検出する検出手段と、電子文書生成に関連する少なくとも１つの処理を前記複数の部分領域のそれぞれに関して行うべき旨の指示と前記複数の部分領域のそれぞれに関する画像データとを、前記第１のアプリケーションから第２のアプリケーションに送出する指示送出手段と、前記複数の部分領域のそれぞれに関する前記少なくとも１つの処理の各処理結果を前記第２のアプリケーションから受け取る受取手段と、前記各処理結果に係るテキスト情報を、前記各処理結果にそれぞれ対応する部分領域の検出位置に配置して、前記電子文書を生成する生成手段と、を備え、前記抽出手段は、前記文字領域における複数行のうちの注目行内において複数の小領域を抽出し、当該複数の小領域を互いに異なる部分領域として区別して前記複数の部分領域を抽出することを特徴とする。 The invention of claim 2 is a communication terminal that generates an electronic document based on a scan image received from the image forming apparatus by executing the first application, and the scan image generated by the image forming apparatus is stored in the communication terminal. Obtaining means for obtaining; extracting means for extracting a plurality of partial areas from the character area of the scanned image; detecting means for respectively detecting positions of the plurality of partial areas in the scanned image; and relating to electronic document generation An instruction sending means for sending an instruction to perform at least one process for each of the plurality of partial areas and image data for each of the plurality of partial areas from the first application to the second application; Each processing result of the at least one processing relating to each of the plurality of partial regions is expressed as the second Comprising a receiving means for receiving from an application, text information relating to the respective processing results, the arranged at the detecting position of the processing result to the corresponding partial area, a generating means for generating the electronic document, wherein the extraction The means is characterized in that a plurality of small areas are extracted in a target line among a plurality of lines in the character area, and the plurality of partial areas are extracted by distinguishing the plurality of small areas as different partial areas.

請求項３の発明は、請求項２の発明に係る通信端末において、前記抽出手段は、前記文字領域における複数行のうちの注目行に関して、前記注目行内における注目文字と前記注目文字の直後の文字との間隔である第１の間隔が、前記注目行に隣接する隣接行と前記注目行との間隔である第２の間隔よりも所定程度を超えて大きい場合には、前記注目文字を含む小領域と前記注目文字の直後の文字を含む小領域とを互いに異なる部分領域として抽出することを特徴とする。 According to a third aspect of the present invention, in the communication terminal according to the second aspect of the present invention, the extracting means relates to a target character in the target line and a character immediately after the target character with respect to the target line of the plurality of lines in the character region. Is larger than a second interval, which is an interval between an adjacent line adjacent to the target line and the target line, by a first interval that is an interval between the target line and the small character including the target character. A region and a small region including a character immediately after the character of interest are extracted as different partial regions.

請求項４の発明は、請求項２の発明に係る通信端末において、前記抽出手段は、前記文字領域における複数行のうちの注目行に関して、前記注目行内における注目文字と前記注目文字の直後の文字との間隔である第１の間隔が、前記注目文字と前記注目文字の直前の文字との間隔である第２の間隔よりも所定程度を超えて大きい場合には、前記注目文字と前記注目文字の直前の文字とを含む小領域と、前記注目文字の直後の文字を含む小領域とを、互いに異なる部分領域として抽出することを特徴とする。 According to a fourth aspect of the present invention, in the communication terminal according to the second aspect of the invention, the extracting means relates to a target character in the target line and a character immediately after the target character with respect to the target line of the plurality of lines in the character area. Is larger than a second interval, which is the interval between the character of interest and the character immediately before the character of interest, by more than a predetermined amount, the character of interest and the character of interest A small region including a character immediately before and a small region including a character immediately after the attention character are extracted as different partial regions.

請求項５の発明は、請求項２の発明に係る通信端末において、前記抽出手段は、前記文字領域における複数行のうちの注目行に関して、前記注目行内における注目文字のサイズと前記注目文字の直後の文字のサイズとが所定程度を超えて異なる場合には、前記注目文字を含む小領域と前記注目文字の直後の文字を含む小領域とを互いに異なる部分領域として抽出することを特徴とする。 According to a fifth aspect of the present invention, in the communication terminal according to the second aspect of the invention, the extracting means relates to a target line of the plurality of lines in the character area, and the size of the target character in the target line and immediately after the target character. When the size of the character differs from the predetermined size by more than a predetermined level, the small region including the character of interest and the small region including the character immediately after the character of interest are extracted as different partial regions.

請求項６の発明は、請求項２の発明に係る通信端末において、前記抽出手段は、前記文字領域における複数行のうちの注目行に関して、前記注目行内における注目文字の行方向に垂直な第１の方向における位置と前記注目文字の直後の文字の前記第１の方向における位置とが所定程度を超えてずれている場合には、前記注目文字を含む小領域と前記注目文字の直後の文字を含む小領域とを互いに異なる部分領域として抽出することを特徴とする。 According to a sixth aspect of the present invention, in the communication terminal according to the second aspect of the present invention, the extracting means relates to a target line of the plurality of lines in the character area, the first being perpendicular to the line direction of the target character in the target line. If the position in the first direction and the position in the first direction of the character immediately after the target character are shifted by more than a predetermined level, the small area including the target character and the character immediately after the target character are It is characterized by extracting the contained small areas as different partial areas.

請求項７の発明は、請求項２ないし請求項６のいずれかの発明に係る通信端末において、前記指示送出手段は、前記複数の部分領域のそれぞれの画像データである複数の部分画像データのそれぞれに前記複数の部分領域の各識別情報を画像化して付加した複数の拡張部分画像データを生成し、当該複数の拡張部分画像データのそれぞれを、前記複数の部分領域のそれぞれに関する画像データとして、前記第１のアプリケーションから前記第２のアプリケーションに送出し、前記受取手段は、前記複数の拡張部分画像データのそれぞれに関する前記少なくとも１つの処理の各処理結果を前記第２のアプリケーションから受け取り、前記生成手段は、前記各処理結果からそれぞれ前記各識別情報を抽出し、前記各処理結果に対応する各部分領域を前記各識別情報に基づいてそれぞれ識別し、前記各処理結果から前記各識別情報をそれぞれ除いたテキスト情報を、識別された前記各部分領域の検出位置に配置して、前記電子文書を生成することを特徴とする。 According to a seventh aspect of the present invention, in the communication terminal according to any one of the second to sixth aspects of the invention, the instruction sending means includes a plurality of pieces of partial image data that are respective pieces of image data of the plurality of partial areas. Generating a plurality of extended partial image data obtained by imaging and adding each identification information of the plurality of partial areas, and each of the plurality of extended partial image data as image data relating to each of the plurality of partial areas, Sending from the first application to the second application, the receiving means receives each processing result of the at least one processing relating to each of the plurality of extended partial image data from the second application, and the generating means Extracts each identification information from each processing result, and extracts each partial area corresponding to each processing result. The electronic document is generated by identifying text information based on each identification information and placing text information obtained by removing the identification information from the processing results at detection positions of the identified partial areas. It is characterized by.

請求項８の発明は、請求項２ないし請求項６のいずれかの発明に係る通信端末において、前記指示送出手段は、前記複数の部分領域のそれぞれに関する画像データである複数のデータファイルのそれぞれに、互いに異なる識別情報を含む固有のファイル名を付し、当該複数のデータファイルを前記第１のアプリケーションから前記第２のアプリケーションに送出し、前記受取手段は、前記複数のデータファイルのそれぞれに関する前記少なくとも１つの処理の各処理結果を複数の新たなデータファイルとして、前記第２のアプリケーションから受け取り、前記複数の新たなデータファイルのファイル名は、前記複数のデータファイルのうち対応するデータファイルのファイル名に付された前記識別情報をその一部に含み、前記生成手段は、前記複数の新たなデータファイルのファイル名に含まれる前記識別情報に基づいて、前記各処理結果に対応する各部分領域を識別し、前記各処理結果に係るテキスト情報を、前記各処理結果に対応する前記各部分領域の検出位置に配置して、前記電子文書を生成することを特徴とする。 According to an eighth aspect of the present invention, in the communication terminal according to any one of the second to sixth aspects of the invention, the instruction sending means applies to each of a plurality of data files which are image data relating to each of the plurality of partial areas. A unique file name including identification information different from each other, the plurality of data files are sent from the first application to the second application, and the receiving unit is configured to transmit the plurality of data files with respect to each of the plurality of data files. Each processing result of at least one process is received as a plurality of new data files from the second application, and the file names of the plurality of new data files are files of corresponding data files among the plurality of data files A part of the identification information attached to the name, and the generation means Based on the identification information included in the file names of a plurality of new data files, each partial area corresponding to each processing result is identified, and text information related to each processing result corresponds to each processing result. The electronic document is generated by being arranged at a detection position of each partial area.

請求項９の発明は、請求項２ないし請求項８のいずれかの発明に係る通信端末において、前記生成手段は、前記複数の部分領域のそれぞれについての前記処理結果に係るテキスト情報を、前記複数の部分領域のそれぞれの検出位置に非表示状態で埋め込んで、前記電子文書を生成することを特徴とする。 According to a ninth aspect of the present invention, in the communication terminal according to any of the second to eighth aspects of the invention, the generation unit generates text information related to the processing result for each of the plurality of partial areas. The electronic document is generated by embedding in a non-display state at each detection position of the partial area.

請求項１０の発明は、請求項２ないし請求項８のいずれかの発明に係る通信端末において、前記生成手段は、前記スキャン画像における文字領域の画像を削除するとともに、前記複数の部分領域のそれぞれについての前記処理結果に係るテキスト情報を、前記複数の部分領域のそれぞれの検出位置に表示状態で埋め込んで、前記電子文書を生成することを特徴とする。 According to a tenth aspect of the present invention, in the communication terminal according to any one of the second to eighth aspects, the generation unit deletes the image of the character area in the scanned image and each of the plurality of partial areas. The electronic document is generated by embedding the text information related to the processing result for each of the plurality of partial areas in a display state at each detection position.

請求項１１の発明は、請求項２ないし請求項８のいずれかの発明に係る通信端末において、前記少なくとも１つの処理は、光学文字認識処理と当該光学文字認識処理の結果に対する翻訳処理とを含む複数の処理であり、前記生成手段は、前記スキャン画像における文字領域の画像を削除するとともに、前記複数の部分領域のそれぞれについての前記処理結果である翻訳結果に係るテキスト情報を、前記複数の部分領域のそれぞれの検出位置に表示状態で埋め込んで、前記電子文書を生成することを特徴とする。 The invention of claim 11 is a communication terminal according to any one of the claims 2 to 8, wherein the at least one process, and a translation process with respect to the optical science character recognition process and the result of the optical character recognition process A plurality of processes including: deleting the image of the character area in the scanned image; and generating text information related to a translation result as the process result for each of the plurality of partial areas. The electronic document is generated by embedding in a display state at each detection position of the partial area.

請求項１２の発明は、請求項２ないし請求項１１のいずれかの発明に係る通信端末において、前記第２のアプリケーションは、前記画像形成装置とも前記通信端末とも異なる外部装置にインストールされていることを特徴とする。 According to a twelfth aspect of the present invention, in the communication terminal according to any one of the second to eleventh aspects, the second application is installed in an external device different from the image forming apparatus and the communication terminal . It is characterized by.

請求項１３の発明は、請求項２ないし請求項１１のいずれかの発明に係る通信端末において、前記第２のアプリケーションは、前記第１のアプリケーションとともに前記通信端末にインストールされていることを特徴とする。 A thirteenth aspect of the present invention is the communication terminal according to any one of the second to eleventh aspects, wherein the second application is installed in the communication terminal together with the first application. To do.

請求項１４の発明は、画像形成装置と通信可能な通信端末に内蔵されたコンピュータにおいて、前記画像形成装置から受信したスキャン画像に基づく電子文書の生成処理を実行させるためのプログラムであって、前記プログラムは、前記コンピュータに、ａ）前記画像形成装置で生成された前記スキャン画像を取得するステップと、ｂ）前記スキャン画像の文字領域から部分領域を抽出するステップと、ｃ）前記スキャン画像内における前記部分領域の位置を検出するステップと、ｄ）前記部分領域に関して電子文書生成に関連する少なくとも１つの処理を行うべき旨の指示と前記部分領域に関する画像データとを、前記プログラムである第１のアプリケーションから前記第１のアプリケーションとは別の第２のアプリケーションに送出するステップと、ｅ）前記少なくとも１つの処理の処理結果を前記第２のアプリケーションから受け取るステップと、ｆ）前記部分領域についての前記処理結果に係るテキスト情報を前記部分領域の検出位置に配置するステップと、ｇ）前記スキャン画像の前記文字領域内の複数の部分領域に関して前記ステップｂ）〜ｆ）を実行することによって、前記電子文書を生成するステップと、を実行させ、前記ステップｇ）においては、前記文字領域における複数行のうちの注目行内において複数の小領域が抽出されるとともに、当該複数の小領域を互いに異なる部分領域として区別して前記複数の部分領域が抽出されることを特徴とする。 According to a fourteenth aspect of the present invention, there is provided a program for causing a computer built in a communication terminal capable of communicating with an image forming apparatus to execute an electronic document generation process based on a scanned image received from the image forming apparatus. program, the computer, a) acquiring the scan image generated by the image forming apparatus, b) extracting a character area or al section partial region of the scanned image, c) the scanned image Detecting the position of the partial area in the computer; d) an instruction to perform at least one process related to electronic document generation with respect to the partial area, and image data relating to the partial area. Sending from one application to a second application different from the first application T) e) receiving a processing result of the at least one process from the second application; f) placing text information on the processing result for the partial area at a detection position of the partial area; G) generating the electronic document by executing steps b) to f) for a plurality of partial areas in the character area of the scanned image, and in step g) A plurality of small areas are extracted in a target line among a plurality of lines in the character area, and the plurality of partial areas are extracted by distinguishing the plurality of small areas as different partial areas.

請求項１５の発明は、請求項１４の発明に係るプログラムにおいて、前記ステップｂ）においては、前記文字領域における複数行のうちの注目行に関して、前記注目行内における注目文字と前記注目文字の直後の文字との間隔である第１の間隔が、前記注目行に隣接する隣接行と前記注目行との間隔である第２の間隔よりも所定程度を超えて大きい場合には、前記注目文字を含む小領域と前記注目文字の直後の文字を含む小領域とが互いに異なる部分領域として抽出されることを特徴とする。 According to a fifteenth aspect of the present invention, in the program according to the fourteenth aspect , in the step b), the attention character in the attention line and the character immediately after the attention character are related to the attention line of the plurality of lines in the character area. If the first interval, which is an interval between characters, is larger than the second interval, which is the interval between the adjacent row adjacent to the target row and the target row, by more than a predetermined degree, the target character is included. A small area and a small area including a character immediately after the target character are extracted as different partial areas.

請求項１６の発明は、請求項１４の発明に係るプログラムにおいて、前記ステップｂ）においては、前記文字領域における複数行のうちの注目行に関して、前記注目行内における注目文字と前記注目文字の直後の文字との間隔である第１の間隔が、前記注目文字と前記注目文字の直前の文字との間隔である第２の間隔よりも所定程度を超えて大きい場合には、前記注目文字と前記注目文字の直前の文字とを含む小領域と、前記注目文字の直後の文字を含む小領域とが、互いに異なる部分領域として抽出されることを特徴とする。 According to a sixteenth aspect of the present invention, in the program according to the fourteenth aspect of the invention, in the step b), the attention character in the attention line and the character immediately after the attention character are related to the attention line of the plurality of lines in the character area. When the first interval, which is an interval between characters, is larger than a second interval, which is the interval between the character of interest and the character immediately before the character of interest, by a predetermined degree or more, the attention character and the attention A small region including a character immediately before the character and a small region including a character immediately after the character of interest are extracted as different partial regions.

請求項１７の発明は、請求項１４の発明に係るプログラムにおいて、前記ステップｂ）においては、前記文字領域における複数行のうちの注目行に関して、前記注目行内における注目文字のサイズと前記注目文字の直後の文字のサイズとが所定程度を超えて異なる場合には、前記注目文字を含む小領域と前記注目文字の直後の文字を含む小領域とが互いに異なる部分領域として抽出されることを特徴とする。 According to a seventeenth aspect of the present invention, in the program according to the fourteenth aspect of the invention, in the step b), the size of the attention character in the attention line and the size of the attention character are related to the attention line of the plurality of lines in the character area. When the size of the character immediately after the character is different from a predetermined level, the small region including the character of interest and the small region including the character immediately after the character of interest are extracted as different partial regions. To do.

請求項１８の発明は、請求項１４の発明に係るプログラムにおいて、前記ステップｂ）においては、前記文字領域における複数行のうちの注目行に関して、前記注目行内における注目文字の行方向に垂直な第１の方向における位置と前記注目文字の直後の文字の前記第１の方向における位置とが所定程度を超えてずれている場合には、前記注目文字を含む小領域と前記注目文字の直後の文字を含む小領域とが互いに異なる部分領域として抽出されることを特徴とする。 According to an eighteenth aspect of the present invention, in the program according to the fourteenth aspect of the invention, in the step b), with respect to the target line of the plurality of lines in the character area, the first line perpendicular to the line direction of the target character in the target line. If the position in the direction of 1 and the position in the first direction of the character immediately after the character of interest are shifted by more than a predetermined level, the small area including the character of interest and the character immediately after the character of interest Are extracted as partial areas different from each other.

請求項１９の発明は、請求項１４ないし請求項１８のいずれかの発明に係るプログラムにおいて、前記ステップｄ）は、ｄ−１）前記部分領域の画像データである部分画像データに前記部分領域の識別情報を画像化して付加した拡張部分画像データを生成するステップと、ｄ−２）前記拡張部分画像データを、前記部分領域に関する画像データとして、前記第１のアプリケーションから前記第２のアプリケーションに送出するステップと、を有し、前記ステップｅ）は、ｅ−１）前記拡張部分画像データに関する前記少なくとも１つの処理の処理結果を前記第２のアプリケーションから受け取るステップ、を有し、前記ステップｆ）は、ｆ−１）前記処理結果から前記識別情報を抽出するステップと、ｆ−２）前記処理結果に対応する部分領域である対応部分領域を前記識別情報に基づいて識別するステップと、ｆ−３）前記処理結果から前記識別情報を除いたテキスト情報を、前記対応部分領域の検出位置に配置するステップと、を有することを特徴とする。 According to a nineteenth aspect of the present invention, in the program according to any one of the fourteenth to eighteenth aspects, the step d) includes: d-1) partial image data that is image data of the partial region; Generating extended partial image data in which identification information is imaged and added; and d-2) sending the extended partial image data as image data relating to the partial area from the first application to the second application. The step e) comprises: e-1) receiving a processing result of the at least one processing relating to the extended partial image data from the second application, and the step f) F-1) extracting the identification information from the processing result; and f-2) a partial area corresponding to the processing result. Identifying a corresponding partial area based on the identification information; and f-3) placing text information obtained by removing the identification information from the processing result at a detection position of the corresponding partial area. It is characterized by that.

請求項２０の発明は、請求項１４ないし請求項１８のいずれかの発明に係るプログラムにおいて、前記ステップｄ）は、ｄ−１）前記部分領域に関する画像データのデータファイルに、互いに異なる識別情報を含む固有のファイル名を付するステップと、ｄ−２）前記データファイルを前記第１のアプリケーションから前記第２のアプリケーションに送出するステップと、を有し、前記ステップｅ）は、ｅ−１）前記データファイルに関する前記少なくとも１つの処理の処理結果を新たなデータファイルとして、前記第２のアプリケーションから受け取るステップ、を有し、前記新たなデータファイルのファイル名は、前記新たなデータファイルに対応する前記データファイルのファイル名に付された前記識別情報をその一部に含み、前記ステップｆ）は、ｆ−１）前記新たなデータファイルのファイル名から前記識別情報を抽出するステップと、ｆ−２）前記ステップｆ−１）で抽出された前記識別情報に基づいて、前記処理結果に対応する部分領域である対応部分領域を識別するステップと、ｆ−３）前記処理結果に係るテキスト情報を、前記対応部分領域の検出位置に配置するステップと、を有することを特徴とする。 According to a twentieth aspect of the present invention, in the program according to any one of the fourteenth to eighteenth aspects of the invention, the step d) includes d-1) identification information different from each other in a data file of image data relating to the partial area. Including a step of assigning a unique file name including d-2) sending the data file from the first application to the second application, wherein step e) includes e-1). Receiving a processing result of the at least one processing relating to the data file as a new data file from the second application, and a file name of the new data file corresponds to the new data file The identification information attached to the file name of the data file is included in a part thereof, and F-1) f-1) extracting the identification information from the file name of the new data file; and f-2) based on the identification information extracted in step f-1). A step of identifying a corresponding partial region that is a partial region corresponding to a processing result, and f-3) a step of arranging text information related to the processing result at a detection position of the corresponding partial region. To do.

請求項２１の発明は、請求項１４ないし請求項２０のいずれかの発明に係るプログラムにおいて、前記ステップｇ）においては、前記複数の部分領域のそれぞれに関する前記少なくとも１つの処理の各処理結果に係るテキスト情報が、前記複数の部分領域のそれぞれの検出位置に非表示状態で埋め込まれて前記電子文書が生成されることを特徴とする。 According to a twenty-first aspect of the present invention, in the program according to any of the fourteenth to twentieth aspects, the step g) relates to each processing result of the at least one processing relating to each of the plurality of partial regions. The electronic document is generated by embedding text information in a non-display state at each detection position of the plurality of partial areas.

請求項２２の発明は、請求項１４ないし請求項２０のいずれかの発明に係るプログラムにおいて、前記ステップｇ）においては、前記スキャン画像における前記文字領域の画像が削除されるとともに、前記複数の部分領域のそれぞれに関する前記少なくとも１つの処理の各処理結果に係るテキスト情報が、前記複数の部分領域のそれぞれの検出位置に表示状態で埋め込まれて、前記電子文書が生成されることを特徴とする。 According to a twenty-second aspect of the present invention, in the program according to any one of the fourteenth to twentieth aspects, in the step g), the image of the character area in the scanned image is deleted and the plurality of portions Text information related to each processing result of the at least one processing relating to each of the regions is embedded in a display state at each detection position of the plurality of partial regions, and the electronic document is generated.

請求項２３の発明は、請求項１４ないし請求項２０のいずれかの発明に係るプログラムにおいて、前記少なくとも１つの処理は、光学文字認識処理と当該光学文字認識処理の結果に対する翻訳処理とを含む複数の処理であり、前記ステップｇ）においては、前記スキャン画像における文字領域の画像が削除されるとともに、前記複数の部分領域のそれぞれに関する前記少なくとも１つの処理の各処理結果である翻訳結果に係るテキスト情報が、前記複数の部分領域のそれぞれの検出位置に表示状態で埋め込まれて、前記電子文書が生成されることを特徴とする。 The invention of claim 23 is the program according to the invention of any one of claims 14 to 20, wherein the at least one process includes a translation process with respect to the optical science character recognition process and the result of the optical character recognition process In step g), the image of the character area in the scanned image is deleted, and the translation result is the result of each of the at least one process for each of the plurality of partial areas. The electronic document is generated by embedding text information in a display state at each detection position of the plurality of partial areas.

請求項２４の発明は、請求項１４ないし請求項２３のいずれかの発明に係るプログラムにおいて、前記第２のアプリケーションは、前記画像形成装置とも前記通信端末とも異なる外部装置にインストールされていることを特徴とする。 According to a twenty-fourth aspect of the present invention, in the program according to any one of the fourteenth to twenty- third aspects, the second application is installed in an external device different from the image forming apparatus and the communication terminal. Features.

請求項２５の発明は、請求項１４ないし請求項２３のいずれかの発明に係るプログラムにおいて、前記第２のアプリケーションは、前記第１のアプリケーションとともに前記通信端末にインストールされていることを特徴とする。 The invention of claim 25 is the program according to any one of claims 14 to 23 , wherein the second application is installed in the communication terminal together with the first application. .

請求項２６の発明は、電子文書生成システムであって、原稿のスキャン画像を生成するとともに、第１のアプリケーションを実行することによって、前記スキャン画像に基づき電子文書を生成する画像形成装置と、第２のアプリケーションによるサービスを提供する外部サーバと、を備え、前記画像形成装置は、前記スキャン画像を生成する生成手段と、前記スキャン画像の文字領域から複数の部分領域を抽出する抽出手段と、前記スキャン画像内における前記複数の部分領域の位置をそれぞれ検出する検出手段と、電子文書生成に関連する少なくとも１つの処理を前記複数の部分領域のそれぞれに関して行うべき旨の指示と前記複数の部分領域のそれぞれに関する画像データとを、前記第１のアプリケーションから第２のアプリケーションに送出する指示送出手段と、前記複数の部分領域のそれぞれに関する前記少なくとも１つの処理の各処理結果を前記第２のアプリケーションから受け取る受取手段と、前記各処理結果に係るテキスト情報を、前記各処理結果にそれぞれ対応する部分領域の検出位置に配置して、前記電子文書を生成する生成手段と、を有し、前記抽出手段は、前記文字領域における複数行のうちの注目行内において複数の小領域を抽出し、当該複数の小領域を互いに異なる部分領域として区別して前記複数の部分領域を抽出することを特徴とする。 According to a twenty-sixth aspect of the present invention, there is provided an electronic document generation system, an image forming apparatus that generates a scanned image of an original and generates an electronic document based on the scanned image by executing a first application; And an external server that provides a service by the application, wherein the image forming apparatus generates the scan image, an extraction unit extracts a plurality of partial areas from the character area of the scan image, and Detection means for detecting the positions of the plurality of partial areas in the scanned image, an instruction that at least one process related to electronic document generation should be performed for each of the plurality of partial areas, and the plurality of partial areas The image data relating to each is transferred from the first application to the second application. Instruction sending means for sending, receiving means for receiving each processing result of the at least one processing relating to each of the plurality of partial areas from the second application, text information relating to each processing result, and each processing result each arranged at the detecting position of the corresponding portion region, have a, a generating means for generating the electronic document, the extraction means, a plurality of small areas in the interest row among the plurality of rows in the character region The plurality of small areas are extracted and distinguished from each other as different partial areas, and the plurality of partial areas are extracted.

請求項２７の発明は、原稿のスキャン画像を生成するとともに、第１のアプリケーションを実行することによって、前記スキャン画像に基づき電子文書を生成する画像形成装置であって、前記スキャン画像を生成する生成手段と、前記スキャン画像の文字領域から複数の部分領域を抽出する抽出手段と、前記スキャン画像内における前記複数の部分領域の位置をそれぞれ検出する検出手段と、電子文書生成に関連する少なくとも１つの処理を前記複数の部分領域のそれぞれに関して行うべき旨の指示と前記複数の部分領域のそれぞれに関する画像データとを、前記第１のアプリケーションから、外部サーバにて実行される第２のアプリケーションに送出する指示送出手段と、前記複数の部分領域のそれぞれに関する前記少なくとも１つの処理の各処理結果を前記第２のアプリケーションから受け取る受取手段と、前記各処理結果に係るテキスト情報を、前記各処理結果にそれぞれ対応する部分領域の検出位置に配置して、前記電子文書を生成する生成手段と、を備え、前記抽出手段は、前記文字領域における複数行のうちの注目行内において複数の小領域を抽出し、当該複数の小領域を互いに異なる部分領域として区別して前記複数の部分領域を抽出することを特徴とする。 According to a twenty-seventh aspect of the present invention, there is provided an image forming apparatus for generating an electronic document based on the scanned image by generating a scanned image of a document and executing the first application, and generating the scanned image. Means for extracting a plurality of partial areas from the character area of the scanned image; detecting means for detecting the positions of the plurality of partial areas in the scanned image; and at least one associated with electronic document generation An instruction indicating that processing should be performed for each of the plurality of partial areas and image data regarding each of the plurality of partial areas are transmitted from the first application to a second application executed on an external server. Instruction sending means and the at least one process for each of the plurality of partial areas Receiving means for receiving each processing result from the second application, and generating the electronic document by arranging text information related to each processing result at a detection position of a partial area corresponding to each processing result Means for extracting a plurality of small areas in a target line of the plurality of lines in the character area, distinguishing the plurality of small areas as different partial areas, and extracting the plurality of partial areas. It is characterized by extracting.

請求項１ないし請求項２７に記載の発明によれば、電子文書生成に関連する少なくとも１つの処理を第１のアプリケーションから第２のアプリケーションに対して依頼する場合において、第２のアプリケーションから取得した処理結果に係るテキスト情報を元のスキャン画像内の適切な位置に配置することが可能である。
According to the invention of claim 1 to claim 27 , when at least one process related to electronic document generation is requested from the first application to the second application, the processing is acquired from the second application. It is possible to arrange the text information related to the processing result at an appropriate position in the original scan image.

Claims

An electronic document generation system,
An image forming apparatus for generating a scanned image of a document;
An external terminal that generates an electronic document based on the scanned image received from the image forming apparatus by executing a first application;
With
The external terminal is
An acquisition means for acquiring the scanned image generated by the image forming apparatus;
Extracting means for extracting a plurality of partial regions from the character region of the scanned image;
Detection means for detecting positions of the plurality of partial regions in the scanned image,
An instruction to perform at least one process related to electronic document generation for each of the plurality of partial areas and image data for each of the plurality of partial areas are sent from the first application to the second application. Instruction sending means for
Receiving means for receiving each processing result of the at least one processing for each of the plurality of partial areas from the second application;
Generating means for generating the electronic document by arranging text information related to each processing result at a detection position of a partial area corresponding to each processing result;
I have a,
The extraction means extracts a plurality of small areas in a target line among a plurality of lines in the character area, distinguishes the plurality of small areas as different partial areas, and extracts the plurality of partial areas. Electronic document generation system.

A communication terminal that generates an electronic document based on a scanned image received from an image forming apparatus by executing a first application,
An acquisition means for acquiring the scanned image generated by the image forming apparatus;
Extracting means for extracting a plurality of partial regions from the character region of the scanned image;
Detection means for detecting positions of the plurality of partial regions in the scanned image,
An instruction to perform at least one process related to electronic document generation for each of the plurality of partial areas and image data for each of the plurality of partial areas are sent from the first application to the second application. Instruction sending means for
Receiving means for receiving each processing result of the at least one processing for each of the plurality of partial areas from the second application;
Generating means for generating the electronic document by arranging text information related to each processing result at a detection position of a partial area corresponding to each processing result;
Equipped with a,
The extraction means extracts a plurality of small areas in a target line among a plurality of lines in the character area, distinguishes the plurality of small areas as different partial areas, and extracts the plurality of partial areas. Communication terminal.

The communication terminal according to claim 2 ,
In the extraction unit, with respect to the target line of the plurality of lines in the character area, a first interval that is an interval between the target character in the target line and a character immediately after the target character is adjacent to the target line. A portion in which a small region including the character of interest and a small region including a character immediately after the character of interest are different from each other when the second interval, which is an interval between the line and the line of interest, is greater than a predetermined distance. A communication terminal that is extracted as a region.

The communication terminal according to claim 2 ,
In the extraction unit, the first interval that is the interval between the attention character in the attention line and the character immediately after the attention character with respect to the attention line of the plurality of lines in the character region is the attention character and the attention character. A small area including the character of interest and the character immediately before the character of interest, and a character immediately after the character of interest. A communication terminal characterized by extracting a small area including a partial area different from each other.

The communication terminal according to claim 2 ,
The extraction means, when the size of the attention character in the attention line and the size of the character immediately after the attention character differ by more than a predetermined level with respect to the attention row of the plurality of lines in the character area, A communication terminal, wherein a small area including a character and a small area including a character immediately after the character of interest are extracted as different partial areas.

The communication terminal according to claim 2 ,
The extraction means relates to a target line of the plurality of lines in the character area, the position in the first direction perpendicular to the line direction of the target character in the target line, and the first direction of the character immediately after the target character. A small area including the character of interest and a small area including a character immediately after the character of interest are extracted as different partial areas. Terminal.

The communication terminal according to any one of claims 2 to 6 ,
The instruction sending means generates a plurality of extended partial image data obtained by imaging each identification information of the plurality of partial areas and adding each of the plurality of partial image data, which is the image data of each of the plurality of partial areas. And sending each of the plurality of extended partial image data as image data relating to each of the plurality of partial areas from the first application to the second application,
The receiving means receives each processing result of the at least one processing related to each of the plurality of extended partial image data from the second application,
The generating means extracts each identification information from each processing result, identifies each partial area corresponding to each processing result based on each identification information, and identifies each identification information from each processing result. A communication terminal, wherein the electronic document is generated by arranging text information excluding each of the text information at the detected position of each identified partial area.

The communication terminal according to any one of claims 2 to 6 ,
The instruction sending means attaches a unique file name including identification information different from each other to each of the plurality of data files that are image data relating to each of the plurality of partial areas, and the plurality of data files are assigned to the first data file. Send from the application to the second application,
The receiving means receives each processing result of the at least one processing relating to each of the plurality of data files as a plurality of new data files from the second application,
The file names of the plurality of new data files include the identification information attached to the file name of the corresponding data file among the plurality of data files as a part thereof,
The generation means identifies each partial area corresponding to each processing result based on the identification information included in the file names of the plurality of new data files, and the text information related to each processing result is A communication terminal, wherein the electronic document is generated by being arranged at a detection position of each partial area corresponding to each processing result.

The communication terminal according to any one of claims 2 to 8 ,
The generation unit generates the electronic document by embedding text information related to the processing result for each of the plurality of partial areas in a non-display state at each detection position of the plurality of partial areas. Communication terminal.

The communication terminal according to any one of claims 2 to 8 ,
The generation unit deletes the image of the character area in the scanned image and displays text information related to the processing result for each of the plurality of partial areas in a display state at each detection position of the plurality of partial areas. A communication terminal embedded to generate the electronic document.

The communication terminal according to any one of claims 2 to 8 ,
Wherein the at least one process is a plurality of processes including the translation process with respect to the optical science character recognition process and the result of the optical character recognition process,
The generation means deletes the image of the character area in the scanned image, and the text information related to the translation result that is the processing result for each of the plurality of partial areas is detected at each detection position of the plurality of partial areas. Embedded in a display state to generate the electronic document.

The communication terminal according to any one of claims 2 to 11 ,
The communication terminal , wherein the second application is installed in an external device different from the image forming apparatus and the communication terminal.

The communication terminal according to any one of claims 2 to 11 ,
The communication terminal , wherein the second application is installed in the communication terminal together with the first application.

In a computer built in a communication terminal capable of communicating with an image forming apparatus, a program for executing generation processing of an electronic document based on a scanned image received from the image forming apparatus,
The program is stored in the computer.
a) obtaining the scanned image generated by the image forming apparatus;
b) extracting a character area or al section partial region of the scanned image,
c) detecting the position of the partial region in the scanned image;
d) An instruction to perform at least one process related to electronic document generation regarding the partial area and image data regarding the partial area are sent from the first application, which is the program, to the first application different from the first application. Sending to a second application;
e) receiving a processing result of the at least one process from the second application;
f) placing text information related to the processing result for the partial area at a detection position of the partial area;
g) generating the electronic document by performing the steps b) to f) for a plurality of partial areas in the character area of the scanned image;
Was executed,
In the step g), a plurality of small areas are extracted in a target line among a plurality of lines in the character area, and the plurality of partial areas are extracted by distinguishing the plurality of small areas as different partial areas. a program characterized by that.

The program according to claim 14 , wherein
In step b), with respect to the target line of the plurality of lines in the character area, a first interval that is an interval between the target character in the target line and the character immediately after the target character is adjacent to the target line. A small region including the character of interest and a small region including a character immediately after the character of interest are mutually greater than a second interval that is an interval between the adjacent line and the attention row A program characterized by being extracted as different partial areas.

The program according to claim 14 , wherein
In the step b), with respect to the target line of the plurality of lines in the character area, a first interval that is an interval between the target character in the target line and the character immediately after the target character is the target character and the A small area including the attention character and the character immediately before the attention character, and a space immediately after the attention character when the second interval, which is an interval between the character immediately before the attention character and the second interval, is greater than a predetermined amount. A program characterized in that a small area including a character is extracted as different partial areas.

The program according to claim 14 , wherein
In the step b), when the size of the target character in the target line and the size of the character immediately after the target character are different from each other with respect to the target line among the plurality of lines in the character region, A program characterized in that a small area including the attention character and a small area including a character immediately after the attention character are extracted as different partial areas.

The program according to claim 14 , wherein
In step b), with respect to the target line of the plurality of lines in the character area, the position in the first direction perpendicular to the line direction of the target character in the target line and the first character immediately after the target character. When the position in the direction is shifted beyond a predetermined level, the small region including the target character and the small region including the character immediately after the target character are extracted as different partial regions. Program.

The program according to any one of claims 14 to 18 ,
Said step d)
d-1) generating expanded partial image data obtained by imaging the partial region identification information to the partial image data which is image data of the partial region;
d-2) sending the expanded partial image data as image data relating to the partial area from the first application to the second application;
Have
Said step e)
e-1) receiving a processing result of the at least one processing related to the extended partial image data from the second application;
Have
Said step f)
f-1) extracting the identification information from the processing result;
f-2) identifying a corresponding partial area that is a partial area corresponding to the processing result based on the identification information;
f-3) placing text information obtained by removing the identification information from the processing result at a detection position of the corresponding partial area;
The program characterized by having.

The program according to any one of claims 14 to 18 ,
Said step d)
d-1) attaching a unique file name including different identification information to a data file of image data relating to the partial area;
d-2) sending the data file from the first application to the second application;
Have
Said step e)
e-1) receiving a processing result of the at least one processing relating to the data file as a new data file from the second application;
Have
The file name of the new data file includes the identification information attached to the file name of the data file corresponding to the new data file as a part thereof,
Said step f)
f-1) extracting the identification information from the file name of the new data file;
f-2) identifying a corresponding partial region that is a partial region corresponding to the processing result based on the identification information extracted in step f-1);
f-3) placing the text information related to the processing result at the detection position of the corresponding partial region;
The program characterized by having.

The program according to any one of claims 14 to 20 ,
In the step g), text information related to each processing result of the at least one processing relating to each of the plurality of partial areas is embedded in a non-display state at each detection position of the plurality of partial areas, and the electrons A program characterized in that a document is generated.

The program according to any one of claims 14 to 20 ,
In the step g), the image of the character area in the scanned image is deleted, and text information relating to each processing result of the at least one process regarding each of the plurality of partial areas is the plurality of partial areas The electronic document is generated by being embedded in the respective detection positions in a display state.

The program according to any one of claims 14 to 20 ,
Wherein the at least one process is a plurality of processes including the translation process with respect to the optical science character recognition process and the result of the optical character recognition process,
In the step g), the image of the character area in the scanned image is deleted, and the text information related to the translation result that is each processing result of the at least one process for each of the plurality of partial areas is The electronic document is generated by being embedded in a display state at each detection position of the partial area.

24. The program according to claim 14, wherein:
The second application is installed in an external device different from the image forming apparatus and the communication terminal.

24. The program according to claim 14, wherein:
The second application is installed in the communication terminal together with the first application.

An electronic document generation system,
An image forming apparatus that generates a scanned image of a document and generates an electronic document based on the scanned image by executing a first application;
An external server providing a service by the second application;
With
The image forming apparatus includes:
Generating means for generating the scanned image;
Extracting means for extracting a plurality of partial regions from the character region of the scanned image;
Detection means for detecting positions of the plurality of partial regions in the scanned image,
An instruction to perform at least one process related to electronic document generation for each of the plurality of partial areas and image data for each of the plurality of partial areas are sent from the first application to the second application. Instruction sending means for
Receiving means for receiving each processing result of the at least one processing for each of the plurality of partial areas from the second application;
Generating means for generating the electronic document by arranging text information related to each processing result at a detection position of a partial area corresponding to each processing result;
I have a,
The extraction means extracts a plurality of small areas in a target line among a plurality of lines in the character area, distinguishes the plurality of small areas as different partial areas, and extracts the plurality of partial areas. Electronic document generation system.

An image forming apparatus that generates a scanned image of a document and generates an electronic document based on the scanned image by executing a first application,
Generating means for generating the scanned image;
Extracting means for extracting a plurality of partial regions from the character region of the scanned image;
Detection means for detecting positions of the plurality of partial regions in the scanned image,
An instruction to execute at least one process related to electronic document generation for each of the plurality of partial areas and image data for each of the plurality of partial areas are executed from the first application on an external server. Instruction sending means for sending to the second application to be sent;
Receiving means for receiving each processing result of the at least one processing for each of the plurality of partial areas from the second application;
Generating means for generating the electronic document by arranging text information related to each processing result at a detection position of a partial area corresponding to each processing result;
Equipped with a,
The extraction means extracts a plurality of small areas in a target line among a plurality of lines in the character area, distinguishes the plurality of small areas as different partial areas, and extracts the plurality of partial areas. Image forming apparatus.