JPH10240650A

JPH10240650A - Document recognition device and recognition processing mehtod

Info

Publication number: JPH10240650A
Application number: JP9364321A
Authority: JP
Inventors: Noboru Shimizu; 昇清水; Katsuhiko Itonori; 勝彦糸乗; Norio Yamamoto; 紀夫山本
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1997-12-19
Filing date: 1997-12-19
Publication date: 1998-09-11

Abstract

PROBLEM TO BE SOLVED: To execute a processing for recognizing picture information on a character, a graphic or the like on an appropriate medium such as paper with a simple operation by providing an information processing unit storing a picture, an information processing unit reading the stored picture to convert it into code information and a network. SOLUTION: Character/picture information inputted from a scanner 2 provided for a document generation work station 1 is transferred to a mass storage device provided in a file station 3 through the network 5 to which the document generation work station 1 is connected. In a recognition work station 4, character/picture information which is thus transferred is recognizer. A recognized result is transferred/stored to/in a file work station 3. The stored recognized result is taken out by the document generation work station 1 one by one as needed. Thus, even a person who is not skilled in a recognition processing operation can easily execute the operation.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は、紙の文書に印刷され
ている文字・図形等の画像情報を認識するときの、ネッ
トワーク構成を利用した文書認識装置およびその認識処
理方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document recognition apparatus using a network configuration for recognizing image information such as characters and graphics printed on a paper document, and a recognition processing method therefor.

【０００２】[0002]

【従来の技術】適当な紙の上に印刷された文字や図形を
認識して、これらに対応する情報をワープロ等からなる
文書編集装置等に入力するための、様々な文書認識装置
に関する研究や提案が従来からなされている。このよう
な従来装置としては、下記に開示されているようなもの
を例として挙げることができる。即ち、特開昭６３−１
５５３８５号公報，特開平１−１８３７９５号公報，特
開平１−１８００８１号公報，特願平３−８１３７４
号，特願平２−３３３１７０号，特願平３−９８０１４
号等に開示されているものを挙げることができる。これ
らに開示されている文書認識装置は、文書を画像情報と
して入力するスキャナー、前記の画像情報から対応の文
字を認識する文字認識装置、この認識の結果に即して所
要の表示・編集をするための表示・編集装置等が一体と
なって構成されているものである。また、”画像処理ハ
ンドブック：ｐ．４８２”［編著者：画像処理ハンドブ
ック編集委員会編（編集委員長：尾上守夫）；発行
所：（株）昭晃堂；発行年月日：昭和６３年２月２８日
（初版２刷）］で開示された文字認識装置は、所要の通
信制御部が付加して設けられた構成のものであるが、文
字認識装置という観点からは、前記された各種の従来例
と基本的には同様な構成のものである。2. Description of the Related Art Research on various document recognition devices for recognizing characters and figures printed on appropriate paper and inputting information corresponding to the characters and graphics to a document editing device or the like comprising a word processor or the like. Proposals have been made in the past. As such a conventional device, the one disclosed below can be cited as an example. That is, JP-A-63-1
No. 55385, Japanese Patent Application Laid-Open No. 1-18395, Japanese Patent Application Laid-Open No. 1-180081, Japanese Patent Application No. 3-81374.
No., Japanese Patent Application No. 2-333170, Japanese Patent Application No. 3-98014
And the like. The document recognizing devices disclosed therein are a scanner for inputting a document as image information, a character recognizing device for recognizing a corresponding character from the image information, and perform a required display / edit according to a result of the recognition. Display / editing device and the like are integrally configured. Also, "Image Processing Handbook: p. 482" [Editor: Image Processing Handbook Editing Committee (Editorial Chair: Morio Onoe); Publisher: Shokodo Co., Ltd .; The character recognition device disclosed on March 28 (first edition, second printing)] has a configuration in which a required communication control unit is additionally provided. From the viewpoint of the character recognition device, the above-described various It has basically the same configuration as the conventional example.

【０００３】[0003]

【発明が解決しようとする課題】ところで、従来のこの
種の装置は上記された構成のものであることから、例え
ば、次のような幾つかの問題点が提起されている。即
ち、（１）：スキャナーから入力される文書情報が、対応す
る画像情報として取り込まれることから、このような画
像情報を格納しておくための記憶装置の記憶容量が大き
くなってしまう。即ち、ある１個の従来装置において多
くの文書情報を入力させるためには、記憶容量の大きい
記憶装置を用意せねばならない。（２）：スキャナーをワークステーション毎に対応して
設けなければならず、スキャナーとワークステーション
からなる従来装置が高価になってしまう。（３）：一般的には、文字認識のために要する処理量は
極めて膨大なものであり、このために、文字の認識処理
がソフトウエアで実現されているようなときには、この
文字の認識処理が行われているときに、他の処理の中断
やその優先順位の低下をさせねばならず、このような従
来装置における操作者の業務効率が著しく低下してしま
う。（４）：文字の認識処理がハードウエアで実現されてい
るとしても、そのための装置自体が高価なものであるこ
とから、このような装置を組み込んだ従来装置も極めて
高価なものになる。（５）：また、従来装置の操作者は、上記のような認識
装置を用いるときには、認識処理のための知識を新しく
身につけることが必要であって、いわゆる初心者による
使用は極めて困難である。However, since this type of conventional device has the above-described configuration, there are the following problems, for example. (1): Since the document information input from the scanner is taken in as the corresponding image information, the storage capacity of the storage device for storing such image information increases. That is, in order to input a large amount of document information in one conventional device, a storage device having a large storage capacity must be prepared. (2): A scanner must be provided for each workstation, and the conventional apparatus including the scanner and the workstation becomes expensive. (3): Generally, the amount of processing required for character recognition is extremely enormous. For this reason, when character recognition processing is realized by software, this character recognition processing is performed. When the process is being performed, other processes must be interrupted or their priorities must be lowered, and the work efficiency of the operator in such a conventional apparatus is significantly reduced. (4): Even if the character recognition processing is realized by hardware, the device itself for that purpose is expensive, so that the conventional device incorporating such a device becomes extremely expensive. (5) Also, when using the above-described recognition apparatus, the operator of the conventional apparatus needs to acquire new knowledge for recognition processing, and it is extremely difficult for a so-called beginner to use the apparatus. .

【０００４】この発明は、前述されたような各種の問題
点を解決するためになされたものである。即ち、複数個
のワークステーションをネットワーク状に接続して構成
した文書認識装置において、これに含まれている大容量
の記憶機能を備えたファイル用ステーションに文書画像
を格納するようにされており、このために、個々のワー
クステーション毎に大容量の記憶装置を備えることが不
要になる。そして、前記複数個のワークステーションの
中のある１個にスキャナーを設けておき、このワークス
テーションを入力専用のもの（即ち、操作者によって実
際に操作されるもの）とすれば、高価なスキャナーをワ
ークステーション毎に備えることが不要になり、個々の
ワークステーションが高価になることを抑制できる。ま
た、前記複数個のワークステーションの中のある１個の
ものに文字認識の処理機能を備えるようにされており、
このために、操作者によって実際に操作されるワークス
テーションへの負荷がそれに応じて軽減されるととも
に、個々のワークステーションに文字認識のための処理
装置を設けることが不要になる。更に、操作者によって
実際に操作されているときにも、文字認識のための処理
装置を用いているという意識が排除されて、初心者であ
っても容易に操作することができる。The present invention has been made to solve the various problems described above. That is, in a document recognition device configured by connecting a plurality of workstations in a network, a document image is stored in a file station provided with a large-capacity storage function included therein, Therefore, it is not necessary to provide a large-capacity storage device for each workstation. If a scanner is provided in one of the plurality of workstations, and this workstation is dedicated to input (that is, one that is actually operated by an operator), an expensive scanner can be used. It is not necessary to provide for each workstation, and it is possible to suppress the cost of each workstation. Further, one of the plurality of workstations is provided with a character recognition processing function,
Therefore, the load on the workstation actually operated by the operator is correspondingly reduced, and it is not necessary to provide a processing device for character recognition in each workstation. Further, even when the operator is actually operating, the consciousness of using the processing device for character recognition is eliminated, and even a novice user can easily operate.

【０００５】[0005]

【課題を解決するための手段】この発明に係る認識シス
テムは、画像を入力する画像入力手段が接続され、この
画像入力手段によって読み取られた画像を記憶する記憶
手段を具備した少なくとも一つの第１の情報処理機器
（ワークステーション）と、前記記憶手段に記憶されて
いる画像を読み出してコード情報に変換する変換手段を
具備した第２の情報処理機器（ワークステーション）
と、前記第１の情報処理機器と前記第２の情報処理機器
とを相互接続するネットワークと、から構成されること
を特徴とするものである。また、この発明に係る文書認
識装置は、スキャナーを接続している文書作成用ワーク
ステーションと、画像情報を蓄積するファイル用ステー
ションと、文書画像を認識する認識用ワークステーショ
ンとをネットワークをもって接続して構成されているも
のであって：文書作成用ワークステーションにおける、
スキャナーを用いて文書画像情報を読み込むための画像
入力部と；文書作成用ワークステーションにおける、文
書画像情報をファイルとしてファイル用ステーションに
蓄積するための画像転送部と；認識用ワークステーショ
ンにおける、ファイル用ステーションから文書画像情報
を取り出すための画像取り出し部と；認識用ワークステ
ーションにおける、文書画像情報を認識して対応のコー
ド情報に変換するための認識・変換部と；認識用ワーク
ステーションにおける、前記認識・変換の結果に基づく
前記コード情報をファイル用ステーションに転送するた
めの認識結果転送部と；を備えてなることを特徴とする
ものである。更に、この発明に係る文書認識装置のため
の文書認識処理方法は：認識用ワークステーション内の
認識・変換部によって所定の認識処理を開始する際に、
現に処理中の文書画像ファイルの有無を確認し、この確
認の結果に依存して前記認識処理を実行すること；認識
済みの文書画像ファイルの名称およびその作成日時と、
認識対象候補としての文書画像ファイルの名称およびそ
の作成日時とをそれぞれに比較し、これらに相違がある
認識対象候補としての文書画像ファイルを実際の認識対
象文書画像ファイルとして選択すること；および、認識
を所望する文書画像ファイルについては、これををファ
イル用ステーションの適所に転記し、その後に所要の認
識処理を実行すること；を特徴とするものである。この
発明による別の文書認識装置は、電子メイル送信用ワー
クステーションを更に付加して構成されたものである。A recognition system according to the present invention is connected to image input means for inputting an image, and includes at least one first storage means for storing an image read by the image input means. Second information processing apparatus (workstation) comprising: an information processing apparatus (workstation); and a conversion unit that reads an image stored in the storage unit and converts the image into code information.
And a network interconnecting the first information processing device and the second information processing device. Further, the document recognition apparatus according to the present invention connects a document creation workstation to which a scanner is connected, a file station for storing image information, and a recognition workstation for recognizing a document image by a network. Comprising: at a document creation workstation,
An image input unit for reading document image information by using a scanner; an image transfer unit for storing document image information as a file in a file station in a document creation workstation; and a file in a recognition workstation. An image extraction unit for extracting document image information from a station; a recognition / conversion unit for recognizing document image information and converting it into corresponding code information in a recognition workstation; and the recognition in a recognition workstation. A recognition result transfer unit for transferring the code information based on the result of the conversion to a file station. Further, a document recognition processing method for a document recognition apparatus according to the present invention includes the steps of:
Confirming the presence or absence of a document image file that is currently being processed, and executing the recognition process depending on the result of this confirmation; the name of the recognized document image file and the date and time of its creation;
Comparing the name of a document image file as a recognition target candidate and the date and time of its creation, and selecting a document image file as a recognition target candidate having a difference between them as an actual recognition target document image file; Is transferred to an appropriate location of the file station, and then a required recognition process is executed. Another document recognition device according to the present invention is configured by further adding an electronic mail transmission workstation.

【０００６】[0006]

【作用】の発明によれば、文書作成用ワークステーショ
ンに備えられているスキャナーから入力された文字画像
情報が、当該文書作成用ワークステーションが接続され
ているネットワークを介して、ファイル用ステーション
内に備えられている大容量の記憶装置に転記される。認
識用ワークステーションにおいては、このようにして転
記された文字画像情報の認識がなされ、この認識の結果
が前記ファイル用ステーション側に転送・格納される。
そして、このようにして格納された認識結果が、必要に
応じて、文書作成用ワークステーションによって個々に
取り出される。このために、スキャナー、大容量の記憶
装置、および、認識用のソフトウエア（またはハードウ
エア）を、個々の文書作成用ワークステーションに設け
ることが不要になる。かくして、装置の全体としてのコ
ストが低減されるとともに、操作者が直接使用する文書
作成用ワークステーションに特別なハードウエアやソフ
トウエアを備えることが不要であることから、認識処理
操作に不慣れな者でも容易に操作をすることが可能にさ
れる。また、この発明によれば、認識対象としての文書
情報に対する認識結果が電子メイルとして操作者に送信
するようにされているために、操作者は、前記の認識結
果が送信されるのを待機しながら、別の作業をしている
ことができる。更に、この発明によれば、認識を所望す
る領域の指定が、文書作成用ワークステーションの操作
者によって実行できるようにされているために、例え
ば、表・図形混在の文書や段組のある文書であっても認
識対象として扱うことが可能である。According to the invention, the character image information input from the scanner provided in the document creation workstation is stored in the file station via the network to which the document creation workstation is connected. The data is transcribed to the provided large-capacity storage device. The recognition workstation recognizes the character image information transcribed in this way, and the result of the recognition is transferred and stored to the file station.
The recognition results stored in this way are individually retrieved by the document creation workstation as needed. This eliminates the need for a scanner, a large-capacity storage device, and recognition software (or hardware) at each document creation workstation. Thus, the overall cost of the apparatus is reduced, and there is no need to provide special hardware and software at the document creation workstation directly used by the operator. However, the operation can be easily performed. Further, according to the present invention, since the recognition result for the document information as the recognition target is transmitted to the operator as an electronic mail, the operator waits for the transmission of the recognition result. While you can do other work. Further, according to the present invention, since the designation of the area desired to be recognized can be executed by the operator of the document creation workstation, for example, a document having a mixture of tables and figures or a document having columns is used. Can be treated as a recognition target.

【０００７】[0007]

【実施例】図１は、この発明の第１の実施例に係るネッ
トワーク構成の文書認識装置を示す概略的な構成図であ
る。この図１の第１の実施例におけるネットワーク構成
の文書認識装置に含まれるものは、操作者によって直接
的に操作される文書作成用ワークステーション１、この
文書作成用ワークステーション１に専属的に接続されて
いるスキャナー２、大容量の記憶装置（図示されない）
が備えられているファイル用ステーション３、前記ファ
イル用ステーション３内の記憶装置に格納されている文
書画像情報を認識するための認識用ワークステーション
４であり、これらの手段はネットワーク５を介して互い
に接続されている。また、図２は、上記第１の実施例に
おける認識処理を機能的に説明するための、概略的な機
能的構成図である。図３は、上記第１の実施例における
ファイル用ステーション３内の記憶装置の機能的な説明
図である。図４は、上記第１の実施例における認識済み
登録ファイルの例示図である。図５は、上記第１の実施
例における認識処理の態様を示すフローチャートであ
る。以下、図１ないし図５を適宜に参照しながら、この
発明の第１の実施例について説明する。FIG. 1 is a schematic configuration diagram showing a network-structured document recognition apparatus according to a first embodiment of the present invention. The network-structured document recognition apparatus of the first embodiment shown in FIG. 1 includes a document creation workstation 1 directly operated by an operator, and is exclusively connected to the document creation workstation 1. Scanner 2, large-capacity storage device (not shown)
And a recognition workstation 4 for recognizing document image information stored in a storage device in the file station 3. These means are mutually connected via a network 5. It is connected. FIG. 2 is a schematic functional configuration diagram for functionally explaining the recognition processing in the first embodiment. FIG. 3 is a functional explanatory diagram of the storage device in the file station 3 in the first embodiment. FIG. 4 is a view showing an example of a recognized registration file in the first embodiment. FIG. 5 is a flowchart showing the mode of the recognition processing in the first embodiment. Hereinafter, a first embodiment of the present invention will be described with reference to FIGS.

【０００８】操作者は文書作成用ワークステーション１
を操作しており、その画像入力部１１の処理を施し、専
属のスキャナー２を働かせ、適当な紙等の媒体に記載さ
れている所要の文書情報を対応の画像情報としてデジタ
ル的に入力させる。このときに、先の文書情報に対応す
る原画像情報が文書作成用ワークステーション１内の所
定の記憶装置（図示されない）に一旦格納されるが、次
のような作業をすることもできる。即ち、文書作成用ワ
ークステーション１内の制御／操作部１４により対象の
画像を適当な表示装置１４１に表示して、この画像が正
規の入力画像であるか否か、適当な画質を有するもので
あるか否か等の確認を操作者にさせてから、これを再入
力させることもできる。[0008] The operator is a workstation 1 for creating a document.
Is processed by the image input unit 11 and the dedicated scanner 2 is operated to digitally input required document information described on a suitable medium such as paper as corresponding image information. At this time, the original image information corresponding to the previous document information is temporarily stored in a predetermined storage device (not shown) in the document creation workstation 1, but the following operation can be performed. That is, a target image is displayed on an appropriate display device 141 by the control / operation unit 14 in the document creation workstation 1, and whether or not this image is a regular input image has an appropriate image quality. It is also possible to make the operator confirm whether or not there is, and then re-enter the confirmation.

【０００９】文書作成用ワークステーション１内の画像
転送部１２においては、スキャナー２から入力された認
識対象としての文書画像を、ネットワーク５を介して、
ファイル用ステーション３内の画像蓄積部３１に転記す
る。このファイル用ステーション３内の所要の記憶容量
を有する記憶装置は、図３に示されているように、認識
対象としての文書画像を格納するための前記画像蓄積部
３１と、認識の結果を格納するための認識結果蓄積部３
２とに分割されており、目的に応じて適宜に使い分けら
れている。ここで、文書作成用ワークステーション１の
操作者によってなされる具体的な操作は、例えば次の通
りである。いま、認識対象としての文書画像に関する情
報が、表示装置１４１の表示画面に、対応する適当なア
イコンとして表示されているとすると、操作者はキーボ
ードのような適当な入力装置１４２、または、マウスの
ような適当なポインティング・デバイス１４３を用いて
その選択を行う。そして、選択されている文書画像のア
イコンを、これも対応のアイコンとして表示装置１４１
の表示画面に表示されている画像蓄積部３１へ転記を行
う。このようにすることで、ある所望の認識対象として
の文書画像を、ファイル用ステーション３における記憶
装置内の画像蓄積部３１に転記することができる。な
お、このような操作者による操作指示は、制御／操作部
１４を介して画像転送部１２に伝えられるものである。In an image transfer section 12 in the document creation workstation 1, a document image as a recognition target input from the scanner 2 is transmitted via a network 5.
The data is transferred to the image storage unit 31 in the file station 3. As shown in FIG. 3, a storage device having a required storage capacity in the file station 3 stores the image storage unit 31 for storing a document image as a recognition target, and stores the recognition result. Result storage unit 3 for performing
And is appropriately used depending on the purpose. Here, a specific operation performed by the operator of the document creation workstation 1 is, for example, as follows. Assuming that information on a document image to be recognized is displayed on the display screen of the display device 141 as a corresponding appropriate icon, the operator can use a suitable input device 142 such as a keyboard or a mouse. The selection is made using an appropriate pointing device 143 as described above. Then, the icon of the selected document image is displayed on the display device 141 as a corresponding icon.
Is transcribed to the image storage unit 31 displayed on the display screen. In this way, a document image as a desired recognition target can be transferred to the image storage unit 31 in the storage device in the file station 3. Such an operation instruction from the operator is transmitted to the image transfer unit 12 via the control / operation unit 14.

【００１０】次に、ファイル用ステーション３における
記憶装置内の画像蓄積部３１に転記されて格納されてい
る認識対象としての文書画像は、認識用ワークステーシ
ョン４によって認識される。図５には、ここでの認識処
理の態様がフローチャートとして示されている。なお、
ここで示されている認識処理は、例えば毎時０分に処理
が開始されるような、ある一定の時刻に開始される処理
であるが、いうまでもなく、その開始時刻や処理の操作
回数は任意に選択・設定することができる。いま、ある
所定の認識処理開始の指示が出されたとすると、まず、
認識処理中のファイルとしての認識処理ファイルが存在
するか否かが確認される（ステップ４２０）。このステ
ップ４２０が必要な理由は次の通りである。即ち、前述
されたように、ここで例示されている処理は、ある一定
の時刻に開始される（例えば、毎時０分に開始される）
処理であるとされているために、ある１個の対象に対す
る認識処理に多くの時間（例えば、１時間よりも長い時
間）がかかっている場合には、その認識処理の終了に先
だって次回の認識処理が始まってしまうことがある。こ
のような場合には、同じ対象に対する認識処理が平行し
て二重に行われて、それだけ余分の処理がなされること
になる。このような理由のために、前述のようなステッ
プ４２０の確認ステップを設けておき、認識処理中ファ
イルの存在が確認されたとき（確認の結果がＹであった
とき）には、認識処理の終了が指示されて、既に認識処
理が開始されている文書画像に対する二重の認識処理の
開始が防止される。前記のステップ４２０において、認
識処理中ファイルの不存在が確認されたとき（確認の結
果がＮであったとき）には、ステップ４２１において処
理中ファイルの作成がなされる。これに続けて、認識用
ワークステーション４（図２）における画像取り出し部
４１の働きにより、ファイル用ステーション３における
記憶装置内の画像蓄積部３１から、ネットワーク５を介
して、全ての認識対象としての文書画像に対応する情報
を取り出し、認識用ワークステーション４内の図示され
ない記憶装置に取り込む（ステップ４２２）ようにされ
る。Next, the document image to be recognized, which is transcribed and stored in the image storage unit 31 in the storage device in the file station 3, is recognized by the recognition workstation 4. FIG. 5 shows a form of the recognition processing here as a flowchart. In addition,
The recognition process shown here is a process that is started at a certain time, for example, the process is started at 0 minutes every hour. However, it is needless to say that the start time and the number of operations of the process are It can be arbitrarily selected and set. Now, if an instruction to start a predetermined recognition process is issued, first,
It is confirmed whether or not there is a recognition processing file as a file undergoing the recognition processing (step 420). The reason why this step 420 is necessary is as follows. That is, as described above, the processing illustrated here is started at a certain time (for example, started at 0 minute every hour).
When the recognition processing for a certain object takes a long time (for example, a time longer than one hour) because the processing is regarded as the processing, the next recognition processing is performed prior to the end of the recognition processing. Processing may start. In such a case, the recognition processing for the same object is performed in parallel and doubly, and extra processing is performed accordingly. For such a reason, a confirmation step of step 420 as described above is provided, and when the existence of the file being recognized is confirmed (when the confirmation result is Y), the recognition processing is performed. The end is instructed, and the start of double recognition processing for a document image for which recognition processing has already been started is prevented. When it is confirmed in step 420 that the file under recognition processing does not exist (when the result of the confirmation is N), a file under processing is created in step 421. Subsequently, by the operation of the image extracting unit 41 in the recognition workstation 4 (FIG. 2), all the images to be recognized are recognized from the image storage unit 31 in the storage device in the file station 3 via the network 5. The information corresponding to the document image is taken out and taken into a storage device (not shown) in the recognition workstation 4 (step 422).

【００１１】ここでは、転送される文書画像について、
順次に１枚ずつ認識処理の対象とされるが、まず、転送
される文書画像が存在するか否かの確認がなされる（ス
テップ４２３）。この転送される文書画像の存在が確認
されたとき（確認の結果がＹであったとき）には、次に
続くステップ４２４において、該当の転送される文書画
像が既に認識済みであるか否かの確認がなされる。ここ
で、認識済みであるか否かの確認は次のようにしてなさ
れる。即ち、後述される登録のステップ４２７における
ように、ある所定の対象に対する認識処理が終了した時
点においては、対応の認識済み登録ファイル（これは、
図４に例示されている）に対して、転送される文書画像
のファイル名とその作成日時（この作成日時は、スキャ
ナーで読み込まれた日時である）とを登録するようにさ
れている。ここで図４を参照すると、Ｉｔｏｎｏｒｉ
（ＮＹａｍａｍｏｔｏ，ＮＳｈｉｍｉｚｕ）が文書画
像のファイル名として登録されており、また「Ｄｅｃ
２６１９：５０（Ｊａｎ１０１９：５０，Ｊａｎ
１１０９：３６）」が作成日時として対応してい
る。そこで、前回までに前記認識済み登録ファイル内で
登録された認識済みのファイル名とその作成日時とが同
一のものであるときには、これは既に認識済みである
（確認の結果がＹであったとき）として、前段のステッ
プ４２３に戻るようにされている。ここで、登録された
認識済みのファイル名およびその作成日時を比較の対象
にするのは次の理由による。即ち、例えば、２個の互い
に異なる内容の文書画像のファイルがあるときに、両者
に対して全く同じファイル名をつけられることがある
が、登録された認識済みのファイル名だけでこれらを比
較したときには、認識処理の対象から排除されてしまう
ことになる。このような事態が生じることを防止するた
めには、登録された認識済みのファイル名に加えて、そ
の作成日時をも比較の対象にすることにより、両者の同
一性のいかんを確認するようにされている。いま、ステ
ップ４２４における確認の結果として、該当の転送され
た文書画像が未だ認識されていないものであるとされた
ときには、これに続くステップ４２５において、認識用
ワークステーション４における認識・変換部４２の働き
により所要の認識処理が行われて、その認識の結果に対
応する所定のコード情報の変換・生成がなされる。この
認識のステップ４２５においては、認識対象の中の文字
領域を抽出して、そこでの文字認識がなされる。ここで
の文字の認識のやり方として特定されるものはないが、
例えば、特願平３−１９５１００号に開示されたやり方
等を用いることができる。これに次いで、前記された文
字認識の結果である所定のコード情報が、認識用ワーク
ステーション４における認識結果転送部４３の働きによ
り、ネットワーク５を介して、ファイル用ステーション
３における記憶装置内の認識結果蓄積部３２に対して転
送される（ステップ４２６）。これに続くステップ４２
７においては、前述されたように、認識対象としての文
書画像情報ファイルのファイル名およびその作成日時
が、認識済み登録ファイルに登録される。そして、この
ようにして登録がなされた認識対象としての文書画像情
報が消去される（ステップ４２８）。Here, regarding the transferred document image,
The documents are sequentially subjected to the recognition processing one by one. First, it is confirmed whether or not there is a document image to be transferred (step 423). When the existence of the transferred document image is confirmed (when the result of the confirmation is Y), in the following step 424, it is determined whether or not the corresponding transferred document image has already been recognized. Is confirmed. Here, whether or not the recognition has been completed is performed as follows. That is, as in step 427 of registration, which will be described later, when the recognition process for a given target is completed, the corresponding recognized registration file (which is
4, the file name of the transferred document image and the creation date and time (the creation date and time is the date and time read by the scanner) are registered. Referring now to FIG.
(NYamamoto, NShimizu) is registered as the file name of the document image, and "Dec.
26 19:50 (Jan 10 19:50, Jan
11 09:36) "corresponds to the creation date and time. Therefore, when the recognized file name registered in the recognized registration file and the creation date and time are the same before the last time, this is already recognized (when the confirmation result is Y, ), The process returns to the previous step 423. Here, the registered recognized file names and their creation dates and times are compared for the following reasons. That is, for example, when there are two document image files having different contents, they may be given exactly the same file name, but these are compared only with the registered recognized file names. At times, it is excluded from the target of the recognition processing. In order to prevent such a situation from happening, in addition to the registered recognized file name, the created date and time are also compared, so that it is necessary to confirm the identity of the two. Have been. Now, as a result of the confirmation in step 424, when it is determined that the transferred document image has not been recognized yet, in step 425 following this, the recognition / conversion unit 42 of the recognition workstation 4 A required recognition process is performed by the operation, and predetermined code information corresponding to the result of the recognition is converted and generated. In the recognition step 425, a character area in the recognition target is extracted, and character recognition is performed there. There is no specific method of character recognition here,
For example, the method disclosed in Japanese Patent Application No. 3-195100 can be used. Subsequently, the predetermined code information, which is the result of the above-described character recognition, is transferred to the file station 3 in the storage device via the network 5 by the operation of the recognition result transfer unit 43 in the recognition workstation 4. The data is transferred to the result storage unit 32 (step 426). Subsequent step 42
At 7, the file name of the document image information file to be recognized and its creation date and time are registered in the recognized registration file, as described above. Then, the document image information registered as a recognition target as described above is deleted (step 428).

【００１２】認識対象としての転送画像が存在する限り
は、前記と同じ処理が繰り返される。そして、全ての転
送画像に対する処理が終了したとき（ステップ４２３に
おける処理の結果がＮであったとき）には、処理中ファ
イルの削除を行って（ステップ４２９）、新しい認識処
理に対処できるようにされる。As long as there is a transfer image to be recognized, the same processing as described above is repeated. Then, when the processing for all the transfer images is completed (when the result of the processing in step 423 is N), the file being processed is deleted (step 429) so that new recognition processing can be dealt with. Is done.

【００１３】認識対象画像がファイル用ステーション３
における記憶装置に転記されてから適当な時間が経過し
たときに、操作者は、前記ファイル用ステーション３に
おける記憶装置内の認識結果蓄積部３２の点検をするこ
とにより、所望の認識結果が格納されているか否かの確
認をする。所望の認識結果の格納が確認されたときに
は、文書作成用ワークステーション１における認識結果
取り出し部１３の働きにより、前記所望の認識結果を取
り出して、文書作成のために用いるようにされる。The image to be recognized is a file station 3
When an appropriate time has elapsed since the data was transcribed to the storage device, the operator checks the recognition result storage unit 32 in the storage device in the file station 3 so that the desired recognition result is stored. Check if it is. When the storage of the desired recognition result is confirmed, the desired recognition result is extracted and used for document creation by the operation of the recognition result extracting unit 13 in the document creation workstation 1.

【００１４】この発明の第１の実施例について詳細に説
明してきたけれども、以下のような各種の変更ないし修
正をすることができる。（１）：上記第１の実施例においては、スキャナーを備
えた文書作成用ワークステーション、大容量の記憶装置
を備えたファイル用ステーション、および、認識用ワー
クステーションが１個ずつネットワークに接続して構成
されているけれども、このような構成に限られるもので
はなく、必要であれば複数個のものを接続して構成させ
ることができる。また、複数個の機能をある１個のワー
クステーションに持たせることもできる。（２）：上記第１の実施例においては、画像情報の読み
込みとその認識結果の取り込みとの双方が、１個の文書
作成用ワークステーションだけで行われているけれど
も、これらの作業を別のワークステーションで行うこと
ができる。例えば、画像情報の入力専用のワークステー
ションを１個ネットワークに接続しておいて、この専用
ワークステーションを関係の操作者全員が使用し、認識
結果を各操作者自身のワークステーションにおいて用い
ることができる。（３）：上記第１の実施例においては、ファイル用ステ
ーション３内の記憶装置を画像蓄積部３１と認識結果蓄
積部３２とに分割するようにされているが、これに限ら
れるものではなく、これを単一にまとめて、画像とその
認識の結果とを同一の蓄積部に蓄積させることができ
る。なお、この場合には、認識処理の終了した画像を消
去させることもできる。（４）：上記第１の実施例においては、認識対象として
の文書画像に関する情報をスキャナーから直接入力する
ようにされているけれども、これに限らず、例えば、光
ディスクファイルのような記録媒体に一旦記録されてい
るものの中から、再利用したい文書画像を検索して認識
させることもできる。（５）：更に、上記第１の実施例においては、文字の認
識をすることを認識処理の例として挙げているけれど
も、これに限られるものではなく、対象としての図形領
域について、対応のベクトル情報に変換するラスター／
ベクトル変換機能、基本的な図形（例えば、三角形、四
角形等の）として認識する図形認識機能、または、文字
／図形が混在する文書を対象とする文書認識機能を持た
せるようにすることもできる。Although the first embodiment of the present invention has been described in detail, various changes and modifications as described below can be made. (1) In the first embodiment, one document creation workstation with a scanner, one file station with a large-capacity storage device, and one recognition workstation are connected to the network. Although it is configured, it is not limited to such a configuration, and a plurality of components can be connected and configured if necessary. Further, a single workstation having a plurality of functions can be provided. (2) In the first embodiment, both reading of image information and taking in of the recognition result are performed by only one document creation workstation. Can be done at the workstation. For example, one workstation dedicated to inputting image information may be connected to the network, and this dedicated workstation may be used by all operators involved, and the recognition result may be used at each operator's own workstation. . (3) In the first embodiment, the storage device in the file station 3 is divided into the image storage unit 31 and the recognition result storage unit 32. However, the present invention is not limited to this. , And the image and the result of its recognition can be stored in the same storage unit. In this case, the image for which the recognition processing has been completed can be deleted. (4) In the first embodiment, the information about the document image to be recognized is directly input from the scanner. However, the present invention is not limited to this. For example, the information is temporarily stored in a recording medium such as an optical disk file. It is also possible to search and recognize a document image to be reused from recorded ones. (5) Further, in the first embodiment, the recognition of characters is described as an example of the recognition processing. However, the present invention is not limited to this. Raster to convert to information /
A vector conversion function, a graphic recognition function for recognizing a basic figure (for example, a triangle or a quadrangle), or a document recognition function for a document in which characters / graphics are mixed may be provided.

【００１５】図６は、この発明の第２の実施例に係るネ
ットワーク構成の文書認識装置を示す概略的な構成図で
ある。この図６の第２の実施例におけるネットワーク構
成の文書認識装置に含まれるものは、操作者によって直
接的に操作される文書作成用ワークステーション１、こ
の文書作成用ワークステーション１に専属的に接続され
ているスキャナー２、大容量の記憶装置（図示されな
い）が備えられているファイル用ステーション３、前記
ファイル用ステーション３内の記憶装置に格納されてい
る文書画像情報を認識するための認識用ワークステーシ
ョン４、および、ある所定のコマンドに応じて所要の電
子的な情報を指定の受け入れ先に転送するための電子メ
イル送信用ワークステーション６であって、これらの手
段はネットワーク５を介して互いに接続されている。ま
た、図７は、上記第２の実施例における認識処理を機能
的に説明するための、概略的な機能的構成図である。そ
して、図８は、上記第２の実施例における電子メイル送
信用ワークステーション６の動作の態様を例示するため
のフローチャートである。なお、上記第２の実施例にお
けるファイル用ステーション３内の記憶装置は、その機
能的な面からは、前述された第１の実施例のそれと同様
であるから、必要に応じて前記第１の実施例のための説
明図としての図３を参照する。また、上記第２の実施例
における認識済み登録ファイルについても、前述された
第１の実施例のそれと同様であるから、必要に応じて前
記第１の実施例のための例示図としての図４を参照す
る。更に、上記第２の実施例における認識処理の態様に
ついても、前述された第１の実施例のそれと同様である
から、必要に応じて前記第１の実施例のためのフローチ
ャートとしての図５を参照する。以下、図６ないし図８
および前述された第１の実施例に関する図３ないし図５
を適宜に参照しながら、この発明の第２の実施例につい
て説明する。FIG. 6 is a schematic block diagram showing a network-structured document recognition apparatus according to a second embodiment of the present invention. The network-structured document recognition apparatus in the second embodiment shown in FIG. 6 includes a document creation workstation 1 directly operated by an operator, and is exclusively connected to the document creation workstation 1. Scanner 2, a file station 3 provided with a large-capacity storage device (not shown), and a recognition work for recognizing document image information stored in a storage device in the file station 3. A station 4 and a workstation 6 for transmitting e-mail required electronic information to a designated destination in response to a predetermined command, these means being connected to each other via a network 5 Have been. FIG. 7 is a schematic functional configuration diagram for functionally explaining the recognition processing in the second embodiment. FIG. 8 is a flowchart for illustrating an operation mode of the electronic mail transmitting workstation 6 in the second embodiment. The storage device in the file station 3 in the second embodiment is similar in function to that of the first embodiment described above. Please refer to FIG. 3 as an explanatory diagram for the embodiment. The recognized registration file in the second embodiment is also the same as that of the first embodiment described above. Therefore, if necessary, FIG. 4 as an illustration for the first embodiment is used. See Further, the mode of the recognition processing in the second embodiment is also the same as that of the first embodiment described above. Therefore, FIG. 5 as a flowchart for the first embodiment may be used as needed. refer. Hereinafter, FIGS. 6 to 8
3 to 5 relating to the first embodiment described above.
A second embodiment of the present invention will be described with reference to FIG.

【００１６】操作者は自己の文書作成用ワークステーシ
ョン１を操作しており、その画像入力部１１の処理を施
し、専属のスキャナー２を働かせ、適当な紙等の媒体に
記載されている所要の文書情報を対応の画像情報として
デジタル的に入力させる。このときに、先の文書情報に
対応する原画像情報が文書作成用ワークステーション１
内の所定の記憶装置（図示されない）に一旦格納される
が、次のような作業をすることもできる。即ち、文書作
成用ワークステーション１内の制御／操作部１４により
対象の画像を適当な表示装置１４１に表示して、この画
像が正規の入力画像であるか否か、適当な画質を有する
ものであるか否か等の確認を操作者にさせてから、これ
を再入力させることもできる。The operator operates his or her own document creation workstation 1, processes the image input unit 11 thereof, activates the dedicated scanner 2, and operates a dedicated scanner 2 so that the required information written on a suitable medium such as paper is written. The document information is digitally input as the corresponding image information. At this time, the original image information corresponding to the previous document information is stored in the document creation workstation 1.
Is temporarily stored in a predetermined storage device (not shown), but the following operation can also be performed. That is, a target image is displayed on an appropriate display device 141 by the control / operation unit 14 in the document creation workstation 1, and whether or not this image is a regular input image has an appropriate image quality. It is also possible to make the operator confirm whether or not there is, and then re-enter the confirmation.

【００１７】文書作成用ワークステーション１内の画像
転送部１２においては、スキャナー２から入力された認
識対象としての文書画像を、ネットワーク５を介して、
ファイル用ステーション３内の画像蓄積部３１に転記す
る。このファイル用ステーション３内の所要の記憶容量
を有する記憶装置は、図３に示されているように、認識
対象としての文書画像を格納するための前記画像蓄積部
３１と、認識の結果を格納するための認識結果蓄積部３
２とに分割されており、目的に応じて適宜に使い分けら
れている。なお、ここでの認識結果蓄積部３２には、転
送処理に失敗したときの認識結果を蓄積しておく作用を
するものである。ここに、文書作成用ワークステーショ
ン１の操作者によってなされる具体的な操作は、例え
ば、次の通りである。いま、認識対象としての文書画像
に関する情報が、表示装置１４１の表示画面に、対応す
る適当なアイコンとして表示されているとすると、操作
者はキーボードのような入力装置１４２、または、マウ
スのような適当なポインティング・デバイス１４３を用
いてその選択を行う。そして、選択されている文書画像
のアイコンを、これも対応のアイコンとして表示装置１
４１の表示画面に表示されている画像蓄積部３１へ転記
を行う。このようにすることで、ある所望の認識対象と
しての文書画像を、ファイル用ステーション３における
記憶装置内の画像蓄積部３１に転記することができる。
なお、このような操作者による操作指示は、制御／操作
部１４を介して画像転送部１２に伝えられるものであ
る。In the image transfer unit 12 in the document creation workstation 1, a document image to be recognized input from the scanner 2 is transmitted via the network 5.
The data is transferred to the image storage unit 31 in the file station 3. As shown in FIG. 3, a storage device having a required storage capacity in the file station 3 stores the image storage unit 31 for storing a document image as a recognition target, and stores the recognition result. Result storage unit 3 for performing
And is appropriately used depending on the purpose. Note that the recognition result accumulation unit 32 functions to accumulate the recognition result when the transfer process has failed. Here, specific operations performed by the operator of the document creation workstation 1 are, for example, as follows. Assuming that information on a document image to be recognized is displayed on the display screen of the display device 141 as a corresponding appropriate icon, the operator can use the input device 142 such as a keyboard or the mouse such as a mouse. The selection is made using an appropriate pointing device 143. Then, the icon of the selected document image is displayed on the display device 1 as a corresponding icon.
The data is transcribed to the image storage unit 31 displayed on the display screen 41. In this way, a document image as a desired recognition target can be transferred to the image storage unit 31 in the storage device in the file station 3.
Such an operation instruction from the operator is transmitted to the image transfer unit 12 via the control / operation unit 14.

【００１８】次に、ファイル用ステーション３における
記憶装置内の画像蓄積部３１に転記されて格納されてい
る認識対象としての文書画像は、認識用ワークステーシ
ョン４によって認識される。図５には、ここでの認識処
理の態様がフローチャートとして示されている。なお、
ここで示されている処理は、例えば毎時０分に処理が開
始されるような、ある一定の時刻に開始される処理であ
るが、その開始時刻や処理の操作回数を任意に選択・設
定できることは勿論である。いま、ある所定の認識処理
開始の指示が出されたとすると、まず、認識処理中のフ
ァイルとしての認識処理中ファイルの存否のいかんが確
認される（ステップ４２０）。このステップ４２０が必
要な理由は次の通りである。即ち、前述されたように、
ここで例示されている処理は、ある一定の時刻に開始さ
れる（例えば、毎時０分に開始される）処理であるとさ
れているために、ある１個の対象に対する認識処理に多
くの時間（例えば、１時間よりも長い時間）がかかって
いる場合には、その認識処理の終了に先だって次回の認
識処理が始まってしまうことがある。このような場合に
は、同じ対象に対する認識処理が平行して二重に行われ
て、それだけ余分の処理がなされることになる。このよ
うな理由のために、前述のステップ４２０のような確認
ステップを設けておき、認識処理中ファイルの存在が確
認されたとき（確認の結果がＹであったとき）には、認
識処理の終了が指示されて、既に認識処理が開始されて
いる文書画像に対する二重の認識処理の開始が防止され
る。前記のステップ４２０において、認識処理中ファイ
ルの不存在が確認されたとき（確認の結果がＮであった
とき）には、ステップ４２１において処理中ファイルの
作成がなされる。これに続けて、認識用ワークステーシ
ョン４における画像取り出し部４１の働きにより、ファ
イル用ステーション３における記憶装置内の画像蓄積部
３１から、ネットワーク５を介して、全ての認識対象と
しての文書画像に対応する情報が取り出されて、認識用
ワークステーション４内の図示されない記憶装置に取り
込まれる（ステップ４２２）。Next, the document image to be recognized, which is transcribed and stored in the image storage unit 31 in the storage device in the file station 3, is recognized by the recognition workstation 4. FIG. 5 shows a form of the recognition processing here as a flowchart. In addition,
The process shown here is a process that is started at a certain time, for example, the process is started every hour at 0 minutes. Of course. If an instruction to start a predetermined recognition process is issued, first, it is confirmed whether or not there is a file under the recognition process as a file under the recognition process (step 420). The reason why this step 420 is necessary is as follows. That is, as described above,
Since the process illustrated here is assumed to be started at a certain time (for example, started at 0 minute every hour), it takes a lot of time for the recognition process for one object. If it takes longer than one hour, for example, the next recognition process may start before the recognition process ends. In such a case, the recognition processing for the same object is performed in parallel and doubly, and extra processing is performed accordingly. For such a reason, a confirmation step such as the above-described step 420 is provided, and when the presence of the file being recognized is confirmed (when the confirmation result is Y), the recognition processing is performed. The end is instructed, and the start of double recognition processing for a document image for which recognition processing has already been started is prevented. When it is confirmed in step 420 that the file under recognition processing does not exist (when the result of the confirmation is N), a file under processing is created in step 421. Subsequently, by the operation of the image extracting unit 41 in the recognition workstation 4, the image storage unit 31 in the storage device in the file station 3 supports all document images as recognition targets via the network 5. The information to be retrieved is taken out and stored in a storage device (not shown) in the recognition workstation 4 (step 422).

【００１９】ここでは、転送される文書画像について、
順次に１枚ずつ認識処理の対象とされるが、まず、転送
される文書画像が存在するか否かの確認がなされる（ス
テップ４２３）。この転送される文書画像の存在が確認
されたとき（確認の結果がＹであったとき）には、次に
続くステップ４２４において、該当の転送される文書画
像が既に認識済みであるか否かの確認がなされる。ここ
で、認識済みであるか否かの確認は次のようにしてなさ
れる。、即ち、後述される登録のステップ４２７におけ
るように、ある所定の対象に対する認識処理が終了した
時点においては、対応の認識済み登録ファイル（これ
は、図４に例示されている）に対して、転送される文書
画像のファイル名とその作成日時（この作成日時は、ス
キャナーで読み込まれた日時である）とを登録するよう
にされている。そこで、前回までに前記認識済み登録フ
ァイル内で登録された認識済みのファイル名とその作成
日時とが同一のものであるときには、これは既に認識済
みである（確認の結果がＹであったとき）として、前段
のステップ４２３に戻るようにされている。ここで、登
録された認識済みのファイル名およびその作成日時を比
較の対象にするのは、前記の第１の実施例において既に
説明された理由によるが、ここで再び説明することは省
略する。いま、ステップ４２４における確認の結果とし
て、該当の転送された文書画像が未だ認識されていない
ものであるとされたときには、これに続くステップ４２
５において、認識用ワークステーション４における認識
・変換部４２の働きにより、所要の認識処理が行われ
て、その認識の結果に対応する所定のコード情報の生成
がなされる。この認識のステップ４２５においては、認
識対象の中の文字領域を抽出して、そこでの文字認識が
なされる。ここでの文字の認識のやり方として特定され
るものはないが、例えば、特願平３−１９５１００号に
開示されたやり方等を用いることができる。これに次い
で、前記された文字認識の結果である所定のコード情報
が、認識用ワークステーション４と電子メイル送信用ワ
ークステーション６との双方に関連する認識結果転送部
４３Ａの以下の働きにより、ネットワーク５を介して、
所定の操作者に対応する文書作成用ワークステーション
１に対して転送される（ステップ４２６）。ここで図８
のフローチャートを参照することにより、認識結果転送
の操作について説明しておく。まず、何等かの操作トリ
ガによって認識結果転送の処理が開始されると、認識対
象としてのファイル名に対応する操作者（ユーザ）名が
取り出される（ステップ８１）。次のステップ８２にお
いては、この操作者（ユーザ）名をあて名として確認す
る。これに次いで、あて名があるか否かの判定がなされ
る（ステップ８３）。この判定の結果が肯定（Ｙ）であ
ったときには、該当のあて名先に向けて電子メイルの送
信をする（ステップ８４）が、前記の判定の結果が否定
（Ｎ）であったときには、例えばファイル用ステーショ
ン３における認識結果蓄積部３２に向けて、前記の電子
メイルを転送しておく（ステップ８５）。このような処
理が終ってから、認識結果の転送処理は全体として終了
することになる。このような認識結果の転送処理は、次
のようにして説明することもできる。即ち、認識結果転
送部４３Ａにおいては、認識対象としてのファイル名に
対応する操作者としてのユーザ名を取り出し、このユー
ザ名をあて名として、認識結果のファイルを発信すべき
コマンドを電子メイル送信用ワークステーション６に対
して転送する。電子メイル送信用ワークステーション６
においては、前述のような電子メイル発信のコマンドを
認識用ワークステーション４側から受け取ると、ネット
ワーク５に接続されている文書作成用ワークステーショ
ン１の中の特定のものに対して、前記認識結果のファイ
ルを転送する。このとき、ユーザ名のミス、あて先（転
送すべきユーザやワークステーション）不明等のトラブ
ルが生じたりして、所要の電子メイルの発信に失敗した
ときには、前記された認識結果のファイルの転送先は、
ファイル用ステーション３における記憶装置内の認識結
果蓄積部３２にされる。次に、図５の）ステップ４２７
においては、前述されたように、認識対象としての文書
画像情報ファイルのファイル名およびその作成日時が、
認識済み登録ファイルに登録される。そして、このよう
にして登録がなされた認識対象としての文書画像情報は
消去される（ステップ４２８）ことになる。Here, regarding the transferred document image,
The documents are sequentially subjected to the recognition processing one by one. First, it is confirmed whether or not there is a document image to be transferred (step 423). When the existence of the transferred document image is confirmed (when the result of the confirmation is Y), in the following step 424, it is determined whether or not the corresponding transferred document image has already been recognized. Is confirmed. Here, whether or not the recognition has been completed is performed as follows. That is, as in step 427 of registration, which will be described later, when the recognition process for a given target is completed, the corresponding recognized registration file (this is illustrated in FIG. 4) is The file name of the transferred document image and the creation date and time (the creation date and time is the date and time read by the scanner) are registered. Therefore, when the recognized file name registered in the recognized registration file and the creation date and time are the same before the last time, this is already recognized (when the confirmation result is Y, ), The process returns to the previous step 423. Here, the registered recognized file name and its creation date and time are compared with each other for the reason already described in the first embodiment, but will not be described again here. If it is determined in step 424 that the transferred document image has not been recognized yet, the process proceeds to step 42.
At 5, the recognition / conversion unit 42 in the recognition workstation 4 performs a required recognition process, and generates predetermined code information corresponding to the recognition result. In the recognition step 425, a character area in the recognition target is extracted, and character recognition is performed there. Although there is no specific method for character recognition here, for example, the method disclosed in Japanese Patent Application No. 3-195100 can be used. Subsequently, predetermined code information as a result of the character recognition described above is transmitted to the network by the following operation of the recognition result transfer unit 43A associated with both the recognition workstation 4 and the electronic mail transmission workstation 6. Via 5
The document is transferred to the document creation workstation 1 corresponding to the predetermined operator (step 426). Here, FIG.
The operation of the recognition result transfer will be described with reference to the flowchart of FIG. First, when the recognition result transfer process is started by any operation trigger, an operator (user) name corresponding to a file name to be recognized is extracted (step 81). In the next step 82, the operator (user) name is confirmed as the address. Following this, it is determined whether there is an address (step 83). If the result of this determination is affirmative (Y), an electronic mail is transmitted to the corresponding destination (step 84). If the result of the above determination is negative (N), for example, the file The electronic mail is transferred to the recognition result accumulation unit 32 in the use station 3 (step 85). After such processing is completed, the transfer processing of the recognition result ends as a whole. Such a transfer process of the recognition result can also be described as follows. That is, the recognition result transfer unit 43A extracts the user name as the operator corresponding to the file name as the recognition target, and uses this user name as the address to send a command to transmit the recognition result file to the electronic mail transmission work. Transfer to station 6. Workstation 6 for electronic mail transmission
When receiving the above-mentioned electronic mail transmission command from the recognition workstation 4, the specified mail transmission command is sent to a specific one of the document creation workstations 1 connected to the network 5. Transfer files. At this time, if the transmission of the required e-mail fails due to troubles such as an incorrect user name or an unknown destination (user or workstation to be transferred), the transfer destination of the above-described recognition result file is ,
The result is stored in the recognition result storage unit 32 in the storage device in the file station 3. Next, step 427 (of FIG. 5)
In, as described above, the file name of the document image information file to be recognized and its creation date and time are
Registered in the recognized registration file. Then, the document image information registered as a recognition target as described above is deleted (step 428).

【００２０】認識対象としての転送画像が存在する限り
は、前記と同じ処理が繰り返される。そして、全ての転
送画像に対する処理が終了したとき（ステップ４２３に
おける処理の結果がＮであったとき）には、処理中ファ
イルの削除を行って（ステップ４２９）、新しい認識処
理に対処できるようにされる。As long as there is a transfer image to be recognized, the same processing as described above is repeated. Then, when the processing for all the transfer images is completed (when the result of the processing in step 423 is N), the file being processed is deleted (step 429) so that new recognition processing can be dealt with. Is done.

【００２１】認識対象画像がファイル用ステーション３
における記憶装置に転記されたときには、操作者は、他
の業務に従事しながら、前記の認識対象画像に関する認
識結果が電子メイルとして送信されてくるのを待機する
ことになる。いま、所望の電子メイルの送信があったと
すると、操作者が使用している文書作成用ワークステー
ション１における表示部の表示画面上に、対応の認識結
果が送信されたことを示す変化が生じる。この表示画面
上での変化としては、例えば、電子メイルに対応するア
イコンの点滅やその形状の変化を挙げることができる。
このようにして、所望の認識結果が送信されたことが視
覚を通して明示的に表示されるために、操作者は、所定
の認識処理が終了したかどうかを気にかける必要がなく
なって、前記所望の認識結果が送信されるまでの時間を
他の業務のために十分に活用することができる。そし
て、所望の認識結果の送信が確認されたときには、文書
作成用ワークステーション１における認識結果取り出し
部１３の働きにより、電子メイルを表すアイコンから前
記所望の認識結果を取り出して、文書作成のために用い
るようにされる。The image to be recognized is a file station 3
Is transferred to the storage device, the operator waits for the recognition result regarding the recognition target image to be transmitted as an electronic mail while engaging in other tasks. Assuming that a desired electronic mail has been transmitted, a change occurs on the display screen of the display unit of the document creation workstation 1 used by the operator, indicating that the corresponding recognition result has been transmitted. Examples of the change on the display screen include a blink of an icon corresponding to the electronic mail and a change in its shape.
In this way, the fact that the desired recognition result has been transmitted is clearly displayed through the visual sense, so that the operator does not need to worry about whether or not the predetermined recognition processing has been completed, and The time until the recognition result is transmitted can be fully utilized for other tasks. Then, when the transmission of the desired recognition result is confirmed, the desired recognition result is extracted from the icon representing the electronic mail by the operation of the recognition result extracting unit 13 in the workstation 1 for document creation, and is used for document creation. To be used.

【００２２】この発明の第２の実施例について詳細に説
明してきたけれども、また、前述された第１の実施例の
場合に加えて、以下のような変更ないし修正をすること
ができる。（６）：上記第２の実施例においては、認識対象として
の文書情報に対する認識結果が電子メイルとして操作者
に送信されているけれども、これに限らず、ネットワー
ク上に設けられたファイルの転送機能を利用して、ある
特定のワークステーションにおける所定の記憶領域に直
接転送することもできる。Although the second embodiment of the present invention has been described in detail, the following changes or modifications can be made in addition to the case of the above-described first embodiment. (6): In the second embodiment, the recognition result for the document information to be recognized is transmitted to the operator as an electronic mail. However, the present invention is not limited to this, and the file transfer function provided on the network is not limited to this. And can be directly transferred to a predetermined storage area in a specific workstation.

【００２３】図９は、この発明の第３の実施例に係るネ
ットワーク構成の文書認識装置を示す概略的な構成図で
ある。この図９の第３の実施例におけるネットワーク構
成の文書認識装置に含まれるものは、操作者によって直
接的に操作される画像入力部を含む文書作成用ワークス
テーション１、この文書作成用ワークステーション１に
専属的に接続されているスキャナー２、前記文書作成用
ワークステーション１からの文書画像情報を認識（して
対応のコード情報に変換）する認識・変換部を含む認識
用ワークステーション４であり、これらの手段はネット
ワーク５を介して互いに接続されている。この第３の実
施例に係る装置は、前述された第１の実施例に係る装置
について、次のような改良が施されたものである。即
ち、文書作成用ワークステーション側には文書画像認識
用の特別な機能・装置を付加して設けることなく、文字
と図形とが混在する文書画像に対する認識対象領域が容
易に指定できるような改良が施されたものである。ま
た、図１０は、上記第３の実施例における認識処理を機
能的に説明するための、概略的な機能的構成図である。
以下、図９および図１０を適宜参照しながら、この発明
の第３の実施例について説明していく。FIG. 9 is a schematic configuration diagram showing a network-structured document recognition apparatus according to a third embodiment of the present invention. The network-structured document recognition apparatus of the third embodiment shown in FIG. 9 includes a document creation workstation 1 including an image input unit directly operated by an operator, and a document creation workstation 1. A scanner 2, which is exclusively connected to a scanner 2, and a recognition workstation 4 including a recognition / conversion unit for recognizing (and converting to corresponding code information) the document image information from the document creation workstation 1, These means are connected to each other via a network 5. The device according to the third embodiment is the same as the device according to the first embodiment described above, except for the following improvements. That is, an improvement has been made so that a recognition target area for a document image in which characters and graphics are mixed can be easily specified without providing a special function or device for document image recognition on the document creation workstation side. It was done. FIG. 10 is a schematic functional configuration diagram for functionally explaining the recognition processing in the third embodiment.
Hereinafter, a third embodiment of the present invention will be described with reference to FIGS. 9 and 10 as appropriate.

【００２４】操作者は文書作成用ワークステーション１
を操作しており、スキャナー２を働かせることにより、
その画像入力部１１Ａによる処理を施して、適当な紙等
の媒体に記載されている文字や図形の混在する所要の文
書情報を、対応の画像情報としてデジタル的に入力させ
る。このようにして入力された画像情報は、文書作成用
ワークステーション１内の制御／操作部１４Ｂにより対
象の画像として適当な表示装置１４１に表示される。The operator is a workstation 1 for creating a document.
Is operated, and by operating the scanner 2,
The image input unit 11A performs a process to digitally input required document information including characters and graphics described on a suitable medium such as paper as corresponding image information. The image information input in this manner is displayed on a suitable display device 141 as a target image by the control / operation unit 14B in the document creation workstation 1.

【００２５】文書作成用ワークステーション１内の画像
編集部１２Ａにおいては、キーボード等の入力装置１４
２やマウス等のポインティング・デバイス１４３を用い
て画像の編集を対話的に実行することにより、対象とし
ての認識領域を指定するようにされる。ここでの認識領
域を指定する方法としては種々考えられるけれども、重
要であることは、該領域を抽出する段階において元から
ある画像要素と、その後から加えられる画像要素との区
別が可能にされることである。ここでは、一例として、
認識を所望する領域を適切な矩形枠をもって囲むように
する方法を示すことにする。一般的に、ある所定の文書
中にしかるべき矩形枠が存在することはよくあることで
あるが、この矩形枠をスキャナー等でデジタル化すると
きには、これにノイズが乗ったり、不所望の傾斜が生成
したり、または、その太さが一定でなくなったりすると
いう、幾つかの不都合が生じることがある。また、画像
編集部１２Ａにおいて描画される表示画面上の矩形枠に
ついてみると、例えば１ドット幅のような、厳密な線幅
をもつ矩形枠をもって認識対象領域の指定をすることが
できる。また、画像編集部１２Ａにおいて果たされる矩
形枠の描画機能は、文書作成用ワークステーション１に
おいて一般的に利用できるものであって、文書画像の認
識のために改めて付加されるものではない。ここで図１
１を参照すると、このような文書画像における矩形枠の
指定の態様が例示されている。この図１１においては、
これに限定されるものではないが、ある所定の文書画像
表示面１１０には、＃１矩形枠１１１、＃２矩形枠１１
２、および、＃３矩形枠１１３が存在するものとして例
示されている。ここでの矩形枠の指定は、文書作成用ワ
ークステーション１において、操作者が、表示装置１４
１の表示画面上で対応の文書画像を見ながら、ポインテ
ィング・デバイス１４３を用いて、所望する矩形枠の対
角（例えば、該当の矩形枠の左上隅と右下隅）の２頂点
を指定することによってなされる。なお、ここでの認識
領域の指定のためには、実際に矩形枠を画定することの
外に、前記の２頂点をカギ型記号（「，」）や十字型記
号（＋）をもって指定することもできる。さて、このよ
うにして認識対象領域が指定された画像ファイルは、文
書作成用ワークステーション１における画像出力部１３
Ａによって、認識用ワークステーション４における画像
入力部４Ａに転送される。前述されたように、文書作成
用ワークステーション１と認識用ワークステーション４
とはネットワーク５を介して接続されているために、こ
こでの転送の態様としては、電子メイルの機能を用いて
直接転送すること、または前記第１の実施例で示したよ
うに、ある特定のファイル用ステーション（図示されな
い）を介して転送すること、等を考えることができる。The image editing unit 12A in the document creation workstation 1 has an input device 14 such as a keyboard.
By interactively editing an image using a pointing device 143 such as a mouse 2 or a mouse, a recognition area as a target is specified. Although there are various methods for specifying the recognition area here, what is important is that, at the stage of extracting the area, it is possible to distinguish between an original image element and an image element added later. That is. Here, as an example,
A method of surrounding a region desired to be recognized with an appropriate rectangular frame will be described. In general, it is common that an appropriate rectangular frame exists in a given document.However, when this rectangular frame is digitized by a scanner or the like, noise or unwanted inclination occurs. Some inconveniences may occur, such as generation or non-constant thickness. Further, regarding the rectangular frame on the display screen drawn by the image editing unit 12A, it is possible to specify the recognition target area using a rectangular frame having a strict line width such as, for example, one dot width. The drawing function of the rectangular frame performed by the image editing unit 12A can be generally used in the document creation workstation 1, and is not added again for recognizing a document image. Here, FIG.
With reference to FIG. 1, an example of the specification of a rectangular frame in such a document image is illustrated. In FIG. 11,
Although not limited to this, a predetermined document image display surface 110 includes a # 1 rectangular frame 111 and a # 2 rectangular frame 11
2 and # 3 rectangular frame 113 are illustrated as being present. The designation of the rectangular frame here is performed by the operator at the document creation workstation 1 using the display device 14.
Using the pointing device 143 to specify two vertices of a diagonal of a desired rectangular frame (for example, an upper left corner and a lower right corner of the rectangular frame) while viewing the corresponding document image on the display screen of (1). Done by Note that, in order to specify the recognition area here, in addition to actually defining a rectangular frame, the two vertices must be specified using a key symbol (",") or a cross symbol (+). Can also. By the way, the image file in which the recognition target area is designated in this way is sent to the image output unit 13 in the document creation workstation 1.
A transfers the image data to the image input unit 4A of the recognition workstation 4. As described above, the document creation workstation 1 and the recognition workstation 4
Is connected via the network 5, the transfer mode here is direct transfer using an electronic mail function, or as described in the first embodiment, Via a file station (not shown).

【００２６】前述の画像入力部４Ａで受けとられた画像
ファイルは、次段の認識領域抽出部４Ｂに送られる。そ
して、この認識領域抽出部４Ｂにおいては、文書作成用
ワークステーション１の画像編集部１２Ａで明示された
認識対象領域の抽出をする。ここで、図１２を参照する
と、これには１ドットの厳密な線幅を有する矩形枠をも
って囲まれた領域を認識する態様が例示されている。こ
の図１２においては、前述された図１１の場合と同様に
所定の文書画像表示面１１０が例示されているが、その
中の＃１矩形枠１１１だけが取り上げられている。この
ような文書画像表示面１１０において、その左上隅から
いわゆるＴＶ式に順次走査がなされていく。そして、矢
印１１１Ａで概略的に示されるように、＃１矩形枠１１
１の最初の黒画素が発見されたときには、まずこの点で
の座標値を記憶する。これに次いで、図の右方向に（即
ち、＃１矩形枠１１１の上辺に沿って右方向に）黒画素
の追跡をする。次の画素が白画素である点（即ち、矢印
１１１Ｂの折曲点）からは、図の下方向に（＃１矩形枠
１１１の右辺に沿って下方向に）黒画素の追跡をする。
このときには、追跡の方向別の黒画素数（ここでは、右
方向での黒画素数および下方向での黒画素数）の記憶を
するとともに、次の画素が白画素である点（矢印１１１
Ｃの折曲点）からは、図の左方向に（＃１矩形枠１１１
の下辺に沿って左方向に）黒画素の追跡をしていく。な
お、この左方向に追跡するときには、右方向への先の追
跡で知られている黒画素の個数だけの追跡をすることが
できる。そして、最後の矢印１１１Ｄの折曲点からは、
図の上方向に（＃１矩形枠１１１の左辺に沿って上方向
に）黒画素の追跡をする。なお、このときにも、下方向
への先の追跡で知られている黒画素の個数だけの追跡を
することができる。そして、＃１矩形枠１１１の下辺お
よび左辺に沿って、それぞれ既知の個数の黒画素を追跡
していったときに、白画素が発見されたときには、現に
追跡している矩形枠は正規のものではないとみなされる
ことになる。また、前述されたように、矩形枠の線幅は
１ドットとして厳密に規定されていることから、黒画素
の追跡をしているときに、その追跡の方向に対して左右
側に接している１ドット以上の黒画素があるような場合
には、これも正規の矩形ではないものとみなして、追跡
の中止をするとともに次に続く矩形枠の抽出に移行する
ことになる。The image file received by the image input unit 4A is sent to a recognition area extracting unit 4B at the next stage. The recognition area extraction unit 4B extracts a recognition target area specified by the image editing unit 12A of the document creation workstation 1. Here, referring to FIG. 12, there is exemplified a mode of recognizing a region surrounded by a rectangular frame having a strict line width of 1 dot. 12, a predetermined document image display surface 110 is illustrated as in the case of FIG. 11 described above, but only the # 1 rectangular frame 111 therein is taken up. On such a document image display surface 110, scanning is sequentially performed from the upper left corner in a so-called TV system. Then, as schematically indicated by the arrow 111A, the # 1 rectangular frame 11
When the first black pixel is found, the coordinate value at this point is first stored. Following this, black pixels are tracked rightward in the figure (ie, rightward along the top side of the # 1 rectangular frame 111). From the point where the next pixel is a white pixel (that is, the bending point of the arrow 111B), the black pixel is tracked downward (downward along the right side of the # 1 rectangular frame 111).
At this time, the number of black pixels for each tracking direction (here, the number of black pixels in the right direction and the number of black pixels in the downward direction) is stored, and the point where the next pixel is a white pixel (arrow 111).
C (the bending point of C) to the left of the figure (# 1 rectangular frame 111).
(To the left along the lower side of). When tracking in the left direction, tracking can be performed by the number of black pixels known in the previous tracking in the right direction. And from the bending point of the last arrow 111D,
The black pixel is tracked upward (upward along the left side of the # 1 rectangular frame 111). At this time, it is possible to track only the number of black pixels known in the downward tracking. When a known number of black pixels are tracked along the lower side and left side of the # 1 rectangular frame 111, and a white pixel is found, the currently tracked rectangular frame is a regular one. Is not considered. Further, as described above, since the line width of the rectangular frame is strictly defined as one dot, when the black pixel is tracked, it is in contact with the left and right sides in the tracking direction. If there is a black pixel of one dot or more, this is also regarded as a non-regular rectangle, the tracking is stopped, and the process proceeds to the extraction of the next rectangular frame.

【００２７】このような黒画素の追跡においては、矩形
枠が元の画像のノイズ成分の部位を、いわばまたいでし
まって、厳密な１ドットの線幅をもたなくなることがあ
る。そこで、このような場合が生起することを考慮し
て、矩形枠を追跡する進行方向の左右側に黒画素が存在
する場合であっても、そのような状態がある一定の長さ
の範囲にあるものであれば、その存在を許容するように
しても良い。また、ここでの矩形枠で囲まれる矩形領域
が極端に小さいことはあり得ないところから矩形枠に対
する最初の追跡としての上辺の追跡において、その長さ
がある一定の値よりも小さいときには、正規の矩形枠で
はないとみなすことにより、以降の追跡を停止すること
もできる。なお、このようなやり方は、線幅が２ドット
以上の矩形枠についても同様に適用することができる。In such tracking of black pixels, the rectangular frame sometimes crosses the noise component portion of the original image, so to speak, and may not have a strict one-dot line width. Therefore, considering that such a case may occur, even if there are black pixels on the left and right sides in the traveling direction of tracking the rectangular frame, such a state is within a certain length range. If there is a certain thing, its existence may be allowed. Also, since the rectangular area surrounded by the rectangular frame cannot be extremely small, when the length of the upper side is smaller than a certain value in the first tracking of the rectangular frame, the normal The following tracking can be stopped by assuming that the frame is not a rectangular frame. Note that such a method can be similarly applied to a rectangular frame having a line width of 2 dots or more.

【００２８】ある１個の文書画像表示面内に複数個の矩
形領域が抽出されたときには、認識用ワークステーショ
ン４内の認識・変換部４Ｃに対して、それらの矩形領域
をどのような順序で転送するかが問題となることがあ
る。ここで図１３を参照すると、ある所定の文書画像表
示面１１０内に３個の矩形領域が含まれたものが例示さ
れている。図１３においては、＃１矩形枠で囲まれた＃
１領域１１１Ｒ、＃２矩形枠で囲まれた＃２領域１１２
Ｒ、および、＃３矩形枠で囲まれた＃３領域１１３Ｒが
存在するものとして示されている。なお、この図１３に
おいては、文書画像表示面１１０の左肩部の座標（０，
０）点は追跡の出発点、矢印ｘは図における横の追跡方
向、そして、矢印ｙは図における縦の追跡方向とされて
いる。この図１３において複数個の矩形領域を順次抽出
していくと、＃１領域１１１Ｒの下辺からみて、＃３領
域１１３Ｒの上辺の方が＃２領域１１２Ｒの上辺よりも
近接しているために、＃１領域１１１Ｒ，＃３領域１１
３Ｒ，＃２領域１１２Ｒの順で抽出されることになる。
しかしながら、これらの認識結果を受け入れる操作者と
しての人間からみれば、＃１領域１１１Ｒ，＃２領域１
１２Ｒ，＃３領域１１３Ｒの順で認識結果が返される方
が好適である。そのために、前述された矩形領域抽出の
後処理として、＃２領域１１２Ｒおよび＃３領域１１３
Ｒのように、図において左右に並んでいる（即ち、両者
のｙ座標に重なりがある）ような矩形領域の場合には、
より左側にある（即ち、ｘ座標がより小さい）矩形領域
の方を、認識・変換部４Ｃに対して先に転送するように
される。この認識・変換部４Ｃにおいては、認識領域抽
出部４Ｂで得られた矩形領域から所要の文字領域を抽出
して、文字認識を行い、この認識結果に対応する文字コ
ードに変換する。ここでの文字認識の仕方は特に限定さ
れるものではないが、例えば、前出した特願平３−１９
５１００号の方法等を適用することができる。そして、
前述された文字認識の結果としての文字コードは、ネッ
トワーク５を介して、文書作成用ワークステーション１
における認識結果入力部１４Ａに対して転送される。そ
して、この文書作成用ワークステーション１に対する操
作者は、制御／操作部１４Ｂを介して、前記の認識結果
を文書の作成に用いることになる。When a plurality of rectangular areas are extracted in one document image display surface, the rectangular areas are sent to the recognition / conversion unit 4C in the recognition workstation 4 in any order. Transferring can be a problem. Referring to FIG. 13, an example in which three rectangular regions are included in a certain document image display surface 110 is illustrated. In FIG. 13, # 1 surrounded by a rectangular frame #
One region 111R, # 2 region 112 surrounded by a # 2 rectangular frame
R and a # 3 region 113R surrounded by a # 3 rectangular frame are shown as being present. In FIG. 13, the coordinates (0, 0, 1) of the left shoulder of the document image display surface 110 are shown.
The point 0) is the starting point of tracking, the arrow x is the horizontal tracking direction in the figure, and the arrow y is the vertical tracking direction in the figure. In FIG. 13, when a plurality of rectangular areas are sequentially extracted, the upper side of the # 3 area 113R is closer to the upper side of the # 2 area 112R when viewed from the lower side of the # 1 area 111R. # 1 area 111R, # 3 area 11
3R and # 2 area 112R are extracted in this order.
However, from the viewpoint of a human operator who accepts these recognition results, the # 1 area 111R and the # 2 area 1
It is preferable that the recognition result is returned in the order of 12R and # 3 area 113R. Therefore, as the post-processing of the rectangular area extraction described above, the # 2 area 112R and the # 3 area 113
In the case of a rectangular area such as R, which is arranged side by side in the figure (that is, the y-coordinates of both overlap),
The rectangular area on the left side (that is, the x coordinate is smaller) is transferred to the recognition / conversion unit 4C first. The recognition / conversion unit 4C extracts a required character area from the rectangular area obtained by the recognition area extraction unit 4B, performs character recognition, and converts the character area into a character code corresponding to the recognition result. Although the method of character recognition here is not particularly limited, for example, Japanese Patent Application No. Hei.
No. 5100 can be applied. And
The character code as a result of the character recognition described above is sent to the document creation workstation 1 via the network 5.
Is transferred to the recognition result input unit 14A. Then, the operator of the document creation workstation 1 uses the above recognition result to create a document via the control / operation unit 14B.

【００２９】この発明の第３の実施例について詳細に説
明してきたけれども、これに限られるものではなく、こ
の発明の精神を逸脱しない範囲において、任意かつ所望
の変更ないし修正をすることができる。即ち、上記第３
の実施例においては、文書作成用ワークステーション１
と認識用ワークステーション４だけがネットワーク５に
接続されている例を示しているが、これに限らず、第１
の実施例のようにファイル用ステーションを介するよう
にすること、もしくは、第２の実施例のように電子メイ
ル送信用ワークステーションを介するようにすることは
可能である。Although the third embodiment of the present invention has been described in detail, the present invention is not limited to this embodiment, and arbitrary and desired changes and modifications can be made without departing from the spirit of the present invention. That is, the third
In the embodiment, the document creation workstation 1
Although the example in which only the recognition workstation 4 is connected to the network 5 is shown in FIG.
It is possible to use a file station as in the second embodiment, or to use an electronic mail transmission workstation as in the second embodiment.

【００３０】[0030]

【発明の効果】以上詳細に説明されたように、この発明
に係るネットワーク構成の認識システム（例えば、文書
認識装置）によれば、操作者によってスキャナーから入
力された文書画像情報が、共通のファイル用ステーショ
ンに備えられた大容量の記憶装置に格納されるように動
作しているために、操作者毎に対応して用意された文書
作成用ワークステーションに対して、個々に大容量の記
憶装置を設ける必要がなくなる。また、文書画像情報を
入力させるためのスキャナーについては、この発明のネ
ットワーク構成の文書認識装置において少なくとも１個
設けられていればよく、操作者毎の文書作成用ワークス
テーション全てに対して個別に設ける必要はない。更
に、この発明における文書画像の認識は前記ネットワー
ク構成の文書認識装置における認識用ワークステーショ
ンで行われるために、操作者毎の文書作成用ワークステ
ーションにそのための負荷がかかることはなく、操作者
の業務が認識操作のために中断してしまうことはない。
また、前記文書作成用ワークステーションに対して、認
識用のハードウエアの接続やソフトウエアのインストー
ルをする必要がない。即ち、操作者毎の文書作成用ワー
クステーションに対して、前記のような特別なハードウ
エアやソフトウエアを付加的に設けることを必要とせず
に、既存の手段ないし装置をもって文書画像認識の機能
を十分に活用することができる。そして、操作者自身に
ついてみれば、直接的に操作する手段ないし装置はいず
れも既存のものであることから、格別に新規な操作法を
習得する必要はなく、初心者であっても文書画像認識の
機能を簡単かつ確実に活用することができる。更に、こ
の発明によれば、文書作成用ワークステーションの操作
者により、認識を所望する領域を指定できるようにされ
ているために、例えば、表・図形混在の文書や段組のあ
る文書であっても認識対象として扱うことが可能であ
り、また、認識操作に必要な部分だけを指定することに
より、計算機資源の無駄な使用を回避することができ
る。また、このような場合においては、認識用のハード
ウエアやソフトウエアを文書作成用ワークステーション
に特別に設けることなく、汎用的に備えられた機能を流
用するだけでよいことから、文書認識のための特別な機
能を付加する必要がない。As described above in detail, according to the network configuration recognition system (for example, a document recognition device) according to the present invention, document image information input from a scanner by an operator is stored in a common file. Operating in a large-capacity storage device provided in the storage station, a large-capacity storage device is individually provided for a document creation workstation prepared for each operator. There is no need to provide In addition, at least one scanner for inputting document image information only needs to be provided in the network-structured document recognition apparatus of the present invention, and is provided separately for all document creation workstations for each operator. No need. Further, since the recognition of the document image in the present invention is performed by the recognition workstation in the document recognition device having the network configuration, the load for the document creation workstation for each operator is not applied, and the operator's work is not required. The work is not interrupted by the recognition operation.
Further, there is no need to connect recognition hardware or install software to the document creation workstation. That is, it is not necessary to additionally provide the above-mentioned special hardware and software for the document creation workstation for each operator, and the document image recognition function can be provided by existing means or devices. Can be fully utilized. As for the operator himself, since all the means or devices for direct operation are already existing, there is no need to learn a particularly new operation method, and even a beginner can perform document image recognition. Functions can be used easily and reliably. Furthermore, according to the present invention, since the operator of the document creation workstation can specify an area desired to be recognized, for example, a document having a mixture of tables and figures or a document having columns is used. Can be treated as a recognition target, and by designating only a part necessary for the recognition operation, it is possible to avoid useless use of computer resources. In such a case, it is only necessary to divert general-purpose functions without specially providing recognition hardware or software on the document creation workstation. There is no need to add special functions of.

[Brief description of the drawings]

【図１】この発明の第１の実施例に係るネットワーク
構成の文書認識装置を示す概略的な構成図である。FIG. 1 is a schematic configuration diagram illustrating a document recognition device having a network configuration according to a first embodiment of the present invention.

【図２】上記第１の実施例における認識処理を機能的
に説明するための概略的な機能的構成図である。FIG. 2 is a schematic functional configuration diagram for functionally explaining a recognition process in the first embodiment.

【図３】上記第１の実施例における記憶装置の機能的
な説明図である。FIG. 3 is a functional explanatory diagram of the storage device according to the first embodiment.

【図４】上記第１の実施例における認識済み登録ファ
イルの例示図である。FIG. 4 is an exemplary diagram of a recognized registration file in the first embodiment.

【図５】上記第１の実施例における認識処理の態様を
示すフローチャートである。FIG. 5 is a flowchart illustrating an aspect of a recognition process in the first embodiment.

【図６】この発明の第２の実施例に係るネットワーク
構成の文書認識装置を示す概略的な構成図である。FIG. 6 is a schematic configuration diagram illustrating a network-structured document recognition device according to a second embodiment of the present invention.

【図７】上記第２の実施例における認識処理を機能的
に説明するための概略的な機能的構成図である。FIG. 7 is a schematic functional configuration diagram for functionally explaining a recognition process in the second embodiment.

【図８】上記第２の実施例における電子メイル送信用
ワークステーション６の動作の態様を例示するためのフ
ローチャートである。FIG. 8 is a flowchart for illustrating an operation mode of the electronic mail transmitting workstation 6 in the second embodiment.

【図９】この発明の第３の実施例に係るネットワーク
構成の文書認識装置を示す概略的な構成図である。FIG. 9 is a schematic configuration diagram illustrating a document recognition device having a network configuration according to a third embodiment of the present invention.

【図１０】上記第３の実施例における認識処理を機能
的に説明するための概略的な機能的構成図である。FIG. 10 is a schematic functional configuration diagram for functionally explaining a recognition process in the third embodiment.

【図１１】上記第３の実施例における文書画像の矩形
枠を指定する態様の例示図である。FIG. 11 is a view showing an example of a mode of designating a rectangular frame of a document image in the third embodiment.

【図１２】上記第３の実施例における、ある所定の線
幅を有する矩形枠で囲まれた領域を認識する態様を例示
する図である。FIG. 12 is a diagram exemplifying a mode of recognizing a region surrounded by a rectangular frame having a certain line width in the third embodiment.

【図１３】上記第３の実施例における、ある所定の文
書画像表示面１１０内に３個の矩形領域が含まれたもの
を例示する図である。FIG. 13 is a diagram illustrating an example in which three rectangular areas are included in a certain document image display surface 110 in the third embodiment.

[Explanation of symbols]

１…文書作成用ワークステーション、１１…画像入力
部、１２…画像伝送部、１３…認識結果取り出し部、１
４…制御／操作部、２…スキャナー、３…ファイル用ス
テーション、３１…画像蓄積部、３２…認識結果蓄積
部、４…認識用ワークステーション、４１…画像取り出
し部、４２…認識・変換部、４３…認識結果転送部、５
…ネットワーク、６…電子メイル送信用ワークステーシ
ョン。DESCRIPTION OF SYMBOLS 1 ... Workstation for document preparation, 11 ... Image input part, 12 ... Image transmission part, 13 ... Recognition result extraction part, 1
4 Control / operation unit 2 Scanner 3 File station 31 Image storage unit 32 Recognition result storage unit 4 Recognition workstation 41 Image retrieval unit 42 Recognition / conversion unit 43 ... Recognition result transfer unit, 5
... Network, 6 ... E-mail sending workstation.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号ＦＩＨ０４Ｌ 12/58 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁶ Identification code FI H04L 12/58

Claims

[Claims]

1. A document image input device for inputting a document image, a recognition device for recognizing a document image input by the document image input device and converting the document image into corresponding code information, Address extracting means for extracting an address corresponding to the document image obtained, and electronic mail transmitting means for transmitting, as an electronic mail, the code information converted by the recognizing means to the address extracted by the address extracting means. A document recognition device characterized by the following.

2. A document processing information processing device connected to a scanner, a file information processing device for storing a document image, and a recognition information processing device for recognizing a document image are connected via a network. In the document recognition apparatus, the information processing device for document creation includes: an image input unit that inputs a document image by a scanner; and a document image input by the image input unit is transferred to the file information processing device as a file. A document image transfer unit, the file information processing device includes a storage unit that stores the document image transferred by the document image transfer unit in the document creation information processing device, and the recognition information processing device. A document image extracting unit for extracting a document image stored in a storage unit in the file information processing device; Recognizing means for recognizing a document image taken out by the document image taking out means and converting it into corresponding code information; destination taking out means taking out a destination corresponding to the document image taken out by the document image taking out means; For the destination retrieved by means,
A document recognizing apparatus, comprising: an electronic mail transmitting unit that transmits the code information converted by the recognizing unit as an electronic mail.

3. A document processing information processing device connected to a scanner, a file information processing device for storing a document image, and a recognition information processing device for recognizing a document image are connected via a network. In the document recognition apparatus, the document creating information processing device includes: an image input unit that inputs a document image by a scanner; and a document image input by the image input unit is transferred to the file information processing device as a file. A document image transfer unit, the file information processing device includes a storage unit that stores the document image transferred by the document image transfer unit in the document creation information processing device, and the recognition information processing device. A document image extracting unit for extracting a document image stored in a storage unit in the file information processing device; Recognizing means for recognizing a document image taken out by the document image taking out means and converting it into corresponding code information; and operator name taking out means taking out an operator name corresponding to the document image taken out by the document image taking out means. When the operator name extracted by the operator name extracting unit is set as a destination, a determining unit that determines whether there is the destination, and when the determining unit determines that there is a destination,
A document recognizing apparatus comprising: an electronic mail transmitting unit that transmits, as an electronic mail, the code information converted by the recognizing unit to the destination.

4. The information processing device for recognition, when the determination unit determines that there is no destination,
4. The document recognition apparatus according to claim 3, further comprising a recognition result transmitting unit that transfers the code information converted by the recognition unit to the file information processing device.

5. An information processing apparatus for creating a document to which a scanner is connected, an information processing apparatus for storing a document image, and an information processing apparatus for recognizing a document image are connected by a network. In the document recognizing apparatus, the information processing apparatus for creating a document inputs a document image by a scanner, transfers the input document image as a file to the information processing apparatus for a file, Storing the document image for document creation and transferred by the information processing device, the recognition information processing device retrieves the document image stored in the file information processing device, and retrieves the retrieved document image. Recognize and convert to the corresponding code information, take out the destination corresponding to the retrieved document image, and retrieve the retrieved destination And transmitting the converted code information as an electronic mail.

6. A document processing information processing device connected to a scanner, a file information processing device for storing a document image, and a recognition information processing device for recognizing a document image are connected via a network. In the document recognition apparatus, the information processing device for document creation includes: an image input unit that inputs a document image by a scanner; and a document image input by the image input unit is transferred to the file information processing device as a file. A document image transfer unit, the file information processing device includes a storage unit that stores the document image transferred by the document image transfer unit in the document creation information processing device, and the recognition information processing device. A document image extracting unit for extracting a document image stored in a storage unit in the file information processing device; Recognizing means for recognizing a document image taken out by the document image taking out means and converting it into corresponding code information; destination taking out means taking out a destination corresponding to the document image taken out by the document image taking out means; A document recognition apparatus, comprising: an electronic mail transmitting unit that transmits, as an electronic mail, an end of recognition by the recognition unit to a destination extracted by the unit.

7. A document processing information processing apparatus connected to a scanner, a file information processing apparatus for storing a document image, and a recognition information processing apparatus for recognizing a document image are connected via a network. In the document recognizing apparatus, the information processing apparatus for creating a document inputs a document image by a scanner, transfers the input document image as a file to the information processing apparatus for a file, Storing the document image transferred by the document creation information processing device, the recognition information processing device retrieves the document image stored in the file information processing device, and recognizes the retrieved document image And converts it into the corresponding code information, extracts the destination corresponding to the extracted document image, and On the other hand, a document recognition method characterized by transmitting the completion of recognition as an electronic mail.