JPH1040238A

JPH1040238A - Document processing system and method for preparing routine document applied to the same

Info

Publication number: JPH1040238A
Application number: JP8190681A
Authority: JP
Inventors: Makoto Takimoto; 誠滝本
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1996-07-19
Filing date: 1996-07-19
Publication date: 1998-02-13

Abstract

PROBLEM TO BE SOLVED: To provide a document processing system whereby a desired routine document is efficiently generated from a free-style document with high precision. SOLUTION: The system is the one where a work station 1 is combined with OCR(optical character reading device) 2 so as to optically reads the free- style document data recorded in a document, for example, by an image scanner 3 and input it in OCR 2. OCR 2 outputs text data consisting of input image data and a character code to the work station 1. The work station 1 extracts text data corresponding to an image range which is designated in input image data displayed on a screen and inputs it to the input object item of the routine document format on the screen. Thus, inputted free-style document data is generated by executing conversion into document data of the routine document format in accordance with the input object item of the routine document format.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文書上の文書デー
タをイメージデータとして読込み、このイメージデータ
を処理して定型文書を作成する機能を備えた文書処理シ
ステムに関する。[0001] 1. Field of the Invention [0002] The present invention relates to a document processing system having a function of reading document data on a document as image data and processing the image data to create a standard document.

【０００２】[0002]

【従来の技術】従来、ワークステーションやパーソナル
コンピュータ等のコンピュータシステムを利用して、各
種の文書を作成する文書処理システムが開発されてい
る。文書の中には、予め複数の項目（必要事項）が設定
されて、各項目に対応した文書内容からなる定型文書が
ある。具体例としては、特許出願用の明細書や、各企業
等において定型フォーマットを設定した発明の提案書な
どがある。2. Description of the Related Art Conventionally, a document processing system for creating various documents using a computer system such as a workstation or a personal computer has been developed. In the document, there is a fixed document in which a plurality of items (necessary items) are set in advance and the contents of the document correspond to each item. Specific examples include a specification for a patent application and a proposal for an invention in which a fixed format is set in each company.

【０００３】従来では、定型文書を作成する方式とし
て、文書作成ソフトウェアが実行されるコンピュータや
ワードプロセッサを利用して、オペレータが設定された
必要事項に従って全文をキーボードやポインティング・
デバイスを使用して入力する方式が一般的である。この
方式では、定型文書のフォーマットを入力して予め登録
しておき、作成時に画面上に呼出して、文書データを入
力する。また、予め定型文書のフォーマットを用意し
て、光学的文字読取装置（ＯＣＲ）により、その定型文
書のフォーマットを読込み、登録する方式がある。Conventionally, as a method for creating a fixed form document, a computer or a word processor on which document creation software is executed is used, and an operator uses a keyboard or a pointing / writing device to copy the entire text in accordance with the set requirements.
A method of inputting using a device is generally used. In this method, the format of a fixed form document is input and registered in advance, and is called on a screen at the time of creation to input document data. Further, there is a method in which a format of a standard document is prepared in advance, and the format of the standard document is read and registered by an optical character reader (OCR).

【０００４】[0004]

【発明が解決しようとする課題】従来の定型文書作成方
式には、以下のような問題がある。キーボードやポイン
ティング・デバイスを使用して入力する方式では、オペ
レータの操作能力に依存して作成効率や作成精度（誤っ
た項目の入力）が異なる。また、ＯＣＲを利用する方式
では、フォーマットの入力効率や精度は優れているが、
定型文書毎に定型文書のフォーマットを予め用意する必
要があるため、フォーマットの管理が必要になる。本発
明の目的は、自由形式の文書から所望の定型文書を効率
的かつ高精度に作成することができる文書処理システム
を提供することにある。The conventional fixed document creation method has the following problems. In the method of inputting using a keyboard or a pointing device, the creation efficiency and the creation accuracy (input of an incorrect item) differ depending on the operation ability of the operator. In addition, the format using OCR is excellent in format input efficiency and accuracy,
Since it is necessary to prepare a format of a standard document for each standard document in advance, it is necessary to manage the format. SUMMARY OF THE INVENTION An object of the present invention is to provide a document processing system capable of efficiently and accurately creating a desired fixed-form document from a free-form document.

【０００５】[0005]

【課題を解決するための手段】本発明は、例えばワーク
ステーションやパーソナルコンピュータなどのコンピュ
ータシステムとＯＣＲとを組合わせたシステムであっ
て、文書上に記録された例えば自由形式の文書データを
光学的に読取るイメージ入力手段と、イメージ入力手段
により入力されたイメージデータを文字コードからなる
テキストデータに変換するデータ変換手段と、入力され
たイメージデータ及びファイル情報として管理する定型
文書フォーマットのイメージデータのそれぞれを画面上
に表示する表示手段と、画面上に表示された入力イメー
ジデータの中で指定したイメージ範囲に対応するテキス
トデータを抽出するテキストデータ指定手段と、画面上
の定型文書フォーマットのイメージから入力対象の項目
を指定したときに前記抽出したテキストデータを入力対
象の項目に対応付けて定型文書フォーマットの文書デー
タとして出力する出力手段とを備えた文書処理システム
である。SUMMARY OF THE INVENTION The present invention relates to a system in which a computer system such as a workstation or a personal computer is combined with an OCR, and for example, free-form document data recorded on a document is optically converted. Image input means for reading the image data, data conversion means for converting the image data input by the image input means into text data consisting of character codes, and image data in a fixed document format managed as the input image data and file information. Means for displaying text on the screen, text data specifying means for extracting text data corresponding to a specified image range from input image data displayed on the screen, and input from an image in a fixed document format on the screen Previous when the target item was specified Associates the extracted text data item of the input object is a document processing system and an output means for outputting the document data of the standard document formats.

【０００６】このような構成のシステムにより、自由形
式で記録された文書をＯＣＲ等を利用して読込み、この
文書のイメージデータから必要事項に対応する範囲のテ
キストデータを抽出して、定型文書フォーマットの該当
項目に入力する。従って、ユーザは、定型文書フォーマ
ットに合わせて全文をキーボードなどにより入力する操
作は不要となり、ＯＣＲなどにより高精度に読込まれた
文書データを使用して効率的に定型文書を作成すること
ができる。また、各種の定型文書フォーマットを有する
定型文書フォーマット情報をコンピュータシステムによ
り管理するため、ＯＣＲの読込み用として定型文書フォ
ーマットを管理する必要はない。With the system having such a configuration, a document recorded in a free format is read using an OCR or the like, and text data in a range corresponding to necessary items is extracted from image data of the document to form a standard document format. Enter the relevant items in. Therefore, the user does not need to input the entire text using a keyboard or the like in accordance with the standard document format, and can efficiently create a standard document by using document data read with high accuracy by OCR or the like. In addition, since the fixed document format information having various fixed document formats is managed by the computer system, it is not necessary to manage the fixed document format for reading the OCR.

【０００７】[0007]

【発明の実施の形態】以下図面を参照して本発明の実施
の形態を説明する。図１は本実施形態に関係するシステ
ムの構成を示すブロック図であり、図２は本実施形態の
動作を説明するための概念図であり、図３は本実施形態
の動作を説明するためのフローチャートである。（システム構成）本実施形態のシステムは、図１に示す
ように、大別してコンピュータであるワークステーショ
ン１と、ＯＣＲ（光学的文字読取装置）２と、イメージ
スキャナ３とからなる。ワークステーション１は、ＣＰ
Ｕ４と、入出力インターフェース５と、テキストデータ
用メモリ６と、出力バッファメモリ７と、表示用メモリ
８と、入力装置としてマウス１１とを備えている。ＣＰ
Ｕ４は、予め用意されたプログラムに基づいて本実施形
態に関係する定型文書の作成処理を実行する。入出力イ
ンターフェース５は、本実施形態ではＯＣＲ２とマウス
１１とに接続し、ＯＣＲ２からのイメージデータ及び文
字コードからなるテキストデータを入力する。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a configuration of a system related to the present embodiment, FIG. 2 is a conceptual diagram for explaining the operation of the present embodiment, and FIG. 3 is a conceptual diagram for explaining the operation of the present embodiment. It is a flowchart. (System Configuration) As shown in FIG. 1, the system according to the present embodiment includes a workstation 1, which is roughly classified as a computer, an OCR (optical character reader) 2, and an image scanner 3. Workstation 1 is a CP
It includes a U4, an input / output interface 5, a text data memory 6, an output buffer memory 7, a display memory 8, and a mouse 11 as an input device. CP
U4 executes a fixed document creation process related to the present embodiment based on a program prepared in advance. In the present embodiment, the input / output interface 5 is connected to the OCR 2 and the mouse 11, and inputs image data from the OCR 2 and text data including character codes.

【０００８】テキストデータ用メモリ６は、ＯＣＲ２に
より入力されたテキストデータを格納するためのメイン
メモリの記憶領域に相当する。出力バッファメモリ７
は、定型文書データとして出力するためのテキストデー
タを格納する。表示用メモリ８は、ワークステーション
１に設けられているディスプレイ（図示せず）の画面上
に表示するイメージデータを格納するためのメモリであ
る。本実施形態では、表示用メモリ８は、定型文書フォ
ーマットのイメージデータをウィンドウ表示するための
メモリ領域（以下定型イメージ領域と称する）９、およ
びＯＣＲ２を介して入力された文書のイメージデータを
ウィンドウ表示するためのメモリ領域（以下入力イメー
ジ領域と称する）１０とからなると想定する。[0008] The text data memory 6 corresponds to a storage area of a main memory for storing text data input by the OCR 2. Output buffer memory 7
Stores text data to be output as standard document data. The display memory 8 is a memory for storing image data to be displayed on a screen of a display (not shown) provided in the workstation 1. In the present embodiment, the display memory 8 includes a memory area (hereinafter, referred to as a standard image area) 9 for displaying image data in a standard document format in a window, and a window displaying image data of a document input via the OCR 2. Memory area (hereinafter, referred to as an input image area) 10.

【０００９】ＯＣＲ２は、通常の光学的文字読取装置で
あり、イメージ入力装置であるイメージスキャナ３によ
り読込まれたイメージデータを出力すると共に、イメー
ジデータから文字認識処理を実行して文字コードからな
るテキストデータに変換する機能を有する。イメージス
キャナ３は、用紙上に印刷（手書きも含む）された文書
を例えば１頁単位に走査してイメージデータとしてＯＣ
Ｒ２に入力する。マウス１１はカーソル入力装置であ
り、本実施形態では画面上のウィンドウ内の項目やイメ
ージ範囲を指定するために使用される。（本実施形態の動作）以下図２と図３のフローチャート
を参照して本実施形態の動作を説明する。The OCR 2 is an ordinary optical character reading device which outputs image data read by an image scanner 3 which is an image input device, executes a character recognition process from the image data, and executes a text consisting of a character code. It has a function to convert to data. The image scanner 3 scans a document printed on paper (including handwriting), for example, in units of one page, and converts the document into OC data.
Input to R2. The mouse 11 is a cursor input device, and is used in this embodiment to specify an item or an image range in a window on the screen. (Operation of this Embodiment) The operation of this embodiment will be described below with reference to the flowcharts of FIGS.

【００１０】本実施形態は、図２に示すように、ワード
プロセッサなどにより自由形式で作成された印刷文書２
０から、予めワークステーション１のファイル装置（ハ
ードディスク装置等）に保存・管理されている定型文書
フォーマットに従って所望の定型文書を作成する処理を
想定する。In this embodiment, as shown in FIG. 2, a printed document 2 created in a free format by a word processor or the like is used.
From 0, a process of creating a desired fixed document according to a fixed document format stored and managed in advance in a file device (hard disk device or the like) of the workstation 1 is assumed.

【００１１】ユーザはイメージスキャナ３を操作して、
印刷文書２０に記録された文書を読込む（ステップＳ
１）。イメージスキャナ３は、印刷文書２０に記録され
た文書をイメージデータに変換してＯＣＲ２に出力す
る。ＯＣＲ２は、入出力インターフェース５を介して、
入力したイメージデータ（入力イメージデータと称す
る）をワークステーション１に出力すると共に、文字認
識処理を実行してテキストデータに変換して出力する。
テキストデータは、図２に示すように、印刷文書２０に
記録された例えば「提案書」などの文字列が文字コード
に変換されたものである。The user operates the image scanner 3 to
Read the document recorded in the print document 20 (step S
1). The image scanner 3 converts the document recorded on the print document 20 into image data and outputs the image data to the OCR 2. The OCR 2 is connected via the input / output interface 5
The input image data (referred to as input image data) is output to the workstation 1 and is also converted to text data by performing a character recognition process and output.
As shown in FIG. 2, the text data is obtained by converting a character string such as “Proposal” recorded in the print document 20 into a character code.

【００１２】ＣＰＵ４は、ＯＣＲ２から入力されたテキ
ストデータをテキストデータ用メモリ６に格納する（ス
テップＳ２）。即ち、メモリ６には、図２に示すよう
に、印刷文書２０に記録された文書データであるテキス
トデータが格納される。さらに、ＣＰＵ４は、入力され
た入力イメージデータを表示用メモリ８の入力イメージ
領域に展開して、画面上に表示する（ステップＳ３）。
即ち、図２に示すように、印刷文書２０に記録された文
書に対応する入力イメージデータが画面上のウィンドウ
２１に表示される。The CPU 4 stores the text data input from the OCR 2 in the text data memory 6 (step S2). That is, the memory 6 stores text data, which is document data recorded on the print document 20, as shown in FIG. Further, the CPU 4 develops the input image data into the input image area of the display memory 8 and displays it on the screen (step S3).
That is, as shown in FIG. 2, input image data corresponding to the document recorded in the print document 20 is displayed in the window 21 on the screen.

【００１３】ユーザは、マウス１１を操作して、ウィン
ドウ２１に表示された入力イメージの中で、所望の項目
に対応するイメージ範囲をカーソルにより指定したと想
定する。ここで、所望の項目とは、定型文書を作成する
ための必要事項に対応するものであり、後述するよう
に、ウィンドウ２２に表示された定型文書フォーマット
に含まれる項目に相当する。ここでは、例えば日付項目
の「９４−１２−１４」を指定したと想定する。It is assumed that the user operates the mouse 11 to designate an image range corresponding to a desired item in the input image displayed on the window 21 by using a cursor. Here, the desired item corresponds to a necessary item for creating a standard document, and corresponds to an item included in the standard document format displayed on the window 22 as described later. Here, it is assumed that the date item “94-12-14” is specified, for example.

【００１４】ＣＰＵ４は、画面上でカーソルにより指定
された項目に対応するイメージデータを抽出し、そのイ
メージ座標を算出する（ステップＳ４，Ｓ５）。即ち、
図２に示すように、指定した日付項目に対応するイメー
ジデータを切り出すための縦横（Ｘ，Ｙ）の座標値（Ｘ
ｉ−Ｘｎ，Ｙ１−Ｙ２）を求める。次に、ＣＰＵ４は、
算出したイメージ座標の範囲に含まれるテキストデータ
（文字列データ）をメモリ６から検索する（ステップＳ
６）。ここで、メモリ６は、１文字単位でイメージ座標
情報を保持している。従って、ユーザからは、画面上の
入力イメージデータの範囲をマウス１１で指定するだけ
で、自動的に該当する範囲のテキストデータを選択する
ことができる。なお、項目中の文字列の一部だけを選択
する場合に、画面上において１文字単位に該当／除外の
選択操作を行なうよう機能を設けてもよい。The CPU 4 extracts image data corresponding to the item specified by the cursor on the screen, and calculates the image coordinates (steps S4 and S5). That is,
As shown in FIG. 2, vertical and horizontal (X, Y) coordinate values (X, Y) for cutting out image data corresponding to the designated date item.
i-Xn, Y1-Y2). Next, the CPU 4
The text data (character string data) included in the range of the calculated image coordinates is searched from the memory 6 (step S).
6). Here, the memory 6 holds image coordinate information in units of one character. Therefore, the user can automatically select the corresponding range of text data only by specifying the range of the input image data on the screen with the mouse 11. When only a part of the character string in the item is selected, a function may be provided so that the corresponding / excluded selection operation is performed for each character on the screen.

【００１５】ＣＰＵ４は、検索したテキストデータを出
力バッファメモリ７に格納する（ステップＳ７）。さら
に、ユーザのマウス１１の操作に応じて、ＣＰＵ４は、
ワークステーション１のファイル装置に保存されている
定型文書フォーマット情報から所望の定型文書フォーマ
ットのイメージデータを定型イメージ領域９に展開し
て、画面上のウィンドウに表示する（ステップＳ８）。
即ち、図２に示すように、ユーザが所望する「提案書」
に対応する定型文書フォーマットをウィンドウ２２に表
示する。ユーザはマウス１１を操作して、カーソルによ
りウィンドウ２２の定型イメージから入力項目（ここで
は作成日とする）を選択したと想定する（ステップＳ
９）。The CPU 4 stores the searched text data in the output buffer memory 7 (step S7). Further, according to the operation of the mouse 11 by the user, the CPU 4
Image data of a desired fixed document format is developed in the fixed image area 9 from the fixed document format information stored in the file device of the workstation 1 and displayed in a window on the screen (step S8).
That is, as shown in FIG.
Is displayed in the window 22. It is assumed that the user operates the mouse 11 and selects an input item (here, the creation date) from the fixed image in the window 22 with the cursor (step S).
9).

【００１６】ＣＰＵ４は、出力バッファメモリ７に格納
されたテキストデータ（日付項目の「９４−１２−１
４」という文字列データ）を、ウィンドウ２２で指定さ
れた定型文書フォーマットの「作成日」の項目欄に入力
する（ステップＳ１０）。即ち、ＣＰＵ４は、表示用メ
モリ８の定型イメージ領域９に、出力バッファメモリ７
に格納されたテキストデータに対応するイメージデータ
を展開してウィンドウ２２に表示出力する（ステップＳ
１１）。The CPU 4 reads the text data (date item "94-12-1") stored in the output buffer memory 7.
The character string data “4”) is input into the item field of “creation date” in the standard document format specified in the window 22 (step S10). That is, the CPU 4 stores the output buffer memory 7 in the fixed image area 9 of the display memory 8.
Image data corresponding to the text data stored in the window 22 is displayed and output to the window 22 (step S
11).

【００１７】前述のステップＳ４からＳ１１間での処理
を繰り返すことにより、自由形式で作成された文書デー
タ（文字列データ２０とイメージデータ２１）から指定
した項目の文書データを定型文書フォーマットの該当す
る項目に入力することができる。これにより、自由形式
で作成された文書を定型文書フォーマットに従った文書
に変換し、結果的に所望の定型文書を作成することがで
きる。なお、ＣＰＵ４は、ユーザの要求に応じて定型イ
メージ領域９に展開した定型文書をプリンタにより印刷
出力することができる。また、作成した定型文書をファ
イル装置に保存し、必要に応じて呼出すことにより表示
出力または印刷出力を行なうことができる。なお、ステ
ップＳ８で定型文書フォーマットをウィンドウ２２に表
示する処理を実行しているが、これに限る事なく、ステ
ップＳ３において入力イメージデータを表示するとき
に、同時または直後に表示するような方式でもよい。By repeating the processing between steps S4 to S11 described above, the document data of the designated item from the document data (character string data 20 and image data 21) created in the free format corresponds to the standard document format. Items can be entered. As a result, a document created in a free format can be converted into a document according to a fixed document format, and as a result, a desired fixed document can be created. Note that the CPU 4 can print out a standard document developed in the standard image area 9 by a printer in response to a user request. Further, the created standard document is stored in a file device, and can be displayed or printed out by calling it as needed. The process of displaying the standard document format in the window 22 in step S8 is executed. However, the present invention is not limited to this. For example, when input image data is displayed in step S3, it may be displayed simultaneously or immediately. Good.

【００１８】以上のように本実施形態によれば、ユーザ
は自由形式で作成された文書を利用して、所望の定型文
書フォーマットに従った定型文書を、ワークステーショ
ン１の簡単な操作のみで、効率的に作成することができ
る。即ち、作成元となる文書をイメージスキャナ３とＯ
ＣＲ２により読取り、ワークステーション１に入力する
方式であるため、入力ミス等がほとんどなく、高精度か
つ高速に文書データを入力することが可能とである。さ
らに、定型文書フォーマットをファイル情報としてワー
クステーション１で管理する方式により、書類（ドキュ
メント）の状態で定型文書フォーマットを管理する必要
がなくなり、必要な定型文書フォーマットを高速かつ容
易に画面上に出力することが可能である。（応用形態）図４は本実施形態の応用形態を示すブロッ
ク図であり、イメージスキャナ３としてハンディ型イメ
ージスキャナ４０を使用して、作成元の文書２０を１頁
単位ではなく、例えば項目単位の部分的文書データを入
力する方式である。As described above, according to the present embodiment, a user can use a document created in a free format to create a fixed document according to a desired fixed document format by simply operating the workstation 1. It can be created efficiently. That is, the document to be created is sent to the image scanner 3 and O
Since it is a method of reading by the CR2 and inputting it to the workstation 1, it is possible to input document data with high accuracy and high speed with almost no input errors. Further, the method of managing the standard document format as file information in the workstation 1 eliminates the need to manage the standard document format in the form of a document, and outputs the required standard document format on the screen quickly and easily. It is possible. (Application) FIG. 4 is a block diagram showing an application of the present embodiment. The handy-type image scanner 40 is used as the image scanner 3, and the original document 20 is created not in units of pages but in units of items, for example. This is a method for inputting partial document data.

【００１９】この応用形態では、ユーザはハンディ型イ
メージスキャナ４０により、必要な項目（例えば日付項
目の「９４−１２−１４」という文字列）を読取り、ワ
ークステーション１に入力させる。ワークステーション
１は、画面上のウィンドウに入力イメージ領域１０に展
開された入力イメージデータを表示し、入力イメージデ
ータに対応するテキストデータ（ＯＣＲ２の出力）をメ
モリ６に格納する。In this application form, the user reads necessary items (for example, a character string “94-12-14” of a date item) using the handy image scanner 40 and causes the workstation 1 to input the items. The workstation 1 displays the input image data expanded in the input image area 10 in a window on the screen, and stores text data (output of the OCR 2) corresponding to the input image data in the memory 6.

【００２０】ユーザは画面上で走査して入力した入力イ
メージを確認すると、定型文書フォーマットを画面上の
ウィンドウに表示して、マウス１１により入力項目（作
成日）を指定する。これにより、メモリ６に格納された
テキストデータが定型文書フォーマットの「作成日」の
項目欄に入力される。ここで、前述したように、メモリ
６に格納されたテキストデータを出力バッファメモリに
格納し、この出力バッファメモリから定型文書フォーマ
ットの指定項目に入力するようにしてもよい。When the user checks the input image by scanning on the screen, the user displays the standard document format in a window on the screen, and specifies an input item (creation date) with the mouse 11. As a result, the text data stored in the memory 6 is input to the item column of “creation date” in the standard document format. Here, as described above, the text data stored in the memory 6 may be stored in the output buffer memory, and may be input from the output buffer memory to the designated item of the standard document format.

【００２１】このような応用形態によれば、ユーザは作
成元の文書から必要な項目のみをハンディ型イメージス
キャナ４０により入力すればよいので、特に作成元の文
書から定型文書用として使用する項目が少ない場合に、
前述した本実施形態の方式と比較して入力処理時間の削
減を図ることができる。なお、基本的な作用効果につい
ては本実施形態と同様である。According to such an application, the user only needs to input only necessary items from the original document by using the handy type image scanner 40. Therefore, in particular, the items to be used for the standard document from the original document are prepared. When there are few,
The input processing time can be reduced as compared with the method of the present embodiment described above. The basic operation and effect are the same as in the present embodiment.

【００２２】また、本実施形態及びその応用形態におい
て、定型文書フォーマットを選択して表示する場合に、
入力イメージデータから例えば「題名」の項目を指定す
ることにより、その「題名」（本実施形態では提案書）
に対応する定型文書フォーマットをファイルから自動的
に選択して表示するようにしてもよい。このとき、複数
候補がある場合には、ユーザがマウス１１により選択す
る。なお、入力イメージデータから「題名」の項目を指
定した場合に、出力バッファメモリにはその「題名」に
対応するテキストデータ（提案書という文字データ）が
格納される。In the present embodiment and its application, when a fixed document format is selected and displayed,
For example, by specifying an item of “title” from the input image data, the “title” (in this embodiment, a proposal)
May be automatically selected from the file and displayed. At this time, when there are a plurality of candidates, the user selects them with the mouse 11. When the item of “title” is specified from the input image data, text data (character data of a proposal) corresponding to the “title” is stored in the output buffer memory.

【００２３】[0023]

【発明の効果】以上詳述したように本発明によれば、自
由形式の文書からイメージ処理により文書データを入力
する機能と定型文書フォーマットをファイルとして管理
する機能とを有する文書処理システムにより、自由形式
の文書から定型文書に必要な項目のデータを取出して所
望の定型文書を、簡単な操作で高精度に作成することが
できる。従って、オペレータの操作能力に依存すること
なく、高速かつ確実な定型文書作成を実現することがで
きる。また、定型文書フォーマットを書類として管理す
る必要がないため、煩わしい定型文書フォーマットの管
理処理を不要にすることができる。As described above in detail, according to the present invention, a document processing system having a function of inputting document data from a free-form document by image processing and a function of managing a fixed-form document format as a file is provided. By extracting data of items necessary for a standard document from a format document, a desired standard document can be created with high accuracy by a simple operation. Therefore, it is possible to realize high-speed and reliable creation of a standard document without depending on the operation ability of the operator. Further, since there is no need to manage the standard document format as a document, it is possible to eliminate the need for complicated management of the standard document format.

[Brief description of the drawings]

【図１】本発明の実施形態に関係するシステムの構成を
示すブロック図。FIG. 1 is a block diagram showing a configuration of a system related to an embodiment of the present invention.

【図２】本実施形態の動作を説明するための概念図。FIG. 2 is a conceptual diagram for explaining the operation of the embodiment.

【図３】本実施形態の動作を説明するためのフローチャ
ート。FIG. 3 is a flowchart for explaining the operation of the embodiment.

【図４】本実施形態の応用形態に関係するブロック図。FIG. 4 is a block diagram related to an application form of the embodiment.

[Explanation of symbols]

１…ワークステーション（コンピュータシステム）２…光学的文字読取装置（ＯＣＲ）３…イメージスキャナ（イメージ入力手段）４…ＣＰＵ５…入出力インターフェース６…テキストデータ用メモリ７…出力バッファメモリ８…表示用メモリ（表示手段）９…定型イメージ領域１０…入力イメージ領域２０…印刷文書（テキストデータ）４０…ハンディ型イメージスキャナ Reference Signs List 1 workstation (computer system) 2 optical character reading device (OCR) 3 image scanner (image input means) 4 CPU 5 input / output interface 6 text memory 7 output buffer memory 8 display Memory (display means) 9: fixed image area 10: input image area 20: printed document (text data) 40: handy image scanner

Claims

[Claims]

1. A document processing system for creating and outputting a fixed document by using fixed document format information managed as file information, wherein the document data recorded on the document is converted into image data and input. An image input unit that converts input image data input by the image input unit into text data; image data of a desired fixed document format from the fixed document format information; and the input image data, respectively. Display means for displaying on the screen, text data designation means for extracting text data corresponding to the image range designated in the input image data on the screen from the text data obtained by the data conversion means, Image of the standard document format on the screen And output means for outputting text data extracted by the text data specifying means in association with the input target item as document data in a standard document format when an input target item is specified from the data. Characteristic document processing system.

2. A memory for storing text data converted by the data conversion means, wherein the text data specification means calculates coordinate information of an image range specified on the screen, and calculates the coordinate information. 2. A function for specifying and searching for text data stored in the memory means based on the data.
Document processing system as described.

3. An output buffer means for temporarily storing the text data extracted by the text data designating means, wherein the output means includes an output buffer designated on the screen and an input target item of a standard document format. 2. The document processing system according to claim 1, further comprising a function of creating and displaying document data in a fixed document format by inputting the text data stored in the output buffer means.

4. The data conversion means converts input image data input from the image input means into text data corresponding to an item unit of a fixed document format, and the output means outputs a fixed document specified on the screen. 2. The document processing system according to claim 1, further comprising a function of inputting text data in item units obtained by said data conversion means to a format input target item and outputting the data as document data in a standard document format. .

5. Image input means for converting document data recorded on a document into image data and inputting the data, data converting means for converting the input image data into text data, image data in a fixed document format, and Display means for displaying each of the input image data on a screen, and a fixed document creation method applied to a document processing system for creating and outputting a fixed document using fixed document format information managed as file information Converting the document data recorded on the document into image data by the image input unit and inputting the image data; converting the input image data into text data by the data conversion unit; Displaying the input image data on a screen by A step of designating a desired image range from the input image data on the screen; and a step of extracting text data corresponding to the image range designated on the screen from text data obtained by the data conversion unit. Displaying, on a screen, the image data of the standard document format selected by the display means; and, when an item to be input is specified from the image data of the standard document format on the screen, the extracted text data is displayed. Outputting the data as document data in a fixed document format in association with the input target item.