JPH103516A

JPH103516A - Method and device for processing information

Info

Publication number: JPH103516A
Application number: JP8155507A
Authority: JP
Inventors: Tomotoshi Kanatsu; 知俊金津
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1996-06-17
Filing date: 1996-06-17
Publication date: 1998-01-06

Abstract

PROBLEM TO BE SOLVED: To improve the efficiency of checking and modifying work for character recognition processing by specifying a required area and changing a language mode attribute displayed in accordance with a language mode attribute instruction. SOLUTION: A document picture is inputted by a scanner 4 or the like. A preprocessing part 7 analyzes the input picture, divides the picture into a plurality of areas, judges the attributes (texts, graphics, table, ruled lines, the sorts of the texts, etc.) of respective areas and stores the positional information and attribute information of respective areas (blocks) in a memory 3. A recognition part 8 recognizes characters included in the inputted picture and stores information related to the input document such as the texts (character code strings) of a recognized result and area information in the memory 3 or a storage device 5 as a file. An attribute changing part 6 changes an attribute and a block symbol following the attribute by the operation of a pointing device or the like. A display part 9 displays an image obtained by each processing.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、入力画像の文字を
認識する情報処理方法及び装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing method and apparatus for recognizing characters in an input image.

【０００２】本発明は、異なる言語が存在し得る原稿画
像を処理し得る情報処理方法及び装置に関するものであ
る。[0002] The present invention relates to an information processing method and apparatus capable of processing a document image in which different languages can exist.

【０００３】本発明は、汎用コンピュータ上で動作する
アプリケーションの起動を制御する情報処理方法及び装
置に関するものである。[0003] The present invention relates to an information processing method and apparatus for controlling activation of an application running on a general-purpose computer.

【０００４】[0004]

【従来の技術】２種類以上の言語によって書かれた文書
を対象に文字認識を行う場合、言語別に最適な固有処理
を行なう複数の認識処理系を用意し、入力文書別あるい
は入力文書中の文字領域別にユーザーが言語を指定する
操作を行い、各言語に特化した認識処理を行うことで、
どの言語の文字に対しても高い認識率を持つ文字認識装
置が得られる。このような文字認識装置においては、操
作者が文字認識装置に、文書ごとあるいは文書中の文字
領域ごとに正しい言語モードを指定した後、文字認識処
理を行なわせている。2. Description of the Related Art In the case of performing character recognition on a document written in two or more languages, a plurality of recognition processing systems for performing optimal proper processing for each language are prepared, and characters for each input document or characters in the input document are prepared. By performing the operation of specifying the language for each area and performing recognition processing specialized for each language,
A character recognition device having a high recognition rate for characters in any language can be obtained. In such a character recognition device, the operator causes the character recognition device to perform a character recognition process after designating a correct language mode for each document or each character region in the document.

【０００５】文字認識処理の出力結果に対しては確認お
よび修正が必要である。従来の文字認識アプリケーショ
ンでは、認識結果に対して、１．文字認識アプリケーションプログラム内で簡単なテ
キストエディタを持ち、それを用いて確認、修正を行
う。２．ファイルに出力結果を保存する。操作者は他のテキ
ストエディタや文書エディタなどのアプリケーションを
起動し、前記ファイルを開いて確認、修正を行う。とい
う２種類の方法を提供していた。It is necessary to confirm and correct the output result of the character recognition process. In the conventional character recognition application, for the recognition result: We have a simple text editor in the character recognition application program, and use it to check and correct. 2. Save the output results to a file. The operator activates another application such as a text editor or a document editor, opens the file, checks and corrects the file. Two types of methods were provided.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、従来の
装置では、設定された言語モードはプログラム内部の変
数値としてのみ存在していた。この変数値は領域を指定
して領域情報のダイアログを開くことで確認できるが、
画像中の各領域毎の言語モードを知る為には、領域ひと
つひとつを指定して領域情報のダイアログを開き値を確
認するという操作を必要としていた。However, in the conventional device, the set language mode exists only as a variable value in the program. This variable value can be confirmed by specifying the area and opening the area information dialog.
In order to know the language mode for each area in the image, it was necessary to specify each area and open an area information dialog to check the value.

【０００７】このため、言語の異なる複数の文書を連続
処理する場合や、文書中の多数の文字領域に異なる言語
モードを指定した後などに、注目領域に対する言語モー
ドが正しく設定されているかどうかを確認する作業は非
常に繁雑であり、操作者の負担となっていた。For this reason, in a case where a plurality of documents having different languages are continuously processed, or after specifying a different language mode for a large number of character areas in the document, it is determined whether or not the language mode for the attention area is correctly set. The task of checking is very complicated and burdens the operator.

【０００８】また、認識結果に対する修正等の方法とし
て提供されるもののうち、１．の方法は、主に誤認識文
字を個々に修正することを目的としたものである。文字
認識処理によって派生した下位候補を操作者に提示する
などして、効果的な誤認識文字修正環境が提供される。[0008] Of the methods provided as a method for correcting the recognition result, etc., 1. The method is mainly intended to individually correct misrecognized characters. An effective misrecognized character correction environment is provided, for example, by presenting the lower candidate derived by the character recognition processing to the operator.

【０００９】しかし、認識結果に単純なテキストエディ
タでは処理できない情報が含まれる場合、上記方法では
不十分である。例えば、表や写真も混在する複雑なフォ
ーマット文書に対する認識結果など、認識結果に文字だ
けではなくイメージ情報、及び文書の構造情報を含む場
合である。However, when the recognition result includes information that cannot be processed by a simple text editor, the above method is insufficient. For example, there are cases where the recognition result includes not only characters but also image information and document structure information, such as a recognition result for a complex format document in which tables and photographs are also mixed.

【００１０】その点、２．の方法では、アプリケーショ
ンを適当に選択することで、複雑なフォーマットの認識
結果も確認・修正することが出来る。例えば、前述の構
造情報を含む認識結果を表示する際には、構造情報に対
応する文書エディタを用いればよい。In that regard, 2. In the method of (1), the recognition result of a complicated format can be confirmed and corrected by appropriately selecting an application. For example, when displaying a recognition result including the above-described structure information, a document editor corresponding to the structure information may be used.

【００１１】しかしこの際には、文字認識結果のファイ
ル出力を待って所望のアプリケーションを起動し、その
アプリケーション上で保存された出力結果ファイルを開
く、という操作を必要とするために、出力結果の確認に
手間がかかった。In this case, however, it is necessary to start the desired application after waiting for the output of the character recognition result file, and to open the output result file saved on the application. It took time to confirm.

【００１２】[0012]

【課題を解決する手段】上記課題を解決する為に、本発
明は、入力画像と、該画像を構成する複数の領域を表す
枠と、前記領域の言語モード属性を表示し、前記表示さ
れている領域における所望の領域を指定し、前記指定さ
れた領域の言語モード属性を指示し、前記指示に応じて
前記表示されている言語モード属性を変更する情報処理
方法及び装置。In order to solve the above problems, the present invention displays an input image, a frame representing a plurality of regions constituting the image, and a language mode attribute of the region. An information processing method and apparatus for designating a desired area in an existing area, indicating a language mode attribute of the specified area, and changing the displayed language mode attribute in accordance with the instruction.

【００１３】上記課題を解決する為に、本発明は好まし
くは、前記言語モード属性に応じて異なる文字認識処理
を行う。In order to solve the above problems, the present invention preferably performs different character recognition processing according to the language mode attribute.

【００１４】上記課題を解決する為に、本発明は好まし
くは、言語モード属性に応じて、異なるシンボルパター
ンを表示する。In order to solve the above problem, the present invention preferably displays different symbol patterns according to the language mode attribute.

【００１５】上記課題を解決する為に、本発明は好まし
くは、前記言語モード属性に応じて言語毎に異なる処理
を前記画像に対して行う。[0015] In order to solve the above problem, the present invention preferably performs different processing for each language on the image according to the language mode attribute.

【００１６】上記課題を解決する為に、本発明は好まし
くは、前記言語モード属性に応じて、異なる色のシンボ
ルパターンを表示する。In order to solve the above problem, the present invention preferably displays a symbol pattern of a different color according to the language mode attribute.

【００１７】上記課題を解決する為に、本発明は好まし
くは、前記言語モード属性に応じて、異なる形状のシン
ボルパターンを表示する。In order to solve the above problem, the present invention preferably displays a symbol pattern having a different shape according to the language mode attribute.

【００１８】上記課題を解決する為に、本発明は好まし
くは、前記画像をスキャナにより入力する。In order to solve the above problems, the present invention preferably inputs the image by a scanner.

【００１９】上記課題を解決する為に、本発明は、汎用
コンピュータ上で動作する文字認識アプリケーションに
おける情報処理方法であって、入力画像を文字認識し、
前記文字認識の結果を保存し、前記文字認識結果の保存
後、他のアプリケーションを自動起動する情報処理方法
及び装置を提供する。In order to solve the above-mentioned problem, the present invention is an information processing method in a character recognition application operating on a general-purpose computer.
Provided is an information processing method and apparatus for storing a result of the character recognition and automatically starting another application after storing the character recognition result.

【００２０】上記課題を解決する為に、本発明は好まし
くは、前記文字認識は、認識結果として複数の出力形式
を持つ。In order to solve the above problems, the present invention preferably has the character recognition having a plurality of output formats as a recognition result.

【００２１】上記課題を解決する為に、本発明は好まし
くは、前記自動起動するアプリケーションは、前記出力
形式に対応したアプリケーションを選択起動する。In order to solve the above-mentioned problem, the present invention preferably provides that the application to be automatically started selects and starts an application corresponding to the output format.

【００２２】上記課題を解決する為に、本発明は好まし
くは、前記自動起動するアプリケーションを、所望のも
のに指定する。In order to solve the above problem, the present invention preferably designates the application to be automatically started to a desired one.

【００２３】上記課題を解決する為に、本発明は、アプ
リケーションを特定し、指定された処理の実行を指示
し、前記指示に応答して前記指定された処理を実行した
後、前記特定されているアプリケーションを起動する情
報処理方法及び装置を提供する。In order to solve the above problems, the present invention specifies an application, instructs execution of a specified process, executes the specified process in response to the instruction, and then executes the specified process. To provide an information processing method and apparatus for starting an application.

【００２４】上記課題を解決する為に、本発明は好まし
くは、前記アプリケーションの特定は、前記処理の実行
指示画面上にアプリケーションを特定する手段を設け、
そこで行う。[0024] In order to solve the above-mentioned problem, the present invention preferably includes the step of specifying the application on a screen for instructing execution of the processing, by specifying the application.
So do it.

【００２５】上記課題を解決する為に、本発明は好まし
くは、前記特定されたアプリケーションは、前記処理の
実行指示画面に対して記憶され、該処理の実行指示画面
が表示される度に該記憶されているアプリケーションを
表示し、編集可能とする。In order to solve the above-mentioned problem, the present invention preferably stores the specified application in an execution instruction screen of the processing, and stores the identified application every time the execution instruction screen of the processing is displayed. The application that is being displayed is displayed and can be edited.

【００２６】上記課題を解決する為に、本発明は好まし
くは、前記アプリケーションの起動は、前記指定された
処理を終了した後に行う。In order to solve the above-mentioned problem, the present invention preferably starts the application after ending the specified processing.

【００２７】上記課題を解決する為に、本発明は好まし
くは、前記アプリケーションの特定は、アプリケーショ
ン名を入力することにより行う。[0027] In order to solve the above-mentioned problem, the present invention preferably specifies the application by inputting an application name.

【００２８】[0028]

【発明の実施の形態】以下、添付図面に従って本発明に
かかる実施の形態を詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the accompanying drawings.

【００２９】〔装置の構成〕図１は発明の実施の形態に
おける装置の構成を示すブロック図である。図１におい
て、１は装置であり、２は装置１全体を制御する演算処
理用の中央処理装置（以下、ＣＰＵという）であり、Ｃ
ＰＵ２は、メモリ３に記憶されている制御プログラムに
従って例えば後述するフローチャートに示すような、本
実施の形態において説明する各種処理を実行する。３は
ＲＯＭ、或はＲＡＭを含むメモリであって、ＣＰＵ３が
実行する為の制御プログラムや、各種データを記憶する
とともに、ＣＰＵ３のワークエリアや、入力部４より入
力された文書画像データを記憶する領域、更にはイメー
ジの中で認識を行う領域を表すブロック枠座標やその属
性情報を格納する場合にも使用する。[Configuration of Apparatus] FIG. 1 is a block diagram showing the configuration of an apparatus according to an embodiment of the present invention. In FIG. 1, reference numeral 1 denotes a device, 2 denotes a central processing unit (hereinafter referred to as a CPU) for arithmetic processing for controlling the entire device 1, and C
The PU 2 executes various processes described in the present embodiment, for example, as shown in a flowchart described later, according to a control program stored in the memory 3. Reference numeral 3 denotes a memory including a ROM or a RAM, which stores a control program to be executed by the CPU 3 and various data, and also stores a work area of the CPU 3 and document image data input from the input unit 4. It is also used to store the area, and also the block frame coordinates indicating the area to be recognized in the image and the attribute information thereof.

【００３０】４は文書画像を読取って入力する入力部
で、例えばスキャナ等により構成され、原稿画像データ
をデジタルデータとして入力する。４’はスキャナ４を
接続し、データの授受、及び制御を行うインターフェイ
ス部である。なお、画像の入力は装置１に直接接続され
たスキャナ４から行うものと限定されるものではなく、
他の情報処理装置で入力された画像のデジタルデータを
通信回線等を介したり、外部記憶装置に記憶されている
画像のデジタルデータをインターフェイス部４’から入
力しても良い。Reference numeral 4 denotes an input unit for reading and inputting a document image, which is constituted by, for example, a scanner or the like, and inputs document image data as digital data. Reference numeral 4 'denotes an interface unit to which the scanner 4 is connected, for exchanging data and controlling. Note that input of an image is not limited to input from the scanner 4 directly connected to the apparatus 1,
Digital data of an image input by another information processing apparatus may be input via a communication line or the like, or digital data of an image stored in an external storage device may be input from the interface unit 4 '.

【００３１】５は記憶装置（ハードディスク、ＦＤ、Ｃ
Ｄ−ＲＯＭ、磁気テープ、ＲＯＭなど）であって、イン
ターフェイス５’を介して装置本体１間でデータの授
受、制御が行われる。メモリ３に記憶される制御プログ
ラムも、この記憶装置から提供されても良い。5 is a storage device (hard disk, FD, C
D-ROM, magnetic tape, ROM, etc.), and data is exchanged and controlled between the main units 1 through the interface 5 '. The control program stored in the memory 3 may also be provided from this storage device.

【００３２】６は属性変更部であって、後述のポインテ
ィングデバイス等による操作によって、属性とそれに伴
うブロックシンボル（詳細は後述する）の変更を行う。
変更指示に応じて、属性変更部６はメモリ３に記憶され
ている属性情報の記憶更新も行う。７は文字認識前処理
部で、メモリ３に記憶された文書画像の画像的特徴を解
析して、ブロック切り出しや、更にそのブロックに含ま
れる１文字毎のパターン画像を切り出す等の、認識の前
処理を行う。８は認識部であり、８−１の日本語認識部
では、前処理部７によって前処理されたパターンが像か
ら幾何学的特徴を抽出し、あらかじめ日本語認識用辞書
８−２に格納されている標準パターンと照合して文書画
像の文字認識を行う。同様に８−３は英語認識部であ
り、英語辞書８−４に格納されているパターンを用いて
文字認識を行う。この２つの認識部のどちらを用いて認
識処理が行われるかはブロック毎に指定された言語モー
ド属性に従う。この言語モード属性は、メモリ３に記憶
されている情報に従う。日本語用認識辞書８−２及び英
語用認識辞書８−４は、各言語に使用される文字の標準
パターンデータ及び、その文字認識結果を言語解析する
認識後処理の為の言語毎の文法情報等が記憶されてい
る。この認識用辞書を他の言語の辞書に切り替えること
により、日本語、英語のみならず、フランス語、イタリ
ア語等、どのような言語にも対応できる。Reference numeral 6 denotes an attribute changing unit for changing an attribute and a corresponding block symbol (to be described later in detail) by an operation using a later-described pointing device or the like.
In response to the change instruction, the attribute change unit 6 also updates the storage of the attribute information stored in the memory 3. Reference numeral 7 denotes a character recognition preprocessing unit that analyzes image characteristics of the document image stored in the memory 3 and performs block pre-recognition, such as block extraction and further pattern extraction for each character included in the block. Perform processing. Reference numeral 8 denotes a recognition unit. In the Japanese recognition unit 8-1, the pattern preprocessed by the preprocessing unit 7 extracts a geometric feature from an image and is stored in advance in the Japanese recognition dictionary 8-2. The character recognition of the document image is performed by collating with the standard pattern. Similarly, an English recognition unit 8-3 performs character recognition by using patterns stored in the English dictionary 8-4. Which of the two recognition units is used for the recognition process depends on the language mode attribute specified for each block. This language mode attribute follows information stored in the memory 3. The Japanese-language recognition dictionary 8-2 and the English-language recognition dictionary 8-4 include standard pattern data of characters used in each language and grammatical information for each language for post-recognition processing for language analysis of the character recognition results. Etc. are stored. By switching the dictionary for recognition to a dictionary of another language, any language such as French, Italian, etc. can be supported, in addition to Japanese and English.

【００３３】９は表示部で、入力された文書画像を表示
するイメージ表示部９−１、その文書画像を認識した結
果のテキストを表示する認識結果表示部９−２、文書画
像から前処理部７により切り出されたブロックを識別で
きるようにブロックの境界線を表示する為のブロック枠
表示部９−３、各ブロックから判定された属性を識別で
きるように属性表示部９−４からなる。１０はＬＣＤや
ＣＲＴ等のディスプレイであって、表示部９の制御によ
り、画像や図形、文字、等を様々な属性を付加して表示
することができる。Reference numeral 9 denotes a display unit, which is an image display unit 9-1 for displaying an input document image, a recognition result display unit 9-2 for displaying a text as a result of recognizing the document image, and a pre-processing unit based on the document image. 7 includes a block frame display section 9-3 for displaying block boundaries so that the blocks cut out by the block 7 can be identified, and an attribute display section 9-4 for identifying attributes determined from each block. Reference numeral 10 denotes a display such as an LCD or a CRT, which can display images, graphics, characters, and the like with various attributes added thereto under the control of the display unit 9.

【００３４】１１は外部の出力装置、例えばＬＢＰやイ
ンクジェットプリンタ等のプリンタであり、インターフ
ェイス部１１’の制御のもと、各種データを印字する。
１２はキーボード及びマウス等の、ユーザが各種データ
の入力、指示を行う為のものであって、インターフェイ
ス部１２’の制御のもと、指示がなされる。１３はシス
テムバスであって、ＣＰＵ２のデータバス、アドレスバ
ス、及び制御信号バス等を含んでいる。Reference numeral 11 denotes an external output device, for example, a printer such as an LBP or an ink jet printer, which prints various data under the control of the interface unit 11 '.
Reference numeral 12 denotes a keyboard and a mouse for the user to input and give various data, and gives instructions under the control of the interface unit 12 '. Reference numeral 13 denotes a system bus, which includes a data bus, an address bus, a control signal bus, and the like of the CPU 2.

【００３５】以上のような構成において、入力部４から
原稿画像を入力し、メモリ３に格納する。そして、格納
された原画像とは別個に、縮小画像を生成し、メモリ３
に格納する。この縮小画像は、原画像のｎ×ｎの画素ブ
ロックを１つの縮小画素とするもので、ｎ×ｎの画素ブ
ロック中に１つでも黒画素が存在する場合に縮小画素を
黒画素として決定するものである。この処理を行うと、
結局原画像を縦横それぞれ１／ｎ倍にしたのと同じこと
になり、かつ連接する文字イメージ（文字パターンのド
ットイメージ）が互いに連結された状態となる。この連
結されたドット分布に外接する矩形を順次定義してい
き、領域分割していく。In the above configuration, a document image is input from the input unit 4 and stored in the memory 3. Then, a reduced image is generated separately from the stored original image and stored in the memory 3.
To be stored. In this reduced image, an n × n pixel block of the original image is used as one reduced pixel. When at least one black pixel exists in the n × n pixel block, the reduced pixel is determined as a black pixel. Things. When you do this,
Eventually, this is the same as when the original image is vertically and horizontally multiplied by 1 / n, and the connected character images (dot images of the character pattern) are connected to each other. A rectangle circumscribing the connected dot distribution is sequentially defined, and the area is divided.

【００３６】以下の説明では、領域分割処理を上記のよ
うにして処理したものとして説明するが、領域分割処理
はこれに限らずいかなる処理で行っても良い。In the following description, it is assumed that the area division processing has been performed as described above, but the area division processing is not limited to this and may be any processing.

【００３７】図２は本発明の実施の形態における、図１
の装置（コンピュータ）１上で動作するソフトウェアの
ブロック図である。装置１のオペレーションシステム２
０１が、各アプリケーションに対して、ＣＰＵ２やメモ
リ３、及び各種外部機器の使用を管理する。このオペレ
ーションシステム２０１の管理下で、本発明に係る文字
認識アプリケーション２０２、テキストエディタ２０
３、文書エディタ２０４、イメージ表示アプリケーショ
ン２０５が選択的に、あるいは同時に実行される。FIG. 2 shows an embodiment of the present invention.
3 is a block diagram of software operating on the device (computer) 1 of FIG. Operation system 2 of device 1
01 manages the use of the CPU 2, the memory 3, and various external devices for each application. Under the control of the operation system 201, the character recognition application 202 and the text editor 20 according to the present invention are used.
3. The document editor 204 and the image display application 205 are selectively or simultaneously executed.

【００３８】図３は、本発明の実施の形態における全体
的な処理を表すフローチャートであって、文書画像がス
キャナ４等により入力され（Ｓ３０１）、文字認識アプ
リケーション２０２により入力画像を前処理部７が解析
して領域分割し、各領域の属性（テキスト、図、表、罫
線、テキストの種類等）を判断し、各領域（ブロック）
の位置情報や、属性情報をメモリ３に記憶する（Ｓ３０
２）。Ｓ３０１で入力された画像に含まれる文字の認識
を文字認識アプリケーション２０２により行い（Ｓ３０
３）、認識結果のテキスト（文字コード列）や、領域情
報等、入力文書に関する情報を一つのファイルとして保
存（Ｓ３０４）する。各処理において行われるイメージ
の表示（例えばＳ５０２等）は、イメージ表示アプリケ
ーション２０５により実行しても良い。FIG. 3 is a flowchart showing the overall processing in the embodiment of the present invention. A document image is input by the scanner 4 or the like (S301), and the input image is converted by the character recognition application 202 into the pre-processing unit 7. Analyzes and divides the area, determines the attributes (text, figure, table, ruled line, text type, etc.) of each area, and determines each area (block)
Is stored in the memory 3 (S30).
2). Characters included in the image input in S301 are recognized by the character recognition application 202 (S30).
3) Information about the input document, such as the text (character code string) of the recognition result and area information, is stored as one file (S304). Display of an image (for example, S502 or the like) performed in each process may be executed by the image display application 205.

【００３９】図４は、Ｓ３０２で行われた領域分割処理
における、領域分割の結果を表示するウインドウの例で
ある。FIG. 4 is an example of a window for displaying the result of the area division in the area division processing performed in S302.

【００４０】入力された文書画像（メモリ３に格納され
ている）と領域分割結果及び領域の属性とを合せてイメ
ージウインドウとしてイメージ表示部９−１の処理によ
りディスプレイ１０に表示する。The input document image (stored in the memory 3), the area division result and the attribute of the area are combined and displayed on the display 10 by the processing of the image display unit 9-1 as an image window.

【００４１】図４において、４１は入力部４で読み取ら
れた文書画像を表示していることを示すイメージウイン
ドウである。４２は領域分割の結果である枠（１つの領
域のサイズを明示する枠）であり、ここではテキスト部
分の段落の位置を示している。４３、４４及び４５はブ
ロックの属性を示すブロックシンボルである。ブロック
の属性とは、ブロック枠内の文字の性質を示すもので、
ブロックの順序、内容（表題、文章）、組み方向（横、
縦）、言語モード（日本語、英語…）などがある。In FIG. 4, reference numeral 41 denotes an image window showing that a document image read by the input unit 4 is displayed. Reference numeral 42 denotes a frame (frame specifying the size of one region) as a result of the region division, which indicates the position of the paragraph in the text portion. 43, 44 and 45 are block symbols indicating the attributes of the block. Block attributes indicate the character of the characters in the block frame.
Block order, content (title, sentence), composition direction (horizontal,
Vertical), language mode (Japanese, English ...).

【００４２】ここではブロックシンボルの色によって、
前述の属性のうち言語モードを表示させている。図４の
４３、４４は枠内の言語モード属性が日本語であること
を示す白地のブロックシンボル、４５は枠内の言語モー
ド属性が英語であることを示す黒地のブロックシンボル
である。Here, depending on the color of the block symbol,
The language mode among the attributes described above is displayed. In FIG. 4, reference numerals 43 and 44 denote block symbols on a white background indicating that the language mode attribute in the frame is Japanese, and reference numeral 45 denotes a block symbol on a black background indicating that the language mode attribute in the frame is English.

【００４３】〔ブロック属性の表示〕以下、イメージウ
インドウにブロック毎の属性を表示し、属性の修正を受
けつけた上で各属性に応じた認識処理を行う例につい
て、Ｓ３０１〜Ｓ３０３の処理ステップを更に詳細にし
た処理を図５のフローチャートに示し、以下に説明す
る。尚、この処理を実行する制御プログラムはメモリ３
に記憶されている。[Display of Block Attribute] Hereinafter, in the example in which the attribute of each block is displayed in the image window, and the attribute correction is accepted, and the recognition process corresponding to each attribute is performed, the processing steps of S301 to S303 are further performed. The detailed processing is shown in the flowchart of FIG. 5 and will be described below. The control program for executing this process is stored in the memory 3
Is stored in

【００４４】まずステップＳ３０１で操作者によりセッ
トされた文書画像を入力部４（スキャナ）により読み取
って入力し、メモリ３に記憶する。First, in step S 301, the document image set by the operator is read and input by the input unit 4 (scanner) and stored in the memory 3.

【００４５】次にステップＳ５０１に進み、前処理部７
により領域分割を行う。領域分割において、テキストの
段落、図、表、罫線などの各ブロックの属性も取り出
す。Next, the process proceeds to step S501, where the preprocessing unit 7
To divide the area. In the region division, attributes of each block such as a paragraph, a figure, a table, and a ruled line of a text are also extracted.

【００４６】ステップＳ５０２では、Ｓ５０１で取り出
され、メモリ３に記憶された属性情報に従って図４に示
したように、領域の属性をブロックシンボルという形式
で表示すると共に、イメージ表示部で読み取った画像を
表示する。初期状態では全ブロックの属性はデフォルト
値に設定される。In step S502, as shown in FIG. 4, the attribute of the area is displayed in the form of a block symbol in accordance with the attribute information stored in the memory 3 and stored in the memory 3, and the image read by the image display unit is displayed. indicate. In the initial state, the attributes of all blocks are set to default values.

【００４７】ステップＳ５０３では操作者が表示部に示
された属性情報を確認し、修正の必要があれば、マウス
やキーボード１２を用いて領域の属性の修正情報を入力
する。属性が修正された領域があるとＳ５０３で判断さ
れた場合は、指示に応じてメモリ３に格納されている属
性情報を変更し、再びステップＳ５０２を実行して修正
されたブロックシンボルを再表示する。Ｓ５０３で属性
の修正がなされていないと判断された後、文字認識処理
に移行する。In step S503, the operator checks the attribute information shown on the display unit, and inputs correction information of the attribute of the area using the mouse or the keyboard 12 if necessary. If it is determined in step S503 that there is a region whose attribute has been corrected, the attribute information stored in the memory 3 is changed in accordance with the instruction, and step S502 is executed again to display the corrected block symbol again. . After it is determined in S503 that the attribute has not been modified, the process proceeds to character recognition processing.

【００４８】ステップＳ５０５では、ブロック毎に指定
された属性をメモリ３より参照し、言語モード属性が日
本語の場合は日本語認識部８−１によりステップＳ５０
６の日本語文字認識を、言語モード属性が英語の場合は
英語認識部８−３によりステップＳ５０７の英語文字認
識を行う。In step S505, the attribute specified for each block is referred to from the memory 3. If the language mode attribute is Japanese, the Japanese recognition unit 8-1 executes step S50.
In step S507, English character recognition is performed by the English recognition unit 8-3 when the language mode attribute is English.

【００４９】最後にステップＳ５０８において認識結果
を出力する。認識結果は操作者の指定によってディスプ
レイ１０の表示画面に表示するだけでなく、記憶装置５
やプリンタ１１に出力することが可能である。Finally, in step S508, the recognition result is output. The recognition result is displayed not only on the display screen of the display 10 according to the designation of the operator but also on the storage device 5.
Or to the printer 11.

【００５０】最後にステップＳ５０８において認識結果
を出力する。認識結果は操作者の指定によってディスプ
レイ１０の表示画面に表示するだけでなく、記憶装置５
やプリンタ１１に出力することが可能である。Finally, in step S508, the recognition result is output. The recognition result is displayed not only on the display screen of the display 10 according to the designation of the operator but also on the storage device 5.
Or to the printer 11.

【００５１】図６はステップＳ５０２におけるブロック
シンボル表示部の処理を示すフローチャートである。Ｓ
６１において各ブロックの属性をメモリ３より読み出
し、ブロックの言語モード属性が日本語のときはＳ６２
に進み白地のブロックシンボルを、ブロックの言語モー
ド属性が英語のときはＳ６３に進み黒地のブロックシン
ボルを表示する。FIG. 6 is a flowchart showing the processing of the block symbol display unit in step S502. S
At 61, the attribute of each block is read from the memory 3, and if the language mode attribute of the block is Japanese, S62
The process proceeds to step S63 to display a block symbol on a white background and a block symbol on a black background when the language mode attribute of the block is English.

【００５２】図７は本発明の実施の形態における属性の
修正処理を可能とする表示の例示図である。操作パネル
７２とイメージウインドウ７１を表示している。イメー
ジウインドウ７に示されているのは領域分割処理後のＳ
５０２の状態で、全ブロックの言語モード属性はデフォ
ルト値にセットされており、ここでは全て日本語となっ
ている。このため図７中のブロックシンボル７３、７４
及び７５の全てが、枠内の文章が日本語であることを示
す白色地で表示されている。FIG. 7 is a view showing an example of a display enabling the attribute correction processing in the embodiment of the present invention. An operation panel 72 and an image window 71 are displayed. What is shown in the image window 7 is S after the area division processing.
In the state of 502, the language mode attribute of all blocks is set to a default value, and here all are in Japanese. Therefore, the block symbols 73 and 74 in FIG.
And 75 are all displayed on a white background indicating that the text in the frame is in Japanese.

【００５３】ここで、ブロックシンボル７５の示す領域
は、実際には英語の文章の領域であるから、認識の前に
ブロックの言語モード属性を英語に変更する必要があ
る。このことは操作者がイメージウインドウの画像とそ
れに上書きされたブロックシンボルの色を比較すること
で容易に確認できる。Here, since the area indicated by the block symbol 75 is actually an English sentence area, it is necessary to change the language mode attribute of the block to English before recognition. This can be easily confirmed by the operator comparing the color of the block symbol overwritten with the image in the image window.

【００５４】操作者は、マウス１２でブロックシンボル
７５を指定し、更に操作パネル７２中の、ブロック属性
を英語に変更するボタン７６をクリックすることで、ブ
ロック７８の言語モード属性値を英語に変更してメモリ
３を更新する。本操作によってブロックシンボルは枠内
の文章が英語であることを示す黒地のブロックシンボル
に変更され、イメージウインドウは図４の４１と同じ状
態になる。なお、本属性変更操作は上記に限らず、他の
操作によって行われるようにしてもよい。The operator changes the language mode attribute value of the block 78 to English by designating the block symbol 75 with the mouse 12 and clicking the button 76 for changing the block attribute to English on the operation panel 72. To update the memory 3. By this operation, the block symbol is changed to a black block symbol indicating that the text in the frame is in English, and the image window is in the same state as 41 in FIG. The attribute change operation is not limited to the above, and may be performed by another operation.

【００５５】以上のように、本発明においては、図７に
おけるような表示によって操作者は表示された画像とブ
ロックシンボルの色との比較から、領域毎に指定すべき
言語モードと、現在設定されている言語モードとの不整
合を容易に発見することができ、多言語文字認識処理の
際の言語モード確認操作の繁雑さを軽減することが出来
る。As described above, in the present invention, by comparing the displayed image with the colors of the block symbols by the display as shown in FIG. 7, the language mode to be designated for each area and the currently set language mode are set. Inconsistency with the current language mode can be easily found, and the complexity of the language mode confirmation operation during multilingual character recognition processing can be reduced.

【００５６】尚、図４においては、ブロックシンボルの
色を、認識に用いられる言語モードの種類により、日本
語では白地、英語では黒地としたが、他の区別の容易な
様々な色を用いることも可能である。In FIG. 4, the color of the block symbol is white in Japanese and black in English, depending on the type of language mode used for recognition. However, various other easily distinguishable colors are used. Is also possible.

【００５７】尚、図７においては、領域分割直後にはす
べてのブロックに等しくデフォルトの言語モードを設定
していたが、枠内の画像を基に言語モードを判別するよ
うな言語判別手段を用い、ブロック毎に言語判別を行っ
た結果を各ブロックのそれぞれの言語モードデフォルト
値としてもよい。このときも、本発明における属性表示
手段によって、操作者はブロックシンボルの色と画像を
比較することで前記の言語判別の正否を容易に確認出来
るので、操作の負担は軽減される。In FIG. 7, the default language mode is set equally for all blocks immediately after the area division. However, a language discriminating means for discriminating the language mode based on the image in the frame is used. Alternatively, the result of performing the language determination for each block may be used as the default language mode value for each block. Also at this time, the operator can easily confirm the correctness of the language discrimination by comparing the color of the block symbol with the image by the attribute display means of the present invention, so that the operation burden is reduced.

【００５８】尚、図４においては、ブロックシンボルを
区別するために色を用いたが、視覚的に区別が容易であ
るように、言語モード属性によってブロックシンボルの
形を変更してもよい。例えば図８で示すように８１と８
２のように形状を変えたり、大きさを変えたり、あるい
は点滅の有無をもって区別してもよい。In FIG. 4, the colors are used to distinguish the block symbols, but the shape of the block symbols may be changed according to the language mode attribute so that the symbols can be visually distinguished easily. For example, as shown in FIG.
The shape may be changed, the size may be changed, or the presence or absence of blinking may be distinguished as shown in FIG.

【００５９】尚、図７においては、言語モードは日本語
と英語の２つであったが、各種多言語を対象とする文字
認識装置においては、それぞれの言語モードを示すブロ
ックシンボル毎に別の色もしくは形状を割り当ててもよ
い。In FIG. 7, there are two language modes, Japanese and English. However, in a character recognition apparatus for various languages, a different language mode is used for each block symbol indicating each language mode. Colors or shapes may be assigned.

【００６０】〔アプリケーション起動制御〕次に、文字
認識アプリケーション２０２の処理を図９のフローチャ
ートに沿って説明する。しかし、ここでは、特に保存処
理Ｓ３０４についてＳ９０１〜Ｓ９０６に詳細に説明す
る。Ｓ３０１〜Ｓ３０３は図５及び図６のフローチャー
トで示した処理と同様であるので、ここでは省略する。[Application Activation Control] Next, the processing of the character recognition application 202 will be described with reference to the flowchart of FIG. However, here, the saving process S304 will be described in detail in S901 to S906. Steps S301 to S303 are the same as the processes shown in the flowcharts of FIGS.

【００６１】まずステップＳ３０１で、操作者はスキャ
ナ４を用いて文書画像データを文字認識プログラムに入
力する。スキャナのかわりに、入力画像として外部記憶
装置５中の、あるいは図示外のネットワークを介して受
信された文書画像ファイルを指定してもよい。入力され
た画像データはメモリ３に格納される。First, in step S301, the operator uses the scanner 4 to input document image data to a character recognition program. Instead of the scanner, a document image file in the external storage device 5 or received via a network (not shown) may be designated as an input image. The input image data is stored in the memory 3.

【００６２】次にステップＳ３０２では、ＣＰＵ２は文
書画像に対し領域分割を行う。領域分割によって文書中
のテキスト、表、罫線、図、写真などの各領域を取り出
し、その文書内での座標や大きさなどからなる構造情報
をメモリ３に記憶する。また図や写真のように文字認識
処理が行えない領域に関してはイメージデータをメモリ
３に記憶する。Next, in step S302, the CPU 2 performs region division on the document image. Each area such as a text, a table, a ruled line, a figure, and a photograph in the document is extracted by the area division, and the structure information including the coordinates and the size in the document is stored in the memory 3. Image data is stored in the memory 3 for an area where character recognition cannot be performed, such as a figure or a photograph.

【００６３】図１０は入力文書の例、図１１は図１０の
文書画像に対する領域分割の結果の例を示し、各々イメ
ージ表示アプリケーション２０５によりディスプレイ１
０に表示される。１１０１、１１０２、１１０３はテキ
スト領域、１１０４は表領域、１１０５は図の領域、１
１０６は罫線である。FIG. 10 shows an example of an input document, and FIG. 11 shows an example of the result of area division for the document image of FIG.
Displayed as 0. 1101, 1102, 1103 are text areas, 1104 is a table area, 1105 is a figure area, 1
106 is a ruled line.

【００６４】ここで、前記領域分割は、文書画像の縮小
によって得られた連結成分の分布から文字の集合を推定
する方式によって行われるが、この方式に限らずいかな
る処理で行ってもよい。Here, the region division is performed by a method of estimating a set of characters from the distribution of connected components obtained by reducing the size of a document image, but the present invention is not limited to this method and may be performed by any process.

【００６５】ステップＳ３０３では、ＣＰＵ２はテキス
トとして属性が与えられた領域内の文字を文字認識す
る。文字認識は文字パターンから幾何学的特徴を抽出
し、認識部８において認識用辞書中の標準パターンと照
合し、文字コードに変換される。In step S303, the CPU 2 recognizes a character in a region whose attribute is given as text. In character recognition, a geometric feature is extracted from a character pattern, collated with a standard pattern in a recognition dictionary in a recognition unit 8, and converted into a character code.

【００６６】ステップＳ９０１において、操作者は認識
結果を保存するかどうかを選択し、キーボード・マウス
１２により指示する。保存される認識結果は、Ｓ３０３
で得たテキスト領域内の文字認識結果である文字コー
ド、Ｓ３０１で記憶された図・写真イメージ、及びそれ
らの構造情報を総合した、文書全体の認識結果である。In step S 901, the operator selects whether or not to save the recognition result, and instructs using the keyboard / mouse 12. The stored recognition result is S303
This is a recognition result of the entire document obtained by synthesizing the character code which is the character recognition result in the text region obtained in step S301, the figure / photograph image stored in step S301, and their structural information.

【００６７】Ｓ９０１で保存を選択するよう指示された
場合、Ｓ９０２において、操作者は保存ファイル名をキ
ーボード１２により入力する。さらにＳ９０３におい
て、認識結果を保存後に結果の確認・修正のために他の
アプリケーションを起動するかどうかを選択し、キーボ
ード或いはマウス１２により指示する。When an instruction to select save is given in S901, the operator inputs a save file name using the keyboard 12 in S902. Further, in step S903, after saving the recognition result, the user selects whether to start another application for checking and correcting the result, and instructs using the keyboard or the mouse 12.

【００６８】認識結果の保存後にアプリケーションを起
動させることが指示されている場合、Ｓ９０４に進み認
識結果をファイルとしてメモリ３或いは記憶装置５に保
存した後、Ｓ９０５において指示に応じて他のアプリケ
ーションを起動する。この際に、他のアプリケーション
に対し、保存された認識結果のファイルを指定して、起
動された他のアプリケーション上で直ちに認識結果内容
が表示されるようにする。If it is instructed to start the application after storing the recognition result, the process proceeds to S904, where the recognition result is stored as a file in the memory 3 or the storage device 5, and then in S905, another application is started according to the instruction. I do. At this time, the saved recognition result file is designated for the other application so that the content of the recognition result is immediately displayed on the started other application.

【００６９】アプリケーションを起動しない場合は、Ｓ
９０６で認識結果をメモリ３或いは記憶装置５にファイ
ル保存して、終了する。If the application is not started, S
In step 906, the recognition result is stored in a file in the memory 3 or the storage device 5, and the process ends.

【００７０】図９のＳ９０１〜Ｓ９０３において表示さ
れているウインドウ例を、図１２に示し、操作について
説明する。FIG. 12 shows an example of the window displayed in S901 to S903 in FIG. 9, and the operation will be described.

【００７１】操作者はキーボードやマウス１２を用いて
ダイアログ１２０２中のテキストボックス１２０３に認
識結果保存のためのファイル名を入力する。The operator uses the keyboard and mouse 12 to enter a file name for storing the recognition result in a text box 1203 in the dialog 1202.

【００７２】また、チェックボックス１２０３によって
アプリケーションの自動起動を有無を選択する。チェッ
クボックス１２０３の状態は、マウス１２でカーソル１
２０８をあわせクリックすることによって、クリックす
る度にオン→オフ→オンと交互に変化する。A check box 1203 is used to select whether to automatically start the application. The state of the check box 1203 is determined by
When the user clicks on the button 208, the state changes from on to off to on each time the click is made.

【００７３】更に１２０５に起動したいアプリケーショ
ン名をキーボード１２により入力することで、操作者の
望む任意のアプリケーションが起動されるようになる。
この入力値は保存用ウインドウ１２０２に対して保存す
ることが出来、変更のない場合は認識操作の際に毎回入
力しておかなくてもよいようになっている。Further, by inputting the name of the application to be started into the keyboard 1205 through the keyboard 12, an arbitrary application desired by the operator is started.
This input value can be stored in the storage window 1202, and if there is no change, it is not necessary to input the value every time the recognition operation is performed.

【００７４】保存ファイル名とアプリケーション起動の
有無を指示した後保存のボタン１２０６をマウス１２に
よりクリックすることで、動作はステップＳ９０４また
はＳ９０６に移る。保存を望まない場合はキャンセルボ
タン１２０７をマウス１２によりクリックすればよい。When the save button 1206 is clicked with the mouse 12 after instructing the save file name and whether or not to start the application, the operation proceeds to step S904 or S906. If saving is not desired, the cancel button 1207 may be clicked with the mouse 12.

【００７５】図１３中の１３０２は、Ｓ９０２において
保存用ウインドウ１２０２の起動アプリケーションとし
て文書エディタを指示したことにより、ステップＳ９０
５で起動されたアプリケーションの例である。このアプ
リケーションは複雑な文書構造を再現出来、文書中の図
や写真などもイメージとして取り込める文書エディタで
ある。In FIG. 13, reference numeral 1302 denotes a step S90 in which a document editor is designated as an application to start the save window 1202 in step S902.
5 is an example of an application started in FIG. This application is a document editor that can reproduce complicated document structures and can also capture figures and pictures in documents as images.

【００７６】以上のように、本発明においては、認識結
果の形式に合った任意のアプリケーションをユーザが指
示できるので、その都度所望のアプリケーションを認識
結果の確認・修正に用いることが可能である上に、それ
らのアプリケーションが認識結果の保存時に自動的に起
動されるので、認識内容を直ちに確認することが出来
る。よって認識後の作業の効率が向上する。As described above, according to the present invention, since the user can specify an arbitrary application conforming to the format of the recognition result, the desired application can be used for checking and correcting the recognition result each time. In addition, since those applications are automatically started when the recognition result is stored, the recognition contents can be immediately confirmed. Therefore, the efficiency of the work after recognition is improved.

【００７７】尚、文字認識結果の出力形式が、テキスト
のみ、構造情報付きテキスト、あるいはイメージ画像の
ように複数選べる場合、その出力形式の選択によってそ
れぞれ異なるアプリケーションが起動するようにしても
よい。その為には、図１４に示すような、文字認識結果
出力形式を選択できるウインドウ１４０２を表示すれば
良い。認識結果出力の設定ダイアログボックス１４０２
中の出力形式選択メニュー１４０３をマウス１２で指定
することで出力形式の選択が出来る。また、１４０４、
１４０５、１４０６はそれぞれの形式の出力を表示する
のに用いるアプリケーションで、その内容は操作者によ
ってキーボード１２から文字列を入力することによって
変更可能である。When a plurality of output formats of the character recognition result can be selected, such as text only, text with structure information, or image images, different applications may be activated depending on the selection of the output format. For this purpose, a window 1402 for selecting a character recognition result output format as shown in FIG. 14 may be displayed. Recognition result output setting dialog box 1402
The output format can be selected by designating the output format selection menu 1403 in the mouse with the mouse 12. Also, 1404,
Reference numerals 1405 and 1406 denote applications used to display the output in each format, the contents of which can be changed by inputting a character string from the keyboard 12 by the operator.

【００７８】上記の選択を認識前、あるいは認識結果保
存前にしておけば、認識結果保存時には出力形式に適合
したアプリケーションが自動的に起動される。よって操
作者は認識結果の形式が如何様であっても内容を直ちに
確認できる。If the above selection is made before the recognition or before the storage of the recognition result, an application suitable for the output format is automatically started when the recognition result is stored. Therefore, the operator can immediately confirm the content regardless of the format of the recognition result.

【００７９】尚、テキスト領域の文字認識処理の直後
に、誤認識文字修正処理を加えてもよい。図１５はその
処理のフローチャートを示す図であり、図９のフローチ
ャートに対して追加されたステップＳ１５００で誤認識
文字修正処理を行う。It is to be noted that an erroneously recognized character correcting process may be added immediately after the character recognizing process of the text area. FIG. 15 is a diagram showing a flowchart of the process. In step S1500 added to the flowchart of FIG. 9, an erroneously recognized character correction process is performed.

【００８０】誤認識文字修正処理として、文字修正エデ
ィタを用いる例を図１６に示す。文字修正エディタウイ
ンドウ１６０３は文字認識アプリケーションプログラム
ウインドウ１６０１の一部として提供されるエディタで
あり、プログラム内部の情報を利用して注目文字毎に候
補１６０４を提示し、操作者に誤認識を効果的に修正す
る環境を提供する。FIG. 16 shows an example in which a character correction editor is used as the erroneously recognized character correction processing. The character correction editor window 1603 is an editor provided as a part of the character recognition application program window 1601, and presents a candidate 1604 for each target character by using information in the program to effectively prevent the operator from erroneous recognition. Provide an environment for modification.

【００８１】このように、誤認識文字の修正に優れたエ
ディタで文字単位の確認・修正を行ない、その後、文字
修正エディタでは確認・修正出来ない部分を他のアプリ
ケーションを用いて確認・修正を行うことが出来る。As described above, an editor excellent in correcting erroneously recognized characters is used for checking and correcting characters, and thereafter, a part which cannot be checked and corrected by the character correcting editor is checked and corrected using another application. I can do it.

【００８２】例えば、文書構造を再現出来る文書エディ
タを他のアプリケーションとし起動し、文書の認識結果
のさらなる確認・修正をおこなうことが出来る。この
際、アプリケーションは認識結果の保存時に自動的に起
動され、認識内容を直ちに確認することが出来るので、
認識後の作業の効率は向上する。For example, a document editor capable of reproducing a document structure can be started as another application to further confirm and correct the recognition result of the document. At this time, the application is automatically started when the recognition result is saved, and the recognition content can be checked immediately.
The efficiency of the work after recognition is improved.

【００８３】[0083]

【発明の効果】以上説明したように、本発明によれば、
入力画像と、該画像を構成する複数の領域を表す枠と、
前記領域の言語モード属性を表示し、前記表示されてい
る領域における所望の領域を指定し、前記指定された領
域の言語モード属性を指示し、前記指示に応じて前記表
示されている言語モード属性を変更することにより、文
字認識処理の確認、修正作業の効率を向上できる。As described above, according to the present invention,
An input image, a frame representing a plurality of regions constituting the image,
Displaying a language mode attribute of the area, specifying a desired area in the displayed area, indicating a language mode attribute of the specified area, and displaying the language mode attribute according to the instruction; , The efficiency of the character recognition processing confirmation and correction work can be improved.

【００８４】以上説明したように、本発明によれば、前
記言語モード属性に応じて異なる文字認識処理を行うこ
とにより、文字認識処理の精度を向上させられる。As described above, according to the present invention, the accuracy of character recognition processing can be improved by performing different character recognition processing according to the language mode attribute.

【００８５】以上説明したように、本発明によれば、言
語モード属性に応じて、異なるシンボルパターンを表示
することにより、言語モード属性の識別を容易にするこ
とができる。As described above, according to the present invention, it is possible to easily identify a language mode attribute by displaying different symbol patterns according to the language mode attribute.

【００８６】以上説明したように、本発明によれば、前
記言語モード属性に応じて言語毎に異なる処理を前記画
像に対して行うことにより、言語に合わせた適切な処理
を行うことができる。As described above, according to the present invention, by performing processing different for each language on the image in accordance with the language mode attribute, it is possible to perform appropriate processing according to the language.

【００８７】以上説明したように、本発明によれば、前
記言語モード属性に応じて、異なる色のシンボルパター
ンを表示することにより、言語モード属性の識別を容易
に行える。As described above, according to the present invention, by displaying symbol patterns of different colors according to the language mode attribute, the language mode attribute can be easily identified.

【００８８】以上説明したように、本発明によれば、前
記言語モード属性に応じて、異なる形状のシンボルパタ
ーンを表示することにより、言語モード属性の識別を容
易に行える。As described above, according to the present invention, by displaying a symbol pattern having a different shape according to the language mode attribute, the language mode attribute can be easily identified.

【００８９】以上説明したように、本発明によれば、前
記画像をスキャナにより入力することにより、どのよう
な原稿画像に対しても処理を可能とする。As described above, according to the present invention, any document image can be processed by inputting the image with a scanner.

【００９０】以上説明したように、本発明によれば、汎
用コンピュータ上で動作する文字認識アプリケーション
における情報処理方法であって、入力画像を文字認識
し、前記文字認識の結果を保存し、前記文字認識結果の
保存後、他のアプリケーションを自動起動することによ
り、文字認識処理後の修正作業等を効率よく行え、ま
た、その為のアプリケーションを起動させる為の作業能
率を向上させられる。As described above, according to the present invention, there is provided an information processing method for a character recognition application operating on a general-purpose computer, comprising the steps of recognizing a character in an input image, storing the result of the character recognition, After the recognition result is saved, by automatically starting another application, correction work after the character recognition processing can be performed efficiently, and work efficiency for starting the application for that purpose can be improved.

【００９１】以上説明したように、本発明によれば、前
記文字認識は、認識結果として複数の出力形式を持つこ
とにより、必要に応じて必要なデータを得ることができ
る。As described above, according to the present invention, the character recognition has a plurality of output formats as recognition results, so that necessary data can be obtained as needed.

【００９２】以上説明したように、本発明によれば、前
記自動起動するアプリケーションは、前記出力形式に対
応したアプリケーションを選択起動することにより、出
力形式に適したアプリケーションを選択することができ
る。As described above, according to the present invention, the application to be automatically started can select an application suitable for the output format by selecting and starting an application corresponding to the output format.

【００９３】以上説明したように、本発明によれば、前
記自動起動するアプリケーションを、所望のものに指定
することにより、ユーザの所望するアプリケーションを
指定できる。As described above, according to the present invention, an application desired by a user can be designated by designating a desired application as the application to be automatically started.

【００９４】以上説明したように、本発明によれば、ア
プリケーションを特定し、指定された処理の実行を指示
し、前記指示に応答して前記指定された処理を実行した
後、前記特定されているアプリケーションを起動するこ
とにより、アプリケーションを起動する時の操作性を向
上させられる。As described above, according to the present invention, an application is specified, an instruction to execute a specified process is issued, and after executing the specified process in response to the instruction, the specified application is executed. By activating the application, the operability at the time of activating the application can be improved.

【００９５】以上説明したように、本発明によれば、前
記アプリケーションの特定は、前記処理の実行指示画面
上にアプリケーションを特定する手段を設け、そこで行
うことにより、アプリケーションの指定操作が容易にな
る。As described above, according to the present invention, the specification of the application is provided on the execution instruction screen of the processing by means for specifying the application, and the application is easily specified. .

【００９６】以上説明したように、本発明によれば、前
記特定されたアプリケーションは、前記処理の実行指示
画面に対して記憶され、該処理の実行指示画面が表示さ
れる度に該記憶されているアプリケーションを表示し、
編集可能とすることにより、デフォルトとして、以前指
定したアプリケーションを提供するので、指定操作が容
易になる。As described above, according to the present invention, the specified application is stored in the processing execution instruction screen, and is stored each time the processing execution instruction screen is displayed. Display the applications that are
By allowing editing, the previously specified application is provided as a default, thereby facilitating the specifying operation.

【００９７】以上説明したように、本発明によれば、前
記アプリケーションの起動は、前記指定された処理を終
了した後に行うことにより、次に起動するアプリケーシ
ョンを指示する為に、指定された処理の終了を待つ必要
がなくなり、操作性が向上する。As described above, according to the present invention, the activation of the application is performed after the termination of the designated process, so that the designated process is started in order to indicate the application to be activated next. There is no need to wait for termination, and operability is improved.

【００９８】以上説明したように、本発明によれば、前
記アプリケーションの特定は、アプリケーション名を入
力することにより行うことにより、アプリケーションの
特定作業が簡単になる。As described above, according to the present invention, by specifying the application by inputting the application name, the operation of specifying the application is simplified.

[Brief description of the drawings]

【図１】発明の実施の形態における装置の構成を示すブ
ロック図FIG. 1 is a block diagram showing a configuration of an apparatus according to an embodiment of the present invention.

【図２】発明の実施の形態におけるソフトウェア構成図FIG. 2 is a software configuration diagram according to the embodiment of the invention;

【図３】全体的な処理のフローチャートFIG. 3 is a flowchart of an overall process;

【図４】イメージウインドウの例示図FIG. 4 is an exemplary view of an image window.

【図５】属性修正及び認識処理のフローチャートFIG. 5 is a flowchart of an attribute correction and recognition process.

【図６】属性表示処理のフローチャートFIG. 6 is a flowchart of an attribute display process.

【図７】属性修正処理時の表示例示図FIG. 7 is an exemplary display at the time of attribute correction processing.

【図８】属性表示の第二の例FIG. 8 shows a second example of attribute display.

【図９】認識結果保存処理のフローチャートFIG. 9 is a flowchart of a recognition result storing process.

【図１０】入力原稿画像の例示図FIG. 10 is a view showing an example of an input document image;

【図１１】図１０の画像を領域分割した結果の図FIG. 11 is a view showing a result of dividing the image of FIG. 10 into regions;

【図１２】保存ウインドウ例示図FIG. 12 is an exemplary diagram of a save window.

【図１３】他のアプリケーションが起動された時の表示
例示図FIG. 13 is a diagram showing an example of display when another application is activated.

【図１４】認識結果出力の設定ウインドウ例示図FIG. 14 is a view showing an example of a setting window of a recognition result output.

【図１５】誤認識文字修正ステップを加えた処理のフロ
ーチャートFIG. 15 is a flowchart of processing in which an erroneously recognized character correction step is added.

【図１６】誤認識文字修正時の表示例示図FIG. 16 is a view showing an example of display when correcting a misrecognized character.

Claims

[Claims]

1. An input image, a frame representing a plurality of areas constituting the image, and a language mode attribute of the area are displayed, a desired area in the displayed area is specified, and the specified area is displayed. An information processing method comprising: instructing a language mode attribute of an area; and changing the displayed language mode attribute in accordance with the instruction.

2. The information processing method according to claim 1, wherein different character recognition processing is performed according to the language mode attribute.

3. The information processing method according to claim 1, wherein different symbol patterns are displayed according to the language mode attribute.

4. The information processing method according to claim 1, wherein processing different for each language is performed on the image according to the language mode attribute.

5. The information processing method according to claim 1, wherein a symbol pattern of a different color is displayed according to the language mode attribute.

6. The information processing method according to claim 1, wherein a symbol pattern having a different shape is displayed according to the language mode attribute.

7. The information processing method according to claim 1, wherein the image is input by a scanner.

8. An input image, frames representing a plurality of regions constituting the image, display means for displaying a language mode attribute of the region, and region designation for designating a desired region in the displayed region Information processing apparatus, comprising: means for specifying a language mode attribute of the designated area.

9. An information processing method for a character recognition application operating on a general-purpose computer, wherein the input image is subjected to character recognition, the result of the character recognition is stored, and after storing the character recognition result, another application is executed. An information processing method characterized by automatically starting.

10. The information processing method according to claim 9, wherein the character recognition has a plurality of output formats as a recognition result.

11. The information processing method according to claim 10, wherein the automatically started application selects and starts an application corresponding to the output format.

12. The information processing method according to claim 9, wherein the application to be automatically started is designated as a desired one.

13. The method according to claim 1, wherein an application is specified, execution of a specified process is instructed, and after executing said specified process in response to said instruction, said specified application is started. Information processing method.

14. The information processing method according to claim 13, wherein a means for specifying the application is provided on the execution instruction screen of the processing, and the specification of the application is performed there.

15. The specified application,
15. The information according to claim 14, wherein the stored application is stored with respect to the processing execution instruction screen, and the stored application is displayed and edited each time the processing execution instruction screen is displayed. Processing method.

16. The information processing method according to claim 1, wherein the activation of the application is performed after the specified processing is completed.

17. The information processing method according to claim 13, wherein the specification of the application is performed by inputting an application name.

18. An application specifying unit for specifying an application, an execution instructing unit for instructing execution of a specified process, and executing the specified process in response to the instruction, and then executing the specified application. An information processing apparatus, comprising: an application activating unit for activating the application.