JPH0981583A

JPH0981583A - Device and method for document filing

Info

Publication number: JPH0981583A
Application number: JP7234429A
Authority: JP
Inventors: Mieko Asano; 三恵子浅野; Shigeyoshi Shimotsuji; 成佳下辻
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1995-09-12
Filing date: 1995-09-12
Publication date: 1997-03-28

Abstract

PROBLEM TO BE SOLVED: To provide the device and method for document filing which facilitate registration and retrieval of a document picture in a filing device. SOLUTION: With respect to the document filing device which takes in pictures of various documents to file them, the picture of a document is inputted by a picture input part 11 and is stored in a memory 12, and a picture processing part 13 segments a desired partial picture from the entire picture of the document stored in this memory 12, and the entire picture and the partial picture are preliminarily stored as document data in a preservation part 14 correspondingly to each other, and this partial picture is shown in a display part 15 as the index at the time or retrieval of the document.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明は各種ドキュメント
をファイリングするドキュメントファイリング装置およ
び方法に係り、特に簡単な構成を付加するのみでドキュ
メント画像のファイル装置への登録および検索を容易と
するドキュメントファイリング装置および方法に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document filing apparatus and method for filing various documents, and a document filing apparatus and a method for facilitating registration and retrieval of a document image in a file apparatus only by adding a simple structure, Regarding the method.

【０００２】[0002]

【従来の技術】従来のドキュメントファイリング装置で
は、ドキュメント画像の保存方法として、ドキュメント
１枚分の画像を取り込んで、このファイルに名前をつけ
て保存する方法や、ドキュメント上に記載された文字お
よび図形を画像情報ではなくコード情報に変換してコー
ドファイルとして保存する方法などが多く採用されてい
る。また、近年、ドキュメント１枚全体の画像を保存す
るのではなく、部分的に一部の画像を記憶する手法が提
案されてきている。たとえば、特開平４−３２１１８８
号公報には、未記入帳票のフォーマットデータを用い
て、帳票に記入された画像のみを記憶する方法が開示さ
れている。2. Description of the Related Art In a conventional document filing apparatus, as a method of saving a document image, a method of capturing an image of one document and giving a name to this file and saving it, and characters and figures described on the document A method of converting the image information into code information instead of image information and saving it as a code file is often used. Further, in recent years, a method of partially storing an image of a document instead of storing the entire image of a document has been proposed. For example, JP-A-4-321188
The publication discloses a method of storing only the image entered in the form by using the format data of the blank form.

【０００３】また、従来のドキュメントファイリング装
置では、保存したドキュメント画像の検索方法として、
予め設定されて呈示されるキーワードの中から、いずれ
かのキーワードを指定する方法や、オペレータがキーボ
ードなどから任意のキーワードを入力する方法などが多
く採用されている。Further, in the conventional document filing apparatus, as a method of searching the stored document image,
A method of designating one of the keywords set and presented in advance and a method of allowing an operator to input an arbitrary keyword from a keyboard or the like are often employed.

【０００４】ところが、ドキュメント１枚分の画像を記
憶し管理する方法では、検索のためのキーワード情報と
して、オペレータがマニュアル操作で各々の画像に対し
固有の名前をつけるなどといったことが必要となり、作
業が著しく煩雑であった。However, in the method of storing and managing the image of one document, it is necessary for the operator to manually give a unique name to each image as keyword information for the search. Was extremely complicated.

【０００５】一方、ドキュメント上に記載された文字、
図形を画像情報からコード情報に変換する方法では、変
換後のコードデータを検索情報として用いることができ
るものの、これらの変換作業の多くをオペレータのマニ
ュアル操作で行う必要があり、その労力は非常に多大で
ある。On the other hand, the characters written on the document,
In the method of converting a figure from image information to code information, the code data after conversion can be used as search information, but many of these conversion operations need to be performed manually by an operator, which is extremely labor intensive. It's a lot.

【０００６】さらに、従来の検索方法では、ファイリン
グされているドキュメントを表示して検索する際に、帳
票の量が多いと膨大な時間を費やしてしまうといった問
題があった。Further, the conventional retrieval method has a problem that a huge amount of time is spent when a filed document is displayed and retrieved, if the amount of the form is large.

【０００７】[0007]

【発明が解決しようとする課題】前述したように、従来
のドキュメントファイリング装置においては、ドキュメ
ント１枚分の画像に名前を付けて保存するといったもの
や、ドキュメントをコード情報に変換して保存するとい
ったものが存在するが、いずれにしてもドキュメントを
検索する前に、使用する情報を予めオペレータがマニュ
アル操作でドキュメントに与えておく必要があるので、
その作業が著しく煩雑であり、多大な労力を必要とする
問題があった。As described above, in the conventional document filing apparatus, one image of a document is named and saved, or the document is converted into code information and saved. There are things, but in any case, before searching for a document, it is necessary for the operator to manually give the information to be used to the document in advance,
There is a problem that the work is remarkably complicated and requires a lot of labor.

【０００８】また、従来の検索システムでは、ファイリ
ングされているドキュメントを表示して検索する際に、
帳票の量が多いと膨大な時間を費やしてしまうといった
問題があった。Further, in the conventional search system, when displaying and searching a filed document,
There was a problem that a huge amount of time was spent when the amount of forms was large.

【０００９】本発明は、このような事情に鑑みてなされ
たものであり、ドキュメント画像のファイル装置への登
録を容易とし、かつドキュメントの検索を容易にするド
キュメントファイリング装置および方法を提供すること
を目的とする。The present invention has been made in view of the above circumstances, and it is an object of the present invention to provide a document filing apparatus and method that facilitate registration of a document image in a file device and facilitate document retrieval. To aim.

【００１０】[0010]

【課題を解決するための手段】本発明は、各種ドキュメ
ントの画像を取り込んでファイリングするドキュメント
ファイリング装置において、前記ドキュメントの画像を
入力する画像入力手段と、この画像入力手段により入力
された前記ドキュメントの全体画像から部分画像を切り
出す部分画像切り出し手段と、前記全体画像と部分画像
とを対応づけてドキュメントデータとして格納するファ
イル手段とを具備し、前記ドキュメントを検索する際
に、前記部分画像をインデックスとしてユーザに呈示す
ることを特徴とする。SUMMARY OF THE INVENTION The present invention is a document filing apparatus for fetching and filing images of various documents, and image input means for inputting the image of the document, and the document input by the image input means. It comprises a partial image cutout means for cutting out a partial image from the entire image, and a file means for storing the entire image and the partial image in association with each other as document data, and when the document is searched, the partial image is used as an index. It is characterized by being presented to the user.

【００１１】本発明によれば、ドキュメントを文字デー
タおよび画像データとしてファイルに登録する際に、登
録したい所望のドキュメントを画像として入力し、入力
された画像データからドキュメントのインデックスとな
る部分画像を切り出してドキュメントと対応づけて保存
する。そして、この部分画像をユーザに呈示して、ドキ
ュメント検索に利用する。According to the present invention, when a document is registered in a file as character data and image data, a desired document to be registered is input as an image, and a partial image serving as a document index is cut out from the input image data. Save it in association with the document. Then, this partial image is presented to the user and used for document retrieval.

【００１２】すなわち、本発明によれば、ドキュメント
をファイルに保存する際に、自動的に部分画像インデッ
クスをつけて保存が行なえるので、ドキュメントのファ
イルへの登録が容易となるとともに、部分画像を利用し
たファイル内のドキュメントの検索が容易に行なえるこ
ととなる。That is, according to the present invention, when a document is saved in a file, the partial image index can be automatically added and saved, which facilitates registration of the document in the file and saves the partial image. The documents in the used files can be easily searched.

【００１３】前記部分画像切り出し手段としては、ユー
ザの指示または予め設定された切り出し情報に基づいて
前記全体画像から前記部分画像を切り出す手段をりよう
できる。The partial image cutout means may be a means for cutting out the partial image from the entire image based on a user's instruction or preset cutout information.

【００１４】また、前記ファイル手段は、複数の部分画
像を前記全体画像に対応づける手段を含み、前記ドキュ
メントを検索する際に、前記複数の部分画像をインデッ
クスとしてユーザに順次呈示することが好ましい。Further, it is preferable that the file means includes means for associating a plurality of partial images with the whole image, and when the document is searched, the plurality of partial images are sequentially presented to the user as an index.

【００１５】また、前記部分画像切り出し手段は、ユー
ザまたは前記切り出し情報により指示される切り出しパ
ターンを複数種類有し、前記ファイル手段は、前記切り
出された部分画像と、その部分画像を切り出した前記切
り出しパターンの種別とを対応づけて格納する手段を含
むことが好ましい。Further, the partial image cutout means has a plurality of kinds of cutout patterns instructed by a user or the cutout information, and the file means is the cutout partial image and the cutout obtained by cutting out the partial image. It is preferable to include means for storing the pattern types in association with each other.

【００１６】また、前記全体画像に対応づけられた前記
部分画像に対し文字認識または図形認識を施してコード
情報に変換する認識手段をさらに具備し、前記ファイル
手段は、このコード情報を前記ドキュメントに対応づけ
て格納する手段を含むことが好ましい。The file means further comprises a recognizing means for performing character recognition or graphic recognition on the partial image associated with the whole image and converting the partial image into code information. It is preferable to include means for storing in association with each other.

【００１７】また、前記部分画像切り出し手段は、前記
全体画像から線分を抽出する線分抽出手段と、前記全体
画像から文字列領域を抽出する文字列抽出手段とを含
み、前記抽出した線分上に存在する文字列領域の画像を
切り出すことが好ましい。Further, the partial image cutout means includes a line segment extraction means for extracting a line segment from the whole image and a character string extraction means for extracting a character string area from the whole image, and the extracted line segment. It is preferable to cut out the image of the character string region existing above.

【００１８】また、前記部分画像切り出し手段は、前記
全体画像から修飾文字領域を抽出する修飾文字抽出手段
を含み、前記抽出した修飾文字領域内の画像を切り出す
ことが好ましい。Further, it is preferable that the partial image cutout means includes a modified character extraction means for extracting a modified character area from the entire image, and cuts out an image in the extracted modified character area.

【００１９】また、前記修飾文字抽出手段としては、文
字枠抽出手段、文字列抽出手段、線分抽出手段および領
域内の線幅、黒画素密度、輪郭線分もしくは黒画素の連
結領域を計測する計測手段の少なくともいずれか１つを
含み、網掛け文字領域、下線領域、斜体文字領域、白抜
き文字領域、拡大文字領域および太線文字領域の少なく
ともいずれか１つを抽出することを特徴とする。As the modified character extraction means, a character frame extraction means, a character string extraction means, a line segment extraction means, and a line width, a black pixel density, a contour line segment or a connected area of black pixels in the area are measured. It is characterized in that at least one of the measuring means is included and at least one of a shaded character area, an underlined area, an italic character area, an outlined character area, an enlarged character area, and a bold character area is extracted.

【００２０】また、前記部分画像切り出し手段は、前記
全体画像から文字列領域を抽出する手段を含み、予め指
定された位置に存在する文字列領域内の画像を切り出す
ように構成してもよい。The partial image cutting means may include means for extracting a character string area from the entire image, and may be configured to cut out an image in the character string area existing at a position designated in advance.

【００２１】また、前記画像に対し文字認識を施してコ
ード情報に変換する認識手段をさらに具備し、前記部分
画像切り出し手段は、前記全体画像から文字列領域を抽
出する文字抽出手段を含み、この抽出した文字列領域内
の文字コードを前記文字認識手段により取得して、この
取得した文字コードが予め登録された文字コードを含む
ときにその文字列領域内の画像を切り出すことが望まし
い。Further, a recognition means for performing character recognition on the image to convert it into code information is further provided, and the partial image cutout means includes a character extraction means for extracting a character string region from the whole image. It is preferable that the character code in the extracted character string area is acquired by the character recognition means, and an image in the character string area is cut out when the acquired character code includes a preregistered character code.

【００２２】また、前記画像に対し文字認識を施してコ
ード情報に変換する認識手段をさらに具備し、前記部分
画像切り出し手段は、前記全体画像から文字列領域を抽
出する文字抽出手段を含み、この抽出した文字列領域内
の文字コードを前記文字認識手段により取得して、この
取得した文字コードが予め登録された文字コードを含む
ときに、部分画像として抽出すべき領域を示す位置情報
にて指定された位置に存在する部分画像を切り出すこと
がさらに好ましい。Further, the image processing apparatus further comprises a recognizing means for performing character recognition on the image to convert it into code information, and the partial image cutting means includes a character extracting means for extracting a character string area from the whole image. A character code in the extracted character string area is acquired by the character recognition means, and when the acquired character code includes a preregistered character code, it is designated by position information indicating an area to be extracted as a partial image. It is more preferable to cut out the partial image existing at the specified position.

【００２３】また、前記部分画像切り出し手段は、前記
全体画像から図、表および絵の領域を抽出する手段と、
前記全体画像から文字列領域を抽出する文字列抽出手段
とを含み、図、表および絵の領域の上または下に位置す
る文字列領域内の画像を切り出すことを特徴とする。Further, the partial image cutout means extracts means for drawing, table and picture areas from the whole image,
A character string extracting unit for extracting a character string area from the entire image, and cutting out an image in the character string area located above or below the figure, table and picture areas.

【００２４】また、前記部分画像切り出し手段は、前記
全体画像から図、表および絵の領域を抽出する手段を含
み、図、表および絵の領域の画像を切り出すことを特徴
とする。Further, the partial image cutout means includes means for extracting a figure, table and picture area from the whole image, and cuts out an image of the figure, table and picture area.

【００２５】また、前記ファイル手段は、前記部分画像
切り出し手段により切り出された部分画像を部分画像自
体またはその部分画像の全体画像上の位置情報のいずれ
かで格納することを特徴とする。Further, the file means is characterized in that the partial image cut out by the partial image cutout means is stored as either the partial image itself or position information on the entire image of the partial image.

【００２６】[0026]

【発明の実施の形態】以下、図面を参照して本発明の一
実施形態を説明する。BEST MODE FOR CARRYING OUT THE INVENTION An embodiment of the present invention will be described below with reference to the drawings.

【００２７】図１は同実施形態に係るドキュメントファ
イリング装置の基本的な処理機能を示すブロック図であ
る。FIG. 1 is a block diagram showing the basic processing functions of the document filing apparatus according to the embodiment.

【００２８】図１に示すように、同実施形態に係るドキ
ュメントファイリング装置は、制御部１０、画像入力部
１１、メモリ１２、画像処理部１３、保存部１４、表示
部１５、入力部１６および文字認識部１７を具備してな
る。As shown in FIG. 1, the document filing apparatus according to the embodiment has a control unit 10, an image input unit 11, a memory 12, an image processing unit 13, a storage unit 14, a display unit 15, an input unit 16 and characters. The recognition unit 17 is provided.

【００２９】制御部１０は、ドキュメントファイリング
装置全体の制御を司る。また、画像入力部１１は、図面
や文書などの各種ドキュメントの画像を入力する。な
お、この画像入力部１１は、ラインセンサなどのイメー
ジスキャナを用いて原稿上に記載されたドキュメント情
報を光学的に画像入力するものであっても良いし、また
画像ファイルから二値画像化されているドキュメント情
報を読み込むようなものであってもよい。The control unit 10 controls the entire document filing apparatus. The image input unit 11 also inputs images of various documents such as drawings and documents. The image input unit 11 may be one that optically inputs the document information described on the document using an image scanner such as a line sensor, or a binary image is converted from the image file. It may be one that reads the document information that is stored.

【００３０】この画像入力部１１から入力されたドキュ
メントの画像は、メモリ１２に格納される。そして、画
像処理部１３は、このメモリ１２に格納された画像デー
タに対して部分画像の切り出しなどを行なう。また、保
存部１４は、入力されたドキュメントの画像、切り出し
画像および認識結果のコードデータなどを保存する。そ
して、表示部１５は、保存部１４に保存されているドキ
ュメントや切り出された部分画像を表示する。The image of the document input from the image input unit 11 is stored in the memory 12. Then, the image processing unit 13 cuts out a partial image from the image data stored in the memory 12. Further, the storage unit 14 stores the image of the input document, the cut-out image, the code data of the recognition result, and the like. Then, the display unit 15 displays the document stored in the storage unit 14 and the cut out partial image.

【００３１】入力部１６は、表示部１５に複数表示され
たドキュメントからいずれかのドキュメントを選択する
際に使用されるキーボードやマウスなどである。また、
文字認識部１７は、画像処理部１３により切り出された
部分画像の文字コードを取得する。したがって、この文
字認識部１７は、部分画像を画像情報のまま保存部１４
にて保持する際には用いられないことになる。The input unit 16 is a keyboard, a mouse or the like used when selecting one of the documents displayed on the display unit 15. Also,
The character recognition unit 17 acquires the character code of the partial image cut out by the image processing unit 13. Therefore, the character recognition unit 17 stores the partial image as the image information in the storage unit 14.
It will not be used when holding.

【００３２】次に、図２を参照して同実施形態のドキュ
メントファイリング装置の動作を説明する。Next, the operation of the document filing apparatus of the same embodiment will be described with reference to FIG.

【００３３】まず、画像入力部１１が、ファイリングす
るドキュメントの画像をスキャンして、得られた画像信
号をメモリ１２に格納する（図２のステップＡ１）。次
に、画像処理部１３が、メモリ１２に格納されたドキュ
メントの画像信号から、切り出し情報に基づいて所望の
部分画像を切り出す（図２のステップＡ２）。この切り
出し情報は、入力部１６を介してユーザが指示しても良
いし、後述する自動処理により得られたものを用いても
よい。そして、画像処理部１３は、この切り出した部分
画像とドキュメントとを対応づけて、保存部１４にファ
イリングする（図２のステップＡ３）。First, the image input section 11 scans the image of the document to be filed and stores the obtained image signal in the memory 12 (step A1 in FIG. 2). Next, the image processing unit 13 cuts out a desired partial image from the image signal of the document stored in the memory 12 based on the cutout information (step A2 in FIG. 2). The cutout information may be instructed by the user via the input unit 16 or may be obtained by an automatic process described later. Then, the image processing unit 13 associates the cut-out partial image with the document and files in the storage unit 14 (step A3 in FIG. 2).

【００３４】この対応づけの例としては、たとえば入力
画像名が［入力００１］であれば、部分画像を［入力０
０１部分画像００１］などといった名前にして保存して
おけば、ファイル名から部分画像、および部分画像から
ファイル名といった相互の検索が行なえることになる。
なお、画像ファイリング方法はどのような方法を用いて
も構わない。As an example of this correspondence, if the input image name is [input 001], the partial image is [input 0
01 partial image 001] and the like, it becomes possible to perform mutual searches such as a partial image from a file name and a partial image from a file name.
Any method may be used as the image filing method.

【００３５】図３に部分画像と文書とを対応づけた例を
示す。なお、ここで保存される部分画像については、入
力画像の一部の画像をそのまま別に保持するといった方
法や、入力画像上の位置情報のみを保持するといった方
法などが考えられる。FIG. 3 shows an example in which partial images and documents are associated with each other. As for the partial image to be stored here, a method of holding a part of the input image as it is, a method of holding only the positional information on the input image, and the like can be considered.

【００３６】一方、保存部１４にファイリングされてい
るドキュメントを検索する際には、図４に示すように、
表示部１５に部分画像それぞれを１つのドキュメントを
示すアイコンに関連づけて一覧表示する。そして、ユー
ザは、この表示された部分画像のいずれかを入力部１６
を介して指示することによりドキュメントを保存部１４
からロードする。たとえば、表示部１５に表示されてい
る部分画像［部分画像００１］が入力部１６を介してユ
ーザから指示された際には、保存部１４に保持されてい
る［入力００１部分画像００１］の対応関係より、ドキ
ュメント［入力００１］を保存部１４からロードする。
また、この表示されている部分画像について、保存部１
４に保持されているものが複数個ある際には、一定時間
経過後に表示する部分画像を切り替えるように表示する
か、または入力部１６を介して「次」の部分画像の表示
指示が行なわれた際に部分画像を切り替えるように表示
する。なお、この保存部１４に保存されているドキュメ
ントの部分画像の表示方法は、図４に示した以外に、図
５に示したように一覧表にして、ドキュメントの名前、
作成日付およびキーワードとともに並べて表示してもよ
い。On the other hand, when searching a document filed in the storage unit 14, as shown in FIG.
Each partial image is displayed as a list on the display unit 15 in association with an icon indicating one document. Then, the user inputs one of the displayed partial images into the input unit 16
The document is stored in the storage unit 14 by instructing via the
Load from. For example, when the partial image [partial image 001] displayed on the display unit 15 is instructed by the user via the input unit 16, the correspondence of the [input 001 partial image 001] stored in the storage unit 14 is obtained. Based on the relationship, the document [input 001] is loaded from the storage unit 14.
In addition, with respect to this displayed partial image, the storage unit 1
When there are a plurality of items held in 4, the partial images to be displayed are switched so that they will be displayed after a certain period of time, or the “next” partial image is instructed to be displayed via the input unit 16. The partial image is displayed so as to be switched. In addition to the method shown in FIG. 4, the method of displaying the partial images of the document stored in the storage unit 14 is not limited to that shown in FIG.
It may be displayed side by side with the creation date and keywords.

【００３７】図６に同実施形態の線上に位置する文字列
画像を切り出す際の動作手順を示す。FIG. 6 shows an operation procedure for cutting out the character string image located on the line of the embodiment.

【００３８】この切り出しは、線上に位置する文字列画
像が、そのドキュメント全体を代表する部分画像となり
うる可能性が高いことに基づいている。This clipping is based on the possibility that the character string image located on the line can be a partial image representing the entire document.

【００３９】画像処理部１３は、まず画像中の線分を抽
出する（図６のステップＢ１）。この線分抽出は、たと
えば図７に示したような手順で行なう。The image processing section 13 first extracts a line segment in the image (step B1 in FIG. 6). This line segment extraction is performed, for example, by the procedure shown in FIG.

【００４０】すなわち、まず、入力データをラスタスキ
ャンしてその追跡開始点を検出した後、その追跡開始点
から時計周りあるいは反時計周りに図形境界がつくる閉
曲線を追跡することによって輪郭を抽出する（図７のス
テップＣ１）。このような閉曲線追跡により抽出された
輪郭の情報は、座標点列、あるいは方向コードの列であ
るチェインコードとして保存される。なお、この輪郭抽
出処理については、従来より種々提唱されている他の輪
郭抽出の手法を用いても良いことは言うまでもない。That is, first, the input data is raster-scanned to detect the tracking start point thereof, and then the contour is extracted by tracing the closed curve formed by the figure boundary clockwise or counterclockwise from the tracking start point ( Step C1 of FIG. 7). The contour information extracted by such closed curve tracing is stored as a chain code that is a sequence of coordinate points or a sequence of direction codes. It is needless to say that other contour extraction methods conventionally proposed variously may be used for this contour extraction processing.

【００４１】次に、この輪郭線データをもとに、画像の
交差部や角点という特徴点を検出するが、これは、たと
えば輪郭点列から凸／凹の形状をする点列を特徴点とし
て抽出する。この特徴点抽出は、たとえば曲率算出処理
（図７のステップＣ２）によって求められた輪郭点列の
曲率ｋを所定の閾値Ｋ０，Ｋ１と比較して、（ｋ≦Ｋ
１）となる点を凸点列とすることにより求める（図７の
ステップＣ３）。Next, based on this contour line data, characteristic points such as intersections and corner points of the image are detected. For example, the characteristic points are a point sequence having a convex / concave shape from the contour point sequence. To extract. This feature point extraction is performed by, for example, comparing the curvature k of the contour point sequence obtained by the curvature calculation process (step C2 in FIG. 7) with predetermined threshold values K0 and K1, and (k ≦ K
The points 1) are obtained as a convex point sequence (step C3 in FIG. 7).

【００４２】次に、これら凸／凹部の輪郭点列を線分と
してその始点／終点を計算する。これにより入力画像中
の直線線分部分を検出することができ、閥値Ｔより長い
線分を以下の処理対象とする（図７のステップＣ４）。Next, the start point / end point are calculated by using these convex / concave contour point sequences as line segments. Thereby, the straight line segment portion in the input image can be detected, and the line segment longer than the threshold value T is subjected to the following processing (step C4 in FIG. 7).

【００４３】なお、線分抽出方法は、画像の水平方向の
黒画素の連りを計測し、閾値Ｔより長い部分を検出し水
平線分を検出する方式をとってもよいし、従来より種々
提唱されている手法など他の方法を用いてもよい。The line segment extraction method may be a method of measuring a series of black pixels in the horizontal direction of an image, detecting a portion longer than a threshold T to detect a horizontal line segment, and various methods have been proposed in the past. You may use other methods, such as the method.

【００４４】次に、画像処理部１３は、画像中の文字列
を抽出する（図６のステップＢ２）。この文字列抽出
は、たとえば以下の方法で行う。Next, the image processing unit 13 extracts the character string in the image (step B2 in FIG. 6). This character string extraction is performed by the following method, for example.

【００４５】まず、画像を連結領域ごとにラベルづけし
て外接図形を求める。または黒領域の輪郭をすべて求め
て、この各々の輪郭の外接長方形を求める。次に、隣接
する外接図形がある閾値内の距離にある場合には併合し
てみる。そして、この併合を行った場合には、再度外接
図形を計算し、下線の方向の変動がある閾値以内の場合
に同一文字列であるとする。なお、文字列抽出方法は従
来より種々提唱されている手法など他の方法を用いても
よい。First, the image is labeled for each connected area to obtain a circumscribing figure. Alternatively, all contours of the black area are obtained, and a circumscribed rectangle of each contour is obtained. Next, if adjacent circumscribing figures are within a certain threshold distance, they are merged. Then, when this merging is performed, the circumscribing figure is calculated again, and if the variation in the direction of the underline is within a certain threshold value, the same character string is determined. As the character string extraction method, other methods such as variously proposed methods may be used.

【００４６】次に、画像処理部１３は、このように抽出
した線分と文字列との対応づけを行なう（図６のステッ
プＢ３）。この対応づけは、たとえば図８に示したよう
な手順で行なう。Next, the image processing unit 13 associates the line segment thus extracted with the character string (step B3 in FIG. 6). This association is performed, for example, by the procedure shown in FIG.

【００４７】すなわち、まず、閾値Ｔｈより長い線分を
検出し（図８のステップＤ１）、この検出した線分の上
または下部領域を検索して選択した線分と文字列枠のベ
ースラインとの距離が閾値ｄ以下のときに対応づける
（図８のステップＤ２）。That is, first, a line segment longer than the threshold Th is detected (step D1 in FIG. 8), and the upper or lower region of the detected line segment is searched to select the selected line segment and the baseline of the character string frame. When the distance is less than or equal to the threshold value d (step D2 in FIG. 8).

【００４８】また、これとは別に、この線分の近傍に位
置する文字領域の抽出は、図９に示すように、図７で示
した線分検出により求まった線分の上部を探索し、黒画
素の連結領域の大きさが閾値以下のものを求めて、これ
らの外接長方形を部分画像とする処理で構成してもよ
い。Separately from this, extraction of the character area located in the vicinity of this line segment is performed by searching the upper portion of the line segment obtained by the line segment detection shown in FIG. 7, as shown in FIG. It may be configured such that the size of the connected region of black pixels is equal to or smaller than a threshold value and these circumscribed rectangles are set as partial images.

【００４９】次に、画像処理部１３は、線上に位置する
文字列の検出を行なう（図６のステップＢ４）。Next, the image processing unit 13 detects the character string located on the line (step B4 in FIG. 6).

【００５０】このような検出手順によれば、たとえば図
１０に示したドキュメント画像を例にすると、図１１の
ような文字列画像が切り出されることとなる。According to such a detection procedure, when the document image shown in FIG. 10 is taken as an example, the character string image as shown in FIG. 11 is cut out.

【００５１】図１２に同実施形態の修飾文字画像を切り
出す際の動作手順を示す。FIG. 12 shows an operation procedure for cutting out the modified character image of the embodiment.

【００５２】この切り出しは、編掛けなどの修飾が施さ
れた文字列が、そのドキュメントを代表する部分画像と
なりうる可能性が高いことに基づいている。This cutout is based on the possibility that a character string that has been modified, such as by knitting, can be a partial image representing the document.

【００５３】画像処理部１３は、網掛け文字領域、斜体
文字領域、拡大文字領域および太線文字領域などの修飾
文字領域を検出し（図１２のステップＥ１）、この検出
した領域の画像を切り出す（図１２のステップＥ２）。The image processing unit 13 detects modified character areas such as a shaded character area, an italic character area, an enlarged character area and a thick line character area (step E1 in FIG. 12), and cuts out an image of the detected area ( Step E2 in FIG. 12).

【００５４】以下に、修飾文字領域それぞれの検出方法
の一例を示す。An example of a method of detecting each modified character area will be described below.

【００５５】網掛け文字領域の抽出は、たとえば図１３
に示したような手順で行なう。Extraction of the shaded character area is performed by, for example, FIG.
Follow the procedure shown in.

【００５６】まず、画像処理部１３は、黒画素の連結領
域を求める（図１３のステップＦ１）。そして、その連
結領域の外接長方形の各辺の長さが閾値ｈ以下の領域に
マークをつける（図１３のステップＦ２）。次に、画像
処理部１３は、連結領域の外接長方形の各辺の長さが閾
値ｃ以下の領域を求め、もし、他のこのような連結領域
とその中心との距離が閾値Ｄ以下の際には、この双方の
領域を１つの領域として統合する（図１３のステップＦ
３）。この処理を連結領域の外接長方形の各辺の長さが
閾値ｃ以下のものすべてについて行なう。First, the image processing section 13 obtains a connected area of black pixels (step F1 in FIG. 13). Then, a mark is attached to a region where the length of each side of the circumscribed rectangle of the connected region is equal to or less than the threshold value h (step F2 in FIG. 13). Next, the image processing unit 13 obtains an area in which the length of each side of the circumscribed rectangle of the connected area is less than or equal to the threshold value c, and if the distance between another such connected area and its center is less than or equal to the threshold value D. Both areas are integrated as one area (step F in FIG. 13).
3). This process is performed for all the lengths of the sides of the circumscribed rectangle of the connected area that are equal to or less than the threshold value c.

【００５７】次に、統合された領域内部にあるマーク付
けされた領域の個数を計測し、この個数を統合された領
域の外接長方形の面積で割った値を計算する（図１３の
ステップＦ４）。そして、この値が閾値ｐ以上のとき
に、統合された領域を網掛け文字領域として出力する
（図１３のステップＦ５）。もちろん、網掛け文字領域
の抽出方法は他のどの方法を用いてもよい。たとえば、
入力画像を大きさｍの正方形領域にまず分割し、この正
方形領域内部にある外接長方形の各辺の長さが閾値ｈ以
下の連結領域の個数を計測する。そして、この個数が閾
値ｑ以上の正方形領域を網掛け文字領域が存在する領域
としてマークする。次に、このマークされた各正方形領
域の中で隣接するものを求め、互いに隣接する正方形を
一つの領域としてまとめあげる。これによりできた領域
を網掛け文字領域が存在する領域として出力する。Next, the number of marked areas inside the integrated area is measured, and the value is divided by the area of the circumscribed rectangle of the integrated area to calculate a value (step F4 in FIG. 13). . Then, when this value is equal to or greater than the threshold value p, the integrated area is output as a shaded character area (step F5 in FIG. 13). Of course, any other method may be used to extract the shaded character area. For example,
The input image is first divided into square areas of size m, and the number of connected areas in which the length of each side of the circumscribed rectangle inside the square area is equal to or less than the threshold value h is measured. Then, the square area of which the number is equal to or larger than the threshold value q is marked as an area in which a shaded character area exists. Next, adjacent ones of the marked square areas are obtained, and the squares adjacent to each other are grouped as one area. The area thus created is output as an area in which a shaded character area exists.

【００５８】斜体文字領域の抽出は、たとえば以下のよ
うに行う。The extraction of the italic character area is performed as follows, for example.

【００５９】まず、画像処理部１３は、文字を構成する
黒領域を抽出する。この抽出は、たとえば、連結する黒
画素の外接図形が閾値以下のものを求めることにより行
なう。この文字構成要素の抽出は、他のどのような方法
を用いてもよい。次に、画像処理部１３は、文字列抽出
を行なう。この文字列抽出は、先に求まった文字構成要
素の中で近傍に位置するものを統合することにより行な
う。そして、文字列領域内の２×２の画素のパターンを
観測し、図１４（ａ）〜（ｈ）に示すパターンの個数を
求める。そして、この数を文字列領域の黒画素数で割っ
た値を算出し、これが閾値以上の際には、この部分を斜
体文字領域と判断する。First, the image processing unit 13 extracts a black area forming a character. This extraction is performed, for example, by obtaining a circumscribing figure of black pixels to be connected that is equal to or smaller than a threshold. Any other method may be used to extract this character component. Next, the image processing unit 13 extracts a character string. This character string extraction is performed by integrating the character components found previously and located in the vicinity. Then, the pattern of 2 × 2 pixels in the character string region is observed, and the number of patterns shown in FIGS. 14A to 14H is obtained. Then, a value obtained by dividing this number by the number of black pixels in the character string area is calculated, and when this value is equal to or larger than the threshold value, this portion is determined to be an italic character area.

【００６０】もちろん、斜体文字領域の抽出方法は他の
どの方法を用いてもよい。たとえば、入力画像を大きさ
ｍの正方形領域にまず分割し、この正方形領域内部の２
×２の画素のパターンを観測して、図１４（ａ）〜
（ｈ）に示すパターンの個数を計測する。そして、この
個数が正方形領域内部の黒画素に占める割合を計算し、
これが閾値ｑ以上のときに、正方形領域を斜体文字が存
在する領域としてマークする。次に、マークされた各正
方形領域の中で隣接するものを求めて、互いに隣接する
正方形を一つの領域のまとめあげる。これによりできた
領域を斜体文字領域が存在する領域として出力する。Of course, any other method may be used to extract the italic character area. For example, the input image is first divided into square areas of size m, and 2
By observing the pattern of the pixel of × 2, FIG.
The number of patterns shown in (h) is measured. Then, the ratio of this number to the black pixels inside the square area is calculated,
When this is greater than or equal to the threshold q, the square area is marked as an area in which italic characters are present. Next, in each of the marked square areas, the adjacent ones are obtained, and the squares adjacent to each other are grouped into one area. The area thus created is output as an area in which an italic character area exists.

【００６１】拡大文字領域の抽出は、前述した文字列抽
出により求まった領域の外接長方形の大きさが閾値以上
のものを拡大文字領域として出力する。In the extraction of the enlarged character area, the area in which the size of the circumscribed rectangle of the area obtained by the above-mentioned character string extraction is not less than the threshold value is output as the enlarged character area.

【００６２】太線文字領域の抽出は、たとえば以下のよ
うに行う。The extraction of the thick line character area is performed as follows, for example.

【００６３】まず、画像処理部１３は、黒画素の連結領
域を求める。連結領域の外接長方形の各辺の長さが閾値
ｃ以下の領域を求め、もし他のこのような連結領域とそ
の中心との距離が閾値ｄ以下のときには、この双方の領
域を１つの領域として統合する。この処理を連結領域の
外接長方形の各辺の長さが閾値ｃ以下のものすべてにつ
いて行なう。次に、統合された領域内部の黒領域に対し
て白画素と隣接するものを白画素に変換する処理を行な
う。この変換処理をｎ回繰り返す。次に、この変換後の
画像の黒領域を囲む最小の長方形を求める。この長方形
の各辺の長さと最初の文字列領域を囲む外接長方形各辺
の長さとの差が閾値以下のときには、この領域を太線文
字領域と判断する。First, the image processing section 13 obtains a connected area of black pixels. A region in which the length of each side of the circumscribed rectangle of the connected region is less than or equal to the threshold value c is obtained. If the distance between the other connected region and its center is less than or equal to the threshold value d, both regions are regarded as one region. Integrate. This process is performed for all the lengths of the sides of the circumscribed rectangle of the connected area that are equal to or less than the threshold value c. Next, the black area inside the integrated area is converted into white pixels adjacent to the white pixel. This conversion process is repeated n times. Next, the smallest rectangle that encloses the black area of this converted image is obtained. When the difference between the length of each side of the rectangle and the length of each side of the circumscribing rectangle surrounding the first character string region is equal to or less than the threshold value, this region is determined to be a thick line character region.

【００６４】これにより、網掛け文字領域、斜体文字領
域、拡大文字領域および太線文字領域などの修飾文字領
域が、部分画像として検出され切り出されることにな
る。As a result, modified character areas such as shaded character areas, italic character areas, enlarged character areas and thick line character areas are detected and cut out as partial images.

【００６５】図１５に同実施形態の指定文字領域画像部
分の切り出しを行なう際の動作手順を示す。FIG. 15 shows an operation procedure for cutting out the designated character area image portion of the embodiment.

【００６６】まず、ユーザは入力部１６を介して切り出
したい文字領域を指示する（図１５のステップＧ１）。
これは、たとえば、図１６（ａ）に示すようなメニュー
ボタンを準備し、左上、中央上、右上、左下、中央下お
よび右下などのように、切り出す文字の領域を指定する
ことにより行なう。また、図１６（ｂ）に示すように
（開始点、幅、高さ）などの絶対座標を記入しておいて
指示してもよい。もちろん、直接座標値を入力してもよ
い。また、指示の方法はキーボードや音声など、いずれ
の手段を用いても構わない。すなわち、切り出したい文
字領域を指定できればよい。First, the user designates a character area to be cut out via the input unit 16 (step G1 in FIG. 15).
This is done, for example, by preparing a menu button as shown in FIG. 16A and designating the region of the character to be cut out, such as upper left, upper center, upper right, lower left, lower center and lower right. Alternatively, as shown in FIG. 16 (b), absolute coordinates such as (starting point, width, height) may be entered and designated. Of course, the coordinate values may be directly input. Further, any method such as a keyboard or voice may be used as the method of instructing. That is, it suffices if the character area to be cut out can be designated.

【００６７】そして、画像処理部１３は、この指示に従
って、部分画像を切り出す（図１５のステップＧ２）。Then, the image processing section 13 cuts out the partial image in accordance with this instruction (step G2 in FIG. 15).

【００６８】図１７は同実施形態のすでに登録されてい
る文字コードを含む文字列領域の画像を切り出す際の動
作手順である。FIG. 17 shows an operation procedure for cutting out an image of a character string area including a character code already registered in the embodiment.

【００６９】ここでは、文字コードを登録する登録モー
ドと、部分画像を切り出す部分画像切り出しモードとを
もつものとする。Here, it is assumed that there are a registration mode for registering a character code and a partial image cutout mode for cutting out a partial image.

【００７０】登録モードでは、部分画像を切り出すため
の文字コード情報を登録する。そして、ユーザは、登録
したい文字を入力する（図１７のステップＨ１）。この
入力は、キーボードから行なってもよいし、すでに登録
されている文字情報の中からマウスなどで指定してもよ
い。具体的には、たとえば、図１８に示したような文字
コードが登録される。なお、この登録の方法や登録の仕
方は別の手段を用いても構わない。In the registration mode, character code information for cutting out a partial image is registered. Then, the user inputs the character to be registered (step H1 in FIG. 17). This input may be performed from the keyboard or may be designated with a mouse or the like from the registered character information. Specifically, for example, the character code as shown in FIG. 18 is registered. It should be noted that this registration method and the registration method may use other means.

【００７１】次に、部分画像切り出しモードについて説
明する。Next, the partial image cutout mode will be described.

【００７２】まず、画像処理部１３は、文字列を抽出す
る（図１７のステップＨ２）。この文字列検出は、たと
えば以下の方法で行なう。First, the image processing section 13 extracts a character string (step H2 in FIG. 17). This character string detection is performed by the following method, for example.

【００７３】まず、画像を連結領域ごとにラベル付け
し、外接図形を求める。次に、隣接する外接図形がある
閾値内の距離にあるときには併合してみる。この併合を
行なった場合には、再度外接図形を計算し、下線の方向
の変動がある閾値以内の場合に同一文字列であるとす
る。なお、文字列抽出方法は従来より種々提唱されてい
る手法など他の方法を用いてもよい。First, the image is labeled for each connected region, and the circumscribing figure is obtained. Next, if adjacent circumscribing figures are within a certain threshold distance, they are merged. When this merging is performed, the circumscribing figure is calculated again, and if the variation in the direction of the underline is within a certain threshold value, the same character string is determined. As the character string extraction method, other methods such as variously proposed methods may be used.

【００７４】次に、画像処理部１３は、文字列内の文字
を認識する（図１７のステップＨ３）。この文字認識の
手段は、既存のどの手法を用いてもよい。そして、登録
モードにより予め登録されている登録文字が、認識した
文字列内にないかどうかを判断し、認識した文字が登録
文字の中に存在した場合には、先の登録情報にしたがっ
て部分画像を切り出す（図１７のステップＨ４）。たと
えば、図１８に示すような文字列を登録モードで登録し
た場合に、図１９に示したような文書が入力されたと
き、［第、章］という登録文字がある場合には、図２０
に示したような文字列が切り出されることになる。Next, the image processing section 13 recognizes the character in the character string (step H3 in FIG. 17). Any existing method may be used for this character recognition means. Then, it is determined whether the registered character registered in advance in the registration mode is not in the recognized character string, and if the recognized character exists in the registered character, the partial image is displayed according to the previous registration information. Is cut out (step H4 in FIG. 17). For example, when a character string as shown in FIG. 18 is registered in the registration mode and a document as shown in FIG.
The character string shown in is cut out.

【００７５】図２１は同実施形態のすでに登録されてい
る文字コードを含む文字列領域内の指定された位置の画
像を切り出す際の動作手順である。FIG. 21 shows an operation procedure for cutting out an image at a designated position in a character string area including a character code which is already registered in the embodiment.

【００７６】ここでは、前述と同様に文字コードを登録
する登録モードと、部分画像を切り出す部分画像切り出
しモードとをもつものとする。Here, it is assumed that there are a registration mode for registering a character code and a partial image cutout mode for cutting out a partial image, as described above.

【００７７】登録モードでは、部分画像を切り出すため
の文字コード情報を登録する。そして、ユーザは、登録
したい文字と、切り出したい文字の位置とを入力する
（図２１のステップＩ１）。この入力は、キーボードか
ら行なってもよいし、すでに登録されている文字情報の
中からマウスなどで指定してもよい。具体的には、たと
えば、図２２に示したような文字コードと切り出し位置
とが登録される。なお、この登録の方法や登録の仕方は
別の手段を用いても構わない。In the registration mode, character code information for cutting out a partial image is registered. Then, the user inputs the character to be registered and the position of the character to be cut out (step I1 in FIG. 21). This input may be performed from the keyboard or may be designated with a mouse or the like from the registered character information. Specifically, for example, the character code and the cutout position as shown in FIG. 22 are registered. It should be noted that this registration method and the registration method may use other means.

【００７８】次に、部分画像切り出しモードについて説
明する。Next, the partial image cutout mode will be described.

【００７９】まず、画像処理部１３は、文字列を抽出す
る（図２１のステップＩ２）。この文字列検出は、たと
えば以下の方法で行なう。First, the image processing section 13 extracts a character string (step I2 in FIG. 21). This character string detection is performed by the following method, for example.

【００８０】まず、画像を連結領域ごとにラベル付け
し、外接図形を求める。次に、隣接する外接図形がある
閾値内の距離にあるときには併合してみる。この併合を
行なった場合には、再度外接図形を計算し、下線の方向
の変動がある閾値以内の場合に同一文字列であるとす
る。なお、文字列抽出方法は従来より種々提唱されてい
る手法など他の方法を用いてもよい。First, the image is labeled for each connected region, and the circumscribing figure is obtained. Next, if adjacent circumscribing figures are within a certain threshold distance, they are merged. When this merging is performed, the circumscribing figure is calculated again, and if the variation in the direction of the underline is within a certain threshold value, the same character string is determined. As the character string extraction method, other methods such as variously proposed methods may be used.

【００８１】次に、画像処理部１３は、文字列内の文字
を認識する（図２１のステップＩ３）。この文字認識の
手段は、既存のどの手段を用いてもよい。そして、登録
モードにより予め登録されている登録文字が、認識した
文字列内にないかどうかを判断し、認識した文字が登録
文字の中に存在した場合には、先の登録情報にしたがっ
て部分画像を切り出す（図２１のステップＩ４）。たと
えば、図２２に示すような文字列を登録した場合に、図
２３に示したような文書が入力されたとき、［氏名］と
いう登録文字がある場合には、図２４に示したような文
字列が切り出されることになる。Next, the image processing section 13 recognizes the character in the character string (step I3 in FIG. 21). As the means for character recognition, any existing means may be used. Then, it is determined whether the registered character registered in advance in the registration mode is not in the recognized character string, and if the recognized character exists in the registered character, the partial image is displayed according to the previous registration information. Is cut out (step I4 in FIG. 21). For example, when a character string as shown in FIG. 22 is registered, when a document as shown in FIG. 23 is input and there is a registered character [name], the character as shown in FIG. Rows will be cut out.

【００８２】図２５は同実施形態の図、表または絵の領
域の上もしくは下に位置する文字列内の画像を切り出す
例である。FIG. 25 is an example of cutting out an image in a character string located above or below the drawing, table or picture area of the same embodiment.

【００８３】画像処理部１３は、まず、領域分離を行な
う（図２５のステップＪ１）。この領域分離では、画像
中の要素を文字列と、図、表および絵の領域とに分離す
る。この分離は、文字列抽出および図領域抽出（図２５
のステップＪ２〜ステップＪ３）によって実施する。The image processing section 13 first performs area separation (step J1 in FIG. 25). In this area separation, elements in the image are separated into character strings and figures, tables and pictures areas. This separation includes character string extraction and drawing area extraction (see FIG. 25).
Step J2 to step J3).

【００８４】文字列抽出は、たとえば以下の方法で行な
う。The character string is extracted by the following method, for example.

【００８５】まず、画像を連結領域ごとにラベル付け
し、外接図形を求める。次に、隣接する外接図形がある
閾値内の距離にあるときには併合してみる。この併合を
行なった場合には、再度外接図形を計算し、下線の方向
の変動がある閾値以内の場合に同一文字列であるとする
（図２５のステップＪ２）。なお、文字列抽出方法は、
従来より種々提唱されている手法など他の方法を用いて
もよい。そして、併合を行なった最終矩形の大きさがあ
る閾値以上の場合には、図、表または絵の候補とする
（図２５のステップＪ３）。First, the image is labeled for each connected region, and the circumscribed figure is obtained. Next, if adjacent circumscribing figures are within a certain threshold distance, they are merged. When this merging is performed, the circumscribing figure is calculated again, and when the variation in the direction of the underline is within a certain threshold value, the same character string is determined (step J2 in FIG. 25). In addition, the character string extraction method is
Other methods such as various conventionally proposed methods may be used. Then, when the size of the final rectangle obtained by merging is equal to or larger than a certain threshold value, it is set as a candidate for a figure, a table or a picture (step J3 in FIG. 25).

【００８６】次に、図、表または絵の周辺に文字列がな
いかどうかを探索して修飾文字列の検出を行なう（図２
５のステップＪ４）。これは、たとえば、着目する図、
表および絵領域の周辺（上または下）に文字列がないか
どうかを探索するものである。具体的には、着目する
図、表または絵領域の上辺にもっとも近い文字列と上辺
との距離がある閾値以下である場合には、この文字列が
着目する図、表または絵領域に対応していると見做す。
下辺についても同様の処理を行う。そして、この結果か
ら得られた文字領域の画像を切り出す（図２５のステッ
プＪ５）。図、表または絵の領域に対応する文字列の対
応づけは、他の方法を用いてもよい。これらの図領域を
修飾する文字列領域が検出できればよい。図２６は原画
像例、図２７は図領域が抽出された例、図２８はそれを
修飾する文字列を切り出した例である。Next, the modified character string is detected by searching for a character string around the figure, table or picture (FIG. 2).
5 step J4). This is, for example,
It searches for a character string around (above or below) the table and the picture area. Specifically, if the distance between the character string closest to the upper side of the figure, table or picture area of interest and the upper side is less than or equal to a threshold value, this character string corresponds to the figure, table or picture area of interest. It is considered to be.
Similar processing is performed on the lower side. Then, the image of the character area obtained from this result is cut out (step J5 in FIG. 25). Other methods may be used for associating the character strings corresponding to the areas of the figure, table, or picture. It suffices if a character string region that modifies these drawing regions can be detected. FIG. 26 is an example of an original image, FIG. 27 is an example in which a drawing area is extracted, and FIG. 28 is an example in which a character string that modifies it is cut out.

【００８７】図２９は図、表または絵の領域の画像を切
り出す例である。FIG. 29 shows an example of cutting out an image of a drawing, table or picture area.

【００８８】画像処理部１３は、まず、領域分離を行な
う（図２９のステップＫ１）。この領域分離では、画像
中の要素を文字列と、図、表および絵の領域とに分離す
る。この分離は、文字列抽出および図領域抽出（図２９
のステップＫ２〜ステップＫ３）によって実施する。The image processing section 13 first performs area separation (step K1 in FIG. 29). In this area separation, elements in the image are separated into character strings and figures, tables and pictures areas. This separation is performed by extracting character strings and drawing areas (see FIG. 29).
Step K2 to step K3).

【００８９】文字列抽出は、たとえば以下の方法で行な
う。The character string is extracted by the following method, for example.

【００９０】まず、画像を連結領域ごとにラベル付け
し、外接図形を求める。次に、隣接する外接図形がある
閾値内の距離にあるときには併合してみる。この併合を
行なった場合には、再度外接図形を計算し、下線の方向
の変動がある閾値以内の場合に同一文字列であるとする
（図２９のステップＫ２）。なお、文字列抽出方法は、
従来より種々提唱されている方法など他の方法を用いて
もよい。そして、併合を行なった最終矩形の大きさがあ
る閾値以上の場合には、図、表または絵の候補とする
（図２９のステップＫ３）。First, the image is labeled for each connected region, and the circumscribed figure is obtained. Next, if adjacent circumscribing figures are within a certain threshold distance, they are merged. When this merging is performed, the circumscribing figure is calculated again, and if the variation in the direction of the underline is within a certain threshold value, the same character string is determined (step K2 in FIG. 29). In addition, the character string extraction method is
Other methods such as variously proposed methods may be used. Then, if the size of the final rectangle obtained by the merging is equal to or larger than a certain threshold value, the candidate is selected as a figure, table or picture (step K3 in FIG. 29).

【００９１】そして、この結果得られた図、表または絵
の画像領域を切り出す（図２９のステップＫ４）。図３
０は原画像例、図３１は図領域を切り出した例である。Then, the image area of the figure, table or picture obtained as a result is cut out (step K4 in FIG. 29). FIG.
0 is an example of the original image, and FIG. 31 is an example of cutting out the drawing area.

【００９２】これにより、ドキュメントをファイルに保
存する際に、所望する部分画像、もしくはそのドキュメ
ントを代表する蓋然性の高い部分画像をインデックスと
して付加して保存が行なえることとなり、かつこの部分
画像インデックスを利用すれば、その検索が簡便とな
る。As a result, when saving a document in a file, a desired partial image or a highly probable partial image representing the document can be added as an index and saved, and this partial image index can be stored. If it is used, the search becomes easy.

【００９３】[0093]

【発明の効果】以上詳述したように、本発明によれば、
ドキュメントの画像をファイル装置に保存する際に、自
動的に部分画像インデックスを付加して保存が行なえる
ので、ドキュメントのファイル装置への登録が容易にな
るとともに、部分画像を利用してファイル装置内のドキ
ュメントの検索を容易に行うことができ、ユーザーの手
を煩わせることなく、処理工程の軽減が図れることとな
る。As described in detail above, according to the present invention,
When saving the image of the document to the file device, the partial image index can be automatically added and saved, which makes it easy to register the document in the file device and uses the partial image to save in the file device. The documents can be easily searched for, and the number of processing steps can be reduced without bothering the user.

[Brief description of drawings]

【図１】本発明の一実施形態に係るドキュメントファイ
リング装置の基本的な処理機能を示すブロック図。FIG. 1 is a block diagram showing basic processing functions of a document filing device according to an embodiment of the present invention.

【図２】同実施形態のドキュメントファイリング装置の
動作を説明するためのフローチャート。FIG. 2 is an exemplary flowchart for explaining the operation of the document filing apparatus of the same embodiment.

【図３】同実施形態の部分画像と文書とを対応づけた例
を示す図。FIG. 3 is a diagram showing an example in which a partial image and a document of the same embodiment are associated with each other.

【図４】同実施形態の部分画像表示例を示す図。FIG. 4 is a view showing a partial image display example of the same embodiment.

【図５】同実施形態の部分画像表示例を示す図。FIG. 5 is a view showing a partial image display example of the same embodiment.

【図６】同実施形態の線上に位置する文字列画像を切り
出す際の動作手順を示すフローチャート。FIG. 6 is a flowchart showing an operation procedure for cutting out a character string image located on a line in the embodiment.

【図７】同実施形態の線分抽出処理の流れを示すフロー
チャート。FIG. 7 is a flowchart showing a flow of line segment extraction processing of the same embodiment.

【図８】同実施形態の線上文字列の対応づけ処理の流れ
を示すフローチャート。FIG. 8 is a flowchart showing a flow of a process of associating a line character string of the embodiment.

【図９】同実施形態の線分の近傍に位置する文字領域抽
出の例を示す概念図。FIG. 9 is a conceptual diagram showing an example of extracting a character region located in the vicinity of a line segment according to the same embodiment.

【図１０】同実施形態の線上文字列を抽出する原画像例
を示す図。FIG. 10 is a diagram showing an example of an original image for extracting a line character string according to the first embodiment.

【図１１】同実施形態の線上文字列を抽出した部分画像
例を示す図。FIG. 11 is a view showing an example of a partial image in which a linear character string of the embodiment is extracted.

【図１２】同実施形態の修飾文字画像を切り出す際の動
作手順を示すフローチャート。FIG. 12 is a flowchart showing an operation procedure when cutting out a modified character image of the same embodiment.

【図１３】同実施形態の網掛け文字領域抽出処理を説明
するためのフローチャート。FIG. 13 is a flowchart for explaining a shaded character region extraction process of the same embodiment.

【図１４】同実施形態の画素パターンを示す概念図。FIG. 14 is a conceptual diagram showing a pixel pattern of the same embodiment.

【図１５】同実施形態の指定文字領域画像部分を切り出
す際の動作手順を示すフローチャート。FIG. 15 is a flowchart showing an operation procedure when cutting out a designated character area image portion of the embodiment.

【図１６】同実施形態の指定文字領域画像の領域を指定
するメニュー例を示す図。FIG. 16 is a view showing an example of a menu for designating an area of a designated character area image according to the same embodiment.

【図１７】同実施形態の文字列領域内にすでに登録され
ている文字コートが含まれるときにその文字列領域内の
画像を切り出す場合の動作手順を示すフローチャート。FIG. 17 is an exemplary flowchart showing an operation procedure for cutting out an image in a character string area when a character code already registered is included in the character string area according to the embodiment.

【図１８】同実施形態の文字コードの登録例を示す図。FIG. 18 is a diagram showing an example of registration of character codes of the same embodiment.

【図１９】同実施形態の原画像例を示す図。FIG. 19 is a diagram showing an example of an original image of the same embodiment.

【図２０】同実施形態の部分画像切り出し結果を示す
図。FIG. 20 is a view showing a partial image cutout result of the embodiment.

【図２１】同実施形態の文字列領域内にすでに登録され
ている文字コードが含まれるときに、指定された位置の
文字列領域内部の画像を切り出す場合の動作手順を示す
フローチャート。FIG. 21 is an exemplary flowchart showing an operation procedure for cutting out an image inside a character string area at a specified position when a character code already registered is included in the character string area of the embodiment.

【図２２】同実施形態の文字コードと切り出し位置との
登録例を示す図。FIG. 22 is a diagram showing an example of registration of a character code and a cutout position according to the same embodiment.

【図２３】同実施形態の原画像例を示す図。FIG. 23 is a diagram showing an example of an original image of the same embodiment.

【図２４】同実施形態の部分画像切り出し結果を示す
図。FIG. 24 is a view showing a partial image cutout result of the embodiment.

【図２５】同実施形態の図、表または絵の領域上もしく
は下に位置する文字列内の画像を切り出す例を示すフロ
ーチャート。FIG. 25 is a flowchart showing an example of cutting out an image in a character string located above or below the drawing, table or picture area of the embodiment.

【図２６】同実施形態の原画像例を示す図。FIG. 26 is a diagram showing an example of an original image of the same embodiment.

【図２７】同実施形態の図、表または絵の領域を抽出し
た結果の例を示す図。FIG. 27 is a diagram showing an example of a result of extracting a diagram, table, or picture region of the embodiment.

【図２８】同実施形態の画像を切り出した結果の例を示
す図。FIG. 28 is a diagram showing an example of a result obtained by cutting out an image according to the same embodiment.

【図２９】図、表または絵の領域の画像を切り出す例を
示すフローチャート。FIG. 29 is a flowchart showing an example of cutting out an image of a drawing, table or picture area.

【図３０】同実施形態の原画像例を示す図。FIG. 30 is a diagram showing an example of an original image of the same embodiment.

【図３１】同実施形態の図、表または絵の領域を抽出し
た結果の例を示す図。FIG. 31 is a diagram showing an example of a result of extracting a diagram, table, or picture region of the embodiment.

[Explanation of symbols]

１０…制御部、１１…画像入力部、１２…メモリ、１３
…画像処理部、１４…保存部、１５…表示部、１６…入
力部、１７…文字認識部。10 ... Control unit, 11 ... Image input unit, 12 ... Memory, 13
... image processing section, 14 ... storage section, 15 ... display section, 16 ... input section, 17 ... character recognition section.

Claims

[Claims]

1. A document filing apparatus for capturing and filing images of various documents, and an image input unit for inputting the images of the documents, and a portion for cutting out partial images from the entire image of the document input by the image input unit. An image cutting means and a file means for storing the whole image and the partial image in association with each other as document data, and presenting the partial image as an index to the user when searching the document. Document filing device.

2. A document filing method for capturing and filing images of various documents, wherein an entire image of the document is input, a partial image is cut out from the input entire image, and the entire image and the partial image are extracted. Is stored as document data in association with each other, and when the document is searched, the partial image is presented to the user as an index.