JP3373068B2

JP3373068B2 - Optical character recognition device

Info

Publication number: JP3373068B2
Application number: JP30050294A
Authority: JP
Inventors: 和弘石川; 直人青木; 保彦清水; 志津子川田; 充瀧口; 貴之加藤; 幸代黒澤; 雅史下山
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1994-12-05
Filing date: 1994-12-05
Publication date: 2003-02-04
Anticipated expiration: 2018-02-04
Also published as: JPH08161427A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は新聞、雑誌等の一般文書
画像パタ−ンから文字を読み取り電子ファイルに変換す
る光学式文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an optical character recognition device for reading characters from a general document image pattern of newspapers, magazines, etc. and converting them into electronic files.

【０００２】[0002]

【従来の技術】従来、光学式文字認識装置は、新聞、雑
誌等の一般文書の情報媒体をイメ−ジスキャナ等の画像
取得装置にセットして読み取りを開始させると、情報媒
体を画像読取手段により光学的に読み取って文書画像パ
タ−ンを文書画像パタ−ン記憶手段に記憶し、文書画像
パタ−ンに文字図形分離、行切り出し、文字切り出し等
の前処理と文字認識処理とを施して文字図形情報（文
字、図形、位置、属性等からなる）を文字図形情報記憶
手段に記憶する。2. Description of the Related Art Conventionally, an optical character recognizing device sets an information medium of a general document such as a newspaper or a magazine in an image acquiring device such as an image scanner and starts reading the information medium by an image reading means. The document image pattern is optically read and stored in the document image pattern storage means, and the document image pattern is subjected to preprocessing such as character / figure separation, line segmentation, and character segmentation, and character recognition process, and character recognition is performed. Graphic information (consisting of characters, figures, positions, attributes, etc.) is stored in the character / graphic information storage means.

【０００３】[0003]

【発明が解決しようとする課題】従来の光学式文字認識
装置にあっては、雑誌のように見開き２頁分からなる一
般文書（以後ブックタイプと記す）の情報媒体を画像取
得装置にセットして認識処理させると、たとえ欲しい情
報が雑誌の頁単位であっても、２頁分の情報を認識処理
し、１頁分の情報として記憶するので、雑誌の頁単位で
情報を記憶するには、記憶された情報を人手により編集
し直さなければならず、手間がかかるという問題点があ
った。In the conventional optical character recognition device, an information medium of a general document (hereinafter referred to as a book type) having a two-page spread like a magazine is set in the image acquisition device. When the recognition processing is performed, even if the desired information is the page unit of the magazine, the information for two pages is recognized and stored as the information for one page. Therefore, in order to store the information for each page of the magazine, The stored information has to be manually edited again, which is troublesome.

【０００４】あるいは、情報媒体を画像取得装置にセッ
トする際、頁単位で画像取得装置にセットして画像を取
得する方法もあるが、いずれにせよ手間を要するという
問題点があった。Alternatively, when the information medium is set in the image acquisition device, there is also a method of setting the page in the image acquisition device to acquire the image, but there is a problem that it takes time in any case.

【０００５】本発明は、複数頁分の情報を画像取得さ
せ、文字図形情報を頁単位で記憶できる光学式文字認識
装置を提供することを目的としている。It is an object of the present invention to provide an optical character recognition device capable of acquiring information for a plurality of pages as an image and storing character graphic information in page units.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に本発明の光学式文字認識装置においては、文書画像パ
タ−ン記憶手段から文書画像パタ−ンの領域範囲を示す
値を入力し、処理して頁分割情報を算出する頁分割情報
処理手段と、文書画像パタ−ン記憶手段から文書画像パ
タ−ンを入力し、文書画像パタ−ンに前処理及び文字認
識処理を施し、頁分割情報に基づく頁単位で文字図形情
報を文字図形情報記憶手段に出力する頁単位文字認識処
理手段とを備える。In order to achieve the above object, in the optical character recognition apparatus of the present invention, a value indicating the area range of the document image pattern is input from the document image pattern storage means, The page division information processing means for processing and calculating the page division information and the document image pattern from the document image pattern storage means are input, and the document image pattern is subjected to preprocessing and character recognition processing, and page division is performed. And a page unit character recognition processing means for outputting character and graphic information to the character and graphic information storage means on a page basis based on the information.

【０００７】[0007]

【作用】上記のように構成された光学式文字認識装置の
頁単位文字認識処理手段は文書画像パタ−ンに前処理及
び文字認識処理を施し、認識処理された文字図形情報を
頁分割情報処理手段から入力した頁分割情報に基づく頁
単位で文字図形情報記憶手段に出力する。The page-based character recognition processing means of the optical character recognition apparatus configured as described above performs preprocessing and character recognition processing on the document image pattern, and performs page division information processing on the recognized character graphic information. The data is output to the character / graphic information storage means in page units based on the page division information input from the means.

【０００８】従って本発明よれば、複数頁分の情報を画
像取得させ、文字図形情報を頁単位で記憶できるのであ
る。Therefore, according to the present invention, information of a plurality of pages can be obtained as an image and the character / graphic information can be stored in page units.

【０００９】[0009]

【実施例】本発明の実施例について図面を参照しながら
説明する。尚、各図面に共通な要素には同一符号を付
す。Embodiments of the present invention will be described with reference to the drawings. Elements common to the drawings are given the same reference numerals.

【００１０】第１実施例図１は本発明の基本的構成を示す機能ブロック図であ
る。画像読取手段１は情報媒体を光学的に読み取って文
書画像パタ−ン記憶手段２に出力する。頁分割情報処理
手段３は文書画像パタ−ン記憶手段２から入力した文書
画像パタ−ンの領域範囲を示す値を処理して頁分割情報
を算出する。頁単位文字認識処理手段４は文書画像パタ
−ン記憶手段２から文書画像パタ−ンを入力して文書画
像パタ−ンに文字図形分離、行切り出し、文字切り出し
等の前処理及び文字認識処理を施し、認識処理された文
字図形情報を頁分割情報処理手段３から入力した頁分割
情報に基づく頁単位で文字図形情報記憶手段５に出力す
る。 First Embodiment FIG. 1 is a functional block diagram showing the basic configuration of the present invention. The image reading means 1 optically reads the information medium and outputs it to the document image pattern storage means 2. The page division information processing means 3 processes a value indicating the area range of the document image pattern input from the document image pattern storage means 2 to calculate page division information. The page unit character recognition processing means 4 inputs the document image pattern from the document image pattern storage means 2 and performs preprocessing such as character pattern separation, line segmentation, and character segmentation on the document image pattern and character recognition process. The applied and recognized character / graphic information is output to the character / graphic information storage means 5 in page units based on the page division information input from the page division information processing means 3.

【００１１】図２は第１実施例の構成を示すブロック図
である。中央処理装置１１（以後ＣＰＵ１１と記す）に
は画像メモリ１２、メインメモリ１３（以後メモリ１３
と記す）が接続されてあり、画像メモリ１２の入力部に
はイメ−ジスキャナ等の画像取得装置１４が接続されて
ある。央処理装置１１と画像取得装置１４とで画像読取
手段１を構成し、中央処理装置１１と画像メモリ１２と
で文書画像パタ−ン記憶手段２を構成し、中央処理装置
１１とメモリ１３とで頁分割情報処理手段３、頁単位文
字認識処理手段４を構成する。FIG. 2 is a block diagram showing the configuration of the first embodiment. The central processing unit 11 (hereinafter referred to as CPU 11) includes an image memory 12 and a main memory 13 (hereinafter referred to as the memory 13).
The image acquisition device 14 such as an image scanner is connected to the input section of the image memory 12. The central processing device 11 and the image acquisition device 14 constitute the image reading means 1, the central processing device 11 and the image memory 12 constitute the document image pattern storage means 2, and the central processing device 11 and the memory 13 constitute the document image pattern storage means 2. The page division information processing means 3 and the page unit character recognition processing means 4 are configured.

【００１２】図３は第１実施例の文書画像パタ−ンの模
式図である。文書画像パタ−ンはＣＰＵ１１が画像メモ
リ１２、画像取得装置１４に読取りタイミング信号を出
力し、画像取得装置１４が情報媒体を光学的に読み取っ
て画像メモリ１２に記憶させる。文書画像パタ−ンは原
点０からＸ軸正方向へ幅Ｗ、Ｙ軸負方向へ高さＨの領域
範囲を有し、情報媒体が本等であれば、文書画像エリア
２１、２２の間に陰影部２０が存在する。FIG. 3 is a schematic diagram of the document image pattern of the first embodiment. In the document image pattern, the CPU 11 outputs a read timing signal to the image memory 12 and the image acquisition device 14, and the image acquisition device 14 optically reads the information medium and stores it in the image memory 12. The document image pattern has an area range of width W in the positive direction of the X axis and height H in the negative direction of the Y axis from the origin 0. If the information medium is a book or the like, it is between the document image areas 21 and 22. There is a shaded portion 20.

【００１３】図４は図３に示した文書画像パタ−ンのヒ
ストグラムである。横軸をＸ、縦軸をヒストグラム値Ｈ
（Ｘ）とし、図３に示した文書画像パタ−ンのＹ軸を予
め決めた間隔でＸ軸正方向へ走査して黒デ−タの度数分
布をヒストグラム値Ｈ（Ｘ）として表したものである。
文書画像エリア２１、２２は黒デ−タの度数分布が少な
いので予め決めてある閾値Ｈ（ｓ）以下となり、陰影部
２０は黒デ−タの度数分布が多いので閾値Ｈ（ｓ）を越
える。FIG. 4 is a histogram of the document image pattern shown in FIG. Horizontal axis is X, vertical axis is histogram value H
(X), the Y-axis of the document image pattern shown in FIG. 3 is scanned in the positive direction of the X-axis at a predetermined interval, and the frequency distribution of black data is represented as a histogram value H (X). Is.
Since the document image areas 21 and 22 have a small frequency distribution of black data, they are below a predetermined threshold value H (s), and the shaded area 20 has a large frequency distribution of black data, and therefore exceeds the threshold value H (s). .

【００１４】図５は第１実施例の動作を説明するフロ−
チャ−トである。ステップＳ1 でＣＰＵ１１は画像メモ
リ１２、画像取得装置１４に読取りタイミング信号を出
力して画像取得装置１４が光学的に読み取った情報を画
像メモリ１２に文書画像パタ−ンとして記憶させる。FIG. 5 is a flow chart for explaining the operation of the first embodiment.
It is a chart. In step S1, the CPU 11 outputs a read timing signal to the image memory 12 and the image acquisition device 14 to store the information optically read by the image acquisition device 14 in the image memory 12 as a document image pattern.

【００１５】ステップＳ2 でＣＰＵ１１は画像メモリ１
２から図３に示した文書画像パタ−ンの領域範囲を示す
値Ｗ、Ｈを入力してメモリ１３に記憶する。In step S2, the CPU 11 causes the image memory 1
The values W and H indicating the area range of the document image pattern shown in FIGS. 2 to 3 are input and stored in the memory 13.

【００１６】ステップＳ3 でＣＰＵ１１は、図３に示し
たように、文書画像パタ−ンを走査してヒストグラムを
求め、メモリ１３に記憶する。In step S3, the CPU 11 scans the document image pattern to obtain a histogram and stores it in the memory 13, as shown in FIG.

【００１７】ステップＳ4 でＣＰＵ１１は閾値Ｈ（ｓ）
を越える陰影部２０をカウントして分割頁数を算出す
る。分割頁数をＰとすると、Ｐ＝陰影部の数＋１で表さ
れ、本実施例では、陰影部２０が１箇所なので、分割頁
数Ｐ＝２である。In step S4, the CPU 11 sets the threshold value H (s).
The number of shaded portions 20 exceeding the number is counted to calculate the number of divided pages. When the number of divided pages is P, it is represented by P = the number of shaded portions + 1, and in this embodiment, since there is one shaded portion 20, the number of divided pages P = 2.

【００１８】ステップＳ5 でＣＰＵ１１はメモリ１３に
記憶してある文書画像パタ−ンの領域範囲を示す値Ｗを
読み出し、図３に示すように、分割頁数Ｐで除算して仮
の頁分割位置Ｂ（Ｘk 、０）を求める。本実施例ではＸ
k ＝Ｗ／Ｐ＝Ｗ／２であるから仮の頁分割位置Ｂ（Ｗ／
２、０）となる。In step S5, the CPU 11 reads out the value W indicating the area range of the document image pattern stored in the memory 13, and divides it by the number of divided pages P as shown in FIG. Find B (Xk, 0). In this embodiment, X
Since k = W / P = W / 2, provisional page division position B (W /
2, 0).

【００１９】ステップＳ6 でＣＰＵ１１はヒストグラム
を用いて、図４に示すように、仮の頁分割位置Ｂ（Ｘk
、０）から真の頁分割位置Ｓ（Ｘs 、０）を求める。
即ち、ＣＰＵ１１はＸ軸方向の仮の頁分割位置Ｘk を中
心に＋方向、−方向にそれぞれ予め設定したδ、δの範
囲を有しており、その範囲内でヒストグラム値Ｈ（Ｘ）
が最大となるＸの値Ｘs を求め、真の頁分割位置Ｓ（Ｘ
s 、０）とし、各頁の開始位置と終了位置とをメモリ１
３に記憶する。本実施例では第１頁の開始位置、終了位
置をそれぞれ（０、０）、（Ｘs 、Ｈ）とし、第２頁の
開始位置、終了位置をそれぞれ（Ｘs 、０）、（Ｗ、
Ｈ）としてメモリ１３に記憶する。In step S6, the CPU 11 uses the histogram to generate a temporary page division position B (Xk as shown in FIG.
, 0) to obtain the true page division position S (Xs, 0).
That is, the CPU 11 has preset ranges of δ and δ in the + direction and − direction centering on the temporary page division position Xk in the X-axis direction, and the histogram value H (X) is within the range.
The value Xs of X that maximizes the
s, 0) and the start and end positions of each page are stored in the memory 1
Store in 3. In this embodiment, the start position and end position of the first page are (0, 0) and (Xs, H), and the start position and end position of the second page are (Xs, 0) and (W, respectively).
H) is stored in the memory 13.

【００２０】ステップＳ7 でＣＰＵ１１は画像メモリ１
２から各頁の開始位置、終了位置に基づいて文書画像パ
タ−ンを読み出して文字と図形との分離処理、行切出し
処理、文字切出し処理等の前処理を施し、さらに切出し
処理された文字に対して認識処理を施し、頁単位でメモ
リ１３に記憶する。In step S7, the CPU 11 causes the image memory 1
The document image pattern is read based on the start position and end position of each page from 2, and pre-processing such as character and figure separation processing, line cutting processing, and character cutting processing is performed, and the cut-out processed characters are further processed. The recognition process is performed on the page, and the page is stored in the memory 13 page by page.

【００２１】ステップＳ8 でＣＰＵ１１は全分割頁につ
いて処理したか否かをチェックし、全分割頁処理ならば
終了し、否ならばステップＳ7 に戻る。In step S8, the CPU 11 checks whether or not all divided pages have been processed. If all divided pages have been processed, the processing ends, and if no, the process returns to step S7.

【００２２】尚、本実施例では画像取得装置がブックタ
イプの情報媒体を光学的に読み取って画像メモリに記憶
させた文書画像パタ−ンを頁分割する動作について説明
したが、情報媒体を複数頁に渡ってこの動作を行う場合
にはステップＳ1 からステップＳ8 までを複数頁分繰り
返す。この際、ＣＰＵ１１が最初の分割頁に付与する頁
番号１をメモリ１３に記憶しておき、頁分割した最初の
分割頁に頁番号１を付与し、以後順次＋１インクリメン
トして頁分割した各頁に自動的に頁番号を付与するよう
にしてもよい。In this embodiment, the operation in which the image acquisition device optically reads a book type information medium and divides the document image pattern stored in the image memory into pages is described. When this operation is performed over a period of time, steps S1 to S8 are repeated for a plurality of pages. At this time, the page number 1 given to the first divided page by the CPU 11 is stored in the memory 13, the page number 1 is given to the first divided page, and then each page is sequentially incremented by +1. Alternatively, the page number may be automatically assigned.

【００２３】また、ステップＳ6 でメモリ１３に記憶し
た領域に基づきステップＳ7 、ステップＳ8 で頁単位毎
に前処理、認識処理を施したが、文書画像パタ−ン全体
に対して文字と図形との分離処理を施し、その後メモリ
１３に記憶した各頁の開始位置、終了位置に基づき、文
字図形分離処理、行切出し処理、文字切出し処理等の前
処理を施し、さらに切出し処理された各文字に対して認
識処理を施し、頁単位でメモリ１３に記憶するようにし
てもよい。Further, although preprocessing and recognition processing are performed for each page unit in steps S7 and S8 based on the area stored in the memory 13 in step S6, the entire document image pattern is composed of characters and figures. Separation processing is performed, and then, based on the start position and end position of each page stored in the memory 13, pre-processing such as character / figure separation processing, line cutout processing, and character cutout processing is performed. The recognition processing may be performed and stored in the memory 13 page by page.

【００２４】また、文書画像パタ−ン全体に対して前処
理、認識処理を施した後、メモリ１３に記憶した各頁の
開始位置、終了位置に基づき頁単位でメモリ１３に記憶
するようにしてもよい。Further, after the preprocessing and recognition processing are performed on the entire document image pattern, it is stored in the memory 13 page by page based on the start position and end position of each page stored in the memory 13. Good.

【００２５】第１実施例よれば、陰影部から分割頁数、
頁分割位置を自動的に算出するので、操作に不慣れな人
でも簡単に操作できる。According to the first embodiment, from the shaded area to the number of divided pages,
Since the page division position is automatically calculated, even a person who is unfamiliar with the operation can easily operate it.

【００２６】第２実施例図６は第２実施例の構成を示すブロック図であり、第１
実施例と異なるところは入力装置１５を設けた点であ
る。入力装置１５は、例えばキ−スイッチであり、分割
頁数、頁付与方向、最初の付与頁番号、全体の付与頁数
を入力する。頁付与方向とは画像メモリに記憶させた文
書画像パタ−ンを頁分割した後、例えば左から右に向か
って頁を付与するのか、右から左に向かって頁を付与す
るのかを決めるパラメ−タである。第１実施例では分割
頁数を自動的にＣＰＵ１１が文書画像パタ−ンに存在す
る陰影部から求めたが、第２実施例では入力装置１５か
ら指定する。 Second Embodiment FIG. 6 is a block diagram showing the configuration of the second embodiment.
The difference from the embodiment is that the input device 15 is provided. The input device 15 is, for example, a key switch, and inputs the number of divided pages, the page addition direction, the first added page number, and the total number of added pages. The page addition direction is a parameter for deciding whether to add a page from the left to the right or a page from the right to the left after dividing the document image pattern stored in the image memory into pages. It is In the first embodiment, the number of divided pages is automatically obtained by the CPU 11 from the shaded area existing in the document image pattern, but in the second embodiment, it is designated by the input device 15.

【００２７】図７は第２実施例の文書画像パタ−ンの模
式図である。文書画像パタ−ンはＣＰＵ１１が画像メモ
リ１２、画像取得装置１４に読取りタイミング信号を出
力し、画像取得装置１４が情報媒体を光学的に読み取っ
て画像メモリ１２に記憶させる。文書画像パタ−ンは原
点０からＸ軸正方向へ幅Ｗ、Ｙ軸負方向へ高さＨの領域
範囲を有する。第１実施例と異なり、陰影部は存在しな
い。FIG. 7 is a schematic diagram of the document image pattern of the second embodiment. In the document image pattern, the CPU 11 outputs a read timing signal to the image memory 12 and the image acquisition device 14, and the image acquisition device 14 optically reads the information medium and stores it in the image memory 12. The document image pattern has a range of a width W from the origin 0 in the positive direction of the X axis and a height H in the negative direction of the Y axis. Unlike the first embodiment, there is no shaded area.

【００２８】図８は図７に示した文書画像パタ−ンのヒ
ストグラムである。横軸をＸ、縦軸をヒストグラム値Ｈ
（Ｘ）とし、図７に示した文書画像パタ−ンのＹ軸を予
め決めた間隔でＸ軸正方向へ走査して黒デ−タの度数分
布をヒストグラム値Ｈ（Ｘ）として表したものである。FIG. 8 is a histogram of the document image pattern shown in FIG. Horizontal axis is X, vertical axis is histogram value H
(X), the Y-axis of the document image pattern shown in FIG. 7 is scanned in the positive direction of the X-axis at a predetermined interval, and the frequency distribution of black data is represented as a histogram value H (X). Is.

【００２９】図９は第２実施例の動作を説明するフロ−
チャ−トである。分割頁数、頁付与方向、最初の付与頁
番号、全体の付与頁数は入力装置１５から入力され、メ
モリ１３に記憶されている。本実施例では分割頁数Ｐ＝
２、頁付与方向は左から右に向かって頁を付与するもの
とする。FIG. 9 is a flow chart for explaining the operation of the second embodiment.
It is a chart. The number of divided pages, the page addition direction, the first added page number, and the total number of added pages are input from the input device 15 and stored in the memory 13. In this embodiment, the number of divided pages P =
2. As for the page giving direction, pages are given from left to right.

【００３０】ステップＳ1 でＣＰＵ１１は画像取得装置
１４にセットされた最初の情報媒体を画像メモリ１２、
画像取得装置１４に読取りタイミング信号を出力し、画
像取得装置１４が光学的に読み取った情報を画像メモリ
１２に文書画像パタ−ンとして記憶させる。In step S1, the CPU 11 sets the first information medium set in the image acquisition device 14 to the image memory 12,
A read timing signal is output to the image acquisition device 14, and the information optically read by the image acquisition device 14 is stored in the image memory 12 as a document image pattern.

【００３１】ステップＳ2 でＣＰＵ１１は画像メモリ１
２から図７に示した文書画像パタ−ンの領域範囲を示す
値Ｗ、Ｈを入力してメモリ１３に記憶する。In step S2, the CPU 11 causes the image memory 1
The values W and H indicating the area range of the document image pattern shown in FIGS. 2 to 7 are input and stored in the memory 13.

【００３２】ステップＳ3 でＣＰＵ１１は文書画像パタ
−ンを走査し、図８に示したように、ヒストグラムを求
めてメモリ１３に記憶する。In step S3, the CPU 11 scans the document image pattern, obtains a histogram and stores it in the memory 13, as shown in FIG.

【００３３】ステップＳ4 でＣＰＵ１１は仮の頁分割位
置を求める。即ち、メモリ１３に記憶してある分割頁数
Ｐ＝２と文書画像パタ−ンの領域範囲を示す値Ｗとを読
み出して除算し、仮の頁分割位置Ｂ（Ｘk 、０）を求め
る。本実施例ではＸk ＝Ｗ／Ｐ＝Ｗ／２であるから仮の
頁分割位置Ｂ（Ｗ／２、０）となる。In step S4, the CPU 11 obtains a temporary page division position. That is, the number of divided pages P = 2 stored in the memory 13 and the value W indicating the area range of the document image pattern are read and divided to obtain a temporary page division position B (Xk, 0). In this embodiment, Xk = W / P = W / 2, so that the temporary page division position B (W / 2, 0) is obtained.

【００３４】ステップＳ5 でＣＰＵ１１はヒストグラム
を用いて、図８に示すように、仮の頁分割位置Ｂ（Ｘk
、０）から真の頁分割位置Ｓ（Ｘs 、０）を求める。
即ち、ＣＰＵ１１はＸ軸方向の仮の頁分割位置Ｘk を中
心に＋方向、−方向にそれぞれ予め決めた間隔δで移動
して、その位置（Ｘk ±Ｎ×δ：Ｎは整数）でのヒスト
グラム値Ｈ（Ｘk ±Ｎ×δ）と予め決めた閾値Ｈ（ｓ）
とを比較して閾値Ｈ（ｓ）を越える位置（Ｘk ＋Ｎ1 ×
δ：Ｎ1 は整数）、（Ｘk −Ｎ2 ×δ：Ｎ2 は整数）を
それぞれ求め、Ｘs ＝Ｘk ＋｛（Ｎ1 −Ｎ2 ）×δ｝／
２を求め、真の頁分割位置Ｓ（Ｘs 、０）とし、各頁の
開始位置と終了位置とをメモリ１３に記憶する。本実施
例では第１頁の開始位置、終了位置をそれぞれ（０、
０）、（Ｘs、Ｈ）とし、第２頁の開始位置、終了位置
をそれぞれ（Ｘs 、０）、（Ｗ、Ｈ）としてメモリ１３
に記憶する。In step S5, the CPU 11 uses the histogram to generate a temporary page division position B (Xk
, 0) to obtain the true page division position S (Xs, 0).
That is, the CPU 11 moves around the provisional page division position Xk in the X-axis direction in the + direction and the − direction at predetermined intervals δ, and the histogram at that position (Xk ± N × δ: N is an integer). Value H (Xk ± N × δ) and predetermined threshold H (s)
And the position (Xk + N1 ×
.delta.: N1 is an integer) and (Xk-N2.times..delta.: N2 is an integer), and Xs = Xk + {(N1-N2) .times..delta.} /
2 is obtained and the true page division position S (Xs, 0) is set, and the start position and end position of each page are stored in the memory 13. In this embodiment, the start position and end position of the first page are (0,
0) and (Xs, H), and the start and end positions of the second page are (Xs, 0) and (W, H), respectively.
Remember.

【００３５】ステップＳ6 でＣＰＵ１１は画像メモリ１
２から各頁の開始位置、終了位置、頁付与方向、付与頁
番号に基づいて文書画像パタ−ンを読み出して文字と図
形との分離処理、行切出し処理、文字切出し処理等の前
処理を施し、さらに切出し処理された文字に対して認識
処理を施し、付与頁番号を付加して頁単位でメモリ１３
に記憶する。ＣＰＵ１１は最初の付与頁番号を最初の分
割頁に付与したのち、＋１インクリメントして次の分割
頁に付与するとともに頁番号を全体の付与頁数から引い
ていく。In step S6, the CPU 11 causes the image memory 1
From 2, the document image pattern is read out based on the start position, end position of each page, the page giving direction, and the given page number, and pre-processing such as character and figure separation processing, line cutting processing, and character cutting processing is performed. Further, recognition processing is performed on the characters that have been cut out, the added page numbers are added, and the memory 13 is added in page units.
Remember. The CPU 11 assigns the first assigned page number to the first divided page, increments it by +1 to assign it to the next divided page, and subtracts the page number from the total number of assigned pages.

【００３６】ステップＳ7 でＣＰＵ１１は情報媒体を全
て処理したか否かを全体の付与頁数が０になったか否か
でチェックし、全て処理ならば終了し、否ならばステッ
プＳ6 に戻る。In step S7, the CPU 11 checks whether or not all the information media have been processed, depending on whether or not the total number of pages to be added has become 0. If all the processing has been completed, the process ends.

【００３７】尚、ステップＳ5 でメモリ１３に記憶した
領域に基づきステップＳ6 、ステップＳ7 で頁単位毎に
前処理、認識処理を施したが、文書画像パタ−ン全体に
対して文字と図形との分離処理を施し、その後メモリ１
３に記憶した各頁の開始位置、終了位置に基づき行切出
し処理、文字切出し処理等の前処理を施し、さらに切出
し処理された各文字に対して認識処理を施し、頁単位で
メモリ１３に記憶するようにしてもよい。It should be noted that although pre-processing and recognition processing have been carried out page by page in steps S6 and S7 based on the area stored in the memory 13 in step S5, the entire document image pattern has characters and figures. Separation processing is performed, and then memory 1
Preprocessing such as line cutout processing and character cutout processing is performed based on the start position and end position of each page stored in No. 3, and recognition processing is performed on each cut out character, and stored in the memory 13 in page units. You may do it.

【００３８】また、文書画像パタ−ン全体に対して前処
理、認識処理を施した後、メモリ１３に記憶した各頁の
開始位置、終了位置に基づき頁単位でメモリ１３に記憶
するようにしてもよい。Further, after the preprocessing and recognition processing are performed on the entire document image pattern, it is stored in the memory 13 page by page based on the start position and end position of each page stored in the memory 13. Good.

【００３９】また、本実施例では仮の頁分割位置を求
め、図８に示したヒストグラムの黒地の部分を利用して
真の頁分割位置を求めたが、逆に白地の最も広い範囲の
中点の位置を真の頁分割位置としてもよい。Further, in this embodiment, the temporary page division position is obtained, and the true page division position is obtained by utilizing the black background portion of the histogram shown in FIG. 8. However, conversely, in the widest range of the white background. The position of the point may be the true page division position.

【００４０】また、本実施例では頁分割位置を補正する
ようにしたが、補正を必要としない場合には頁分割位置
は（Ｗ／Ｐ、０）とする。Although the page division position is corrected in this embodiment, the page division position is (W / P, 0) when the correction is not required.

【００４１】また、本実施例では入力装置から分割頁
数、頁付与方向、最初の付与頁番号、全体の付与頁数を
入力したが、頁付与方向、最初の付与頁番号を自動的に
設定するようにしておけば分割頁数のみでよい。また、
最初の付与頁番号を自動的に設定するようにしておき、
情報媒体の頁付与方向に合わせて頁を付与する場合には
分割頁数、頁付与方向を入力するだけでよい。Further, in this embodiment, the number of divided pages, the page giving direction, the first giving page number, and the total giving page number are inputted from the input device, but the page giving direction and the first giving page number are automatically set. If so, only the number of divided pages is required. Also,
Make sure to set the first page number automatically,
When pages are added in accordance with the page giving direction of the information medium, it is only necessary to input the number of divided pages and the page giving direction.

【００４２】また、本実施例では分割頁数を入力した
が、分割頁サイズを入力して文書画像パタ−ンの領域範
囲を示す値Ｗ、Ｈを除算して頁分割位置を求めてもよ
い。分割頁サイズ入力にあたっては、印刷用紙サイズに
合わせてＡ5 、Ａ4 、Ａ3 の用紙サイズをメモリに記憶
しておき、入力はＡ5 、Ａ4 、Ａ3 とする。Although the number of divided pages is input in this embodiment, the divided page size may be input and the values W and H indicating the area range of the document image pattern may be divided to obtain the page division position. . When inputting the divided page size, the paper sizes A5, A4 and A3 are stored in the memory in accordance with the print paper size, and the input is A5, A4 and A3.

【００４３】第２実施例よれば、情報媒体が画像取得装
置に多少ずれてセットされても頁分割位置を補正して頁
分割する。According to the second embodiment, the page division is performed by correcting the page division position even if the information medium is set in the image acquisition device with some deviation.

【００４４】[0044]

【発明の効果】本発明は、以上説明したように構成され
ているので以下に記載される効果を奏する。Since the present invention is configured as described above, it has the following effects.

【００４５】文書画像パタ−ン記憶手段から入力した文
書画像パタ−ンの領域範囲を示す値を処理して頁分割情
報を出力する頁分割情報処理手段と、文書画像パタ−ン
記憶手段から入力した文書画像パタ−ンに前処理及び文
字認識処理を施して頁分割情報処理手段から入力した頁
分割情報に基づく頁単位で文字図形情報を出力する頁単
位文字認識処理手段と、頁単位文字認識処理手段から文
字図形情報を入力して記憶する頁単位情報記憶手段とを
備えたことにより、頁単位文字認識処理手段は文書画像
パタ−ンに前処理及び文字認識処理を施し、認識処理さ
れた文字図形情報を頁分割情報処理手段から入力した頁
分割情報に基づく頁単位で頁単位記憶手段に出力するの
で、複数頁分の情報を画像取得させ、文字図形情報を頁
単位で記憶できる。Input from the document image pattern storage means and page division information processing means for processing a value indicating the area range of the document image pattern input from the document image pattern storage means and outputting page division information. A page unit character recognition processing unit for performing preprocessing and character recognition processing on the formed document image pattern and outputting character graphic information in page units based on page division information input from the page division information processing unit, and page unit character recognition By providing the page unit information storage unit for inputting and storing character graphic information from the processing unit, the page unit character recognition processing unit performs preprocessing and character recognition processing on the document image pattern, and the recognition processing is performed. Since the character / graphic information is output to the page unit storage unit in page units based on the page division information input from the page division information processing unit, information for a plurality of pages can be acquired as an image and the character / graphic information can be stored in page units.

【００４６】また、頁分割情報処理手段は文書画像パタ
−ン記憶手段から文書画像パタ−ンを入力し、文書画像
パタ−ンの頁間に有する陰影部から分割頁数を算出し、
さらに文書画像パタ−ンの領域範囲を示す値を分割頁数
で除算して頁分割位置を算出し、頁分割情報として頁単
位文字認識処理手段に出力するようにしたことにより、
分割頁数、頁分割位置を自動的に算出するので、操作に
不慣れな人でも簡単に操作できる。Further, the page division information processing means inputs the document image pattern from the document image pattern storage means, calculates the number of divided pages from the shaded portion between the pages of the document image pattern,
Further, the value indicating the area range of the document image pattern is divided by the number of divided pages to calculate the page division position, and is output as page division information to the page unit character recognition processing means.
Since the number of pages to be divided and the page division position are automatically calculated, even a person unfamiliar with the operation can easily perform the operation.

【００４７】また、分割頁数と頁付与方向とを入力する
頁分割情報入力手段を有し、文書画像パタ−ン記憶手段
から入力した文書画像パタ−ンの領域範囲を示す値を頁
分割情報入力手段から入力した分割頁数で除算して頁分
割位置を算出し、頁分割情報として頁分割位置と頁付与
方向とを頁単位文字認識処理手段に出力するようにした
ことにより、媒体の頁付与方向に合わせて分割頁に頁を
付与できるので、頁付与方向の異なる雑誌や本等をその
頁単位で編集するのに適している。Further, it has a page division information input means for inputting the number of divided pages and a page giving direction, and a value indicating the area range of the document image pattern inputted from the document image pattern storage means is set as the page division information. The page division position is calculated by dividing by the number of division pages input from the input means, and the page division position and the page giving direction are output to the page unit character recognition processing means as page division information. Since pages can be added to the divided pages in accordance with the adding direction, it is suitable for editing magazines, books, etc. having different page adding directions in page units.

【００４８】また、分割頁数と頁付与方向と最初の付与
頁番号と全体の付与頁数とを入力する頁分割情報入力手
段を有し、文書画像パタ−ン記憶手段から入力した文書
画像パタ−ンの領域範囲を示す値を頁分割情報入力手段
から入力した分割頁数で除算して頁分割位置を算出し、
頁分割情報として頁分割位置と頁付与方向と付与頁番号
とを頁単位文字認識処理手段に出力するようにしたこと
により、媒体の頁付与方向に合わせて分割頁に自動的に
頁番号が付与されるので、さらに頁付与方向の異なる雑
誌や本等をその頁単位で編集するのに適している。Further, it has a page division information input means for inputting the number of division pages, the page addition direction, the first addition page number and the total number of addition pages, and the document image pattern input from the document image pattern storage means. -Dividing the value indicating the area range of the page division by the number of divided pages input from the page division information input means to calculate the page division position,
By outputting the page division position, the page giving direction, and the giving page number as the page dividing information to the page unit character recognition processing means, the page numbers are automatically given to the divided pages in accordance with the page giving direction of the medium. Therefore, it is suitable for editing magazines, books, etc. having different page addition directions page by page.

【００４９】また、頁分割位置補正手段を備えたことに
より、情報媒体が画像取得装置に多少ずれてセットされ
ても頁分割位置を補正して頁分割することができる。Further, by providing the page division position correction means, even if the information medium is set in the image acquisition device with some deviation, the page division position can be corrected and page division can be performed.

[Brief description of drawings]

【図１】本発明の基本的構成を示す機能ブロック図であ
る。FIG. 1 is a functional block diagram showing a basic configuration of the present invention.

【図２】第１実施例の構成を示すブロック図である。FIG. 2 is a block diagram showing the configuration of the first embodiment.

【図３】第１実施例の文書画像パタ−ンの模式図であ
る。FIG. 3 is a schematic diagram of a document image pattern according to the first embodiment.

【図４】図３に示した文書画像パタ−ンのヒストグラム
である。FIG. 4 is a histogram of the document image pattern shown in FIG.

【図５】第１実施例の動作を説明するフロ−チャ−トで
ある。FIG. 5 is a flowchart for explaining the operation of the first embodiment.

【図６】第２実施例の構成を示すブロック図である。FIG. 6 is a block diagram showing a configuration of a second embodiment.

【図７】第２実施例の文書画像パタ−ンの模式図であ
る。FIG. 7 is a schematic diagram of a document image pattern according to a second embodiment.

【図８】図７に示した文書画像パタ−ンのヒストグラム
である。8 is a histogram of the document image pattern shown in FIG.

【図９】第１実施例の動作を説明するフロ−チャ−トで
ある。FIG. 9 is a flowchart for explaining the operation of the first embodiment.

[Explanation of symbols]

１画像読取手段２文書画像パタ−ン記憶手段３頁分割情報処理手段４頁単位文字認識処理手段５文字図形情報記憶手段 1 Image reading means 2 Document image pattern storage means 3 Page division information processing means 4 Page unit character recognition processing means 5 Character / graphic information storage means

───────────────────────────────────────────────────── フロントページの続き (72)発明者川田志津子東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 (72)発明者瀧口充東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 (72)発明者加藤貴之東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 (72)発明者黒澤幸代東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 (72)発明者下山雅史東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 (56)参考文献特開平５−298477（ＪＰ，Ａ) 特開平５−135201（ＪＰ，Ａ) 特開平３−201177（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/00 - 9/82 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Shizuko Kawara 1-7-12 Toranomon, Minato-ku, Tokyo Within Oki Electric Industry Co., Ltd. (72) Inventor Mitsuru Takiguchi 1-7-12 Toranomon, Minato-ku, Tokyo Oki Denki Kogyo Co., Ltd. (72) Inventor Takayuki Kato 1-7-12 Toranomon, Minato-ku, Tokyo Oki Denki Kogyo Co., Ltd. (72) Inventor Sachiyo Kurosawa 1-12-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd. (72) Inventor Masafumi Shimoyama 1-7-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd. (56) Reference JP-A-5-298477 (JP, A) JP HEI 5-135201 (JP, A) JP-A-3-201177 (JP, A) (58) Fields investigated (Int.Cl. ⁷ , DB name) G06K 9/00-9/82

Claims

(57) [Claims]

1. An information medium is optically read by an image reading means to store a document image pattern in a document image pattern storage means, and character / image separation, line segmentation, character segmentation, etc. in the document image pattern. In the optical character recognition device which performs the pre-processing and the character recognition processing and stores it in the character / graphic information storage means, a value indicating the area range of the document image pattern is input from the document image pattern storage means, and the processing is performed. The page division information processing means for calculating page division information and the document image pattern from the document image pattern storage means are inputted, and the document image pattern is subjected to pre-processing and character recognition processing to obtain the page division information. An optical character recognition device, comprising: a page unit character recognition processing means for outputting character and graphic information to the character and graphic information storage means based on page.

2. The page division information processing means inputs the document image pattern from the document image pattern storage means and calculates the number of divided pages from the shaded areas between the pages of the document image pattern,
The optical character recognition according to claim 1, further comprising: dividing a value indicating the area range of the document image pattern by the number of divided pages to calculate a page division position, and outputting the page division information to the page unit character recognition processing means. apparatus.

3. The page division information processing means has page division information input means for inputting the number of divided pages, and a page indicating a value indicating the area range of the document image pattern input from the document image pattern storage means. 2. The optical character recognition device according to claim 1, wherein a page division position is calculated by dividing by the number of divided pages input from the division information input means, and is output to the page unit character recognition processing means as page division information.

4. The page division information processing means has page division information input means for inputting the number of divided pages and a page giving direction, and the area range of the document image pattern inputted from the document image pattern storage means. The value indicating is divided by the number of divided pages input from the page division information input means to calculate the page division position, and the page division position and the page addition direction are output to the page unit character recognition processing means as page division information. Item 1. The optical character recognition device according to item 1.

5. The page division information processing means has page division information input means for inputting the number of divided pages, the page giving direction, the first added page number and the total number of added pages, and stores the document image pattern. The page division position is calculated by dividing the value indicating the area range of the document image pattern input from the means by the page division information input from the page division information input means, and the page division position and the page addition direction are calculated as the page division information. The optical character recognition device according to claim 1, wherein the assigned page number and the page unit character recognition processing means are output.

6. The optical character recognition apparatus according to claim 3, 4, or 5, wherein the divided page size is input instead of the number of divided pages. apparatus.

7. The page division information processing means divides the value indicating the area range of the document image pattern by the number of divided pages to obtain a temporary page division position, and the temporary page division position is determined from the temporary page division position. Claim 2 or Claim 3 or Claim 4 or Claim 5 or Claim 5 provided with a page division position correcting means for obtaining
Alternatively, the optical character recognition device according to claim 6.