JPH04241074A

JPH04241074A - Automatic document clean copying device

Info

Publication number: JPH04241074A
Application number: JP9123691A
Authority: JP
Inventors: Yoshiyuki Namitsuka; 義幸波塚
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1991-01-25
Filing date: 1991-01-25
Publication date: 1992-08-28
Anticipated expiration: 2016-01-09
Also published as: JP3122476B2

Abstract

PURPOSE:To automatically make a clean copy of a handwritten character document without inputting any clean copying data by an operator. CONSTITUTION:Handwritten characters are read by a read part 1 and converted by an A/D conversion part 2 into a digital signal, which is stored in a memory 3. The handwritten character string is cut, line by line, by a character line retrieval part 4, respective characters of each line of the handwritten character string are segmented by a character area segmentation part 5, and segment elements of each character are detected by a character constituent element detection part 6 and stored in a data storage part 7. Then a size varying part 8 varies the longitudinal and lateral lengths of each character to a proper size and a position correction part 9 corrects the lateral positions of respective characters so that the character string have constant lateral intervals; and character strings of respective lines are reconstituted by a document image shaping and reconstitution part 10 so as to have specific intervals, and they are stored in a memory 11 and then outputted through an output part 12.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は、文書を自動的に清書す
る自動文書清書装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an automatic document formatting device for automatically formatting documents.

【０００３】0003

【従来の技術】従来、この種の文書清書装置は、特開平
２−１２７７７５号公報や特開平２−１２７７７６号公
報に示すように、文書上の画像データ表示領域の縦サイ
ズや横サイズと画像データの縦サイズや横サイズにより
、表示領域に欠ける部分が発生しないように、かつ画像
データの縦横比が１対１になるように演算するように構
成されている。したがって、この文書清書装置は、操作
者が画像データ表示領域の縦サイズや横サイズ等の清書
用データを入力すると、表示領域に欠ける部分が発生し
ないように、かつ画像データの縦横比が１対１になるよ
うに演算して文字列を清書することができる。2. Description of the Related Art Conventionally, this type of document copying apparatus has been designed to adjust the vertical and horizontal sizes of image data display areas on documents and image It is configured to perform calculations so that no missing portion occurs in the display area and the aspect ratio of the image data is 1:1 depending on the vertical and horizontal sizes of the data. Therefore, when the operator inputs formatting data such as the vertical size and horizontal size of the image data display area, this document formatting device ensures that the image data does not have any missing parts and that the aspect ratio of the image data is 1:1. You can correct a string by calculating it so that it becomes 1.

【０００４】0004

【発明が解決しようとする課題】しかしながら、上記従
来の文書清書装置では、画像データ表示領域の縦サイズ
や横サイズにより演算するので、これらの清書用データ
を操作者が入力しなければならず、また、手書き文字の
文書を自動的に清書することができないという問題点が
ある。However, in the above-mentioned conventional document formatting device, since calculations are made based on the vertical size and horizontal size of the image data display area, the operator must input these formatting data. Another problem is that it is not possible to automatically clean up documents with handwritten characters.

【０００５】本発明は上記従来の問題点に鑑み、操作者
がなんら清書用データを入力することなく手書き文字の
文書を自動的に清書することができる自動文書清書装置
を提供することを目的とする。SUMMARY OF THE INVENTION In view of the above conventional problems, an object of the present invention is to provide an automatic document formatting device that can automatically format documents with handwritten characters without the operator inputting any formatting data. do.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に、第１の手段は、手書き文字列を光学的に読み取る読
み取り手段と、前記読み取り手段により読み取られた手
書き文字列を検索して１行毎に検索する抽出手段と、前
記検索手段により抽出された各行の手書き文字列を文字
毎に切り出す切り出し手段と、前記切り出し手段により
切り出された各手書き文字の大きさを判別する判別手段
と、前記判別手段により判別された各文字が適正な大き
さになるように縦横の長さを変倍する変倍手段と、前記
検索手段により抽出された各行の手書き文字列の間隔が
適正になるように、前記変倍手段により変倍された各文
字を配列する配列手段と、前記配列手段により配列され
た各行の文字列を出力する出力手段とを備えたことを特
徴とする。[Means for Solving the Problems] In order to achieve the above object, a first means includes a reading means for optically reading a handwritten character string, and a reading means for searching the handwritten character string read by the reading means. an extracting means for searching line by line; a cutting means for cutting out the handwritten character string of each line extracted by the searching means character by character; and a determining means for determining the size of each handwritten character cut out by the cutting means; scaling means for scaling the vertical and horizontal lengths of each character discriminated by the discriminating means so that each character has an appropriate size; and a scaling means for scaling the length and width of each character discriminated by the discriminating means so that the intervals between the handwritten character strings in each line extracted by the searching means are appropriate. The apparatus is characterized in that it comprises an arrangement means for arranging the characters scaled by the scaling means, and an output means for outputting the character strings of each line arranged by the arrangement means.

【０００７】第２の手段は、第１の発明の変倍手段が前
記判別手段により判別された各文字の線分要素を判別し
てベクトル化し、このベクトル化された線分要素を変倍
することを特徴とする。[0007] In the second means, the scaling means of the first invention discriminates and vectorizes the line segment elements of each character discriminated by the discrimination means, and scales the vectorized line segment elements. It is characterized by

【０００８】第３の手段は、第２の発明の判別手段が前
記切り出し手段により切り出された各手書き文字を辞書
パターンとのマッチングにより認識し、辞書パターンの
文字フォントを参照してフォントデータの輪郭線をベク
トル化することを特徴とする。[0008] In the third means, the determining means of the second invention recognizes each handwritten character cut out by the cutting means by matching with a dictionary pattern, and determines the outline of the font data by referring to the character font of the dictionary pattern. It is characterized by converting lines into vectors.

【０００９】[0009]

【作用】第１の手段は上記構成により、手書き文字列が
１行毎に抽出され、各行の手書き文字列が文字毎に分離
され、各文字の大きさが判別されて適正な大きさになる
ように縦横の長さが変倍され、各行の手書き文字列の間
隔が適正になるように配置される。したがって、操作者
がなんら清書用データを入力することなく手書き文字の
文書を自動的に清書することができる。[Operation] The first means uses the above configuration to extract the handwritten character string line by line, separate the handwritten character string in each line character by character, determine the size of each character, and make the appropriate size. The vertical and horizontal lengths are scaled so that the handwritten character strings in each line are spaced appropriately. Therefore, it is possible to automatically format a document with handwritten characters without the operator inputting any formatting data.

【００１０】第２の手段は、各文字の線分要素がベクト
ル化され、このベクトル化された線分要素が変倍される
ので、筆記者が意図した筆跡で清書される。[0010] In the second method, the line segment elements of each character are vectorized, and the vectorized line segment elements are scaled, so that the handwriting as intended by the scribe is corrected.

【００１１】第３の手段は、手書き文字が辞書パターン
とのマッチングにより認識され、辞書パターンのフォン
トデータがベクトル化される。したがって、このベクト
ル化された線分要素が変倍されるので、手書き文字を辞
書パターンの文字フォントに清書することができる。[0011] In the third means, handwritten characters are recognized by matching with a dictionary pattern, and font data of the dictionary pattern is vectorized. Therefore, since the vectorized line segment elements are scaled, handwritten characters can be formatted into a dictionary pattern character font.

【００１２】0012

【実施例】以下、図面を参照して本発明の実施例を説明
する。図１は本発明に係る自動文書清書装置の一実施例
を示すブロック図、図２は図１の自動文書清書装置によ
り読み取られる手書き原稿とそのヒストグラムを示す説
明図、図３は図１の文字行検索部により抽出された行イ
メージを示す説明図、図４は図１の文字領域切り出し部
により切り出された文字を示す説明図、図５は図４の文
字の構成要素を示す説明図、図６は図１の文字構成要素
検出部の動作を説明するためのフローチャート、図７は
図１の文字領域サイズ変更部及び位置補正部の動作を説
明するためのフローチャート、図８は図１の文字領域サ
イズ変更部の変倍動作を示す説明図、図９は図１の文字
領域サイズ変更部により変倍される前後のベクトルデー
タを示す説明図、図１０は図１の文書画像整形、再構成
部の動作を示す説明図である。Embodiments Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of an automatic document cleansing device according to the present invention, FIG. 2 is an explanatory diagram showing a handwritten document read by the automatic document cleansing device shown in FIG. 1 and its histogram, and FIG. 3 is a diagram showing the characters in FIG. FIG. 4 is an explanatory diagram showing the line image extracted by the line search unit. FIG. 4 is an explanatory diagram showing the characters cut out by the character area clipping unit in FIG. 1. FIG. 5 is an explanatory diagram showing the constituent elements of the characters in FIG. 4. 6 is a flowchart for explaining the operation of the character component detection section in FIG. 1, FIG. 7 is a flowchart for explaining the operation of the character area size changing section and position correction section in FIG. 1, and FIG. An explanatory diagram showing the scaling operation of the area resizing unit. FIG. 9 is an explanatory diagram showing vector data before and after being scaled by the character area resizing unit in FIG. 1. FIG. 10 is an explanatory diagram showing the document image formatting and reconstruction in FIG. 1. It is an explanatory view showing operation of a part.

【００１３】図１において、手書き原稿等の画像は読み
取り部１により光学的に読み取られて電気信号に変換さ
れ、この電気信号はＡ／Ｄ変換部２によりディジタル信
号に変換され、メモリ３に記憶される。図２の左方は手
書き原稿Ｐの一例を示し、この手書き原稿Ｐは、横書き
された手書き文字１３と写真等の矩形画像１４を有する
。図２の右方は、手書き原稿Ｐの手書き文字１３と矩形
画像１４が横方向に累積されたヒストグラム１５を示し
、矩形の写真画像１４におけるヒストグラム１５は、一
様に加算されるものとみなされるので、手書き文字１３
との切れ目が保存される。In FIG. 1, an image of a handwritten manuscript or the like is optically read by a reading section 1 and converted into an electrical signal, and this electrical signal is converted into a digital signal by an A/D converting section 2 and stored in a memory 3. be done. The left side of FIG. 2 shows an example of a handwritten manuscript P, and this handwritten manuscript P has horizontally written handwritten characters 13 and a rectangular image 14 such as a photograph. The right side of FIG. 2 shows a histogram 15 in which the handwritten characters 13 of the handwritten document P and the rectangular image 14 are accumulated in the horizontal direction, and the histogram 15 in the rectangular photographic image 14 is considered to be uniformly added. Therefore, handwritten characters 13
The break with is saved.

【００１４】図１に示す文字行検索部４は、メモリ３に
格納された画像データによりヒストグラム１５の累積度
数が縦方向に極端に変化する点を判別して文字行の切り
出し点とみなし、行イメージを抽出する。図３上段は文
字行検索部４により抽出された行イメージ１７を示し、
この行イメージ１７は、横書きされた各文字領域１６を
有する。図３下段は各文字領域１６の黒画素が縦方向に
累積されたヒストグラム１８を示す。The character line search unit 4 shown in FIG. 1 determines a point where the cumulative frequency of the histogram 15 changes drastically in the vertical direction based on the image data stored in the memory 3, considers it as a cutting point of a character line, and searches the line. Extract the image. The upper part of FIG. 3 shows a line image 17 extracted by the character line search unit 4,
This line image 17 has character areas 16 written horizontally. The lower part of FIG. 3 shows a histogram 18 in which the black pixels of each character area 16 are accumulated in the vertical direction.

【００１５】図１に示す文字領域切り出し部５は、この
ヒストグラム１８の累積度数が横方向に極端に変化する
点を判別して各文字領域１６を判別し、次いで、各文字
領域１６から上下方向の余白を除去することにより文字
部の位置と縦横の長さを算出する。図４は文字領域切り
出し部５により切り出された文字「あ」を示し、この文
字は、文字領域１６の左下隅のｘ座標Ｘｓと、右上隅の
ｙ座標Ｙｓと、横の長さｘ　ｌｅｎｇｔｈと、縦の長さ
ｙ　ｌｅｎｇｔｈ等を有する。図５はこの文字「あ」の
文字構成要素を示し、各文字構成要素は複数の画素２５
から成り、また、ストロークの端点２３とベクトル２４
により表現される。The character area cutting unit 5 shown in FIG. The position and vertical and horizontal length of the character part are calculated by removing the margins. FIG. 4 shows the character “A” cut out by the character area cutting unit 5, and this character has an x coordinate Xs of the lower left corner of the character area 16, a y coordinate Ys of the upper right corner, and a horizontal length x length. , vertical length y length, etc. FIG. 5 shows the character components of this character “a”, each character component having a plurality of pixels 25
and the end point 23 of the stroke and the vector 24
is expressed by

【００１６】図１に示す文字構成要素検出部６は図６に
示すように、この複数の画素２５から成る現在の処理対
象の文字画像を細線化し（ステップ１９）、次いで、そ
の文字画像線幅をサンプリングし、平均線幅を算出する
（ステップ２０）。次いで、文字構成要素検出部６は、
文字のストロークの端点２３を検出し（ステップ２１）
、この端点２３とストロークの交点によりベクトル２４
を算出することにより文字画像をベクトル化し（ステッ
プ２２）、文字領域１６の位置、サイズ、ベクトルデー
タをデータ格納部７に格納する。As shown in FIG. 6, the character component detection unit 6 shown in FIG. is sampled and the average line width is calculated (step 20). Next, the character component detection unit 6
Detect the end point 23 of the character stroke (step 21)
, the intersection of this end point 23 and the stroke creates a vector 24
The character image is vectorized by calculating (step 22), and the position, size, and vector data of the character area 16 are stored in the data storage section 7.

【００１７】図１に示す文字領域のサイズ変更部８は図
７に示すように、先ず、切り出した文字の高さを複数の
カテゴリに分類する（ステップ２６）。すなわち、例え
ば見出しのように比較的大きい文字と、文章の普通の大
きさの文字と、文章の文字であって促音「っ」のように
比較的小さい文字等のカテゴリに分類する。次いで、各
カテゴリを代表する文字の平均の高さを算出し、この平
均の高さと各文字の高さを比較することにより各文字が
そのカテゴリに属するかを判定し、各文字をそのカテゴ
リの平均の高さに変倍する（ステップ２７）。すなわち
、例えば図８に示すように、切り出した文字３２の高さ
ｙ　ｌｅｎｇｔｈとそのカテゴリの平均の高さｙ　ａｖ
ｅ　により高さｙ　ａｖｅ　の文字３４に変倍する。次
いで、サイズ変更部８は図７及び図８に示すように、そ
の文字の縦横の比を比較し（ステップ２８）、縦方向の
変倍率と同一の比率で横方向の長さｘ　ｌｅｎｇｔｈを
長さｘ　ｒｅｆｏｒｍに変倍する（ステップ２９）。尚
、この横方向の長さｘ　ｒｅｆｏｒｍは式（１）により
求めることができる。　　　　ｘ　ｒｅｆｏｒｍ＝（ｘ　ｌｅｎｇｔｈ／ｙ　
ｌｅｎｇｔｈ）＊ｙ　ａｖｅ　・・・（１）As shown in FIG. 7, the character area resizing section 8 shown in FIG. 1 first classifies the heights of the cut out characters into a plurality of categories (step 26). That is, for example, the characters are classified into categories such as relatively large characters such as headings, normal-sized characters such as sentences, and relatively small characters such as the consonant "tsu". Next, calculate the average height of characters representing each category, compare this average height with the height of each character to determine whether each character belongs to that category, and assign each character to that category. The image is scaled to the average height (step 27). That is, as shown in FIG. 8, for example, the height y length of the cut out character 32 and the average height y av of the category
The scale is changed by e to a character 34 of height y ave . Next, as shown in FIGS. 7 and 8, the size changing unit 8 compares the aspect ratio of the character (step 28), and increases the horizontal length x length at the same ratio as the vertical scaling factor. The magnification is changed to x reform (step 29). Note that this lateral length x reform can be determined by equation (1). x reform=(x length/y
length)*y ave...(1)

【００１８
】尚、図９に示すように文字「あ」のベクトルデータの
始点と終点は、変倍前の画像領域３９、変倍後の画像領
域４０内のアドレスにより示される。したがって、変倍
前の切り出し文字領域３９の縦方向と横方向の各変倍率
によりベクトルの縦横の各方向の変位量を算出し、また
、この切り出し領域３９の位置補正に応じてベクトルの
位置アドレスを同様に補正し、文字の各ストロークを変
倍することにより変倍後の文字領域４０の各ストローク
を求めることができる。0018
As shown in FIG. 9, the start and end points of the vector data of the character "A" are indicated by addresses in the image area 39 before scaling and in the image area 40 after scaling. Therefore, the amount of displacement in the vertical and horizontal directions of the vector is calculated based on the scaling factors in the vertical and horizontal directions of the cutout character area 39 before scaling, and the position address of the vector is calculated according to the position correction of the cutout area 39. By similarly correcting and scaling each stroke of the character, each stroke of the character area 40 after scaling can be obtained.

【００１９】次いで、図１に示す文字領域の位置補正部
９は、図７に示すように各文字間の横方向のブランクの
平均の長さを算出し（ステップ３０）、切り出し領域内
の各文字が適切な位置になるように、かつ変倍前と変倍
後において一行の文字数が同一になるようにブランクを
利用して補正する（ステップ３１）。尚、この補正は、
変倍後の各領域の横の平均の長さｘ　ｒｅｆｏｒｍを集
計し、切り出した行イメージ１７の長さからこの合計値
を減算することより、行イメージ１７内で存在可能なブ
ランク長を算出し、このブランク長を文字数により割り
算することにより各文字領域間の平均ブランク長を算出
することにより行う。Next, the character area position correction unit 9 shown in FIG. 1 calculates the average length of horizontal blanks between each character as shown in FIG. Correction is performed using blanks so that the characters are placed in appropriate positions and the number of characters in one line is the same before and after scaling (step 31). Furthermore, this correction is
The blank length that can exist in the row image 17 is calculated by summing up the average horizontal length x reform of each area after scaling and subtracting this total value from the length of the cut row image 17. , by dividing this blank length by the number of characters to calculate the average blank length between each character area.

【００２０】次いで、図１に示す文書画像整形、再構成
部１０は、図１０に示すように縦方向が一致しない文字
列の下が所定の間隔のライン３５に一致するような文字
列３６に再構成する。尚、横方向の文字間の距離３７は
、位置補正部９により算出された平均ブランク長である
。このように補正された画像データはメモリ１１に一旦
格納され、出力部１２によりプリント等されて出力され
る。Next, the document image shaping and reconstruction unit 10 shown in FIG. 1 converts the character string 36 into a character string 36 such that the bottom of the character strings that do not match in the vertical direction matches the line 35 at a predetermined interval, as shown in FIG. Reconfigure. Note that the distance 37 between characters in the horizontal direction is the average blank length calculated by the position correction unit 9. The image data corrected in this manner is temporarily stored in the memory 11, and is outputted as a print or the like by the output unit 12.

【００２１】したがって、上記実施例によれば、文字行
検索部４により手書き文字列を行毎に切り出し、文字領
域切り出し部５により各行の手書き文字列の各文字を切
り出し、文字構成要素検出部６により各文字の線分要素
を検出し、サイズ変更部８により各文字の大きさを判別
して適正な大きさになるように縦横の長さ３８を変倍し
、位置補正部９により文字列の横方向の間隔が一定にな
るように各文字の横方向の位置を補正し、再構成部１０
により各行の文字列の間隔が一定になるように補正する
ので、操作者がなんら清書用データを入力することなく
手書き文字の文書を自動的に清書することができる。Therefore, according to the above embodiment, the character line search unit 4 cuts out the handwritten character string line by line, the character area cutting unit 5 cuts out each character of the handwritten character string in each line, and the character component detection unit 6 cuts out each character of the handwritten character string in each line. The line segment element of each character is detected, the size change unit 8 determines the size of each character, and the vertical and horizontal lengths 38 are scaled to the appropriate size, and the position correction unit 9 converts the character string The reconstruction unit 10 corrects the horizontal position of each character so that the horizontal spacing is constant.
Since the spacing between the character strings in each line is corrected to be constant, it is possible to automatically format a document with handwritten characters without the operator inputting any formatting data.

【００２２】この場合、一行に含まれる文字数は同数で
あり、また、意識的な大きさで手書きされた文字がその
大きさに応じた比率で変倍されるので、筆記者が意図す
るレイアウトで出力される。また、手書き文字の線分を
ベクトル化するので、文字のかすれ、切れ、ノイズの付
加などを防止することができ、更に、筆記者の筆跡を保
持することができる。[0022] In this case, the number of characters included in one line is the same, and the handwritten characters with a conscious size are scaled according to the size, so the layout is the same as the scribe intended. Output. Furthermore, since the line segments of handwritten characters are vectorized, it is possible to prevent characters from becoming blurred, cut, or added with noise, and furthermore, it is possible to preserve the handwriting of the scribe.

【００２３】図１１は、本発明に係る自動文書清書装置
の第２の実施例における文字構成要素検出部６１の動作
を説明するためのフローチャート、図１２はその変倍動
作を示す説明図である。図１１において、文字構成要素
検出部６１は、第１の実施例における文字領域切り出し
部５により切り出された各文字を辞書パターンとのマッ
チングと、文字構成要素による構造解析や文章内の文字
の意味解析等により認識する（ステップ４１）。次いで
、辞書パターンの文字フォントを参照してフォントデー
タの輪郭線をベクトル化する（ステップ４２）。したが
って、図１２に示すようにサイズ変更部９が辞書パター
ンのフォントデータ４３を拡大すると、活字体のような
文字４４が出力されるので、手書き文字列を完全に清書
することができる。FIG. 11 is a flowchart for explaining the operation of the character component detection section 61 in the second embodiment of the automatic document fairing apparatus according to the present invention, and FIG. 12 is an explanatory diagram showing the scaling operation thereof. . In FIG. 11, a character component detection unit 61 matches each character extracted by the character area extraction unit 5 in the first embodiment with a dictionary pattern, performs structural analysis based on character components, and performs meaning of characters in a sentence. It is recognized by analysis etc. (step 41). Next, the outline of the font data is vectorized by referring to the character font of the dictionary pattern (step 42). Therefore, as shown in FIG. 12, when the size changing section 9 enlarges the font data 43 of the dictionary pattern, characters 44 similar to printed characters are output, so that the handwritten character string can be completely transcribed.

【００２４】[0024]

【発明の効果】以上説明したように、請求項１記載の発
明は、手書き文字列を光学的に読み取る読み取り手段と
、前記読み取り手段により読み取られた手書き文字列を
検索して１行毎に検索する抽出手段と、前記検索手段に
より抽出された各行の手書き文字列を文字毎に切り出す
切り出し手段と、前記切り出し手段により切り出された
各手書き文字の大きさを判別する判別手段と、前記判別
手段により判別された各文字が適正な大きさになるよう
に縦横の長さを変倍する変倍手段と、前記検索手段によ
り抽出された各行の手書き文字列の間隔が適正になるよ
うに、前記変倍手段により変倍された各文字を配列する
配列手段と、前記配列手段により配列された各行の文字
列を出力する出力手段とを備えたので、操作者がなんら
清書用データを入力することなく手書き文字の文書を自
動的に清書することができる。As explained above, the invention according to claim 1 includes a reading means for optically reading a handwritten character string, and a reading means for searching the handwritten character string read by the reading means, line by line. extracting means for cutting out the handwritten character string of each line extracted by the searching means for each character; determining means for determining the size of each handwritten character cut out by the clipping means; scaling means for changing the length and width of each identified character so that it becomes an appropriate size; Since it is equipped with an arrangement means for arranging each character scaled by the multiplication means and an output means for outputting the character strings of each line arranged by the arrangement means, the operator does not have to input any data for fair copying. Documents with handwritten characters can be automatically formatted.

【００２５】請求項２記載の発明は、各手書き文字の線
分要素を判別してベクトル化し、このベクトル化された
線分要素を変倍するので、筆記者が意図した筆跡で清書
することができる。[0025] The invention according to claim 2 distinguishes the line segment elements of each handwritten character, vectorizes them, and scales the vectorized line segment elements, so that the scribe can write with the handwriting intended by the scribe. can.

【００２６】請求項３の発明は、各手書き文字を辞書パ
ターンとのマッチングにより認識　　し、辞書パターン
の文字フォントを参照してフォントデータの輪郭線をベ
クトル化するので、手書き文字を辞書パターンの文字フ
ォントに清書することができる。[0026] The invention of claim 3 recognizes each handwritten character by matching it with a dictionary pattern, and vectorizes the outline of the font data by referring to the character font of the dictionary pattern. You can format the font.

[Brief explanation of the drawing]

【図１】本発明に係る自動文書清書装置の一実施例を示
すブロック図である。FIG. 1 is a block diagram showing an embodiment of an automatic document fairing device according to the present invention.

【図２】図１の自動文書清書装置により読み取られる手
書き原稿とそのヒストグラムを示す説明図である。FIG. 2 is an explanatory diagram showing a handwritten manuscript read by the automatic document fairing device of FIG. 1 and its histogram.

【図３】図１の文字行検索部により抽出された行イメー
ジを示す説明図である。FIG. 3 is an explanatory diagram showing a line image extracted by the character line search unit of FIG. 1;

【図４】図１の文字領域切り出し部により切り出された
文字を示す説明図である。FIG. 4 is an explanatory diagram showing characters cut out by the character area cutting section of FIG. 1;

【図５】図４の文字の構成要素を示す説明図である。FIG. 5 is an explanatory diagram showing the constituent elements of the characters in FIG. 4;

【図６】図１の文字構成要素検出部の動作を説明するた
めのフローチャートである。FIG. 6 is a flowchart for explaining the operation of the character component detection section of FIG. 1;

【図７】図１の文字領域サイズ変更部及び位置補正部の
動作を説明するためのフロー　　チャートである。FIG. 7 is a flowchart for explaining the operations of the character area size changing unit and position correcting unit in FIG. 1;

【図８】図１の文字領域サイズ変更部の変倍動作を示す
説明図である。FIG. 8 is an explanatory diagram showing a scaling operation of the character area size changing section of FIG. 1;

【図９】図１の文字領域サイズ変更部により変倍される
前後のベクトルデータを示す説明図である。FIG. 9 is an explanatory diagram showing vector data before and after being scaled by the character area size changing unit of FIG. 1;

【図１０】図１の文書画像整形、再構成部の動作を示す
説明図である。FIG. 10 is an explanatory diagram showing the operation of the document image shaping and reconstruction section of FIG. 1;

【図１１】本発明に係る自動文書清書装置の第２の実施
例における文字構成要素検出部の動作を説明するための
フローチャートである。FIG. 11 is a flowchart for explaining the operation of a character component detecting section in a second embodiment of the automatic document fairing apparatus according to the present invention.

【図１２】その変倍動作を示す説明図である。FIG. 12 is an explanatory diagram showing the magnification changing operation.

[Explanation of symbols]

１　　読み取り部４　　文字行検索部５　　文字領域切り出し部６　　文字構成要素検出部８　　サイズ変更部９　　位置補正部１０　　文書画像整形、再構成部１２　　出力部６１　　文字構成要素検出部 1 Reading section 4 Character line search section 5 Character area extraction part 6 Character component detection unit 8 Size change section 9 Position correction section 10 Document image formatting and reconstruction unit 12 Output section 61 Character component detection unit

Claims

[Claims]

1. Reading means for optically reading a handwritten character string; extraction means for searching the handwritten character string read by the reading means line by line; a cutting means for cutting out a handwritten character string character by character; a determining means for determining the size of each handwritten character cut out by the clipping means; scaling means for scaling the vertical and horizontal lengths; and arranging means for arranging the characters scaled by the scaling means so that the intervals between the handwritten character strings in each line extracted by the searching means are appropriate. and an output means for outputting character strings in each line arranged by the arrangement means.

2. The scaling means discriminates and vectorizes line segment elements of each character discriminated by the discriminating means,
2. The automatic document formatting apparatus according to claim 1, wherein the vectorized line segment element is scaled.

3. The discrimination means recognizes each handwritten character cut out by the cutout means by matching it with a dictionary pattern, and vectorizes the outline of the font data by referring to the character font of the dictionary pattern. The automatic document integrator according to claim 1.