JP3025382B2

JP3025382B2 - Document processing device

Info

Publication number: JP3025382B2
Application number: JP4239308A
Authority: JP
Inventors: 由治高橋; 克典竹田
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1992-09-08
Filing date: 1992-09-08
Publication date: 2000-03-27
Anticipated expiration: 2015-03-27
Also published as: JPH0689276A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、例えば、ＤＴＰシステ
ムへの応用も可能な日本語ワードプロセッサへの入力、
光ファイルシステムへの入力、あるいはカード作成など
のデータ入力などパーソナルコンピュータへの入力に供
せられる文字読取手段を備えた文書処理装置に関するも
のである。The present invention relates to, for example, an input to a Japanese word processor which can be applied to a DTP system,
The present invention relates to a document processing apparatus provided with character reading means for input to a personal computer such as input to an optical file system or data input such as card creation.

【０００２】[0002]

【従来の技術】従来、一般に、ＯＣＲ（Optical Charac
ter Reader）により原稿の文書データを光学的に読み込
ませ、この文書データに対して各種の制御処理を施す文
書処理装置は、以下のようなプロセスによりデータの処
理を行うようになっている。2. Description of the Related Art Conventionally, generally, OCR (Optical Charac)
A document processing apparatus that optically reads document data of a document by a ter Reader and performs various control processes on the document data performs data processing by the following processes.

【０００３】即ち、先ず、ＯＣＲにて文字原稿上を走査
してビットイメージを記憶し、このビットイメージを文
字単位に区切り矩形状にすることで、ビットイメージの
散布状況やクセを拾い出す。これにより、マッチング辞
書で文字としての認識を行い文字コードを得る。そし
て、上記の文字コードを送信データ化し、これを文書処
理部へ送信することで、文書処理部にて文書データに対
する各種の制御処理を施す。That is, first, a character document is scanned by the OCR to store a bit image, and the bit image is divided into characters and formed into a rectangular shape, thereby picking up the scattered state and habit of the bit image. As a result, the character is recognized by the matching dictionary to obtain a character code. The character code is converted into transmission data and transmitted to the document processing unit, so that the document processing unit performs various control processes on the document data.

【０００４】[0004]

【発明が解決しようとする課題】ところが、上記従来の
ような文書処理装置では、ＯＣＲにより読み取りを行う
データは、文字コードのみであるため、文書処理部に送
られてくる送信データには、文字のポイント数や書体種
のデータが含まれていない。このため、文書処理部での
データの受信後、ユーザは文書編集機能を使って文字の
ポイント数や書体種など文字原稿に沿った文書作成をや
り直さなければならないという問題を有している。However, in the above-described conventional document processing apparatus, since the data read by the OCR is only character codes, the transmission data sent to the document processing unit includes character data. Does not include the number of points or typeface data. For this reason, after receiving the data in the document processing unit, there is a problem that the user has to re-create the document in accordance with the character original such as the number of character points and the type of font using the document editing function.

【０００５】[0005]

【課題を解決するための手段】本発明の文書処理装置
は、上記の課題を解決するために、以下の手段を講じて
いる。The document processing apparatus of the present invention employs the following means to solve the above-mentioned problems.

【０００６】即ち、本発明の文書処理装置は、原稿の各
文字をコード化して読み取ると共に、これら各文字毎の
ポイント数をデータとして読み取る文字読取手段と、文
字読取手段にて読み取った各文字毎のポイント数を、自
立語中で最も大きいポイント数に、自立語単位でそろえ
る処理を行うポイント数調整手段とを備えている。That is, the document processing apparatus according to the present invention comprises: a character reading means for reading each character of a document by encoding and reading the number of points for each character as data; and a character reading means for reading each character read by the character reading means. the number of points, the self
Align to the largest number of points in a spoken word in independent words
And a point number adjusting means for performing a process .

【０００７】また、本発明の文書処理装置は、原稿の各
文字をコード化して読み取ると共に、これら各文字毎の
ポイント数をデータとして読み取る文字読取手段と、ポ
イント数が複数種類登録されたポイント数情報テーブル
と、文字読取手段にて読み取った各文字毎のポイント数
を、自立語単位で統一すべく調整すると共に、その調整
したポイント数を、ポイント数情報テーブルに登録され
た最も近いポイント数にそろえる処理を行うポイント数
調整手段とを備えている。 Further , the document processing apparatus of the present invention provides a
Characters are coded and read, and each character
Character reading means for reading the number of points as data;
Point number information table in which multiple types of points are registered
And the number of points for each character read by the character reading means
To be unified for each independent word, and the adjustment
Registered in the point information table.
Number of points to be processed to match the closest number of points
Adjusting means.

【０００８】さらに、本発明の文書処理装置は、原稿の
各文字をコード化して読み取ると共に、これら各文字毎
の書体種をデータとして読み取る文字読取手段と、文字
読取手段にて読み取った各文字毎の書体種を、自立語に
含まれる文字の中で多数決された書体種あるいは最初に
出てきた書体種のどちらかに、自立語単位でそろえる処
理を行う書体種調整手段とを備えている。 Further, the document processing apparatus of the present invention
Each character is coded and read, and each
Character reading means for reading the typeface of
The font type for each character read by the reading means is converted to an independent word
The typeface or the first one that is
A process for aligning independent font units with one of the typefaces
Font type adjustment means for performing

【０００９】さらに、本発明の文書処理装置は、原稿の
各文字をコード化して読み取ると共に、これら各文字毎
の書体種をデータとして読み取る文字読取手段と、文書
処理装置の基本書体種を含む複数種類の書体種が登録さ
れた書体種情報テーブルと、文字読取手段にて読み取っ
た各文字毎の書体種を、自立語単位で統一すべく調整す
ると共に、その調整した書体種が書体種情報テーブルに
登録されていない場合に、その調整した書体種を上記基
本書体種にそろえる処理を行う書体種調整手段とを備え
ている。 Furthermore, the document processing apparatus of the present invention
Each character is coded and read, and each
Character reading means for reading the typeface of
Multiple font types including the basic font type of the processor are registered.
Type information table and character reading means
Adjust the typeface of each character so that it is unified for each independent word .
And the adjusted font type is displayed in the font type information table.
If it is not registered, the adjusted font type is
A typeface adjustment means for performing processing that matches the typeface of the book
ing.

【００１０】[0010]

【作用】上記の構成によれば、文字読取手段にて原稿の
各文字コードを読み取ると共に、各文字毎のポイント数
と書体種との各データを読み取る。そして、この文字読
取手段で読み取ったポイント数および書体種の各データ
を、各々、ポイント数調整手段および書体種調整手段に
より自立語単位で統一すべく調整する。さらに、その調
整したポイント数を登録された最も近いポイント数にそ
ろえる処理や、調整した書体種が登録されていない場合
に、その調整した書体種を基本書体種にそろえる処理が
行われる。これにより、例えば、文書データをプリント
アウトする際などは、ポイント数および書体種の各条件
設定を省くことで操作性の向上を招来することができ
る。According to the above arrangement, each character code of the document is read by the character reading means, and the data of the number of points and the type of each character are read. Then, the data of the point number and the font type read by the character reading unit are respectively sent to the point number adjusting unit and the font type adjusting unit .
Make adjustments to unify more independent words. In addition, the tone
Adjust the adjusted points to the nearest registered points.
When the processing to curl or the adjusted font type is not registered
In addition, the process of aligning the adjusted font type with the basic font type
Done. Thus, for example, when printing out document data, the operability can be improved by omitting the condition setting of the number of points and the type of font.

【００１１】[0011]

【実施例】本発明の一実施例について図１ないし図４に
基づいて説明すれば、以下の通りである。DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below with reference to FIGS.

【００１２】本実施例に係る文書処理装置は、原稿の各
文字（文書データ）をコード化して読み取る文字読取部
（文字読取手段、ポイント数調整手段および書体種調整
手段）と、この文字読取部にて読み取った文書データに
対して各種の制御処理を施す文書処理部（ポイント数調
整手段および書体種調整手段）とから構成されている。The document processing apparatus according to this embodiment includes a character reading section (character reading means , point number adjusting means, and typeface adjusting means for reading each character (document data) of a document by encoding.
Means ) and a document processing section (point number adjustment) for performing various control processes on the document data read by the character reading section.
Adjusting means and typeface adjusting means ).

【００１３】上記文字読取部は、図１に示すように、先
ず、図示しないＯＣＲ（Optical Character Reader）に
より原稿の各文字を光学的にビットイメージとして読み
取る（Ｓ１）。尚、上記のＯＣＲは、ビットイメージと
しての文字コードの読み取りに伴い、これら各文字毎の
ポイント数および書体種を各データとして読み取るもの
である。そして、上記のビットイメージに対してマッチ
ング処理を施すべく、ビットイメージの散布状態などか
ら文字として成立するように矩形状の範囲指定を行い最
小単位にビットイメージを区分けする（Ｓ２）。As shown in FIG. 1, the character reading section first reads each character of a document optically as a bit image using an OCR (Optical Character Reader) (not shown) (S1). The OCR reads the number of points and the type of each character as data as the character code is read as a bit image. Then, in order to perform a matching process on the bit image, a rectangular range is designated so as to be realized as a character from the scattered state of the bit image, and the bit image is divided into minimum units (S2).

【００１４】次に、上記のように矩形状に区切られたビ
ットイメージの特徴、クセなどからマッチング辞書と照
合していく。その結果、類似度の高い文字の文字コー
ド、ポイント数、書体種などが得られる。尚、マッチン
グは、単独で行うと、言葉として意味がなくなる虞れが
あるので、前後の文字の認識結果を参考にし、特徴情報
を補正強化することで認識率を上げる。また、文字コー
ドは、誤認識の対応策として複数個を出力するものと
し、候補文字リストの中で自立語が形成されるのであれ
ば、第１位の候補にその文字コードを持ってくる処理を
行う（Ｓ３）。Next, the matching is performed with the matching dictionary based on the characteristics of the bit image divided into the rectangular shape as described above, habits, and the like. As a result, a character code, a point number, a typeface, and the like of a character having a high degree of similarity can be obtained. Note that if the matching is performed alone, there is a possibility that the meaning may be lost as a word. Therefore, the recognition rate is increased by correcting and strengthening the characteristic information with reference to the recognition result of the preceding and following characters. A plurality of character codes are output as a countermeasure for erroneous recognition. If an independent word is formed in the candidate character list, a process of bringing the character code to the first candidate is performed. Is performed (S3).

【００１５】次に、上記のようなマッチング処理後、自
立語単位でのポイント数、および書体種を統一すべく、
ポイント数の調整（Ｓ４）、および書体種の調整（Ｓ
５）を行う。尚、上記ポイント数の調整は、自立語中で
最も大きいポイント数にそろえることで行われ、また、
書体種の調整は、自立語中での多数決あるいは最初に出
てきた書体種にそろえることで行われるものである。Next, after the above-described matching processing, in order to unify the number of points and the typeface in independent word units,
Adjusting the number of points (S4) and adjusting the font type (S
Perform 5). The above adjustment of the number of points is performed by aligning the points with the largest number of points in the independent word.
Adjustment of the typeface is done by majority voting in independent words or by adjusting to the first typeface that appears.

【００１６】そして、上記のように認識結果として得ら
れたポイント数、書体種、文字コードなどを送信データ
として後述する文書処理部に送信して（Ｓ６）、以上の
ような文字読取部での処理プロセスを終了する。Then, the number of points, the type of font, the character code, etc. obtained as a result of the recognition as described above are transmitted as transmission data to a later-described document processing unit (S6). End the processing process.

【００１７】上記の送信データは、図２に示すように、
例えば「日本の首都は東京である」という文書におい
て、「日本の首都は」が７.5ポイントの教科書体、「東
京」が１０.5ポイントのゴシック体、「である」が８ポ
イントの教科書体の場合、その送信フォーマットは、図
３に示すようなフォーマットとして送信されるものであ
る。尚、図３に示すフォーマットは、文字数、候補文字
コード、チェックサムなどを省略した暫定的な例示フォ
ーマットである。The above transmission data is, as shown in FIG.
For example, in the document "The capital of Japan is Tokyo", "The capital of Japan" is a textbook with 7.5 points, "Tokyo" is a Gothic font with 10.5 points, and "To" is a textbook with 8 points. In the case of a body, the transmission format is transmitted as a format as shown in FIG. The format shown in FIG. 3 is a tentative example format in which the number of characters, candidate character codes, checksums, and the like are omitted.

【００１８】一方、上記文書処理部は、図４に示すよう
に、文字読取部からの送信データを受信データとして記
憶する受信バッファ１と、受信データにおける各文字毎
のポイント数を認識するポイント数情報テーブル（ポイ
ント数認識手段）２と、受信データにおける各文字毎の
書体種を認識する書体種情報テーブル（書体種認識手
段）３と、上記の各情報テーブル２・３にて認識された
各文字データを文書データとして記憶する文書バッファ
（文書記憶手段）４とを備えている。On the other hand, as shown in FIG. 4, the document processing section includes a reception buffer 1 for storing transmission data from the character reading section as reception data, and a point number for recognizing a point number for each character in the reception data. An information table (point number recognition means) 2, a font type information table (font type recognition means) 3 for recognizing a font type for each character in the received data, and each of the information tables 2 and 3 A document buffer (document storage means) 4 for storing character data as document data.

【００１９】以下に、文書処理部でのデータの処理プロ
セスについて説明する。The data processing process in the document processing unit will be described below.

【００２０】先ず、Ｓ１０にて、文字読取部から送信さ
れてくるデータを受信して、これを受信データとして受
信バッファ１に記憶する。First, in S10, data transmitted from the character reading unit is received and stored in the reception buffer 1 as received data.

【００２１】そして、上記の受信データのうち文字コー
ドについては、候補文字の選択処理を施す（Ｓ１１）。
これは、例えば「日」という文字をＯＣＲで読み取った
場合でも、必ずしも「日」の文字として認識されるとは
限らない。そのため、受信データの文字コードとして幾
つかの候補文字コードが抱き合わせてあり、例えば
「日」の場合では、「白」「臼」「目」などのように、
ユーザが画面上などで適切な文字コードをそれらの候補
から選択し文書バッファ４への入力を行う。Then, for the character code in the received data, a candidate character selection process is performed (S11).
For example, even if the character “day” is read by the OCR, it is not always recognized as the character “day”. Therefore, several candidate character codes are tied together as the character code of the received data. For example, in the case of "day", for example, "white", "mortar", "eye", etc.
The user selects an appropriate character code from the candidates on the screen or the like, and inputs the character code to the document buffer 4.

【００２２】また、上記の受信データのうちポイント数
のデータについては、受信バッファ１から取り出したポ
イント数のデータがポイント数情報テーブル２にあるか
否かを検索する（Ｓ１２）。Ｓ１２の検索においてポイ
ント数のデータがポイント数情報テーブル２になけれ
ば、その中で最も近いポイント数を選択する。例えば、
ポイント数情報テーブル２に、６.5ポイント、８ポイン
ト、９ポイント、１０ポイント、および１２ポイントの
５種類のポイント数が登録されている場合、受信バッフ
ァ１から取り出したポイント数のデータが７.5ポイント
なら、それに最も近い８ポイントとして処理が行われ
る。For the data of the number of points in the received data, it is searched whether or not the data of the number of points extracted from the reception buffer 1 exists in the point number information table 2 (S12). If the data of the number of points is not found in the point number information table 2 in the search in S12, the closest point number is selected among them. For example,
When five types of points, 6.5 points, 8 points, 9 points, 10 points, and 12 points are registered in the point number information table 2, the data of the number of points extracted from the reception buffer 1 is 7. If it is 5 points, the processing is performed as the closest 8 points.

【００２３】そして、文書バッファ４のカーソル位置の
入力設定状態から、現在幾つのポイント数で入力される
かを文書バッファ４より知り、ポイント数の置き換え処
理で得られたポイント数と照合して、同じであれば、文
書バッファ４への処理が免除される。一方、ポイント数
が異なる場合には、文書バッファ４の入力設定状態をそ
のポイント数に設定する（Ｓ１３）。尚、この設定は、
次のポイント数、または受信データの最後のデータを処
理するまでは有効であるが、それ以降の処理について
は、受信前のカーソル位置以降のポイント数に影響を及
ぼすため、元のポイント数に自動的に戻るものである。Then, from the input setting state of the cursor position in the document buffer 4, the number of points currently input is known from the document buffer 4, and is compared with the number of points obtained in the point number replacement process. If they are the same, the processing to the document buffer 4 is exempted. On the other hand, when the number of points is different, the input setting state of the document buffer 4 is set to the number of points (S13). This setting is
It is valid until the next point number or the last data of the received data is processed.However, since the subsequent points affect the number of points after the cursor position before reception, the original point number is automatically It is something that returns.

【００２４】また、上記の受信データのうち書体種のデ
ータについては、受信バッファ１から取り出した書体種
のデータが書体種情報テーブル３にあるか否かを検索す
る（Ｓ１４）。Ｓ１４の検索において書体種のデータが
書体種情報テーブル３になければ、文書処理装置の基本
書体種である明朝体を選択する。例えば、書体種情報テ
ーブル３に、明朝体、ゴシック体、正楷書体、行書体、
および丸ゴシック体の５種類の書体種が登録されている
場合、受信バッファ１から取り出した書体種のデータが
教科書体なら、基本書体種である明朝体として処理が行
われる。For the type data among the received data, it is searched whether or not the type data extracted from the reception buffer 1 is in the type information table 3 (S14). If the data of the font type is not found in the font type information table 3 in the search in S14, the Mincho font which is the basic font type of the document processing apparatus is selected. For example, in the font type information table 3, Mincho font, Gothic font, regular square font, line font,
When five types of typefaces, that is, a gothic type and a round type, are registered, and if the data of the typeface extracted from the reception buffer 1 is a text typeface, the processing is performed as a basic typeface, Mincho.

【００２５】そして、文書バッファ４のカーソル位置の
入力設定状態から、現在何の書体種で入力されるかを文
書バッファ４より知り、書体種の置き換え処理で得られ
た書体種と照合して、同じであれば、文書バッファ４へ
の処理が免除される。一方、書体種が異なる場合には、
文書バッファ４の入力設定状態をその書体種に設定する
（Ｓ１５）。尚、この設定は、次の書体種、または受信
データの最後のデータを処理するまで有効であるが、そ
れ以降の処理については、受信前のカーソル以降の書体
種に影響を及ぼすため、元の書体種に自動的に戻るもの
である。Then, from the input setting state of the cursor position in the document buffer 4, the type of font currently input is known from the document buffer 4, and is compared with the type of font obtained by the font type replacement process. If they are the same, the processing to the document buffer 4 is exempted. On the other hand, if the typeface is different,
The input setting state of the document buffer 4 is set to the typeface (S15). Note that this setting is valid until the next font type or the last data of the received data is processed.However, since the subsequent processes affect the font type after the cursor before reception, the original font type is affected. It automatically returns to the typeface.

【００２６】以上のように、本実施例の文書処理装置
は、原稿の各文字（文書データ）をコード化して読み取
ると共に、これら各文字毎のポイント数および書体種を
各データとして読み取る文字読取部と、文字読取部から
の送信データを受信データとして記憶し、この受信デー
タに対して各種の制御処理を施す文書処理部とから構成
されている。また、上記の文書処理部は、受信データに
おける各文字毎のポイント数を認識するポイント数情報
テーブル２と、受信データにおける各文字毎の書体種を
認識する書体種情報テーブル３と、上記の各情報テーブ
ル２・３にて認識された各文字データを記憶する文書バ
ッファ４とを備えている。As described above, the document processing apparatus of the present embodiment encodes and reads each character (document data) of a document and reads the number of points and type of each character as data. And a document processing unit that stores transmission data from the character reading unit as reception data and performs various control processes on the reception data. The document processing unit includes a point number information table 2 for recognizing the number of points for each character in the received data, a font type information table 3 for recognizing a font type for each character in the received data, A document buffer 4 for storing each character data recognized in the information tables 2 and 3;

【００２７】このため、文書データをなす各文字の文字
コードに加えて、各文字毎のポイント数と書体種とをデ
ータとして文書処理部の文書バッファ４に記憶させるこ
とができる。従って、本文書処理装置にプリンタを接続
させて文書データの印刷を行わせるような場合には、原
稿の文書データに応じたポイント数および書体種で自動
的にプリントアウトすることができる。Therefore, in addition to the character code of each character constituting the document data, the number of points and the type of each character can be stored as data in the document buffer 4 of the document processing unit. Therefore, in a case where a document is printed by connecting a printer to the document processing apparatus, the document can be automatically printed out with the number of points and the typeface according to the document data of the document.

【００２８】[0028]

【発明の効果】本発明の文書処理装置は、以上のよう
に、原稿の各文字をコード化して読み取ると共に、これ
ら各文字毎のポイント数をデータとして読み取る文字読
取手段と、文字読取手段にて読み取った各文字毎のポイ
ント数を、自立語中で最も大きいポイント数に、自立語
単位でそろえる処理を行うポイント数調整手段とを備え
ている構成である。As described above, the document processing apparatus according to the present invention comprises a character reading means for reading each character of a document by encoding and reading the number of points of each character as data, and a character reading means. The number of points read for each character is changed to the largest number of points in the independent word.
This is a configuration including point number adjusting means for performing a process of aligning in units .

【００２９】また、本発明の文書処理装置は、原稿の各
文字をコード化して読み取ると共に、これら各文字毎の
ポイント数をデータとして読み取る文字読取手段と、ポ
イント数が複数種類登録されたポイント数情報テーブル
と、文字読取手段にて読み取った各文字毎のポイント数
を、自立語単位で統一すべく調整すると共に、その調整
したポイント数を、ポイント数情報テーブルに登録され
た最も近いポイント数にそろえる処理を行うポイント数
調整手段とを備えている構成である。 Further , the document processing apparatus of the present invention
Characters are coded and read, and each character
Character reading means for reading the number of points as data;
Point number information table in which multiple types of points are registered
And the number of points for each character read by the character reading means
To be unified for each independent word, and the adjustment
Registered in the point information table.
Number of points to be processed to match the closest number of points
And an adjusting means.

【００３０】さらに、本発明の文書処理装置は、原稿の
各文字をコード化して読み取ると共に、これら各文字毎
の書体種をデータとして読み取る文字読取手段と、文字
読取手段にて読み取った各文字毎の書体種を、自立語に
含まれる文字の中で多数決された書体種あるいは最初に
出てきた書体種のどちらかに、自立語単位でそろえる処
理を行う書体種調整手段とを備えている構成である。 Further, the document processing apparatus of the present invention
Each character is coded and read, and each
Character reading means for reading the typeface of
The font type for each character read by the reading means is converted to an independent word
The typeface or the first one that is
A process for aligning independent font units with one of the typefaces
And a typeface adjusting means for performing the processing.

【００３１】さらに、本発明の文書処理装置は、原稿の
各文字をコード化して読み取ると共に、これら各文字毎
の書体種をデータとして読み取る文字読取手段と、文書
処理装置の基本書体種を含む複数種類の書体種が登録さ
れた書体種情報テーブルと、文字読取手段にて読み取っ
た各文字毎の書体種を、自立語単位で統一すべく調整す
ると共に、その調整した書体種が書体種情報テーブルに
登録されていない場合に、その調整した書体種を上記基
本書体種にそろえる処理を行う書体種調整手段とを備え
ている構成である。 Further, the document processing apparatus of the present invention
Each character is coded and read, and each
Character reading means for reading the typeface of
Multiple font types including the basic font type of the processor are registered.
Type information table and character reading means
Adjust the typeface of each character so that it is unified for each independent word.
And the adjusted font type is displayed in the font type information table.
If it is not registered, the adjusted font type is
A typeface adjustment means for performing processing that matches the typeface of the book
Configuration.

【００３２】これにより、文字読取手段で読み取ったポ
イント数および書体種の各データを、各々、ポイント数
調整手段および書体種調整手段により自立語単位で統一
すべく調整する。さらに、その調整したポイント数を登
録された最も近いポイント数にそろえる処理や、調整し
た書体種が登録されていない場合に、その調整した書体
種を基本書体種にそろえる処理が行われる。従って、本
文書処理装置にプリンタを接続させて文書データの印刷
を行わせるような場合には、原稿の文書データに応じた
ポイント数および書体種で自動的にプリントアウトする
ことができ、ひいては、操作性を向上させた便利な文書
処理装置を提供することができるという効果を奏する。As a result, the port read by the character reading means can be read.
The number of points and typeface data are converted into points
Unified for independent words by adjusting means and typeface adjusting means
Adjust as needed. In addition, increase the adjusted points
Adjust and adjust the number of points to the closest recorded point number.
If the typeface is not registered,
A process for matching the seed to the basic typeface is performed. Therefore, in the case where a document is printed by connecting a printer to the document processing apparatus, the document can be automatically printed out with the number of points and the typeface corresponding to the document data of the document. There is an effect that a convenient document processing device with improved operability can be provided.

[Brief description of the drawings]

【図１】本発明の一実施例における文書処理装置を構成
する文字読取部でのデータ処理プロセスを示すフローチ
ャートである。FIG. 1 is a flowchart illustrating a data processing process in a character reading unit included in a document processing apparatus according to an embodiment of the present invention.

【図２】上記の文字読取部にて処理されるデータを示す
説明図である。FIG. 2 is an explanatory diagram showing data processed by the character reading unit.

【図３】上記データの送信フォーマットを示す説明図で
ある。FIG. 3 is an explanatory diagram showing a transmission format of the data.

【図４】上記の文書処理装置を構成する文書処理部を示
すブロック図である。FIG. 4 is a block diagram illustrating a document processing unit included in the document processing apparatus.

[Explanation of symbols]

２ポイント数情報テーブル３書体種情報テーブル４文書バッファ 2 Point number information table 3 Font type information table 4 Document buffer

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/21 - 17/28 G06K 9/20 ──────────────────────────────────────────────────続き Continued on the front page (58) Field surveyed (Int.Cl. ⁷ , DB name) G06F 17/21-17/28 G06K 9/20

Claims

(57) [Claims]

With 1. A read by coding each character of the document, and the character reading means for reading the number of points of each of these respective characters as data, the number of points for each character read by a character reader,
Get the largest number of points in the independent word, in independent word units
A document processing apparatus comprising: a point number adjusting unit that performs a process for obtaining a document.

2. A character reading means for encoding and reading each character of a document, reading the number of points for each character as data, a point number information table in which a plurality of types of points are registered, and a character reading means. The number of points for each character read
A document comprising a point number adjusting means for adjusting the number of points to be unified in independent word units and performing processing for adjusting the adjusted point number to the nearest point number registered in the point number information table. Processing equipment.

3. Character reading means for encoding and reading each character of an original and reading the type of each character as data, and converting the type of each character read by the character reading means into an independent word. A document processing apparatus, comprising: a typeface adjusting means for performing processing for aligning, in units of independent words, either a typeface that has been decided by majority among the included characters or a typeface that appears first.

4. Character reading means for reading each character of a document by encoding and reading the type of each character as data, and a plurality of types of type including a basic type of the document processing apparatus are registered. The font type information table and the font type for each character read by the character reading means are adjusted to unify them in independent word units, and when the adjusted font type is not registered in the font type information table, A document processing apparatus, comprising: a typeface adjusting means for performing processing for matching the adjusted typeface to the basic type.