JPH10161825A

JPH10161825A - Illegal character check method, and device for generating illegal character check data

Info

Publication number: JPH10161825A
Application number: JP8315145A
Authority: JP
Inventors: Hideki Shibata; 英樹柴田
Original assignee: Dainippon Screen Manufacturing Co Ltd
Current assignee: Dainippon Screen Manufacturing Co Ltd
Priority date: 1996-11-26
Filing date: 1996-11-26
Publication date: 1998-06-19
Anticipated expiration: 2016-11-26
Also published as: JP3402971B2

Abstract

PROBLEM TO BE SOLVED: To save the manhour for illegal character check when a document is printed in an environment that is different from the document generation environment. SOLUTION: At the document generation side, a character type extraction part 10 refers to a candidate information table 14 to extract a character to undergo the illegal character check from a document file 100 and registers the combination of the character font name and code (called a character type) in a character type list 12. A check data generation part 16 edits the character type data of the list 12, composes the editing result based on the style information 18 and produces a check data file 200 to shown the composition result. The file 200 is stored in a recording medium 202 and also printed on the paper as a check sheet 204. These medium 202 and the sheet 204 are sent to the print side. At the printing side, the file 200 is printed on the paper and the illegal characters are checked between the document production and print sides by comparing the printed result with the sheet 204.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ある情報処理環境
で作成した文書を別の情報処理環境で印刷したときの文
字化けを検査する方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for checking garbled characters when a document created in one information processing environment is printed in another information processing environment.

【０００２】[0002]

【従来の技術】一般に印刷物は、企画、原稿作成、編
集、組版、印刷の工程を経て作成される。近年、編集以
降の工程は電子化が進み、例えばＤＴＰ（デスクトップ
・パブリッシング）システムなどの形で具現化され、広
く利用されている。2. Description of the Related Art In general, a printed matter is created through the steps of planning, manuscript creation, editing, typesetting, and printing. In recent years, the processes after editing have been digitized, and are embodied in the form of, for example, a desktop publishing (DTP) system and are widely used.

【０００３】また、コンピュータからプリンタへ転送す
る印刷データのフォーマットとして、ページ記述言語
（以下、ＰＤＬという）が普及している。ＰＤＬは、プ
リンタの能力（解像度など）に依存しない形で各ページ
の印刷イメージを記述することができ、プリンタは、Ｐ
ＤＬで記述されたイメージを自らの解像度などに応じて
印刷する。コンピュータ側では、文書編集ソフトウエア
などにて文書を作成し、印刷の際にその文書のデータか
らＰＤＬデータを生成してプリンタに送信する。プリン
タは、ＰＤＬの解釈機構を有しており、ＰＤＬデータを
解釈してメモリ上に各ページのラスタイメージを生成
し、そのラスタイメージに従って紙などに印刷を行う。
近年では、ＰｏｓｔＳｃｒｉｐｔ（米国ＡｄｏｂｅＳｙ
ｓｔｅｍｓ社の商標）がＰＤＬの事実上の標準として広
く普及しており、個人用から業務用まで、様々な種類の
ＰｏｓｔＳｃｒｉｐｔ対応の印刷装置が発売されてい
る。As a format of print data transferred from a computer to a printer, a page description language (PDL) has been widely used. The PDL can describe a print image of each page in a form independent of the capability (resolution, etc.) of the printer.
The image described in DL is printed according to its own resolution or the like. On the computer side, a document is created by document editing software or the like, and PDL data is generated from the data of the document at the time of printing and transmitted to a printer. The printer has a PDL interpreting mechanism, interprets the PDL data, generates a raster image of each page on a memory, and performs printing on paper or the like according to the raster image.
In recent years, PostScript (AdobeSysy, USA)
(a trademark of Stems Inc.) has become widespread as a de facto standard of PDL, and various types of PostScript-compatible printing apparatuses have been released from personal use to business use.

【０００４】このような状況のもと、顧客から文書のデ
ータをＰｏｓｔＳｃｒｉｐｔなどのＰＤＬの形で受けと
り印刷処理を代行するサービスが登場している。また印
刷業界においても、原稿の内容を電子的に編集する業者
と、この業者から文書をＰＤＬデータの形で受けとって
タイプセッターに出力するいわゆる「出力センター」な
る業者との分業化が進んでいる。このように、近年で
は、文書を、作成した環境とは別の環境で印刷する場合
が増えてきている。[0004] Under such circumstances, a service that receives document data from a customer in the form of PDL such as PostScript and performs printing processing has appeared. Also in the printing industry, the division of labor into a so-called "output center", which is a company that edits the contents of a manuscript electronically, and a company that receives a document from this company in the form of PDL data and outputs it to a type setter, is progressing. . Thus, in recent years, the number of cases in which a document is printed in an environment different from the environment in which the document was created has been increasing.

【０００５】ところが、文書作成側の環境にあるフォン
トが印刷側の環境にない場合や、あるいは文書作成側と
印刷側とで外字等の文字コードが一致しない場合などが
往々にしてあり、このような場合には、文書作成側が意
図した文字が印刷側で正しく印刷されず、いわゆる文字
化けが生じることがあった。このため、従来は、印刷側
で出力した印刷結果と元の原稿とをつき合わせて、すべ
ての文字が文書作成者の意図どおりに印刷されているか
をチェックしていた。However, there are many cases where the font in the environment on the document creating side is not in the environment on the printing side, or when the character codes such as external characters do not match between the document creating side and the printing side. In such a case, characters intended by the document creator may not be correctly printed on the print side, resulting in so-called garbled characters. For this reason, conventionally, the print result output on the printing side is compared with the original document to check whether all characters are printed as intended by the document creator.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、このよ
うなチェック作業に要する時間や労力は膨大なものであ
り、省力化の方策が求められていた。However, the time and labor required for such a check operation is enormous, and a measure for labor saving has been required.

【０００７】本発明は、このような問題を解決するため
になされたものであり、文書を、その文書を作成した環
境とは別の環境で印刷する場合における、文字化けの検
査を省力化するための方法及び装置を提供することを目
的とする。SUMMARY OF THE INVENTION The present invention has been made to solve such a problem, and it is possible to save labor for checking for garbled characters when a document is printed in an environment different from the environment in which the document was created. It is an object to provide a method and an apparatus for the same.

【０００８】[0008]

【課題を解決するための手段】前述の目的を達成するた
めに、本発明に係る文字化け検査方法は、第１の情報処
理環境で作成した文書を第２の情報処理環境で印刷する
際の文字化けを検査する方法であって、前記第１の情報
処理環境にて、作成した文書に含まれる互いに異なる文
字種を抽出して検査用データを作成し、この検査用デー
タを印刷し、前記検査用データ及びその印刷結果を前記
第２の情報処理環境に伝達し、前記第２の情報処理環境
にて、伝達された前記検査用データを印刷し、この印刷
結果を前記第１の情報処理環境における印刷結果と比較
することにより文字化けを検査することを特徴とする。In order to achieve the above object, a garbled text inspection method according to the present invention provides a method for printing a document created in a first information processing environment in a second information processing environment. A method for inspecting garbled characters, wherein in the first information processing environment, different types of characters included in a created document are extracted to create inspection data, and the inspection data is printed, Transmitting the test data and the print result thereof to the second information processing environment, printing the transmitted test data in the second information processing environment, and transmitting the print result to the first information processing environment. The garbled character is inspected by comparing the result with the print result.

【０００９】この構成において、文字種とは、文字コー
ドやフォントなどによって特定される個々の「文字」の
ことである。この構成では、文書を作成した第１の情報
処理環境にて、文書から互いに異なる文字種のみを抽出
して検査用データを作成する。この検査用データの第１
の情報処理環境での印刷結果を、文書を印刷する第２の
情報処理環境での当該検査用データの印刷結果と比較す
ることにより、両環境間での文字化けを検出する。この
構成によれば、文書中の重複した文字が省かれたものが
検査用データとなるので、検査用データのサイズ、すな
わち文字数は、元の文書に比べて極めて小さいものとな
る。従って、この構成によれば、文書の全文をつき合わ
せる場合よりもはるかに少ない時間・労力で文字化けを
チェックすることができる。In this configuration, the character type is an individual "character" specified by a character code, a font, or the like. In this configuration, in the first information processing environment in which a document is created, only different character types are extracted from the document to create inspection data. The first of this inspection data
By comparing the print result in the second information processing environment for printing a document with the print result in the second information processing environment, garbled characters between the two environments are detected. According to this configuration, the data in which duplicate characters in the document are omitted is used as the inspection data, so that the size of the inspection data, that is, the number of characters, is extremely small as compared with the original document. Therefore, according to this configuration, it is possible to check for garbled characters with much less time and effort than in the case of matching the entire text of a document.

【００１０】また、本発明は、第１の情報処理環境に
て、作成した文書から所定の検査対象文字種に含まれる
文字種を抽出して検査用データを作成し、この検査用デ
ータを印刷し、前記検査用データ及びその印刷結果を第
２の情報処理環境に伝達し、第２の情報処理環境にて、
伝達された前記検査用データを印刷し、この印刷結果を
前記第１の情報処理環境における印刷結果と比較するこ
とにより文字化けを検査することを特徴とする。Further, according to the present invention, in a first information processing environment, a character type included in a predetermined character type to be inspected is extracted from a created document to create inspection data, and the inspection data is printed. Transmitting the inspection data and a print result thereof to a second information processing environment, and in the second information processing environment,
The transmitted test data is printed, and the print result is compared with the print result in the first information processing environment to check for garbled characters.

【００１１】この構成では、作成した文書に含まれる全
文字種を文字化け検査の対象とせずに、あらかじめ定め
られた検査対象文字種に該当する文字種のみを文字化け
検査の対象とする。すなわち、例えばＪＩＳの第１水準
などの文字は、標準化されており文字化けの可能性は極
めて低い。このような文字化けの可能性が低い文字種を
除いた文字化けの可能性の高い文字種のみを検査対象文
字種とし、この検査対象文字種に該当するもののみを抽
出することにより、検査用データのサイズをさらに小さ
くすることができ、効率よく文字化け検査を行うことが
できる。In this configuration, all character types included in the created document are not subjected to the garbled character inspection, but only the character type corresponding to the predetermined character to be inspected is subjected to the garbled character inspection. That is, for example, characters such as JIS first level are standardized and the possibility of garbled characters is extremely low. Only the character types that are highly likely to be garbled, excluding the character types that are unlikely to be garbled, are set as inspection target character types, and only those that correspond to the inspection target character types are extracted, thereby reducing the size of the inspection data. The size can be further reduced, and the garbled test can be performed efficiently.

【００１２】また、前述の目的を達成するために、本発
明に係る文字化け検査用データ作成装置は、印刷対象の
文書データに含まれる互いに異なる文字種を抽出する文
字種抽出手段と、抽出した文字種のデータに基づき検査
用データを生成するデータ生成手段とを含むことを特徴
とする。この構成によれば、印刷対象の文書データに含
まれるすべての文字種を抽出し、検査用データを作成す
ることができる。Further, in order to achieve the above object, a garbled character inspection data generating apparatus according to the present invention comprises a character type extracting means for extracting different character types included in document data to be printed, Data generating means for generating inspection data based on the data. According to this configuration, all the character types included in the document data to be printed can be extracted, and the inspection data can be created.

【００１３】また、本発明に係る文字化け検査用データ
作成装置は、文字化け検査対象文字種を特定するための
情報が登録された候補情報テーブルと、印刷対象の文書
データから、前記候補情報テーブルの情報によって特定
される検査対象文字種に含まれる文字種を抽出する文字
種抽出手段と、抽出した文字種のデータに基づき検査用
データを生成するデータ生成手段とを含むことを特徴と
する。この構成では、候補情報テーブルには、文字化け
の検査をすべき検査対象文字種を特定するための情報が
登録される。文字種抽出手段は、この候補情報テーブル
の情報に基づき、印刷対象の文書データから検査対象に
該当する文字種を抽出する。この構成によれば、印刷対
象の文書に含まれる文字のうち、検査対象のもののみを
抽出して検査用データを生成することができる。Further, the garbled character inspection data creating apparatus according to the present invention uses the candidate information table in which information for specifying the character type to be subjected to the garbled character inspection is registered, and the candidate information table from the document data to be printed. It is characterized by including character type extraction means for extracting a character type included in the inspection target character type specified by the information, and data generation means for generating inspection data based on the extracted character type data. In this configuration, information for specifying a character type to be inspected for which a garbled character is to be inspected is registered in the candidate information table. The character type extracting unit extracts a character type corresponding to the inspection target from the document data to be printed based on the information in the candidate information table. According to this configuration, of the characters included in the document to be printed, only those to be inspected can be extracted to generate inspection data.

【００１４】本発明の好適な態様では、データ生成手段
は、文字種抽出手段にて抽出した文字種をフォントごと
に整理して配列することを特徴とする。この構成によれ
ば、検査用データにおいて、各文字種がフォントごとに
整理して配列されるため、この検査用データの印刷結果
には、検査すべき各文字種がフォントごとに配列して表
示されるので、検査がしやすくなる。In a preferred aspect of the present invention, the data generation means arranges the character types extracted by the character type extraction means for each font. According to this configuration, in the inspection data, each character type is arranged and arranged for each font. Therefore, in the print result of the inspection data, each character type to be inspected is arranged and displayed for each font. Therefore, the inspection becomes easier.

【００１５】さらに好適には、データ生成手段は、各フ
ォントごとに整理した文字種の配列に対し、当該配列に
対応するフォント名を表す文字列データを付加する機能
を有する。この構成によれば、検査用データの印刷結果
には、フォントごとの各文字種の配列に対して当該配列
に対応するフォント名が印刷されるので、文字化けの検
査においてフォント名の特定が容易となる。[0015] More preferably, the data generating means has a function of adding character string data representing a font name corresponding to the arrangement to an arrangement of character types arranged for each font. According to this configuration, since the font name corresponding to the arrangement of each character type for each font is printed in the print result of the inspection data, it is easy to specify the font name in the inspection for garbled characters. Become.

【００１６】また、本発明は、コンピュータを、印刷対
象の文書データに含まれるすべての文字種を抽出する手
段、抽出した文字種のデータに基づき検査用データを生
成する手段、として機能させるためのプログラムを記録
した記録媒体を提供する。Further, the present invention provides a program for causing a computer to function as means for extracting all character types included in document data to be printed and means for generating inspection data based on the extracted character type data. A recorded recording medium is provided.

【００１７】また、本発明は、コンピュータを、文字化
け検査の対象文字種が登録された候補情報テーブルに含
まれる文字種を印刷対象の文書データから抽出する手
段、抽出した文字種のデータに基づき検査用データを生
成する手段、として機能させるためのプログラムを記録
した記録媒体を提供する。According to the present invention, there is further provided a computer for extracting a character type included in a candidate information table in which a character type to be subjected to garbled inspection is registered from document data to be printed, and a check data based on the extracted character type data. And a recording medium storing a program for causing the program to function as a unit for generating a program.

【００１８】なお、前記記録媒体の概念には、フレキシ
ブルディスクなどの磁気媒体やＣＤ−ＲＯＭや光磁気デ
ィスクなどの光学読取式媒体、ＲＯＭやフラッシュメモ
リなどの半導体記憶媒体など、プログラムを記録した機
械読取り可能なすべての媒体が含まれる。なお、上記プ
ログラムを通信媒体を経由して提供・記録する方法も本
発明の態様に含まれる。Note that the concept of the recording medium includes machines that record programs, such as a magnetic medium such as a flexible disk, an optical reading medium such as a CD-ROM and a magneto-optical disk, and a semiconductor storage medium such as a ROM and a flash memory. Includes all readable media. Note that a method of providing and recording the above program via a communication medium is also included in the aspect of the present invention.

【００１９】[0019]

【発明の実施の形態】以下、本発明の好適な実施形態を
図面に基づいて説明する。まず、図１を用いて、本発明
に係る文字化け検査の全体的な処理手順を説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the present invention will be described below with reference to the drawings. First, the overall processing procedure of the garbled text inspection according to the present invention will be described with reference to FIG.

【００２０】図１に示す処理の前提として、文書作成側
の環境と印刷側の環境には、共通のＰＤＬ（例えばＰｏ
ｓｔＳｃｒｉｐｔ）を処理できる印刷装置が接続されて
いるものとする。文書作成側の印刷装置は、作成した文
書データの文字内容が文書の原稿と比較して誤りがない
ことを確かめるための装置であり、少なくとも文書で使
用されている全ての文字の出力が可能なＣＲＴ表示装置
やレーザプリンタ等の装置である。印刷側の印刷装置
は、最終生成物あるいはそれと等価な物を出力できる装
置であり、最終生成物が印刷紙の場合、校正用印刷物を
作成する校正装置、あるいはその装置と同じ文字を生成
し出力できるＣＲＴ表示装置、簡易校正プリンタ等であ
る。また、最終生成物がパーソナルコンピュータ等のＣ
ＲＴ装置に表示される電子出版の場合では、印刷側の印
刷装置は、対象となるパーソナルコンピュータ等の表示
システム、あるいはこのシステムと同じ文字を生成し出
力できるＣＲＴ装置、簡易校正プリンタ等である。As a prerequisite for the processing shown in FIG. 1, a common PDL (for example, PoD
It is assumed that a printing apparatus capable of processing (stScript) is connected. The printing device on the document creation side is a device for confirming that the character content of the created document data is correct by comparing it with the document manuscript, and can output at least all the characters used in the document. It is a device such as a CRT display device and a laser printer. The printing device on the printing side is a device that can output the final product or its equivalent.If the final product is printed paper, a proofing device that creates a proof print, or generates and outputs the same characters as the device CRT display device, simple calibration printer, etc. The final product is C
In the case of electronic publishing displayed on an RT device, the printing device on the printing side is a display system such as a target personal computer, a CRT device capable of generating and outputting the same characters as this system, a simple proof printer, and the like.

【００２１】図１において、まず文書作成側の環境で
は、文書作成者はコンピュータ上で文書編集ソフトウエ
アなどを用いて文書を編集・作成する（Ｓ１０）。作成
した文書について印刷の指示を入力すると、その文書の
データは、変換用ソフトウエアによりＰｏｓｔＳｃｒｉ
ｐｔなどの所定のＰＤＬの記述に変換され（Ｓ１２）、
このＰＤＬのデータが当該環境に接続された印刷装置に
入力され印刷される（Ｓ１４）。文書作成者は、この印
刷結果を例えばディスプレイ上に表示された文書のイメ
ージと比較するなどして、文字化けの検査を行う（Ｓ１
６）。もし、この段階で文字化けが発見された場合に
は、例えば文書編集ソフトウエアにて当該文書を修正し
（Ｓ１８）、修正結果を再びＰＤＬに変換して印刷して
文字化けを検査する（Ｓ１２，Ｓ１４，Ｓ１６）。これ
を文字化けがなくなるまで繰り返すことにより、文書作
成者の意図を正しく反映したＰＤＬの文書ファイル１０
０を得ることができる。文書ファイル１００が完成する
と、次に、この文書ファイル１００を解析して、文字化
け検査用の検査用データファイル２００を生成する（Ｓ
２０）。この文字化け検査用データは、ＰＤＬのデータ
であり、その作成手順については後に詳述する。また、
この検査用データは、印刷装置に入力され、検査用シー
ト２０４として印刷出力される（Ｓ２２）。文書ファイ
ル１００及び検査用データファイル２００は、例えばフ
レキシブルディスクなどの記録媒体２０２に格納され、
検査用シート２０４とともに印刷側の環境に送付され
る。Referring to FIG. 1, first, in an environment on the document creating side, a document creator edits and creates a document using document editing software on a computer (S10). When a print instruction is input for the created document, the data of the document is converted to PostScript by the conversion software.
is converted into a description of a predetermined PDL such as pt (S12),
The PDL data is input to a printing device connected to the environment and printed (S14). The document creator checks the garbled character by comparing the print result with an image of the document displayed on the display, for example (S1).
6). If garbled characters are found at this stage, the document is corrected by, for example, document editing software (S18), and the corrected result is converted to PDL again and printed to check for garbled characters (S12). , S14, S16). This is repeated until the characters are not garbled, so that the PDL document file 10 that correctly reflects the intention of the document creator.
0 can be obtained. When the document file 100 is completed, the document file 100 is analyzed to generate an inspection data file 200 for garbled character inspection (S
20). The garbled inspection data is PDL data, and the procedure for creating the data will be described later in detail. Also,
The inspection data is input to the printing device and printed out as the inspection sheet 204 (S22). The document file 100 and the inspection data file 200 are stored in a recording medium 202 such as a flexible disk, for example.
It is sent to the printing environment together with the inspection sheet 204.

【００２２】印刷環境では、文書作成側から送付された
記録媒体２０２、検査用シート２０４を受けとると（Ｓ
３０）、記録媒体２０２から検査用データファイル２０
０を読み出し、当該環境の印刷装置に入力して印刷する
（Ｓ３２）。そして、この印刷結果の各文字と、文書作
成環境から受けとった検査用シート２０４との各文字と
を比較し（Ｓ３４）、文字化けの有無を検査する。文字
化けがないことが確認できれば、記録媒体２０２の文書
ファイル１００を印刷装置に入力して、印刷処理を行う
（Ｓ３６）。また、文字化けがあった場合には、文字化
けした文字を正しく印刷するのに必要なフォントや文字
のデータを印刷環境側の印刷装置に登録したり、あるい
は文書作成側に文書から文字化けした文字を除いてもら
うなど、文字化けに対する対処を行う（Ｓ３８）。In the printing environment, upon receiving the recording medium 202 and the inspection sheet 204 sent from the document creation side (S
30), the inspection data file 20 from the recording medium 202;
0 is read out, input to the printing device in the environment, and printed (S32). Then, each character of the print result is compared with each character of the inspection sheet 204 received from the document creation environment (S34), and the presence or absence of garbled characters is inspected. If it is confirmed that there is no garbled character, the document file 100 on the recording medium 202 is input to the printing device and printing is performed (S36). In addition, when garbled characters are found, fonts and character data necessary for correctly printing garbled characters are registered in the printing device on the printing environment side, or garbled characters from the document on the document creation side. Countermeasures against garbled characters, such as removing characters, are taken (S38).

【００２３】次に、本実施形態における文字化け検査用
データの生成のための装置構成及び処理手順について説
明する。Next, an apparatus configuration and a processing procedure for generating garbled character inspection data in the present embodiment will be described.

【００２４】図２は、本実施形態の方法に適用される文
字化け検査用データ生成装置の構成を示す機能ブロック
図である。図２において、文字種抽出部１０は、ＰＤＬ
で記述された印刷対象の文書ファイル１００から、文字
化けの検査を行う必要がある文字種を抽出する。本実施
形態では、フォント名及び文字コードの組み合わせによ
って特定される文字の種類のことを文字種と呼ぶ。抽出
した文字種の情報（すなわち、フォント及び文字コード
の組）は、順次文字種リスト１２に登録されていく。候
補情報テーブル１４は、文字化けの検査をすべき文字種
（言い換えれば、文字化けを起こす可能性がある文字
種）を示す情報が登録されている。前述の文字種抽出部
１０は、この候補情報テーブル１４を参照しつつ文字種
の抽出処理を行う。検査用データ生成部１６は、文字種
リスト１２の情報を受けとり、このリストの情報を編集
して検査用データファイル２００を作成する。この際、
検査用データ生成部１６は、リストに登録された文字種
を同一フォントごとに整理するなどの編集を行い、その
編集結果を予め登録されたスタイル情報１８に従って組
版し、組版結果をＰＤＬで記述することにより検査用デ
ータファイル２００を作成する。ここで用いられるＰＤ
Ｌは、前述の文書ファイルを記述するＰＤＬと同じもの
である。作成された検査用データファイル２００は、記
録媒体２０２に格納されるとともに、また紙に印刷され
て検査用シート２０４となる。FIG. 2 is a functional block diagram showing a configuration of a garbled character inspection data generating apparatus applied to the method of the present embodiment. In FIG. 2, the character type extraction unit 10
The character type that needs to be checked for garbled characters is extracted from the document file 100 to be printed described in. In the present embodiment, a type of character specified by a combination of a font name and a character code is referred to as a character type. The extracted information on the character type (that is, the combination of the font and the character code) is sequentially registered in the character type list 12. The candidate information table 14 has registered therein information indicating a character type to be checked for garbled characters (in other words, a character type that may cause garbled characters). The above-described character type extraction unit 10 performs character type extraction processing with reference to the candidate information table 14. The inspection data generation unit 16 receives the information of the character type list 12 and edits the information of this list to create an inspection data file 200. On this occasion,
The inspection data generation unit 16 performs editing such as organizing the character types registered in the list for each same font, typesetting the editing result according to the style information 18 registered in advance, and describing the typesetting result in PDL. To create an inspection data file 200. PD used here
L is the same as the PDL describing the document file described above. The created inspection data file 200 is stored in the recording medium 202 and is also printed on paper to become an inspection sheet 204.

【００２５】この文字化け検査用データ生成装置は、コ
ンピュータシステムにおいて、文字種抽出部１０や検査
用データ生成部１６の機能を記述したプログラムをメモ
リ上にロードし、ＣＰＵにてそのプログラムを実行する
ことにより構築することができる。文字種リスト１２
は、例えばメモリ上に確保したワークエリアに構築され
る。候補情報テーブル１４としては、予めユーザなどが
作成したものを例えばメモリ上にロードし、上記プログ
ラムからの参照を可能にする。このようなプログラムあ
るいはテーブルのデータは、媒体に記憶された状態で提
供される。プログラムなどを記憶した媒体としては、例
えばフレキシブルディスク、ＣＤ−ＲＯＭ、メモリカー
ドなどを用いることができる。媒体に記録されたプログ
ラムやデータは、コンピュータシステムに組み込まれて
いる記憶装置、例えばハードディスク装置にインストー
ルされることにより、このプログラムを実行して本実施
形態に示した各機能を実現する文字化け検査用データ生
成装置の構築に寄与する。このような文字化け検査用デ
ータ生成のためのプログラムは、例えば、文書編集や組
版のためのソフトウエアに、ユーティリティソフトウエ
アの一つとして組み込むこともできる。This garbled inspection data generation device is a computer system in which a program describing the functions of the character type extraction unit 10 and the inspection data generation unit 16 is loaded onto a memory, and the CPU executes the program. Can be constructed by Character type list 12
Is constructed in a work area secured on a memory, for example. As the candidate information table 14, a table created by a user or the like in advance is loaded on a memory, for example, so that the program can be referred to. Such program or table data is provided in a state stored in a medium. As a medium storing a program or the like, for example, a flexible disk, a CD-ROM, a memory card, or the like can be used. The program and data recorded on the medium are installed in a storage device incorporated in the computer system, for example, a hard disk device, and are executed to execute the program and realize the garbled character check for realizing each function described in the present embodiment. Contributes to the construction of a data generation device. Such a program for generating garbled character inspection data can be incorporated into software for document editing and typesetting, for example, as one of utility software.

【００２６】次に、図２の装置による文字化け検査用デ
ータの生成処理の手順を詳細に説明する。Next, the procedure of the process for generating garbled character inspection data by the apparatus shown in FIG. 2 will be described in detail.

【００２７】図４は、以下の説明において具体例として
用いる文書を示す。図４は、図３の文書を印刷するため
のＰＤＬの文書ファイル、すなわち図２における文書フ
ァイル１００の一例を示している。この例は、ＰＤＬと
してＰｏｓｔＳｃｒｉｐｔを用いた場合の例であり、繁
雑さを避けるために一部を省略している。FIG. 4 shows a document used as a specific example in the following description. FIG. 4 shows an example of a PDL document file for printing the document shown in FIG. 3, that is, an example of the document file 100 shown in FIG. This example is an example of a case where PostScript is used as PDL, and a part thereof is omitted to avoid complexity.

【００２８】図４では、右欄に図３の文書を表すＰＤＬ
の記述を順に示し、左欄にＰＤＬ記述の意味、例えば文
書における対応する文字など、を示した。例えば、１行
目の“／ＦＯＮＴ−Ａ・・・”などはフォント指定のた
めの記述であり、“ＦＯＮＴ−Ａ”はフォント名、“ｆ
ｆ”はフォント名に対応するフォントを読み込むオペレ
ータ、“［７０・・］”はフォントに対する座標変
換を表すマトリクス（サイズの変換などのために用い
る）、“ｍｆ”はフォントに対して前記マトリクスを適
用することにより新しいフォントを生成するオペレー
タ、“ｓｅｔｆ”はフォントを文字描画用のフォントと
してセットするオペレータである。したがって、文書フ
ァイルの１行目の記述は、フォント名が“ＦＯＮＴ−
Ａ”であるフォントをロードし、このフォントを指定さ
れたマトリクスにて変換し、この結果生成されたフォン
トを文字描画用のフォントとしてセットすることを表
す。フォント指定は、次のフォント指定がなされるまで
有効である。In FIG. 4, a PDL representing the document of FIG.
Are described in order, and the meaning of the PDL description, for example, the corresponding character in the document, is shown in the left column. For example, “/ FONT-A...” On the first line is a description for specifying a font, “FONT-A” is a font name, “f
"f" is an operator for reading the font corresponding to the font name, "[70 ..]" is a matrix representing coordinate conversion for the font (used for size conversion, etc.), and "mf" is the matrix for the font. Is applied to generate a new font, and "setf" is an operator that sets the font as a font for drawing characters.
A indicates that a font "A" is loaded, the font is converted by a specified matrix, and the font generated as a result is set as a font for character drawing. The font specification is as follows. It is effective until

【００２９】また、文書ファイルの２行目及び３行目
は、図３の文書における文字「サ」の描画を指示する記
述である。ここで、２行目において、“−０７”は座
標、“ｌｃｍｔ”はその座標を描画の基準位置にセット
するオペレータである。３行目において、“＼２０３
Ｔ”は「サ」を示す文字コードであり、“ｓｈ”はその
文字コードの文字をその時点で有効なフォントを用いて
描画するオペレータである。従って、文書ファイルの２
行目及び３行目の記述によれば、２行目に指定された座
標を基準位置として、３行目で指定された文字コードの
文字「サ」が、フォント“ＦＯＮＴ−Ａ”（１行目で設
定）で描画される。以下、４及び５行目の記述で文字
「ン」、６及び７行目の記述で文字「プ」といった具合
に、位置指定と文字コード指定の２行一組で１つの文字
についての描画指示を表す。The second and third lines of the document file are descriptions for instructing drawing of the character "S" in the document of FIG. Here, in the second line, “−07” is a coordinate, and “lcmt” is an operator who sets the coordinate to a drawing reference position. In the third line, “$ 203
“T” is a character code indicating “sa”, and “sh” is an operator that draws a character of the character code using a font valid at that time. Therefore, 2 of the document file
According to the description on the third line and the third line, the character “sa” of the character code specified on the third line is set to the font “FONT-A” (first line) with the coordinates specified on the second line as a reference position. Drawn by eye). Hereinafter, a drawing instruction for one character is given in a pair of position designation and character code designation, such as the character "n" in the description of the fourth and fifth lines, and the character "p" in the description of the sixth and seventh lines. Represents

【００３０】図２の文字化け検査用データ生成装置にお
いて、文字種抽出部１０は、このような文書ファイルの
ＰＤＬ記述を先頭行から順に読み込んで解釈し、文書の
各文字の文字種を特定し、文字化け検査が必要な文字種
を抽出していく。なお、本実施形態では、文字種は、前
述したようにフォントと文字コードの組み合わせで特定
する。In the garbled character inspection data generating apparatus of FIG. 2, the character type extracting unit 10 reads and interprets the PDL description of such a document file in order from the first line, specifies the character type of each character of the document, and Character types that require garbled inspection are extracted. In the present embodiment, the character type is specified by the combination of the font and the character code as described above.

【００３１】また図５は、図２における候補情報テーブ
ル１４の内容の一例を概念的に示したものである。実際
の候補情報テーブル１４は、同様の内容をコンピュータ
で読み取り可能な形式で記述したものとなる。図５の例
では、文字化け検査が不要なフォントのフォント名が欄
３００に、文字化け検査が不要な文字コードの範囲が欄
３０２に、文字化け検査が必要な文字コードの範囲が欄
３０４に、それぞれ登録されている。検査不要のフォン
トとしては、例えば文書作成環境及び印刷環境の両方に
インストールされ、文字化けがないことが分かっている
ものなどが考えられる。文字化け検査が不要な文字コー
ドとしては、例えばＪＩＳの第１水準や第２水準など、
フォントメーカー間で統一されている文字コードが考え
られる。図５の例ではその様な統一された文字コードの
範囲が、検査不要の文字コード範囲として、ＪＩＳ区点
コードの形で登録されている（０１区から１０区、及び
１６区〜８３区）。本実施形態では、文字コード範囲に
該当する文字種は、基本的に文字化けの検査対象から外
す。ただし、ＪＩＳの第１水準や第２水準のコードの範
囲内でも、例えば０２区２６点〜０２区末尾や０８区０
１点〜０８区末尾などのように対応文字が未定義の範囲
があり、このような範囲の文字コードは各フォントメー
カーが自由に利用できる。このため、このようなＪＩＳ
に未定義の範囲の文字コードには、フォントごとに異な
った文字が割り当てられている可能性が高く、文字化け
の可能性がある。そこで、図５では、このような範囲
が、文字化け検査が不要な文字コード範囲の中の例外と
して、文字化け検査が必要な文字コード範囲の欄３０４
に登録されている。また、ＪＩＳの新旧規格間で文字の
形が異なっている文字コード（例えば２２区３８点）も
あり、そのような文字コードも文字化け検査が必要な文
字コード範囲の欄３０４に登録されている。図５では、
文字コードがＪＩＳ区点コードで表されているが、文字
種抽出部１０は、これをＰＤＬが採用する例えば８進あ
るいは１６進などのコード表現に変換して解釈する。こ
れら候補情報テーブル１４の登録情報は、各文字種が文
字化け検査対象か否かを判定する際の判定条件として用
いられる。これら判定条件の適用の仕方については、後
述する具体的な処理手順の説明において詳しく述べる。
なお、候補情報テーブル１４は、ユーザやシステム管理
者が予めエディタなどを用いて作成しておく。FIG. 5 conceptually shows an example of the contents of the candidate information table 14 in FIG. The actual candidate information table 14 describes the same contents in a computer-readable format. In the example of FIG. 5, the font name of the font that does not require garbled inspection is in column 300, the range of character codes that do not require garbled inspection is in column 302, and the range of character codes that require garbled inspection is in column 304. , Each is registered. As the font that does not need to be inspected, for example, a font that is installed in both the document creation environment and the printing environment and is known to have no garbled characters can be considered. Character codes that do not require a garbled check include, for example, JIS first and second standards.
Character codes that are unified among font makers are conceivable. In the example of FIG. 5, such a unified character code range is registered as a character code range that does not need to be inspected in the form of a JIS Kuten code (from Ward 01 to Ward 10, and Ward 16 to Ward 83). . In the present embodiment, the character type corresponding to the character code range is basically excluded from the inspection target for garbled characters. However, even within the range of the JIS first-level and second-level codes, for example, 26 points in 02 section to the end of 02 section or 0 section in 08 section
There is a range in which the corresponding character is undefined, such as from one point to the end of the 08th section, and character codes in such a range can be freely used by each font maker. Therefore, such JIS
It is highly probable that different characters are assigned to the character codes in the undefined range for each font, resulting in garbled characters. Therefore, in FIG. 5, such a range is an exception in the character code range where the garbled character inspection is not required.
Registered in. There are also character codes (for example, 22 sections and 38 points) whose character shapes are different between the old and new standards of JIS, and such character codes are also registered in the character code range field 304 requiring a garbled character inspection. . In FIG.
Although the character code is represented by a JIS punctuation code, the character type extraction unit 10 interprets the character code by converting it into a code expression such as octal or hexadecimal adopted by the PDL. The registration information in the candidate information table 14 is used as a determination condition when determining whether each character type is a garbled test target. How to apply these determination conditions will be described in detail in the following description of specific processing procedures.
The candidate information table 14 is created in advance by a user or a system administrator using an editor or the like.

【００３２】以上説明した文書（図３及び図４）と候補
情報テーブル（図５）を具体例として、図２の装置によ
る文字化け検査用データの生成処理の手順を説明する。
図６は、この手順を示すフローチャートである。図６の
手順のうち、Ｓ２０２からＳ２２０までのステップは文
字種抽出部１０で実行される手順を示し、Ｓ２２２から
Ｓ２３８までのステップは検査用データ生成部１６によ
って実行される手順を示す。以下、図２〜図６を適宜参
照して説明する。Using the above-described document (FIGS. 3 and 4) and the candidate information table (FIG. 5) as specific examples, a description will be given of the procedure of processing for generating garbled inspection data by the apparatus shown in FIG.
FIG. 6 is a flowchart showing this procedure. 6, steps from S202 to S220 indicate procedures performed by the character type extraction unit 10, and steps from S222 to S238 indicate procedures performed by the inspection data generation unit 16. Hereinafter, description will be made with reference to FIGS.

【００３３】文字化け検査用データ生成装置にＰＤＬの
文書ファイル１００が与えられ、検査用データの生成処
理の指示が入力されると、まず文字種抽出部１０は、処
理対象文字の順番を表すカウント値ｎを１に初期化する
（Ｓ２０２）。なお、文字種抽出部１０は、このカウン
ト値ｎを管理するほか、現在の処理対象の文字を表すデ
ータ構造として、現時点で有効なフォントのフォント名
を表す文字列データと、当該文字の文字コードを表す８
進あるいは１６進の整数値データと、を含む構造体を管
理している。この構造体を、以下「処理対象文字デー
タ」と呼ぶ。次に、文字種抽出部１０は、文書ファイル
を順に読み取っていき、第ｎ番目の文字のデータを取り
出す（Ｓ２０４）。ここで、文字種抽出部１０は、例え
ば“ｓｈ”オペレータを文字の区切りとして、１文字ず
つのデータを判別する。Ｓ２０４では、取り出した文字
のデータからフォント名及び文字コードを切り出し、そ
れらを処理対象文字データにセットする。なお、文字の
データにフォント指定が含まれない場合は、処理対象文
字データのフォント名は変更されない。このようにして
文書ファイルから処理対象文字のデータが取り込まれる
と、次に、その処理対象文字のフォント名が、候補情報
テーブル１４に登録された検査不要のフォント名に該当
するか否かを判定する（Ｓ２０６）。検査不要のフォン
トに該当する場合には、その文字は文字化け検査対象と
して抽出する必要はないと判断する。そして、文書ファ
イルの末尾に達したか否かを判定し（Ｓ２１８）、達し
ていない場合はカウント値ｎを１進めて（Ｓ２２０）次
の文字の処理に移行する。When the PDL document file 100 is provided to the garbled test data generating apparatus and an instruction for the process of generating test data is input, first, the character type extracting unit 10 counts the count value indicating the order of the character to be processed. n is initialized to 1 (S202). The character type extraction unit 10 manages the count value n, and also stores character string data representing the font name of the currently valid font and the character code of the character as a data structure representing the current character to be processed. 8 to represent
And a hexadecimal or hexadecimal integer value data. This structure is hereinafter referred to as “character data to be processed”. Next, the character type extraction unit 10 sequentially reads the document file and extracts data of the n-th character (S204). Here, the character type extraction unit 10 determines data for each character by using, for example, an “sh” operator as a character delimiter. In S204, a font name and a character code are cut out from the extracted character data, and these are set as processing target character data. If the font data is not included in the character data, the font name of the character data to be processed is not changed. When the data of the character to be processed is fetched from the document file in this way, it is next determined whether or not the font name of the character to be processed corresponds to the font name which does not need to be checked registered in the candidate information table 14. (S206). If the font does not need to be checked, it is determined that the character does not need to be extracted as a garbled check target. Then, it is determined whether or not the end of the document file has been reached (S218). If the end has not been reached, the count value n is incremented by 1 (S220), and the process proceeds to the next character.

【００３４】Ｓ２０６の判定で、検査不要のフォントで
ないと判定された場合は、その処理対象文字の文字コー
ドが、候補情報テーブル１４に登録された検査不要な文
字コード範囲に含まれるか否かを判定する（Ｓ２０
８）。文字コードが検査不要な文字コード範囲に含まれ
る場合は、さらにその文字コードが、その範囲内でも例
外的に検査が必要な文字コードに該当するか否かを調べ
る（Ｓ２１０）。この結果、そのような例外には該当し
ないと判定された場合に、その処理対象文字は文字化け
検査対象として抽出する必要はないと判断する。そし
て、文書の末尾に達したか否かを判定し（Ｓ２１８）、
達していない場合はカウント値ｎを１進めて（Ｓ２２
０）次の文字の処理に移行する。If it is determined in S206 that the font is not a font requiring no inspection, it is determined whether or not the character code of the character to be processed is included in the character code range not requiring inspection registered in the candidate information table 14. Judgment (S20
8). If the character code is included in the character code range that does not need to be checked, it is further determined whether or not the character code falls within the character code that needs to be checked exceptionally even within the range (S210). As a result, if it is determined that the exception does not correspond to such an exception, it is determined that the processing target character does not need to be extracted as a garbled inspection target. Then, it is determined whether or not the end of the document has been reached (S218),
If not reached, the count value n is advanced by 1 (S22
0) Move to the processing of the next character.

【００３５】Ｓ２０８の判定で検査が不要な文字コード
範囲に含まれないと判定された場合、あるいはＳ２１０
で検査が必要な文字コードに該当すると判定された場合
は、当該処理対象文字は文字化け検査対象と判断され、
当該処理対象文字のフォント名及び文字コードを文字種
リスト１２に登録する。ただし、本実施形態では、文字
種リスト１２に同じ文字種が重複して登録されることを
避けるために、処理対象文字のフォント名及び文字コー
ドを、その時点での文字種リスト１２の各エントリと比
較し（Ｓ２１２）、同一文字種が既に文字種リスト１２
に登録されているか否かを調べる（Ｓ２１４）。この結
果、処理対象文字が文字種リスト１２に未登録と判定さ
れた場合は、その処理対象文字のフォント名及び文字コ
ードの組を文字種リスト１２に登録する（Ｓ２１６）。
Ｓ２１４にて、同一文字種が既に登録されていると判定
された場合には、その処理対象文字は文字種リスト１２
に登録しない。そして、いずれの場合にも、文書ファイ
ルの末尾に達したか否かを判定し（Ｓ２１８）、達して
いない場合はカウント値ｎを１進めて（Ｓ２２０）次の
文字の処理に移行する。If it is determined in step S208 that the character code is not included in the character code range for which the inspection is unnecessary,
If it is determined that the corresponding to the character code that needs to be inspected, the processing target character is determined to be a garbled inspection target,
The font name and character code of the character to be processed are registered in the character type list 12. However, in the present embodiment, in order to prevent the same character type from being registered in the character type list 12 repeatedly, the font name and the character code of the character to be processed are compared with each entry of the character type list 12 at that time. (S212) The same character type is already in the character type list 12
It is checked whether or not it is registered in (S214). As a result, when it is determined that the character to be processed is not registered in the character type list 12, the combination of the font name and the character code of the character to be processed is registered in the character type list 12 (S216).
If it is determined in S214 that the same character type has already been registered, the target character is the character type list 12
Do not register with. In either case, it is determined whether or not the end of the document file has been reached (S218). If not, the count value n is incremented by 1 (S220), and the process proceeds to the processing of the next character.

【００３６】以上の手順を文書ファイルの末尾に達する
まで繰り返すことにより、文書ファイルから文字化け検
査対象に該当するすべての文字種を抽出することができ
る。By repeating the above procedure until the end of the document file is reached, it is possible to extract all the character types corresponding to the garbled character inspection target from the document file.

【００３７】図７は、図４に示した文書ファイルを、図
５に示した候補情報テーブルを用いて上記手順に従って
処理したときに得られる文字種リスト１２のデータ内容
を示している。図７では、分かりやすくするために文字
コードの欄には対応する文字自体を示しているが、実際
のデータでは整数値のコードが登録される。FIG. 7 shows the data content of the character type list 12 obtained when the document file shown in FIG. 4 is processed according to the above-described procedure using the candidate information table shown in FIG. In FIG. 7, the corresponding character itself is shown in the character code column for easy understanding, but an integer code is registered in actual data.

【００３８】このようにして文書ファイルから検査対象
文字種の抽出が完了すると、次に検査用データ生成部１
６が、文字種リスト１２の各文字種データを所定の順序
にしたがってソートする（Ｓ２２２）。本実施形態で
は、まずフォント名に基づいて各文字種データをフォン
トごとに集めて整理し、その後各フォントごとについて
各文字種データを文字コードの例えば昇順に従って並べ
替える。When the extraction of the character type to be inspected from the document file is completed in this way, the inspection data generation unit 1
6 sorts each character type data of the character type list 12 in a predetermined order (S222). In this embodiment, first, each character type data is collected and arranged for each font based on the font name, and then each character type data is rearranged for each font in the ascending order of the character codes, for example.

【００３９】そして、検査用データ生成部１６は、この
ソートされた文字種リスト１２に基づき、以下のように
して検査用データファイル２００を生成する。すなわ
ち、まずカウント値ｋを１に初期化する（Ｓ２２４）。
次に、文字種リスト１２から第ｋ文字のデータ（すなわ
ちフォント名と文字コード）を取り出す（Ｓ２２６）。
そして、カウント値ｋの値が１か否かを判定し（Ｓ２２
８）、ｋ＝１の場合にはその文字（すなわち文字種リス
トの１番目の文字）のフォント名を表す文字列を印刷す
るためのＰＤＬ記述を生成し、検査用データファイル２
００に書き込む（Ｓ２３２）。また、Ｓ２２８にてｋが
１でない場合は、その文字（すなわち第ｋ文字）と一つ
前の文字（すなわち第（ｋ−１）文字）のフォント名を
比較し（Ｓ２３０）、両者が一致しない場合は、その第
ｋ文字のフォント名を印刷するためのＰＤＬ記述を生成
し、検査用データファイル２００に書き込む（Ｓ２３
２）。そして、フォント名の書き込みが終わると、次に
その第ｋ文字の文字コードに基づきその文字を印刷する
ためのＰＤＬ記述を生成し、検査用データファイル２０
０に書き込む（Ｓ２３４）。一方、Ｓ２３０の判定にて
第ｋ文字のフォントが第（ｋ−１）文字のフォントと同
じであった場合には、検査用データファイル２００への
フォント名の書き込みは行わず、その第ｋ文字を印刷す
るためのＰＤＬ記述を検査用データファイル２００に出
力する（Ｓ２３４）。なお、Ｓ２３２及びＳ２３４で
は、検査用データ生成部１６は、スタイル情報１８を参
照して文字サイズなど必要なスタイルを決定し、そのス
タイルに基づきＰＤＬ記述を生成する。このＳ２２８〜
Ｓ２３４の処理によれば、１つのフォントに属する文字
のうちの先頭の文字を文字種リスト１２から読み込んだ
ときに、そのフォントのフォント名のＰＤＬ記述が書き
込まれることになる。これにより、各フォントごとに、
フォント名とそのフォントに属する検査対象の文字とが
並んだ印刷結果を得ることができる。このような表示に
よれば、文字化けの検査の際に、文字化けが生じたフォ
ントの識別が容易となる。Then, the inspection data generation unit 16 generates an inspection data file 200 based on the sorted character type list 12 as follows. That is, first, the count value k is initialized to 1 (S224).
Next, data of the k-th character (that is, font name and character code) is extracted from the character type list 12 (S226).
Then, it is determined whether the value of the count value k is 1 (S22).
8) If k = 1, generate a PDL description for printing a character string representing the font name of the character (that is, the first character in the character type list),
00 is written (S232). If k is not 1 in S228, the font name of the character (that is, the k-th character) is compared with the font name of the immediately preceding character (that is, the (k-1) th character) (S230), and the two do not match. In this case, a PDL description for printing the font name of the k-th character is generated and written in the inspection data file 200 (S23).
2). When the writing of the font name is completed, a PDL description for printing the k-th character is generated based on the character code of the k-th character.
0 is written (S234). On the other hand, if the font of the k-th character is the same as the font of the (k-1) -th character in the determination in S230, the font name is not written in the inspection data file 200, and the k-th character is not written. Is output to the inspection data file 200 (S234). In S232 and S234, the inspection data generation unit 16 determines a required style such as a character size with reference to the style information 18, and generates a PDL description based on the style. This S228-
According to the process of S234, when the first character among the characters belonging to one font is read from the character type list 12, the PDL description of the font name of the font is written. With this, for each font,
A print result in which the font name and the characters to be inspected belonging to the font are arranged can be obtained. According to such a display, it is easy to identify the font in which the garbled character has occurred when the garbled character is inspected.

【００４０】そして、Ｓ２３４の処理が終わると、文字
種リスト１２の末尾に達したか否かを判定し（Ｓ２３
６）、達していない場合はカウント値ｋを１進め（Ｓ２
３８）、Ｓ２２６に戻って上記の処理を繰り返す。この
ような処理を文字種リスト１２の最後まで繰り返すこと
により、フォントごとに整理された検査対象の文字の一
覧を表示するためのデータからなる検査用データファイ
ル２００が得られる。When the process of S234 is completed, it is determined whether the end of the character type list 12 has been reached (S23).
6) If not, the count value k is incremented by 1 (S2
38), returning to S226, and repeating the above processing. By repeating such processing until the end of the character type list 12, an inspection data file 200 including data for displaying a list of characters to be inspected arranged for each font is obtained.

【００４１】図８に、このようにして得られた検査用デ
ータファイル２００のＰＤＬ記述の一例を示す。この例
は、図４の文書ファイルから生成されたデータである。
図８においては、例えば“ＦＯＮＴ−Ｂ”というフォン
ト名を示す文字列に対応するＰＤＬ記述のあとに、その
ＦＯＮＴ−Ｂに属する検査対象の文字に対応するＰＤＬ
記述が続いている。そして、図８の検査用データを印刷
装置に入力すれば、図９に示す印刷結果を得ることがで
きる。FIG. 8 shows an example of the PDL description of the inspection data file 200 obtained as described above. This example is data generated from the document file of FIG.
In FIG. 8, for example, after a PDL description corresponding to a character string indicating a font name “FONT-B”, a PDL corresponding to a character to be inspected belonging to the FONT-B
The description continues. Then, if the inspection data in FIG. 8 is input to the printing apparatus, the print result shown in FIG. 9 can be obtained.

【００４２】以上、本実施形態に置ける検査用データフ
ァイル２００の作成手順を説明した。上記の手順に従っ
て得られた検査用データファイル２００は、記録媒体２
０２に格納されて印刷側に送られると共に、紙に印刷さ
れた検査用シート２０４の形でも印刷側に送られる。印
刷側では、この検査用データファイル２００を印刷装置
に供給して紙に印刷し、その印刷結果を検査用シート２
０４と比較することにより、文書作成側と印刷側との間
での文字化けを検査することができる。The procedure for creating the inspection data file 200 according to the present embodiment has been described above. The inspection data file 200 obtained according to the above procedure is stored in the recording medium 2
02 and sent to the printing side, and also sent to the printing side in the form of an inspection sheet 204 printed on paper. On the printing side, the inspection data file 200 is supplied to a printing device and printed on paper, and the printing result is printed on the inspection sheet 2.
By comparing this with 04, garbled characters between the document creating side and the printing side can be inspected.

【００４３】以上説明したように、本実施形態によれ
ば、候補情報テーブル１４に登録された条件から決定さ
れる検査対象の文字種のみが、印刷対象の文書ファイル
１００から重複なく抽出される。このようにして抽出さ
れた文字種のみを文書作成側及び印刷側の両方で印刷
し、その印刷結果を比較することにより、文書作成側と
印刷側との間での文字化けの有無を検査することができ
る。本実施形態では、文字化けの起こる可能性が高い文
字だけを抽出して検査を行うので、文字化け検査に要す
る時間・労力を大幅に節約することができる。As described above, according to the present embodiment, only the character type to be inspected determined from the conditions registered in the candidate information table 14 is extracted from the document file 100 to be printed without duplication. Inspect the document creation side and the printing side for garbled characters by printing only the character types extracted in this way on both the document creation side and the printing side, and comparing the printing results. Can be. In the present embodiment, since only characters having a high possibility of occurrence of garbled characters are extracted and inspected, the time and labor required for garbled character inspection can be greatly reduced.

【００４４】なお、本実施形態に置いて、候補情報テー
ブル１４に登録される情報は図５のような形式のものに
限られるものではなく、検査すべき文字種を特定できる
ものであればどのような形式でもよい。例えば、検査す
べき文字種を表すフォント名や文字コード、あるいはそ
れらの組合わせを列挙して登録したものを用いることも
できる。In the present embodiment, the information registered in the candidate information table 14 is not limited to the format shown in FIG. 5, but may be any information that can specify the character type to be inspected. Format may be used. For example, a font name and a character code representing a character type to be inspected, or a combination of those listed and registered may be used.

【００４５】また、ユーザが、文書編集ソフトウエアを
用い、文字化け検査をしたい文字種の一覧を示した文書
を通常の文書編集・作成と同様の方法で作成し、この文
書から候補情報テーブル１４を自動生成することもでき
る。この場合、候補情報テーブル生成のツールが、ユー
ザの作成した文書からフォント名と文字コードとを抽出
し、これらの情報を候補情報テーブル１４に登録してい
く。この方法によれば、ユーザが適宜候補情報テーブル
１４をカスタマイズすることが可能となる。Further, the user creates a document showing a list of character types to be inspected for garbled characters by using the document editing software in the same manner as in ordinary document editing / creation, and creates a candidate information table 14 from this document. It can be automatically generated. In this case, the candidate information table generation tool extracts a font name and a character code from the document created by the user, and registers the information in the candidate information table 14. According to this method, the user can appropriately customize the candidate information table 14.

【００４６】なお、候補情報テーブル１４の利用は、必
ずしも本発明にとって必須ではない。図１０は、候補情
報テーブルを用いない場合の検査用データの作成処理の
手順、特に文字種抽出部１０の処理手順を示すフローチ
ャートである。図１０の方法では、検査用データの作成
指示が入力されると、文字種抽出部１０は、カウント値
ｎを１に初期化し（Ｓ２４０）、文書ファイルから第ｎ
文字を取り出す（Ｓ２４２）。上述の実施形態では、こ
のあと候補情報テーブル１４を用いて検査対象の絞り込
みを行っていたが、この方法ではそのような絞り込みは
行わずに、その第ｎ文字を文字種リスト１２の登録デー
タと比較し（Ｓ２４４）、その文字が文字種リスト１２
に既登録であるか否かだけを調べる（Ｓ２４６）。そし
て、その文字が未登録であれば、文字種リスト１２に新
たに登録する（Ｓ２４８）。そして、以上の処理を文書
ファイルの末尾まで繰り返す（Ｓ２５０，Ｓ２５２）。
このような処理により、文書ファイルに含まれるすべて
の文字種が文字種リスト１２に抽出される。以下、検査
用データ生成部１６は、図６のＳ２２２以降の処理を行
い、文字種リスト１２の情報から検査用データを作成す
る。このように、候補情報テーブルによる検査対象の絞
り込みを行なわず、文書ファイルに含まれる互いに異な
る文字種を抽出し、その一覧の印刷結果を用いて文字化
けを検査するという方法でも、文書の全文の印刷結果を
用いて文字化けを検査する場合よりも、文字化けのチェ
ックの労力・時間をはるかに低減することができる。The use of the candidate information table 14 is not necessarily essential to the present invention. FIG. 10 is a flowchart illustrating the procedure of the inspection data creation process when the candidate information table is not used, in particular, the process procedure of the character type extraction unit 10. In the method of FIG. 10, when an instruction to create inspection data is input, the character type extraction unit 10 initializes the count value n to 1 (S240), and reads the n-th file from the document file.
Characters are extracted (S242). In the above-described embodiment, the inspection target is narrowed down by using the candidate information table 14 thereafter. In this method, the n-th character is compared with the registered data of the character type list 12 without performing such narrowing-down. (S244), and the character is the character type list 12
It is checked only whether or not the information has already been registered (S246). If the character is not registered, the character is newly registered in the character type list 12 (S248). Then, the above processing is repeated until the end of the document file (S250, S252).
By such processing, all character types included in the document file are extracted to the character type list 12. Hereinafter, the inspection data generation unit 16 performs the processing from S222 in FIG. 6 and creates inspection data from the information of the character type list 12. As described above, the method of extracting the different character types included in the document file without narrowing down the inspection target by the candidate information table and inspecting the garbled character by using the print result of the list also prints the full text of the document. Compared with the case where garbled characters are inspected using the result, the labor and time for checking garbled characters can be significantly reduced.

【００４７】また、上記実施形態では、フォント名と文
字コードの組合わせにより文字種を特定したが、文字種
の特定の仕方はこれに限らない。例えば、フォント名、
文字コード以外に、文字サイズなど文字の他の属性も含
んだ組合わせにて文字種を特定してもよい。In the above embodiment, the character type is specified by the combination of the font name and the character code. However, the method of specifying the character type is not limited to this. For example, font name,
In addition to the character code, the character type may be specified by a combination including other attributes of the character such as the character size.

【００４８】また、上記実施形態において、検査用シー
トなどにおける検査対象の各文字の印刷サイズを、実際
の文書の印刷の場合と同じ大きさとすれば、ユーザは実
際の印刷状態に近い形の印刷結果に基づき文字化けの検
査を行うことができる。このためには、文書ファイルか
ら文字を抽出する際に、フォント名や文字コードだけで
なく、文字サイズの情報も抽出し、検査用データを作成
する際に、その文字サイズの情報を反映したＰＤＬ記述
を生成すればよい。In the above-described embodiment, if the print size of each character to be inspected on an inspection sheet or the like is set to the same size as when printing an actual document, the user can print in a form close to the actual printing state. Inspection of garbled characters can be performed based on the result. For this purpose, when extracting characters from a document file, not only font names and character codes, but also character size information is extracted, and when creating inspection data, a PDL reflecting the character size information is used. You only need to generate a description.

【００４９】また、文字化け検査用の検査用データの応
用方法として次のようなものも考えられる。すなわち、
一般にＤＴＰシステムなどで文書を作成した場合には、
図１のＳ１４〜Ｓ１８のように、ディスプレイの表示と
印刷結果とを照合して正しい印刷結果が得られているか
を確認するが、このような確認作業に検査用データを適
用することにより、作業効率を改善することができる。
この方法では、作成した検査用データをディスプレイに
表示すると共にプリンタにて印刷し、ディスプレイ表示
と印刷結果とを比較して、ディスプレイ表示と印刷結果
との間での文字化けをチェックする。この方法によれ
ば、文書作成環境におけるディスプレイ表示と印刷結果
との間での文字化けの検査にかかる労力、時間を節約す
ることができる。The following method can be considered as an application method of the inspection data for the garbled inspection. That is,
Generally, when a document is created using a DTP system,
As shown in S14 to S18 in FIG. 1, the display display is compared with the print result to check whether a correct print result is obtained. By applying the inspection data to such a check operation, the work is performed. Efficiency can be improved.
In this method, the created inspection data is displayed on a display and printed by a printer, and the display display is compared with the print result to check for garbled characters between the display display and the print result. According to this method, it is possible to save labor and time required to check for garbled characters between the display display and the print result in the document creation environment.

【００５０】また、本発明は、ＰＤＬで記述された文書
ファイルだけでなく、その他のデータ形式で表された文
書ファイルにも適用可能である。The present invention is applicable not only to a document file described in PDL, but also to a document file represented in another data format.

【００５１】また、検査用データファイル２００の印刷
環境への伝達は、記録媒体を介することなく、例えばデ
ータ通信にて行ってもよい。The transmission of the inspection data file 200 to the printing environment may be performed by, for example, data communication without using a recording medium.

【００５２】[0052]

【発明の効果】以上説明したように、本発明によれば、
印刷対象の文書から互いに異なる文字だけ、あるいは文
字化けの可能性がある検査対象文字種だけを抽出し、こ
れら抽出された文字種の印刷結果を比較することにより
文字化けの検査を行うので、文書の全文の印刷結果を用
いて文字化けを検査する場合よりも、文字化けのチェッ
クの労力・時間をはるかに低減することができる。As described above, according to the present invention,
Since only characters that are different from each other or character types that are likely to be garbled are extracted from the document to be printed and garbled characters are checked by comparing the print results of these extracted character types, the full text of the document is checked. The labor and time required to check for garbled characters can be significantly reduced as compared with the case where garbled characters are inspected using the print result of (1).

[Brief description of the drawings]

【図１】本発明に係る文字化け検査方法の全体の流れ
を示すフローチャートである。FIG. 1 is a flowchart showing an entire flow of a garbled character inspection method according to the present invention.

【図２】本発明に係る文字化け検査用データ生成装置
の構成を示す機能ブロック図である。FIG. 2 is a functional block diagram showing a configuration of a garbled character inspection data generation device according to the present invention.

【図３】印刷対象の文書の一例を示す図である。FIG. 3 is a diagram illustrating an example of a document to be printed.

【図４】図４の文書を表すページ記述言語（ＰＤＬ）
の記述例を示す図である。FIG. 4 is a page description language (PDL) representing the document of FIG.
It is a figure showing the example of description of.

【図５】候補情報テーブルの内容の一例を示す図であ
る。FIG. 5 is a diagram showing an example of the contents of a candidate information table.

【図６】文字化け検査用データ生成装置の処理手順を
示すフローチャートである。FIG. 6 is a flowchart illustrating a processing procedure of the garbled inspection data generation device.

【図７】文字種リストのデータ内容を示す図である。FIG. 7 is a diagram showing data contents of a character type list.

【図８】検査用データファイルの記述例を示す図であ
る。FIG. 8 is a diagram illustrating a description example of an inspection data file.

【図９】図８の検査用データファイルの印刷例を示す
図である。FIG. 9 is a diagram illustrating a print example of the inspection data file of FIG. 8;

【図１０】候補情報テーブルを用いない場合の検査用
データの作成処理の手順の要部を示すフローチャートで
ある。FIG. 10 is a flowchart illustrating a main part of a procedure of a process of creating inspection data when a candidate information table is not used.

[Explanation of symbols]

１０文字種抽出部、１２文字種リスト、１４候補
情報テーブル、１６検査用データ生成部、１８スタイ
ル情報、１００文書ファイル、２００検査用データ
ファイル、２０２記録媒体、２０４検査用シート。10 character type extraction unit, 12 character type list, 14 candidate information table, 16 inspection data generation unit, 18 style information, 100 document file, 200 inspection data file, 202 recording medium, 204 inspection sheet.

Claims

[Claims]

1. A method for inspecting a garbled character when a document created in a first information processing environment is printed in a second information processing environment, the document created in the first information processing environment. Extract the different character types included in the to create inspection data,
Printing the inspection data, transmitting the inspection data and a print result thereof to the second information processing environment, and printing the transmitted inspection data in the second information processing environment; A garbled character inspection method, wherein garbled character inspection is performed by comparing a print result with a print result in the first information processing environment.

2. A method for inspecting garbled characters when a document created in a first information processing environment is printed in a second information processing environment, the document created in the first information processing environment. Extracting a character type included in a predetermined character type to be inspected from to generate inspection data, printing the inspection data, transmitting the inspection data and a print result thereof to the second information processing environment, In the second information processing environment, the transmitted inspection data is printed, and the garbled character is inspected by comparing the print result with the print result in the first information processing environment. Garbage inspection method.

3. A character comprising: character type extracting means for extracting different character types included in document data to be printed; and data generating means for generating inspection data based on the extracted character type data. A garble inspection data creation device.

4. A candidate information table in which information for specifying a garbled inspection target character type is registered, and a character type included in the inspection target character type specified by the information of the candidate information table is determined from document data to be printed. A garbled character inspection data creating device, comprising: a character type extracting means for extracting; and a data generating means for generating inspection data based on the extracted character type data.

5. The garbled character inspection data creating device according to claim 3, wherein the data generating means arranges and sorts the extracted character types for each font.

6. The apparatus according to claim 5, wherein the data generating means has a function of adding character string data representing a font name corresponding to the arrangement to an arrangement of character types arranged for each font. A garbled inspection data creating device, characterized by having

7. A recording medium on which a program for causing a computer to function as means for extracting all character types included in document data to be printed and means for generating inspection data based on the extracted character type data is recorded.

8. A computer for extracting a character type included in a candidate information table in which a garbled character inspection target character type is registered from document data to be printed, and a unit for generating inspection data based on the extracted character type data. A recording medium on which a program for causing a computer to function is recorded.