JP4924990B2

JP4924990B2 - Document processing apparatus and document processing program

Info

Publication number: JP4924990B2
Application number: JP2008063855A
Authority: JP
Inventors: 伸隆加藤; 昌史金子
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2008-03-13
Filing date: 2008-03-13
Publication date: 2012-04-25
Anticipated expiration: 2028-03-13
Also published as: JP2009223363A

Description

本発明は、文書処理装置および文書処理プログラムに関する。 The present invention relates to a document processing apparatus and a document processing program.

近年、スキャンした紙文書から検索可能な電子文書を生成する、といったことが行われている。検索可能な電子文書としては、サーチャブルＸＰＳ（XML Paper Specification）と呼ばれるものや、サーチャブルＰＤＦ（Portable Document Format）またはテキスト付ＰＤＦと呼ばれるもの等が知られている。これらの電子文書は、イメージレイヤとテキストレイヤとが重なるように構成されており、背景となる文書画像がイメージレイヤに表示され、その上層側のテキストレイヤに透明の検索用文字がレイアウトされるようになっている。 In recent years, a searchable electronic document is generated from a scanned paper document. As a searchable electronic document, what is called a searchable XPS (XML Paper Specification), what is called a searchable PDF (Portable Document Format), or a PDF with text is known. These electronic documents are configured so that the image layer and text layer overlap, so that the background document image is displayed on the image layer, and transparent search characters are laid out on the upper text layer. It has become.

また、スキャンした文字データについては、置き換え文字用のアウトラインフォントを用意しておき、操作者の判断および操作に基づいて、スキャンした文字と同等のサイズに変倍したアウトラインフォントに置き換えることが提案されている（例えば、特許文献１参照。）。 For scanned character data, it is suggested to prepare an outline font for replacement characters and replace it with an outline font that has been scaled to the same size as the scanned character based on the judgment and operation of the operator. (For example, refer to Patent Document 1).

特開平５−１２４０２号公報JP-A-5-12402

ところで、上述した検索可能な電子文書では、イメージレイヤとテキストレイヤとが重なる構成のため、当該イメージレイヤに表示される文書画像における文字と、当該テキストレイヤに配される検索用の文字との位置が合致していないと、当該電子文書についての検索を実施した場合に、その検索によって抽出された文字列が文書中のどこに存在しているか正しく特定できないといったことが起こり得る。特に、テキストレイヤで用いるフォントデータの種類によっては、同じポイント数であっても文字幅や文字間距離等が相違することもあるため、イメージレイヤとテキストレイヤとの間で、想定されていなかった文字の位置ずれが生じてしまうことが考えられる。 By the way, in the above-described searchable electronic document, since the image layer and the text layer overlap each other, the position of the character in the document image displayed on the image layer and the search character arranged on the text layer. If they do not match, when a search is performed on the electronic document, it may occur that the character string extracted by the search cannot be correctly specified in the document. In particular, depending on the type of font data used in the text layer, even if the number of points is the same, the character width, distance between characters, etc. may be different, so it was not assumed between the image layer and the text layer. It is conceivable that character misalignment occurs.

ここで、図５（ａ）に示す「Abc Def」という文字列を例に挙げて考える。この「Abc Def」という文字列について、図５（ｂ）に示すように、イメージレイヤにおける文字（図中における黒色文字参照）と、テキストレイヤにおける文字（図中における灰色文字参照）とは、それぞれの位置が必ずしも合致するとは限らず、それぞれの間にずれが生じてしまうことが考えられる。そして、例えばイメージレイヤにおける文字よりテキストレイヤにおける文字のほうが大きくなるようなずれが生じていると、文書出力を行うビューワによっては、図５（ｃ）に示すように、「Abc」についてのハイライト表示部分が「Def」についてのハイライト表示部分に被ってしまう、いわゆる文字被りが発生してしまうおそれがある。つまり、イメージレイヤとテキストレイヤとの間で文字の位置ずれが生じていると、テキスト検索が可能であっても、その検索結果についての文字選択がし辛かったり、文字位置特定がし難くなったりするおそれがある。 Here, a character string “Abc Def” shown in FIG. Regarding the character string “Abc Def”, as shown in FIG. 5B, a character in the image layer (see black characters in the figure) and a character in the text layer (see gray characters in the figure) are respectively It is conceivable that the positions of do not necessarily match, and a shift occurs between them. Then, for example, if there is a shift that causes the characters in the text layer to be larger than the characters in the image layer, depending on the viewer that outputs the document, as shown in FIG. There is a possibility that a so-called character covering, in which the display part covers the highlight display part of “Def”, may occur. In other words, if there is a character misalignment between the image layer and the text layer, even if text search is possible, it is difficult to select characters for the search results, and it is difficult to specify the character position. There is a risk.

この点については、上記特許文献１に開示されているようにフォント置き換えの際に文字毎にサイズ変倍を行うようにしても、必ずしも解消し得るとは限らない。文字毎のサイズ変倍では、文字列を構成する各文字が固定ピッチとなるため、当該文字列の先頭と後尾の位置が合っても一文字毎の位置ずれが発生し得るからであり、また文字列内でフォント種類やポイント数が異なるものに認識される可能性があり、結果として文字列全体としてのバランスが崩れて見栄えが悪くなることもあり得るからである。さらに、上記特許文献１に開示された技術では、操作者の判断および操作を必要とするため、当該操作者、すなわち電子文書の利用者が煩わしさを感じてしまうことも考えられる。 Regarding this point, even if the size is changed for each character at the time of font replacement as disclosed in Patent Document 1, it is not always possible to eliminate this problem. This is because each character constituting the character string has a fixed pitch in the size scaling for each character, so that even if the position of the beginning and the end of the character string match, a positional shift for each character may occur. This is because there is a possibility that the font type and the number of points in the column are different, and as a result, the balance of the entire character string may be lost and the appearance may deteriorate. Furthermore, since the technique disclosed in Patent Document 1 requires the operator's judgment and operation, the operator, that is, the user of the electronic document may feel annoyed.

そこで、本発明は、イメージレイヤとテキストレイヤとが重なる構成の電子文書について、これを生成または画像出力する際に、イメージレイヤとテキストレイヤとの間での文字の位置ずれを抑制する文書処理装置および文書処理プログラムを提供することを目的とする。 Therefore, the present invention provides a document processing device that suppresses character positional deviation between an image layer and a text layer when generating or outputting an electronic document having a configuration in which an image layer and a text layer overlap. And a document processing program.

請求項１に係る発明は、イメージレイヤとテキストレイヤとが重なる構成の電子文書について、当該電子文書における所定単位の文字列毎に、当該文字列の前記イメージレイヤにおける描画領域に関する位置情報を取得する領域情報取得手段と、前記文字列を構成する各文字について、前記テキストレイヤで使用されるフォントのメトリクス情報を取得するフォント情報取得手段と、前記領域情報取得手段での情報取得結果から特定される前記文字列の描画領域の大きさと前記フォント情報取得手段での情報取得結果から特定される当該文字列のフォントの大きさとの比率を算出する描画計算手段と、前記描画計算手段での算出結果に基づいて、前記文字列の描画領域の大きさと当該文字列のフォントの大きさとが合うように、当該フォントの大きさを変倍して前記テキストレイヤについての描画を行う描画処理手段とを備えることを特徴とする文書処理装置である。
請求項２に係る発明は、コンピュータを、イメージレイヤとテキストレイヤとが重なる構成の電子文書について、当該電子文書における所定単位の文字列毎に、当該文字列の前記イメージレイヤにおける描画領域に関する位置情報を取得する領域情報取得手段と、前記文字列を構成する各文字について、前記テキストレイヤで使用されるフォントのメトリクス情報を取得するフォント情報取得手段と、前記領域情報取得手段での情報取得結果から特定される前記文字列の描画領域の大きさと前記フォント情報取得手段での情報取得結果から特定される当該文字列のフォントの大きさとの比率を算出する描画計算手段と、前記描画計算手段での算出結果に基づいて、前記文字列の描画領域の大きさと当該文字列のフォントの大きさとが合うように、当該フォントの大きさを変倍して前記テキストレイヤについての描画を行う描画処理手段として機能させることを特徴とする文書処理プログラムである。 The invention according to claim 1 acquires, for an electronic document having a configuration in which an image layer and a text layer overlap, for each character string of a predetermined unit in the electronic document, positional information regarding the drawing area in the image layer of the character string. The area information acquisition means, the font information acquisition means for acquiring the metric information of the font used in the text layer for each character constituting the character string, and the information acquisition result by the area information acquisition means The drawing calculation means for calculating the ratio between the size of the drawing area of the character string and the font size of the character string specified from the information acquisition result in the font information acquisition means, and the calculation result in the drawing calculation means Based on the size of the drawing area of the character string and the font size of the character string. A document processing apparatus characterized by comprising a drawing processing means to scale the size for drawing the said text layer.
According to a second aspect of the present invention, for an electronic document having a configuration in which an image layer and a text layer overlap each other, for each character string of a predetermined unit in the electronic document, positional information regarding a drawing area in the image layer of the character string From the information acquisition result in the area information acquisition means, the font information acquisition means for acquiring the metric information of the font used in the text layer for each character constituting the character string, and the information acquisition result in the area information acquisition means A drawing calculation unit that calculates a ratio between the size of the drawing area of the specified character string and the font size of the character string specified from the information acquisition result in the font information acquisition unit; Based on the calculation result, the size of the drawing area of the character string matches the font size of the character string. A document processing program for causing to function as drawing processing means to scale the size of the font for drawing for the text layer.

請求項１，２に係る発明によれば、利用者の判断や操作等を必要とすることなく、またテキストレイヤで使用されるフォント種類にもよらずに、所定単位の文字列について、イメージレイヤにおける文字とテキストレイヤにおける文字との描画位置および大きさを合わせることができ、当該イメージレイヤと当該テキストレイヤとが重なる構成の電子文書の閲覧や検索等をする利用者にとっての利便性向上に貢献することが可能となる。さらには、例えば外国語文（特に、アルファベットにより記述される英文）のように文字毎に字幅や文字間距離等が異なるフォント（プロポーショナルフォント）を用いた場合について、一文字毎の位置ずれ調整をする場合に比べて、文字列のバランスを崩すことなく見栄えの良いものとすることができる。 According to the first and second aspects of the present invention, an image layer is used for character strings in a predetermined unit without requiring user judgment or operation, and without depending on the font type used in the text layer. The drawing position and size of characters in the text layer and characters in the text layer can be matched, contributing to improved convenience for users who browse and search electronic documents with the image layer and text layer overlapping. It becomes possible to do. Furthermore, for example, when using fonts (proportional fonts) with different character widths and distances between characters, such as foreign language sentences (particularly English written in alphabets), the positional deviation of each character is adjusted. Compared to the case, it is possible to improve the appearance without breaking the balance of the character string.

以下、図面に基づき本発明に係る文書処理装置および文書処理プログラムについて説明する。 Hereinafter, a document processing apparatus and a document processing program according to the present invention will be described with reference to the drawings.

先ず、文書処理装置の機能構成例について説明する。ここで例に挙げて説明する文書処理装置は、スキャンした紙文書から、検索可能な電子文書、すなわちイメージレイヤとテキストレイヤとが重なる構成の電子文書を生成するものである。このような文書処理装置としては、スキャン機能およびデータ処理機能を有したデジタル複写機、当該複写機としての機能に他装置（プリンタ装置やファクシミリ装置等）としての機能を統合したもの、スキャナ装置に接続して用いられるコンピュータ装置等が挙げられる。 First, a functional configuration example of the document processing apparatus will be described. The document processing apparatus described as an example here generates a searchable electronic document, that is, an electronic document having a configuration in which an image layer and a text layer overlap from a scanned paper document. Such a document processing apparatus includes a digital copying machine having a scanning function and a data processing function, a function in which the function as the copying machine is integrated with a function as another device (printer device, facsimile device, etc.), and a scanner device. For example, a computer device that is used in connection.

図１は、本発明に係る文書処理装置の機能構成例を示すブロック図である。
図例の文書処理装置は、画像入力部１と、設定部２と、画像処理部３と、蓄積部４と、描画処理部５と、データ転送部６と、を備えて構成されている。 FIG. 1 is a block diagram showing a functional configuration example of a document processing apparatus according to the present invention.
The document processing apparatus shown in the figure includes an image input unit 1, a setting unit 2, an image processing unit 3, a storage unit 4, a drawing processing unit 5, and a data transfer unit 6.

画像入力部１は、例えばスキャナ装置としての機能によって実現されるもので、原稿となる紙文書に対するスキャンを行って、当該紙文書からの画像データの読み取りを行うものである。 The image input unit 1 is realized by, for example, a function as a scanner device, and scans a paper document that is a document and reads image data from the paper document.

設定部２は、例えば文書処理装置の利用者が操作するユーザインタフェースパネルによって実現されるもので、当該利用者が、画像入力部１での画像データの読み取りや画像処理部３での画像データの処理に必要となるパラメータ設定を行うためのものである。 The setting unit 2 is realized by, for example, a user interface panel operated by a user of the document processing apparatus. The user can read image data with the image input unit 1 or read image data with the image processing unit 3. This is for setting parameters necessary for processing.

画像処理部３は、所定プログラムを実行するコンピュータ装置としての機能によって実現されるもので、画像入力部１が読み取った画像データに対して、所定の画像処理を行うものである。
画像処理部３が行う画像処理としては、その一つに、画像データに対する文字認識（Optical Character Reader、以下「ＯＣＲ」と略す。）処理がある。すなわち、画像処理部３は、文字認識手段３ａとしての機能を備えている。この文字認識手段３ａは、画像処理部３が所定プログラム（例えば、ＯＣＲ用ソフトウエア）を実行することによって実現されるものである。
なお、文字認識手段３ａが行うＯＣＲ処理の手法については、公知技術を利用すればよいため、ここではその詳細な説明を省略する。
また、画像処理部３が行うＯＣＲ処理以外の画像処理についても、公知技術を利用したものであればよく、ここではその詳細な説明を省略する。 The image processing unit 3 is realized by a function as a computer device that executes a predetermined program, and performs predetermined image processing on the image data read by the image input unit 1.
Image processing performed by the image processing unit 3 includes character recognition (Optical Character Reader, hereinafter abbreviated as “OCR”) processing for image data. That is, the image processing unit 3 has a function as the character recognition means 3a. The character recognition means 3a is realized by the image processing unit 3 executing a predetermined program (for example, OCR software).
The OCR processing method performed by the character recognition unit 3a may be a known technique, and thus detailed description thereof is omitted here.
Further, the image processing other than the OCR processing performed by the image processing unit 3 may be any one using a known technique, and detailed description thereof is omitted here.

蓄積部４は、例えばハードディスク装置といった記憶装置によって実現されるもので、各種情報の記憶蓄積を行うものである。
この蓄積部４が記憶蓄積する各種情報としては、例えば画像入力部１が読み取った画像データまたは画像処理部３での画像処理後の画像データが挙げられる。また、文字認識手段３ａによる文字認識結果に関する情報についても、ここでいう各種情報に含まれる。
さらには、文書処理装置を機能させるために必要となる所定プログラムや、文書画像を作成する上で必要となるフォントデータ４ａ等も、ここでいう各種情報に含まれるものとする。すなわち、蓄積部４は、フォントデータ４ａを記憶蓄積しているものとする。
なお、ここでいうフォントデータ４ａは、フォントそのものを特定するデータの他に、当該フォントのメトリクス（メトリック）情報をも含む。メトリクス情報とは、フォントが占めるスペースの大きさを定義する情報で、カーニング情報も含まれる。 The storage unit 4 is realized by a storage device such as a hard disk device, and stores various types of information.
Examples of various information stored and accumulated by the accumulation unit 4 include image data read by the image input unit 1 or image data after image processing by the image processing unit 3. Information relating to the character recognition result by the character recognition means 3a is also included in the various types of information referred to herein.
Furthermore, it is assumed that a predetermined program necessary for causing the document processing apparatus to function, font data 4a necessary for creating a document image, and the like are also included in the various types of information herein. That is, the storage unit 4 stores and stores font data 4a.
Here, the font data 4a includes metric information of the font in addition to data specifying the font itself. Metric information is information that defines the amount of space occupied by a font, and includes kerning information.

描画処理部５は、所定プログラムを実行するコンピュータ装置としての機能によって実現されるもので、画像入力部１での画像読み取り結果や画像処理部３での画像処理結果等を用いて、イメージレイヤとテキストレイヤとが重なる構成の電子文書の生成を行うものである。
ただし、描画処理部５は、電子文書の生成を行うために、領域情報取得手段５ａ、フォント情報取得手段５ｂ、描画計算手段５ｃおよび描画処理手段５ｄとしての機能を備えている。
領域情報取得手段５ａは、生成すべき電子文書における所定単位の文字列毎に、当該文字列のイメージレイヤにおける描画領域に関する位置情報を取得するものである。位置情報は、詳細を後述するように、画像処理部３の文字認識手段３ａから取得することが考えられる。また、文字列の所定単位としては、文字認識手段３ａでの文字認識結果から特定される単語単位とすることが考えられるが、必ずしも単語単位である必要はなく、文字認識手段３ａでの文字認識結果から特定される文節単位や行単位等といった他の単位であっても構わない。
フォント情報取得手段５ｂは、所定単位の文字列を構成する各文字について、テキストレイヤで使用されるフォントのメトリクス情報を取得するものである。メトリクス情報の取得は、蓄積部４のフォントデータ４ａにアクセスすることによって行うことが考えられる。
描画計算手段５ｃは、領域情報取得手段５ａでの情報取得結果から特定される文字列の描画領域の大きさと、フォント情報取得手段５ｂでの情報取得結果から特定される当該文字列のフォントの大きさとについて、これらの比率を算出するものである。
描画処理手段５ｄは、電子文書生成のための描画処理を行うものである。ただし、描画処理手段５ｄでは、描画計算手段５ｃでの算出結果に基づいて、所定単位の文字列の描画領域の大きさと当該文字列のフォントの大きさとが合うように、当該フォントの大きさを変倍して、テキストレイヤについての描画を行うようになっている。
なお、描画処理部５が生成する電子文書は、イメージレイヤとテキストレイヤとが重なる構成のものであれば、そのデータフォーマットが特に限定されることはなく、例えばサーチャブルＸＰＳに準拠したものであってもよいし、サーチャブルＰＤＦに準拠したものであってもよいし、あるいはこれら以外のデータフォーマットに準拠したものであってもよい。 The drawing processing unit 5 is realized by a function as a computer device that executes a predetermined program, and uses an image reading result in the image input unit 1, an image processing result in the image processing unit 3, etc. An electronic document having a configuration overlapping with a text layer is generated.
However, the drawing processing unit 5 has functions as a region information acquisition unit 5a, a font information acquisition unit 5b, a drawing calculation unit 5c, and a drawing processing unit 5d in order to generate an electronic document.
The area information acquisition unit 5a acquires position information regarding a drawing area in the image layer of the character string for each character string of a predetermined unit in the electronic document to be generated. It is conceivable that the position information is acquired from the character recognition means 3a of the image processing unit 3 as will be described in detail later. Further, the predetermined unit of the character string may be a word unit specified from the character recognition result in the character recognition unit 3a, but is not necessarily a word unit, and the character recognition in the character recognition unit 3a is not necessarily performed. Other units such as a phrase unit or a line unit specified from the result may be used.
The font information acquisition means 5b acquires the metric information of the font used in the text layer for each character constituting a predetermined unit character string. It is conceivable that the metrics information is acquired by accessing the font data 4a of the storage unit 4.
The drawing calculation unit 5c determines the size of the drawing region of the character string specified from the information acquisition result in the region information acquisition unit 5a and the font size of the character string specified from the information acquisition result in the font information acquisition unit 5b. For Sato, these ratios are calculated.
The drawing processing unit 5d performs drawing processing for generating an electronic document. However, in the drawing processing means 5d, the size of the font is adjusted so that the size of the drawing area of the character string in a predetermined unit matches the font size of the character string based on the calculation result in the drawing calculation means 5c. Scaling to draw on the text layer.
The electronic document generated by the drawing processing unit 5 is not particularly limited as long as the image layer and the text layer overlap each other. For example, the electronic document conforms to the searchable XPS. Alternatively, it may be compliant with the searchable PDF, or may be compliant with other data formats.

データ転送部６は、描画処理部５が生成した電子文書について、これをその出力先である外部装置に対して転送するものである。外部装置としては、電子文書の表示出力を行う表示装置、当該電子文書の印刷出力を行う印刷装置、当該電子文書を記憶蓄積するファイルサーバ表示装置等が挙げられるが、特に限定されるものではない。また、当該外部装置への電子文書の転送については、公知のデータ転送技術を用いればよいため、ここでその詳細な説明を省略する。 The data transfer unit 6 transfers the electronic document generated by the drawing processing unit 5 to an external device that is the output destination. Examples of the external device include a display device that displays and outputs an electronic document, a printing device that prints and outputs the electronic document, and a file server display device that stores and stores the electronic document, but is not particularly limited. . In addition, since a known data transfer technique may be used for transferring the electronic document to the external apparatus, detailed description thereof is omitted here.

以上のような構成の文書処理装置において、特に描画処理部５が備える各手段５ａ〜５ｄは、当該文書処理装置におけるコンピュータとしての機能が、所定プログラムを実行することによって実現されるものとする。つまり、の文書処理装置は、所定プログラムを実行するＣＰＵ（Central Processing Unit）や当該所定プログラムを記憶する記憶装置等を備え、当該所定プログラムの実行によって種々の機能を実現し得るように構成されており、このような文書処理装置上で実現される上述の各手段５ａ〜５ｄは、当該文書処理装置にインストールされた所定プログラム（文書処理プログラム）によって実現されるものとする。なお、当該文書処理プログラムは、文書処理装置へのインストールに先立ち、コンピュータ読み取り可能な記憶媒体に格納されて提供されるものであっても、または通信回線を介して外部から配信されるものであってもよい。 In the document processing apparatus having the above-described configuration, each of the units 5a to 5d included in the drawing processing unit 5 is assumed to be realized by executing a predetermined program as a computer function in the document processing apparatus. That is, the document processing apparatus includes a CPU (Central Processing Unit) that executes a predetermined program, a storage device that stores the predetermined program, and the like, and is configured to realize various functions by executing the predetermined program. The above-described means 5a to 5d realized on such a document processing apparatus are realized by a predetermined program (document processing program) installed in the document processing apparatus. The document processing program may be provided by being stored in a computer-readable storage medium prior to installation in the document processing apparatus, or distributed from the outside via a communication line. May be.

次に、以上のように構成された文書処理装置における処理動作例について説明する。
図２は、本発明に係る文書処理装置の処理動作例を示すフローチャートである。 Next, an example of processing operation in the document processing apparatus configured as described above will be described.
FIG. 2 is a flowchart showing an example of processing operation of the document processing apparatus according to the present invention.

上述した構成の文書処理装置では、電子文書を生成するのにあたり、先ず、当該文書処理装置の利用者が当該電子文書の基（原稿）となる紙文書を用意して、当該紙文書を画像入力部１にセットするとともに、設定部２でのパラメータ設定を行い、その後にスタートボタン押下等による動作開始指示を行う（ステップ０１、以下ステップを「Ｓ」と略す。）。利用者による動作開始指示があると、文書処理装置では、設定部２で設定されたパラメータ（カラー／白黒の別や解像度の指定等）に従いつつ、画像入力部１がセットされた紙文書からの画像データの読み取りを当該紙文書の各ページについて行い（Ｓ０２）、そのページ毎の画像データに対して画像処理部３が所定の画像処理（解像度変換や色補正等）を行い（Ｓ０３）、さらに蓄積部４が画像処理後の画像データの記憶蓄積を行う（Ｓ０４）。そして、原稿となる紙文書の全ページについての処理が終了するまで（Ｓ０５）、上述した一連の処理を繰り返し行う（Ｓ０２〜Ｓ０５）。 In the document processing apparatus having the above-described configuration, when generating an electronic document, first, a user of the document processing apparatus prepares a paper document as a base (original) of the electronic document and inputs the paper document as an image. In addition to setting in the unit 1, parameter setting is performed in the setting unit 2, and then an operation start instruction is issued by pressing the start button or the like (step 01, step is hereinafter abbreviated as “S”). When the user gives an operation start instruction, the document processing apparatus follows the parameters set by the setting unit 2 (color / black and white, resolution designation, etc.), and from the paper document in which the image input unit 1 is set. The image data is read for each page of the paper document (S02), the image processing unit 3 performs predetermined image processing (resolution conversion, color correction, etc.) on the image data for each page (S03), and further The storage unit 4 stores and stores the image data after image processing (S04). Then, the above-described series of processing is repeated (S02 to S05) until the processing for all the pages of the paper document as the manuscript is completed (S05).

その後、文書処理装置では、蓄積部４が記憶蓄積している画像データ（すなわち、イメージレイヤに表示される文書画像を特定する画像データ）について、当該画像データにはテキスト（文字）部分とイメージ（画像）部分との両方が含まれている場合があることから、ＯＣＲ処理として、画像処理部３が、１ページ分毎に、テキスト部分とイメージ部分との分離を行う。そして、テキスト部分については、文字認識手段３ａがＯＣＲ処理を行う（Ｓ０６）。テキスト／イメージ分離処理およびＯＣＲ処理の手法は、いずれも、公知技術を利用すればよい。
このＯＣＲ処理によって、文字認識手段３ａは、テキスト部分を構成する文字列を、所定単位である単語単位で、抽出することになる。ここでは、所定単位が単語単位である場合を例に挙げるが、当該所定単位は、予め設定されているものであれば、既に述べたように、文節単位や行単位等であっても構わない。なお、ここで例に挙げる「単語」とは、それぞれ意味をもって文節を構成する一つ一つの言葉のことである。
さらに、このＯＣＲ処理によって、文字認識手段３ａは、単語単位での文字列の抽出に併せて、当該文字列の描画領域に関する位置情報をも、抽出することになる。文字列の描画領域に関する位置情報とは、画像１ページ分上にて当該文字列を描画すべき領域の大きさを特定するための情報のことをいい、具体的には当該文字列が属する矩形領域の左上座標値および右下座標値からなる情報が挙げられる。ただし、当該文字列の描画領域の大きさを特定し得るものであれば、必ずしも矩形領域の左上座標値および右下座標値からなる情報に限定されることはなく、他の情報（例えば、左下座標値および領域幅の値からなる情報）を用いても構わない。 Thereafter, in the document processing apparatus, for the image data stored and accumulated in the accumulation unit 4 (that is, image data specifying a document image displayed on the image layer), the image data includes a text (character) portion and an image ( Since there are cases where both the (image) portion is included, the image processing unit 3 separates the text portion and the image portion for each page as OCR processing. And about the text part, the character recognition means 3a performs an OCR process (S06). Any known technique may be used for the text / image separation process and the OCR process.
By this OCR processing, the character recognition means 3a extracts the character string constituting the text portion in units of words that are predetermined units. Here, a case where the predetermined unit is a word unit will be described as an example. However, as long as the predetermined unit is set in advance, it may be a phrase unit or a line unit as described above. . The “words” mentioned here are each one of the words that make up a phrase with meaning.
Furthermore, by this OCR process, the character recognition means 3a extracts the position information regarding the drawing area of the character string in conjunction with the extraction of the character string in units of words. The position information related to the drawing area of the character string refers to information for specifying the size of the area in which the character string is to be drawn on one page of the image, specifically, the rectangle to which the character string belongs. Information including the upper left coordinate value and the lower right coordinate value of the region is given. However, as long as the size of the drawing area of the character string can be specified, the information is not necessarily limited to the information including the upper left coordinate value and the lower right coordinate value of the rectangular area, and other information (for example, lower left Information consisting of coordinate values and area width values) may be used.

文字認識手段３ａでのＯＣＲ処理の結果、テキスト部分を構成する文字列が抽出された場合には（Ｓ０７）、文書処理装置では、続いて、電子文書を構成するテキストレイヤとなる部分の生成のために、描画処理部５が蓄積部４に記憶蓄積されているフォントデータ４ａを用いて当該文字列についての描画処理を行う。このとき、描画処理部５は、単語単位での文字列毎に、以下に述べるような処理を行う。 When a character string constituting the text part is extracted as a result of the OCR process in the character recognition means 3a (S07), the document processing apparatus subsequently generates a part to be a text layer constituting the electronic document. Therefore, the drawing processing unit 5 performs drawing processing for the character string using the font data 4 a stored and accumulated in the accumulation unit 4. At this time, the drawing processing unit 5 performs processing as described below for each character string in units of words.

すなわち、描画処理部５では、描画処理対象となる単語単位の文字列（以下、単に「処理対象文字列」という。）について、領域情報取得手段５ａがその描画領域に関する位置情報を文字認識手段３ａから取得するとともに、フォント情報取得手段５ｂが蓄積部４のフォントデータ４ａにアクセスして当該処理対象文字列を構成する各文字について使用されるフォントのメトリクス情報を取得する。そして、領域情報取得手段５ａが描画領域に関する位置情報を取得し、フォント情報取得手段５ｂがフォントのメトリクス情報を取得すると、描画計算手段５ｃが、領域情報取得手段５ａでの情報取得結果から特定される処理対象文字列の描画領域の大きさと、フォント情報取得手段５ｂでの情報取得結果から特定される当該処理対象文字列を構成する各文字のフォントの大きさとについて、これらの比率を算出する（Ｓ０８）。
このような比率の算出を描画計算手段５ｃが行うと、描画処理部５では、描画処理手段５ｄが処理対象文字列を構成する各文字の描画処理を行う。ただし、このとき、描画処理手段５ｄは、描画計算手段５ｃでの算出結果に基づいて、当該処理対象文字列の描画領域の大きさと当該処理対象文字列を構成する各文字のフォントの大きさとが合うように、当該フォントの大きさを変倍して、当該各文字の描画を行う（Ｓ０９）。
描画処理部５では、以上のような処理対象文字列についての描画処理を、文字認識手段３ａが抽出した全ての文字列について終了するまで、繰り返し行う（Ｓ０７〜Ｓ０９）。なお、描画処理部５による各文字の描画結果（フォント文字画像の展開結果）は、例えば蓄積部４内に確保されたバッファ領域に保存しておくことが考えられる。 That is, in the drawing processing unit 5, for a character string in units of words (hereinafter simply referred to as “processing target character string”) to be drawn, the region information acquisition unit 5 a obtains position information regarding the drawing region as character recognition unit 3 a. And the font information acquisition means 5b accesses the font data 4a of the storage unit 4 to acquire the metric information of the font used for each character constituting the processing target character string. Then, when the area information acquisition unit 5a acquires the position information regarding the drawing area and the font information acquisition unit 5b acquires the metrics information of the font, the drawing calculation unit 5c is identified from the information acquisition result by the area information acquisition unit 5a. These ratios are calculated for the size of the drawing area of the processing target character string and the font size of each character constituting the processing target character string specified from the information acquisition result in the font information acquisition means 5b ( S08).
When the drawing calculation unit 5c calculates such a ratio, in the drawing processing unit 5, the drawing processing unit 5d performs drawing processing of each character constituting the processing target character string. However, at this time, the drawing processing unit 5d determines the size of the drawing area of the processing target character string and the size of the font of each character constituting the processing target character string based on the calculation result of the drawing calculation unit 5c. The size of the font is changed so as to match, and the characters are drawn (S09).
The drawing processing unit 5 repeatedly performs the drawing process for the processing target character string as described above until all the character strings extracted by the character recognition unit 3a are finished (S07 to S09). It should be noted that the drawing result of each character (font character image development result) by the drawing processing unit 5 may be stored in a buffer area secured in the storage unit 4, for example.

全ての文字列についての描画処理を終了すると、その後、文書処理装置では、描画処理部５が、当該描画処理の結果と蓄積部４が記憶蓄積している画像データとについて、イメージレイヤとテキストレイヤとが重なる構成の電子文書としてのフォーマット化を行う（Ｓ１０）。つまり、描画処理部５は、当該描画処理の結果と当該画像データとを基にして、イメージレイヤとテキストレイヤとが重なる構成の電子文書の生成を行うのである。具体的には、例えばサーチャブルＸＰＳに準拠する場合であれば、イメージレイヤに表示される背景となる文書画像データ、その上層側のテキストレイヤに表示される検索用文字、および、その検索用文字として使用されるフォントデータそのものを、それぞれフォーマット化して電子文書の生成を行う。また、例えばサーチャブルＰＤＦに準拠する場合であれば、イメージレイヤに表示される背景となる文書画像データ、および、その上層側のテキストレイヤに表示される検索用文字を、それぞれフォーマット化して電子文書の生成を行う。なお、このときのフォーマット化の手法およびフォーマットそのものについては、公知技術を利用したものであればよく、ここではその詳細な説明を省略する。 When the drawing processing for all the character strings is completed, the drawing processing unit 5 thereafter performs an image layer and a text layer on the result of the drawing processing and the image data stored and accumulated in the storage unit 4 in the document processing apparatus. Is formatted as an electronic document having an overlapping structure (S10). That is, the drawing processing unit 5 generates an electronic document having a configuration in which the image layer and the text layer overlap based on the result of the drawing process and the image data. Specifically, for example, when conforming to the searchable XPS, the document image data as the background displayed in the image layer, the search characters displayed in the upper text layer, and the search characters The font data itself is formatted to generate an electronic document. For example, when conforming to the searchable PDF, the document image data as the background displayed in the image layer and the search characters displayed in the text layer on the upper layer are respectively formatted to form an electronic document. Generate. It should be noted that the formatting method and the format itself at this time may be those utilizing a known technique, and detailed description thereof is omitted here.

そして、描画処理部５が電子文書の生成を行うと、文書処理装置では、データ転送部６が当該電子文書をその出力先である外部装置に対して転送する（Ｓ１１）。すなわち、データ転送部６は、当該電子文書についてのデータ転送を転送すべきデータがなくなるまで継続的に行い、転送すべきデータがなくなると当該データ転送を完了する。 When the drawing processing unit 5 generates an electronic document, in the document processing apparatus, the data transfer unit 6 transfers the electronic document to the external device that is the output destination (S11). That is, the data transfer unit 6 continuously performs data transfer for the electronic document until there is no more data to be transferred, and completes the data transfer when there is no more data to be transferred.

次に、以上のような一連の処理動作例のうち、描画処理部５が行う処理動作例について、具体例を挙げてさらに詳しく説明する。
図３および図４は、文字描画処理の一具体例を示す説明図である。 Next, among the above-described series of processing operation examples, the processing operation examples performed by the drawing processing unit 5 will be described in more detail with specific examples.
3 and 4 are explanatory diagrams showing a specific example of the character drawing process.

例えば、図３（ａ）に示す「Abc」という文字列を例に挙げて考える。この「Abc」という文字列について、描画処理部５は、図３（ｂ）に示すように、その描画領域に関する位置情報として、当該「Abc」という文字列を描画すべき矩形領域の左上座標値（X₀，Y₀）および右下座標値（X₁，Y₁）を取得する。さらには、当該文字列を構成する「A」、「b」および「c」の各文字について使用されるフォントのメトリクス情報として、当該各文字の高さ方向寸法値Ｈとそれぞれの幅方向寸法値Wa，Wb，Wcを取得する。 For example, consider the character string “Abc” shown in FIG. For the character string “Abc”, the drawing processing unit 5 uses the upper left coordinate value of the rectangular area in which the character string “Abc” should be drawn as position information related to the drawing area, as shown in FIG. Get (X ₀ , Y ₀ ) and lower right coordinate (X ₁ , Y ₁ ). Furthermore, as the metric information of the font used for each of the characters “A”, “b” and “c” constituting the character string, the height direction dimension value H and the width direction dimension value of each character Get Wa, Wb, Wc.

ここで、各文字についてのフォントは、蓄積部４が記憶蓄積しているフォントデータ４ａが一種類のみであれば、そのフォントデータ４ａによるものが使用される。また、蓄積部４が複数種類のフォントデータ４ａを記憶蓄積している場合であれば、所定基準に基づいて選択された種類のフォントデータ４ａによるものが使用される。なお、所定基準としては、原画像の文字形状との類似度によるものや、予め設定された各種類別の優先度によるもの等を用いることが考えられるが、特に限定されるものではなく、他の公知技術によるものであっても構わない。 Here, as for the font for each character, if there is only one type of font data 4a stored and stored in the storage unit 4, the font based on the font data 4a is used. If the storage unit 4 stores and stores a plurality of types of font data 4a, the type of font data 4a selected based on a predetermined standard is used. Note that, as the predetermined standard, it is conceivable to use the one based on the similarity to the character shape of the original image, or one based on the preset priority for each type, but is not particularly limited. It may be based on a known technique.

ところで、フォントデータ４ａの種類によっては、同じポイント数であっても、文字幅や文字間距離等が相違することが知られている。そのため、「A」、「b」および「c」の各文字について使用されるフォントをそのまま描画すると、図３（ｃ）に示すように、電子文書の生成後において、イメージレイヤとテキストレイヤとの間で、想定されていなかった文字の位置ずれが生じてしまうことが考えられる。 By the way, it is known that, depending on the type of font data 4a, the character width, the distance between characters, and the like are different even with the same number of points. Therefore, when the fonts used for the characters “A”, “b”, and “c” are drawn as they are, as shown in FIG. It is conceivable that an unforeseen character misalignment occurs.

このことから、描画処理部５では、図４（ａ）に示す「Abc」という文字列であれば、当該「Abc」という文字列を描画すべき矩形領域の左上座標値（X₀，Y₀）および右下座標値（X₁，Y₁）から、座標値X₀と座標値X₁との差の絶対値を算出して、当該矩形領域の幅方向の大きさを求める。さらには、「A」、「b」および「c」の各文字の幅方向寸法値Wa，Wb，Wcから、これらの和を算出して、当該各文字のフォント群の幅方向の大きさを求める。そして、図４（ｂ）に示すように、各文字のフォント群の幅方向の大きさ［Wa＋Wb＋Wc］について、これを矩形領域の幅方向の大きさ｜X₁−X₀｜で除して、これらの間の比率Magを算出する。 Therefore, in the drawing processing unit 5, if the character string “Abc” shown in FIG. 4A is used, the upper left coordinate value (X ₀ , Y _{0) of the} rectangular area in which the character string “Abc” is to be drawn. ) And the lower right coordinate value (X ₁ , Y ₁ ), the absolute value of the difference between the coordinate value X ₀ and the coordinate value X ₁ is calculated to determine the size of the rectangular area in the width direction. Furthermore, the sum of these from the width direction dimension values Wa, Wb, Wc of each character of “A”, “b” and “c” is calculated, and the size of the font group of each character in the width direction is calculated. Ask. Then, as shown in FIG. 4B, the size [Wa + Wb + Wc] of the font group of each character is divided by the size | X ₁ −X ₀ | of the width direction of the rectangular area, The ratio Mag between these is calculated.

その後、描画処理部５では、「A」、「b」および「c」の各文字について使用されるそれぞれのフォントに対して、算出した比率Magを変倍率として用いて変倍（拡大または縮小のいずれか）を行って、当該フォントについての描画を行う。つまり、描画処理部５は、図４（ｃ）に示すように、「Abc」という文字列の描画領域の大きさ｜X₁−X₀｜と、変倍後における「A」、「b」および「c」の各文字のフォント群の大きさ［Wa*Mag＋Wb*Mag＋Wc*Mag］とが合うように、各フォントの大きさを変倍して、当該「A」、「b」および「c」の各文字の描画を行うのである。 Thereafter, the drawing processing unit 5 uses the calculated ratio Mag as a scaling factor for each of the fonts used for the characters “A”, “b”, and “c”. Any one) to draw the font. That is, as shown in FIG. 4C, the drawing processing unit 5 determines the drawing area size | X ₁ −X ₀ | of the character string “Abc” and “A” and “b” after scaling. And the size of each font so that it matches the size of the font group [Wa * Mag + Wb * Mag + Wc * Mag] of each character of “c”, and the “A”, “b” and “c” "Is drawn.

なお、ここでは、各文字の幅方向寸法のみを変倍の対象とし、各文字の高さ方向寸法値Ｈについては変倍の対象としていないが、当該高さ方向寸法値Ｈについても変倍の対象としても構わない。すなわち、高さ方向寸法値Ｈを矩形領域の高さ方向の大きさ｜Y₀−Y₁｜で除して、これらの間の変倍率を求め、その変倍率をフォントの描画に反映させることも考えられる。 Here, only the width direction dimension of each character is the object of scaling, and the height direction dimension value H of each character is not subject to scaling, but the height direction dimension value H is also subject to scaling. It does not matter as the target. That is, the height direction dimension value H is divided by the height direction size | Y ₀ −Y ₁ | of the rectangular area to obtain a scaling factor between them, and the scaling factor is reflected in the font drawing. Is also possible.

以上のような処理手順を経て生成される電子文書は、文書処理装置の利用者による文字変倍のための判断や操作等を必要とすることなく、またテキストレイヤで使用されるフォント種類にもよらずに、所定単位の一例である単語単位の文字列について、イメージレイヤにおける文字とテキストレイヤにおける文字との描画位置および大きさが合致したものとなる。したがって、例えば外国語文（英文、仏文、独文等。特に、アルファベットにより記述される英文。）のように文字毎に字幅や文字間距離等が異なるフォント（プロポーショナルフォント）を用いた場合であっても、一文字毎の位置ずれ調整をする場合に比べて、文字列単位でバランスよく表示できるようになる。 The electronic document generated through the processing procedure described above does not require judgment or operation for character scaling by the user of the document processing apparatus, and the font type used in the text layer. Regardless, for the character string in units of words, which is an example of the predetermined unit, the drawing positions and sizes of the characters in the image layer and the characters in the text layer match. Therefore, for example, when using fonts (proportional fonts) with different character widths and distances between characters, such as foreign language sentences (English, French, German, etc., especially English written in alphabets). However, as compared with the case of adjusting the positional deviation for each character, the character string unit can be displayed in a balanced manner.

なお、本実施形態では、本発明の好適な実施具体例について説明したが、本発明はその内容に限定されるものではない。 In addition, although this embodiment demonstrated the suitable Example of this invention, this invention is not limited to the content.

例えば、本実施形態では、電子文書の生成を行う場合を例に挙げて説明したが、電子文書の画像出力を行う場合についても、全く同様に本発明を適用することが考えられる。すなわち、イメージレイヤとテキストレイヤとが重なる構成の電子文書につき、その表示出力または印刷出力を行う文書処理装置または文書処理プログラムにおいて、当該表示出力または当該印刷出力のための出力データを生成するのにあたり、本実施形態と同様の手順でフォントの変倍を行えば、出力側デバイスが保持するフォントデータの種類によらずに、イメージレイヤとテキストレイヤとの間で文字の位置ずれが生じてしまうことのない表示出力または印刷出力を行うことが実現可能となる。 For example, in the present embodiment, the case of generating an electronic document has been described as an example. However, the present invention can be applied to the case of outputting an image of an electronic document in exactly the same manner. That is, for an electronic document having a configuration in which an image layer and a text layer overlap, a document processing apparatus or document processing program that performs display output or print output generates output data for the display output or print output. If the font scaling is performed in the same procedure as in this embodiment, the character position shifts between the image layer and the text layer regardless of the type of font data held by the output device. It is possible to perform display output or print output without any problem.

このように、本発明は、本実施形態で説明した内容に限定されるものではなく、その要旨を逸脱しない範囲で変更することが可能である。 Thus, the present invention is not limited to the contents described in the present embodiment, and can be changed without departing from the gist thereof.

本発明に係る文書処理装置の機能構成例を示すブロック図である。It is a block diagram which shows the function structural example of the document processing apparatus which concerns on this invention. 本発明に係る文書処理装置の処理動作例を示すフローチャートである。It is a flowchart which shows the processing operation example of the document processing apparatus which concerns on this invention. 本発明に係る文書処理装置が行う文字描画処理の一具体例を示す説明図（その１）である。It is explanatory drawing (the 1) which shows a specific example of the character drawing process which the document processing apparatus concerning this invention performs. 本発明に係る文書処理装置が行う文字描画処理の一具体例を示す説明図（その２）である。It is explanatory drawing (the 2) which shows a specific example of the character drawing process which the document processing apparatus concerning this invention performs. 検索可能な電子文書の一具体例を示す説明図である。It is explanatory drawing which shows a specific example of the electronic document which can be searched.

Explanation of symbols

１…画像入力部、２…設定部、３…画像処理部、３ａ…文字認識手段、４…蓄積部、４ａ…フォントデータ、５…描画処理部、５ａ…領域情報取得手段、５ｂ…フォント情報取得手段、５ｃ…描画計算手段、５ｄ…描画処理手段、６…データ転送部 DESCRIPTION OF SYMBOLS 1 ... Image input part, 2 ... Setting part, 3 ... Image processing part, 3a ... Character recognition means, 4 ... Accumulation part, 4a ... Font data, 5 ... Drawing process part, 5a ... Area information acquisition means, 5b ... Font information Acquisition means, 5c... Drawing calculation means, 5d... Drawing processing means, 6.

Claims

Position information including a first value indicating a size in a width direction of a drawing area in which a character string of a predetermined unit is drawn in the image layer of an electronic document configured by superimposing an image layer and a text layer. Area information acquisition means to acquire;
Font information acquisition means for acquiring metric information including a second value indicating the size in the width direction of each font of the characters constituting the character string in the text layer;
The first value is obtained by dividing the sum of the second values of the fonts of the characters constituting the character string included in the acquired metrics information by the first value included in the acquired position information. And a drawing calculation means for calculating a ratio of the total of the second values,
By multiplying the size in the width direction of each font of the characters constituting the character string in the text layer by the calculated ratio, the size in the width direction between the drawing area of the character string and the character string And a drawing processing unit for drawing the character string on the text layer .

Computer
Position information including a first value indicating a size in a width direction of a drawing area in which a character string of a predetermined unit is drawn in the image layer of an electronic document configured by superimposing an image layer and a text layer. Area information acquisition means to acquire;
Font information acquisition means for acquiring metric information including a second value indicating the size in the width direction of each font of the characters constituting the character string in the text layer;
The first value is obtained by dividing the sum of the second values of the fonts of the characters constituting the character string included in the acquired metrics information by the first value included in the acquired position information. And a drawing calculation means for calculating a ratio of the total of the second values,
By multiplying the size in the width direction of each font of the characters constituting the character string in the text layer by the calculated ratio, the size in the width direction between the drawing area of the character string and the character string And a document processing program that functions as a drawing processing means for drawing the character string on the text layer .