JP2013004094A

JP2013004094A - Text emphasis method and device and text extraction method and device

Info

Publication number: JP2013004094A
Application number: JP2012132919A
Authority: JP
Inventors: Yie-Hwon Pan; パン・イーフォン; Yutaka Katsuyama; 裕勝山; Junu Sunu; スヌ・ジュヌ; Satoshi Naoi; 聡直井
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2011-06-16
Filing date: 2012-06-12
Publication date: 2013-01-07
Anticipated expiration: 2032-06-12
Also published as: CN102831579A; CN102831579B; JP5939047B2

Abstract

PROBLEM TO BE SOLVED: To provide a method of making a computer emphasize text in an image.SOLUTION: Disclosed is the method of making a computer execute: a current image acquisition step of acquiring an original image including text of at least one line; an update value acquisition step of performing stroke two-dimensional filtering to an original luminance value or/and original color value of each original pixel point on the basis of the degree of direct difference and the degree of indirect difference from an arbitrary original pixel point in the original image to each neighboring pixel point in its neighboring aggregate, and acquiring an updated luminance value or/and updated color value after the filtering of the original image; and an emphasized image generation step of replacing the corresponding original luminance value or original color value with the updated luminance value or/and updated color value after the filtering of the original image, and generating a text emphasis image corresponding to the original image. The range of the neighboring aggregate is shaped like a square having the original pixel point as a center and a side length of w, in which w is smaller than the height of the original image.

Description

本発明は、画像処理に関し、特に、画像内のテキストを強調する方法及び装置、並びに画像内のテキストを抽出する方法及び装置に関する。 The present invention relates to image processing, and more particularly, to a method and apparatus for enhancing text in an image, and a method and apparatus for extracting text in an image.

ビデオを再生する過程中、又は画像を観賞するときに、ビデオ又は画像には、往々テキスト説明が含まれる。そのテキスト説明は、例えば、ビデオ内のイベントの発生時間及び場所に対する説明や、画像の解釈などである。これらのテキスト内容はビデオ又は画像と密に関係するため、ビデオ又は画像におけるテキストの抽出は非常に重要な技術になる。 During the process of playing a video or when viewing an image, the video or image often includes a text description. The text description is, for example, an explanation of the occurrence time and place of an event in the video, an interpretation of an image, or the like. Since these text contents are closely related to the video or image, the extraction of text in the video or image becomes a very important technique.

従来技術では、ビデオ又は画像のテキストを抽出する方法は、二値化、又はエッジ色のクラスタリング及び検出技術を基礎として、画像又はビデオにおけるテキストを抽出することができる。 In the prior art, methods for extracting video or image text can extract text in an image or video based on binarization or edge color clustering and detection techniques.

しかしながら、従来技術でテキストを抽出する際に、ビデオ又は画像にノイズが多すぎ、画像又はビデオが不明瞭であり、ひいてはビデオの一部に照明変化が存在することにより、画像又はビデオ中のテキストと背景との境界が不明瞭になり、又はテキスト内容が不明瞭になり、テキストの抽出効果に影響を与える。 However, when extracting text with the prior art, the video or image is too noisy, the image or video is unclear, and thus there is a lighting change in part of the video, so the text in the image or video The boundary between the background and the background becomes unclear, or the text content becomes unclear, which affects the text extraction effect.

したがって、如何に原画像又はビデオ中のテキストに対して強調処理を行って、画像又はビデオ中のテキストを強調し、さらにテキスト抽出の効果を最適化するかについては、従来技術での解決すべき問題になる。 Therefore, how to perform enhancement processing on the text in the original image or video to enhance the text in the image or video and further optimize the effect of text extraction should be solved by the prior art. It becomes a problem.

本発明の目的は、上述の問題を鑑み、原画像中のテキストがより明瞭になり、さらにテキスト抽出の効果を最適化可能になるように、少なくとも一行のテキストを含む原画像中のテキストに対して強調処理を行えるテキスト強調方法及び装置、並びにテキスト抽出方法及び装置を提供することにある。 In view of the above-mentioned problems, the object of the present invention is for text in an original image containing at least one line of text so that the text in the original image becomes clearer and the effect of text extraction can be optimized. It is an object of the present invention to provide a text emphasizing method and apparatus that can perform emphasis processing, and a text extracting method and apparatus.

本発明の一実施例によれば、コンピュータが画像におけるテキストを強調する方法が提供される。この方法において、前記コンピュータは、少なくとも一行のテキストを含む原画像を取得する現画像取得ステップ、前記原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度及び間接差異度に基づいて、前記各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行い、前記原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得する更新値取得ステップ、及び、前記原画像のフィルタリング後の前記更新輝度値又は／及び更新色値により、対応する前記原輝度値又は及び原色値をそれぞれ置換し、前記原画像に対応するテキスト強調画像を生成する強調画像生成ステップを実行する。また、前記近傍集合の範囲は、前記原画素点を中心とし、且つ辺長がｗである正方形となり、ｗは前記原画像の高さより小さい。 According to one embodiment of the present invention, a method is provided for a computer to enhance text in an image. In this method, the computer obtains an original image including at least one line of text, a direct difference degree and an indirect difference from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set. An update value for performing two-dimensional handwriting filtering on the original luminance value or / and primary color value of each original pixel point based on the degree and obtaining an updated luminance value or / and an updated color value after filtering of the original image And obtaining the text-enhanced image corresponding to the original image by replacing the corresponding original luminance value and / or primary color value with the updated luminance value or / and the updated color value after filtering of the original image, respectively. The enhanced image generation step is executed. The range of the neighborhood set is a square centered on the original pixel point and having a side length of w, and w is smaller than the height of the original image.

本発明の実施例によれば、画像内のテキストを強調する方法及び装置、並びに画像内のテキストを抽出する方法及び装置を提供することができるという効果が得られる。 According to the embodiments of the present invention, it is possible to provide a method and apparatus for enhancing text in an image and a method and apparatus for extracting text in an image.

本発明の一実施例による第１種のテキスト強調方法を示すフローチャートである。4 is a flowchart illustrating a first type of text enhancement method according to an embodiment of the present invention. 第１種のテキスト強調方法におけるステップＳ１０２を示すフローチャートである。It is a flowchart which shows step S102 in the 1st type text emphasis method. 本発明の一実施例による第２種のテキスト強調方法を示すフローチャートである。6 is a flowchart illustrating a second type of text enhancement method according to an embodiment of the present invention. 第２種のテキスト強調方法におけるステップＳ３０２を示すフローチャートである。It is a flowchart which shows step S302 in the 2nd type text emphasis method. 第２種のテキスト強調方法におけるステップＳ３０２を示す他のフローチャートである。It is another flowchart which shows step S302 in the 2nd type text emphasis method. 第２種のテキスト強調方法におけるステップＳ３０２を示す他のフローチャートである。It is another flowchart which shows step S302 in the 2nd type text emphasis method. 第２種のテキスト強調方法におけるステップＳ３０２を示す他のフローチャートである。It is another flowchart which shows step S302 in the 2nd type text emphasis method. 第２種のテキスト強調方法におけるステップＳ３０４を示すフローチャートである。It is a flowchart which shows step S304 in the 2nd type text emphasis method. 本発明の一実施例による第１種のテキスト強調装置を示す模式図である。1 is a schematic diagram illustrating a first type of text emphasis device according to an embodiment of the present invention. 第１種のテキスト強調装置におけるフィルタモジュール９０２を示す模式図である。It is a schematic diagram which shows the filter module 902 in a 1st type text emphasis apparatus. 本発明の一実施例による第２種のテキスト強調装置を示す模式図である。It is a schematic diagram which shows the 2nd type text emphasis apparatus by one Example of this invention. 第２種のテキスト強調装置における筆画極性推定モジュール１１０１を示す模式図である。It is a schematic diagram which shows the stroke polarity estimation module 1101 in a 2nd type text enhancement apparatus. 第２種のテキスト強調装置における筆画極性推定モジュール１１０１を示す他の模式図である。It is another schematic diagram which shows the stroke polarity estimation module 1101 in a 2nd type text enhancement apparatus. 第２種のテキスト強調装置における筆画極性推定モジュール１１０１を示す他の模式図である。It is another schematic diagram which shows the stroke polarity estimation module 1101 in a 2nd type text enhancement apparatus. 第２種のテキスト強調装置における筆画極性推定モジュール１１０１を示す他の模式図である。It is another schematic diagram which shows the stroke polarity estimation module 1101 in a 2nd type text enhancement apparatus. 第２種のテキスト強調装置における判断モジュール１１０２を示す模式図である。It is a schematic diagram which shows the judgment module 1102 in a 2nd type text emphasis apparatus. 本発明の一実施例によるテキスト抽出方法を示すフローチャートである。3 is a flowchart illustrating a text extraction method according to an embodiment of the present invention. 本発明の一実施例によるテキスト抽出装置を示す模式図である。1 is a schematic diagram illustrating a text extraction apparatus according to an embodiment of the present invention. 本発明の一実施例による、情報処理装置としてのコンピュータの模式的な構造を示すブロック図である。It is a block diagram which shows the typical structure of the computer as an information processing apparatus by one Example of this invention.

以下、図面を参照しながら本発明の実施例を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明の一実施例による第１種のテキスト強調方法を示すフローチャートである。図１に示すように、第１種のテキスト強調方法は、具体的に以下のステップを含む。なお、この第１種のテキスト強調方法は、例えば、コンピュータにより実行されてもよい。 FIG. 1 is a flowchart illustrating a first type of text enhancement method according to an embodiment of the present invention. As shown in FIG. 1, the first type of text enhancement method specifically includes the following steps. The first type of text enhancement method may be executed by a computer, for example.

Ｓ１０１：少なくとも一行のテキストを含む原画像を取得する。 S101: An original image including at least one line of text is acquired.

本発明の実施例においては、上述のテキスト強調とは、少なくとも一行のテキストを含む原画像中のテキストに対して強調処理を行うことを指す。ここでの強調は、テキストのエッジを深めること、又は、テキストと背景との区別を際立たせることであると理解することができる。また、本発明の実施例は、応用されるときに、筆画内部の画素の一致性を強調するとともに、テキストと背景との差異度を深める効果を達するように、テキストの外観（例えば、輝度又は色等）及びその形状（例えば、テキストがしま模様となる）情報を考慮した。 In the embodiment of the present invention, the above-described text emphasis refers to performing emphasis processing on text in an original image including at least one line of text. It can be understood that the emphasis here is to deepen the edges of the text or to distinguish the text from the background. Also, embodiments of the present invention, when applied, emphasize the consistency of pixels within a stroke and at the same time achieve the effect of deepening the difference between text and background (e.g. luminance or Color) and its shape (for example, text becomes a striped pattern) information.

Ｓ１０２：原画像における任意な原画素点からその近傍集合（neighborhood set）における各近傍画素点までの直接差異度と間接差異度に基づいて、各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行って、原画像のフィルタリング後の更新された更新輝度値又は／及び更新色値を取得する。近傍集合の範囲は、原画素点を中心とし、且つ辺長がｗである正方形となり、ｗは原画像の高さよりも小さい。 S102: Based on a direct difference degree and an indirect difference degree from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set, the original luminance value or / and the primary color value of each original pixel point are obtained. On the other hand, stroke two-dimensional filtering is performed to obtain an updated updated luminance value and / or updated color value after filtering of the original image. The range of the neighborhood set is a square whose center is the original pixel point and whose side length is w, and w is smaller than the height of the original image.

本ステップでの直接差異度は任意な原画素点がその近傍集合における各領域画素点との直接外観差異、例えば色又は輝度の差異を示し、間接差異度は原画素点からその近傍集合における各領域画素点まで経由した画素のグラジエントモジュール（gradient module）を示す。直接差異度と間接差異度により各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行って、原画像のフィルタリング後の更新された更新輝度値又は／及び更新色値を取得することができる。なお、ここでの近傍集合は、原画素点を中心とし、且つｗを辺長とする正方形を取るように取得可能である。なお、ｗは、原画像の高さより小さく、原画像の高さの８分の１をとることが好ましい。 The direct difference degree in this step indicates a direct appearance difference, for example, a color or luminance difference, from an original pixel point to each region pixel point in the neighborhood set, and an indirect difference degree is determined from each original pixel point in the neighborhood set. The gradient module of the pixel that has passed through the region pixel point is shown. The updated brightness value or / and the updated color value after the original image is filtered by performing the two-dimensional filtering on the original brightness value and / or the primary color value of each original pixel point according to the direct difference level and the indirect difference level. Can be obtained. The neighborhood set here can be acquired so as to take a square centered on the original pixel point and having the side length of w. Note that w is preferably smaller than the height of the original image and is 1/8 of the height of the original image.

図２は、第１種のテキスト強調方法におけるステップＳ１０２を示すフローチャートである。実際の応用においては、図２に示すように、上述のＳ１０２は、具体的に以下の内容を含む。 FIG. 2 is a flowchart showing step S102 in the first type of text enhancement method. In actual application, as shown in FIG. 2, the above-described S102 specifically includes the following contents.

Ｓ２０１：原画素点と各近傍画素点との原輝度値又は／及び原色値に対して代数的減算を行って、直接差異度を取得する。 S201: Algebraic subtraction is performed on the original luminance value or / and the primary color value between the original pixel point and each neighboring pixel point to directly obtain the degree of difference.

本実施例において、D₁(i,j)で画素iとjの直接差異度を示すと、以下の式（１）により輝度値の直接差異度を算出することができる。

In this embodiment, when the direct difference degree between the pixels i and j is indicated by D ₁ (i, j), the direct difference degree of the luminance value can be calculated by the following equation (1).

ただし、f’(i)は目標画素の近傍集合における各近傍画素の輝度の平均値を示し、即ち、目標画素の画素値の代わりに近傍画素の画素値の平均値を用いて直接差異度を算出する。σ(i)は画素iの周りの局所輝度標準偏差を示し、正規化に役立つことができる。 However, f ′ (i) represents the average value of the brightness of each neighboring pixel in the neighborhood set of the target pixel, that is, the difference value is directly calculated using the average value of the pixel values of the neighboring pixels instead of the pixel value of the target pixel. calculate. σ (i) indicates the local luminance standard deviation around pixel i and can be useful for normalization.

また、以下の式（２）により画素iとjの輝度値の直接差異度を算出することもできる。

Further, the direct difference degree between the luminance values of the pixels i and j can be calculated by the following equation (2).

ただし、f(i)は目標画素の近傍集合における各近傍画素の輝度の平均値を示す。 Here, f (i) indicates the average value of the luminance of each neighboring pixel in the neighborhood set of target pixels.

ここで説明すべきは、画素iとjの色値の直接差異度を算出する時に、以下の式（３）又は（４）をそれぞれ採用することもできる。

Here, it should be explained that when calculating the direct difference between the color values of the pixels i and j, the following equations (3) or (4) can also be employed.

式（３）及び（４）中のｎは画素の色情報としてのRチャネル、Gチャネル及びBチャネルを示す。ここで説明すべきは、上述の直接差異度を算出する式は、ただ例示的なものである。当業者は上述の式に対して適応的な変更を行うことができる。 N in the equations (3) and (4) indicates an R channel, a G channel, and a B channel as pixel color information. It should be explained here that the above-described formula for calculating the direct difference is merely illustrative. One skilled in the art can make adaptive changes to the above equations.

S２０２：原画素点からその近傍集合の各近傍画素点までのグラジエントモジュールに基づいて、間接差異度を取得する。 S202: An indirect difference is acquired based on a gradient module from the original pixel point to each neighboring pixel point in the neighborhood set.

なお、本実施例において、D₂(i,j)で画素iとjの間接差異度を示すと、以下の式（５）により画素iとjの輝度値の間接差異度を算出することができる。

In this embodiment, when the indirect difference degree between the pixels i and j is indicated by D ₂ (i, j), the indirect difference degree between the luminance values of the pixels i and j can be calculated by the following equation (5). it can.

ただし、

は、iからjへの方向に沿う、画素lにおけるグラジエントモジュールを示す。式（５）におけるbと式（１）におけるaはともに予め設定された所定のパラメータである。フィルタリングの平滑程度を制御するように両者の単調性が一致している。 However,

Indicates the gradient module at pixel l along the direction from i to j. Both b in the equation (5) and a in the equation (1) are predetermined parameters set in advance. Both monotonicities match to control the smoothness of filtering.

もちろん、実際の応用において、式（５）における最大勾配値の代わりに、iからjまで経由した画素の輝度の最大及び最小の勾配値の差を用いて算出することもできる。算出方法は以下の式（６）に示す。

Of course, in an actual application, the difference between the maximum and minimum gradient values of the pixel luminance from i to j can be calculated instead of the maximum gradient value in Equation (5). The calculation method is shown in the following formula (6).

ただし、Ｍａｘは勾配値の最大値を示し、minは勾配値の最小値を示す。 Here, Max indicates the maximum value of the gradient value, and min indicates the minimum value of the gradient value.

また、画素iとjの色値の間接差異度については、以下の式（７）及び式（８）でそれぞれ算出することもできる。

In addition, the degree of indirect difference between the color values of the pixels i and j can be calculated by the following equations (7) and (8), respectively.

ｎは画素の色情報としてのRチャネル、Gチャネル及びBチャネルを示す。ここで説明すべきは、上述の間接差異度を算出する式は、ただ例示的なものである。当業者は上述の式に対して適応的な変更を行うことができる。 n represents an R channel, a G channel, and a B channel as pixel color information. It should be explained here that the above-described formula for calculating the indirect difference is merely illustrative. One skilled in the art can make adaptive changes to the above equations.

Ｓ２０３：直接差異度及び間接差異度に基づいて、原画素点に対する各近傍画素点の輝度値又は／及び色値の重み値を算出する。 S203: Based on the direct difference degree and the indirect difference degree, a luminance value and / or a color value weight value of each neighboring pixel point with respect to the original pixel point is calculated.

間接差異度及び直接差異度を取得した後に、以下の式（９）で重み値を算出することができる。

After obtaining the indirect difference degree and the direct difference degree, the weight value can be calculated by the following equation (9).

ただし、D₁(i,j)は画素iとjの輝度及び／又は色値の直接差異度を示し、D₂(i,j)は画素iとjの輝度及び／又は色値の間接差異度を示す。なお、w(i,j)は輝度の重み値を示し、w_n(i,j)は色値の重み値を示す。 Where D ₁ (i, j) indicates the direct difference between the luminance values and / or color values of the pixels i and j, and D ₂ (i, j) indicates the indirect difference between the luminance values and / or the color values of the pixels i and j. Degrees. Note that w (i, j) represents a luminance weight value, and w _n (i, j) represents a color value weight value.

Ｓ２０４：以下の筆画二次元フィルタ式（１０）で原画素点の更新輝度値を算出する。

S204: The updated luminance value of the original pixel point is calculated by the following stroke two-dimensional filter equation (10).

ただし、N(i)は画素点iの近傍集合を示し、w(i,j)は原画素点iに対する近傍画素点jの輝度値の重み値を示す。f(j)は近傍集合内の画素点jの輝度値である。 Here, N (i) indicates a neighborhood set of the pixel point i, and w (i, j) indicates a weight value of the luminance value of the neighborhood pixel point j with respect to the original pixel point i. f (j) is the luminance value of the pixel point j in the neighborhood set.

Ｓ２０５：以下の筆画二次元フィルタ式（１１）で原画素点の更新色値を算出する。

S205: The updated color value of the original pixel point is calculated by the following stroke two-dimensional filter equation (11).

ただし、w_n(i,j)はｎチャネルにおいて原画素点iに対する近傍画素点jの色値の重み値を示し、f_n(j)はｎチャネルにおいて前記近傍集合内の画素点jの色値である。 Where w _n (i, j) represents the weight value of the color value of the neighboring pixel point j relative to the original pixel point i in the n channel, and f _n (j) represents the color of the pixel point j in the neighborhood set in the n channel. Value.

ここで説明すべきは、ステップＳ２０４及びＳ２０５は、更新輝度値及び更新色値をそれぞれ算出するため、実際の応用においてそのうちの任意の一つのステップを選択して実行し、又は二つのステップを同時に実行することにより、本実施例を実現することができる。 It should be explained here that steps S204 and S205 calculate and execute an updated luminance value and an updated color value, respectively, so that any one step is selected and executed in an actual application, or two steps are performed simultaneously. By executing, the present embodiment can be realized.

再び図１を参照する。 Refer to FIG. 1 again.

Ｓ１０３：フィルタリングした更新輝度値又は／及び更新色値で、対応する前記原輝度値又は／及び原色値をそれぞれ置換して、前記原画像に対応するテキスト強調画像を生成する。 S103: Replace the corresponding original luminance value or / and primary color value with the filtered updated luminance value or / and updated color value, respectively, to generate a text enhanced image corresponding to the original image.

フィルタリングされた後の更新輝度値又は／及び更新色値を取得した後に、更新輝度値又は／及び更新色値で、原輝度値又は／及び原色値をそれぞれ置換する。このように置換された後、原画像における画素点中のテキスト筆画が強調され、さらに、筆画内部の画素の一致性が強調され、テキストと背景の差異度が深まった。 After obtaining the updated luminance value or / and the updated color value after filtering, the primary luminance value or / and the primary color value are replaced with the updated luminance value or / and the updated color value, respectively. After such replacement, the text stroke at the pixel points in the original image is emphasized, and the matching of the pixels in the stroke is enhanced, and the degree of difference between the text and the background is deepened.

図３は、本発明の一実施例による第２種のテキスト強調方法を示すフローチャートである。図３に示すように、第２種のテキスト強調方法は、具体的に以下のステップを含む。なお、この第２種のテキスト強調方法は、例えば、コンピュータにより実行されてもよい。 FIG. 3 is a flowchart illustrating a second type of text enhancement method according to an embodiment of the present invention. As shown in FIG. 3, the second type of text enhancement method specifically includes the following steps. Note that this second type of text emphasis method may be executed by a computer, for example.

Ｓ３０１：少なくとも一行のテキストを含む原画像を取得する。 S301: An original image including at least one line of text is acquired.

Ｓ３０２：原画像におけるテキストの筆画極性を推定する。極性は、筆画領域内部に位置する画素点と、筆画領域外部に位置する画素点との間の輝度値又は／及び色値の大きさの関係を示す。 S302: The stroke polarity of the text in the original image is estimated. The polarity indicates the relationship between the magnitude of the luminance value and / or the color value between the pixel point located inside the stroke area and the pixel point located outside the stroke area.

実際の応用において、テキストの筆画強調は主にフィルタ技術、即ち、筆画外の周りの画素値で筆画内の目標画素値を強調するものを採用しているため、筆画内の目標画素点の周りのノイズ画素点が筆画強調の効果に悪影響を与えることになる。この影響は、細い筆画又は筆画の間隔を処理する際により明らかになる。このような品質下落を回避するために、本実施例において筆画極性推定が導入されている。本ステップで推定した筆画極性は、筆画領域内部の画素点と筆画領域外部の画素点との間の輝度値又は／及び色値の大きさの関係を示すことができる。 In actual application, text stroke enhancement mainly uses filter technology, that is, a pixel value outside the stroke to emphasize the target pixel value in the stroke, so the area around the target pixel point in the stroke. The noise pixel point of this will adversely affect the effect of stroke enhancement. This effect becomes more apparent when processing thin strokes or stroke intervals. In order to avoid such a drop in quality, stroke polarity estimation is introduced in this embodiment. The stroke polarity estimated in this step can indicate the relationship between the magnitude of the luminance value and / or the color value between the pixel point inside the stroke area and the pixel point outside the stroke area.

図４は、第２種のテキスト強調方法におけるステップＳ３０２を示すフローチャートである。具体的には、極性が筆画領域内部の画素点と筆画領域外部の画素点との間の輝度値の大きさの関係を示す場合に、図４に示されたように、原画像におけるテキストの筆画極性を推定するステップは、以下の内容を含む。 FIG. 4 is a flowchart showing step S302 in the second type of text enhancement method. Specifically, when the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, as shown in FIG. The step of estimating the stroke polarity includes the following contents.

Ｓ４０１：水平方向、垂直方向及び二つの対角線方向（即ち、原画像の幅方向、高さ方向及び二つの対角線方向）において、以下の式（12）により筆画応答強度をそれぞれ算出する。

S401: In the horizontal direction, the vertical direction, and the two diagonal directions (that is, the width direction, the height direction, and the two diagonal directions of the original image), the stroke response strength is calculated by the following equation (12).

ただし、wは原画像の高さの８分の一であり、f(i)は画素点iの輝度値を示す。本ステップにより、水平方向、垂直方向及び二つの対角線方向上の四つの筆画応答強度を取得することができる。 However, w is 1/8 of the height of the original image, and f (i) indicates the luminance value of the pixel point i. By this step, four stroke response intensities in the horizontal direction, the vertical direction, and the two diagonal directions can be acquired.

Ｓ４０２：算出した四つの筆画応答強度のうちの最大の筆画応答強度が以下の2つの条件、即ち、[f(i)−f(l)]と[f(i)−f(k)]の筆画極性が同じ、且つこの最大の筆画応答強度が予め設定された所定の閾値よりも大きいという２つの条件を満たしているか否かを判断する。肯定の場合に、ステップＳ４０３を実行する。否定の場合に、ステップＳ４０４を実行する。 S402: The maximum stroke response intensity among the calculated four stroke response intensities is the following two conditions: [f (i) −f (l)] and [f (i) −f (k)] It is determined whether or not the two conditions that the stroke polarity is the same and the maximum stroke response intensity is larger than a predetermined threshold set in advance are satisfied. If affirmative, step S403 is executed. If not, step S404 is executed.

具体的な応用においては、テキスト筆画内部の画素点と背景画素点との輝度又は色値は一般的に反対なものであるため、[f(i)−f(l)]と[f(i)−f(k)]の極性が同じであれば、i画素点が筆画内部の画素点である可能性が高いことを示す。本ステップは、原画像におけるすべての画素点から、筆画内部の画素点である可能性のあるものを検出する。ここでの極性が同じであることは、i画素点の輝度値及び／又は色値が同時にｌ画素点及びｋ画素点より大きい、又は、i点の輝度値及び／又は色値が同時にｌ画素点及びｋ画素点より小さいことを示し、即ち、[f(i)−f(l)]と[f(i)−f(k)]が同時にゼロよりも大きい、又は、ゼロ以下であることを示す。筆画応答強度の所定の閾値は、実際の必要に応じて調整することができる。従って、本発明は、この所定の閾値の選択を限定しない。 In a specific application, the luminance or color values of pixel points and background pixel points inside a text stroke are generally opposite, so [f (i) −f (l)] and [f (i ) −f (k)] have the same polarity, it indicates that there is a high possibility that the i pixel point is a pixel point inside the stroke. In this step, from all the pixel points in the original image, those that may be pixel points in the stroke are detected. Here, the same polarity means that the luminance value and / or color value of the i pixel point is simultaneously larger than the l pixel point and the k pixel point, or the luminance value and / or color value of the i point are simultaneously 1 pixel. Points and smaller than k pixel points, that is, [f (i) −f (l)] and [f (i) −f (k)] are simultaneously greater than zero or less than zero. Indicates. The predetermined threshold value of the stroke response intensity can be adjusted according to actual needs. Accordingly, the present invention does not limit the selection of this predetermined threshold.

Ｓ４０３：[f(i)−f(l)]又は[f(i)−f(k)]の極性に基づいて、テキストの推定筆画極性を特定する。 S403: The estimated stroke polarity of the text is specified based on the polarity of [f (i) -f (l)] or [f (i) -f (k)].

i画素点が上述の二つの条件を満たしていると、[f(i)−f(l)]又は[f(i)−f(k)]の極性に基づいて、テキスト内の推定筆画極性をp(i)として特定し、筆画内部の画素点と外部の画素点とを区分できれば、その値取りが任意である。例えば、テキスト筆画内部の画素点の輝度値が背景画像点の輝度値より低い場合に、p(i)を０に設定する。それ相応に、p(i)が１であることは、テキスト筆画内部の画素点の輝度値が背景画素点の輝度値以上であることを示す。 If the i pixel point satisfies the above two conditions, the estimated stroke polarity in the text is based on the polarity of [f (i) −f (l)] or [f (i) −f (k)]. Is specified as p (i), and the pixel value inside the stroke can be discriminated from the external pixel point. For example, p (i) is set to 0 when the luminance value of the pixel point inside the text stroke is lower than the luminance value of the background image point. Accordingly, p (i) of 1 indicates that the luminance value of the pixel point inside the text stroke is greater than or equal to the luminance value of the background pixel point.

Ｓ４０４：算出した筆画応答強度を大きさに従って順に選出し、ステップＳ４０２を実行する。 S404: The calculated stroke response strength is selected in order according to the magnitude, and step S402 is executed.

最大の筆画応答強度が上述の二つの条件を満たしていない場合に、二番目の大きい筆画応答強度を選択してステップＳ４０２の判断ステップを実行し、上述の二つの条件を満たす筆画応答強度を取得するまで、これによって類推して筆画応答強度の大きさに従って順番に行う。又は、上述の四つの筆画応答強度がいずれも上述の二つの条件を満たさない場合に、前記画素点iを非筆画画素点とする。 When the maximum stroke response strength does not satisfy the above two conditions, the second largest stroke response strength is selected and the determination step of step S402 is executed to obtain the stroke response strength satisfying the above two conditions. Until this is done, analogy is performed in order according to the magnitude of the stroke response intensity. Alternatively, when none of the above four stroke response intensities satisfies the above two conditions, the pixel point i is set as a non-stroke pixel point.

また、極性が筆画領域内部の画素点と筆画領域外部の画素点との間の輝度値の大きさの関係を示す場合に、他の実施形態もある。図５は、第２種のテキスト強調方法におけるステップＳ３０２を示す他のフローチャートである。図５に示されたように、原画像におけるテキストの筆画極性を推定するステップは、具体的に以下の内容を含む。 In addition, in the case where the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, there are other embodiments. FIG. 5 is another flowchart showing step S302 in the second type of text enhancement method. As shown in FIG. 5, the step of estimating the stroke polarity of the text in the original image specifically includes the following contents.

Ｓ５０１：一つの方向において、上記式（１２）により、原画像における各原画素点の筆画応答強度を算出する。この一つの方向は水平方向、垂直方向及び二つの対角線方向の何れかである。 S501: In one direction, the stroke response intensity of each original pixel point in the original image is calculated by the above equation (12). This one direction is one of a horizontal direction, a vertical direction, and two diagonal directions.

本ステップは、まず、水平方向、垂直方向及び二つの対角線方向の何れかにおいて筆画応答強度を算出する。 In this step, first, the stroke response strength is calculated in one of the horizontal direction, the vertical direction, and the two diagonal directions.

Ｓ５０２：筆画応答強度が以下の二つの条件、即ち[f(i)−f(l)]と[f(i)−f(k)]との極性が同じ、且つこの筆画応答強度が予め設定された所定の閾値よりも大きいことを同時に満たしているか否かを判断する。肯定の場合に、ステップＳ５０３を実行する。否定の場合に、算出した筆画応答強度に対して処理を行わず、さらに演算を行っていない他の方向において、原画像における各原画素点の筆画応答強度を算出する。 S502: The stroke response strength has the following two conditions, that is, [f (i) −f (l)] and [f (i) −f (k)] have the same polarity, and the stroke response strength is preset. It is determined whether or not the predetermined threshold value is satisfied at the same time. If affirmative, step S503 is executed. When the result is negative, no process is performed on the calculated stroke response intensity, and the stroke response intensity of each original pixel point in the original image is calculated in another direction in which no calculation is performed.

Ｓ５０３：[f(i)−f(l)]又は[f(i)−f(k)]の極性に基づいて、原画素点iの初期極性を特定する。 S503: The initial polarity of the original pixel point i is specified based on the polarity of [f (i) −f (l)] or [f (i) −f (k)].

算出した筆画応答強度が既に上記二つの条件を満たしている場合に、[f(i)−f(l)]又は[f(i)−f(k)]の極性に従って、原画素点iの初期極性を特定する。ここで特定した初期極性は、[f(i)−f(l)]又は[f(i)−f(k)]の極性と同じはずである。 When the calculated stroke response intensity already satisfies the above two conditions, according to the polarity of [f (i) −f (l)] or [f (i) −f (k)] Specify the initial polarity. The initial polarity specified here should be the same as the polarity of [f (i) −f (l)] or [f (i) −f (k)].

Ｓ５０４：四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断する。肯定の場合に、ステップＳ５０５を実行する。否定の場合に、ステップＳ５０１を実行する。 S504: It is determined whether or not the calculation of the stroke response intensities in the four directions has been completed. If affirmative, step S505 is executed. If negative, step S501 is executed.

次に、水平方向、垂直方向及び二つの対角線方向の四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断し、算出が完了していない場合に、筆画応答強度を算出していない方向の何れかを選択して、ステップＳ５０１を実行する必要がある。 Next, it is determined whether or not the calculation of the stroke response strengths in the four directions of the horizontal direction, the vertical direction, and the two diagonal directions has been completed. If the calculation has not been completed, the stroke response strength is calculated. It is necessary to select one of the directions that is not present and execute step S501.

Ｓ５０５：四つの方向上の最大の筆画応答強度が対応する初期極性をテキストの推定筆画極性として特定する。 S505: The initial polarity corresponding to the maximum stroke response intensity in the four directions is specified as the estimated stroke polarity of the text.

四つの方向上の筆画応答強度の算出がすべて完了した場合に、上記二つの条件を満たしている筆画応答強度のうち、最大の筆画応答強度の対応する初期極性を選択して、上述のテキストの推定筆画極性を特定する。即ち、原画像におけるテキストの推定筆画極性は最大の筆画応答強度の対応する初期極性と同じである。 When the calculation of the stroke response intensities in the four directions is completed, the initial polarity corresponding to the maximum stroke response strength is selected from the stroke response intensities satisfying the above two conditions. Specify the estimated stroke polarity. That is, the estimated stroke polarity of the text in the original image is the same as the corresponding initial polarity of the maximum stroke response intensity.

図６は、第２種のテキスト強調方法におけるステップＳ３０２を示す他のフローチャートである。極性が筆画領域内部の画素点と筆画領域外部の画素点との間の色値の大きさの関係を示す場合に、原画像におけるテキストの筆画極性を推定するステップは、図６に示されたように、具体的に以下の内容を含む。 FIG. 6 is another flowchart showing step S302 in the second type of text enhancement method. The step of estimating the stroke polarity of the text in the original image when the polarity indicates the relationship of the magnitude of the color value between the pixel point inside the stroke area and the pixel point outside the stroke area is shown in FIG. Specifically, the following contents are included.

Ｓ６０１：水平方向、垂直方向及び二つの対角線方向において以下の式（１３）により、各チャネルの筆画応答強度をそれぞれ算出する。

S601: The stroke response strength of each channel is calculated by the following formula (13) in the horizontal direction, the vertical direction, and the two diagonal directions.

ただし、wは原画像の高さの８分の一であり、f_n(i)は画素点iのチャネルｎにおける色値を示す。例えば、ｎチャネルがそれぞれRチャネル、Gチャネル及びBチャネルの場合に、筆画応答強度がRチャネル、Gチャネル及びBチャネルにおける筆画応答強度の合計である。 However, w is 1/8 of the height of the original image, and f _n (i) indicates a color value in the channel n of the pixel point i. For example, when the n channel is the R channel, the G channel, and the B channel, respectively, the stroke response strength is the sum of the stroke response strengths in the R channel, the G channel, and the B channel.

Ｓ６０２：算出した各チャネルの四つの筆画応答強度のうち、最大の筆画応答強度が以下の二つの条件、即ち、チャネルｎにおいて[f_n(i)−f_n(l)]と[f_n(i)−f_n(k)]との極性が何れも一致し、且つこの筆画応答強度が予め設定された所定の閾値より大きいことを満たしているか否かを判断する。肯定の場合に、ステップＳ６０３を実行し、否定の場合に、ステップＳ６０４を実行する。 S602: Of the four stroke response intensities calculated for each channel, the maximum stroke response strength is the following two conditions, that is, [f _n (i) −f _n (l)] and [f _n ( It is determined whether or not the polarities of i) −f _n (k)] match and that the stroke response intensity is greater than a predetermined threshold value set in advance. If the result is affirmative, step S603 is executed, and if the result is negative, step S604 is executed.

本ステップにおいて、任意のチャネルにおいても、[f_n(i)−f_n(l)]と[f_n(i)−f_n(k)]との極性が一致するという条件を満たす必要がある。 In this step, it is necessary to satisfy the condition that the polarities of [f _n (i) −f _n (l)] and [f _n (i) −f _n (k)] match in any channel. .

Ｓ６０３：[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性に基づいて、テキストの推定筆画極性を特定する。 S603: The estimated stroke polarity of the text is specified based on the polarity of [f _n (i) −f _n (l)] or [f _n (i) −f _n (k)].

Ｓ６０４：算出した筆画応答強度を大きさに従って順に選択して、ステップＳ６０２を実行する。 S604: The calculated stroke response intensity is selected in order according to the magnitude, and step S602 is executed.

最大の筆画応答強度が上記二つの条件を満たさない場合に、本ステップにおいて、ある筆画応答強度が前記二つの条件を満たすまで、大きさに従って順に、二番目、三番目及び四番目の筆画応答強度を選択してステップＳ６０２を実行する。又は、四つの筆画応答強度がいずれも上記二つの条件を満たさない場合に、画素点iを非筆画画素点とする。例えば、二番目の大きい筆画応答強度が既に上記二つの条件を満たしている場合に、筆画極性推定の流れを中止する。 When the maximum stroke response strength does not satisfy the above two conditions, in this step, the second, third, and fourth stroke response strengths in order according to the size until a certain stroke response strength satisfies the above two conditions. Is selected and step S602 is executed. Alternatively, the pixel point i is set as a non-stroke pixel point when none of the four stroke response intensities satisfies the above two conditions. For example, when the second largest stroke response intensity already satisfies the above two conditions, the stroke polarity estimation flow is stopped.

実際の応用において、極性が筆画領域内部の画素点と筆画領域外部の画素点の間の色値の大きさの関係を示す場合に、他の実施形態も存在する。図７は、第２種のテキスト強調方法におけるステップＳ３０２を示す他のフローチャートである。図７に示されたように、原画像におけるテキストの筆画極性を推定するステップは、以下の内容を含む。 In an actual application, other embodiments exist when the polarity indicates the relationship of the magnitude of the color value between the pixel points inside the stroke area and the pixel points outside the stroke area. FIG. 7 is another flowchart showing step S302 in the second type of text enhancement method. As shown in FIG. 7, the step of estimating the stroke polarity of the text in the original image includes the following contents.

Ｓ７０１：一つの方向において、上記式（１３）により、原画像における各原画素点の各チャネルの筆画応答強度を算出し、この一つの方向は水平方向、垂直方向及び二つの対角線方向の何れかである。 S701: In one direction, the stroke response intensity of each channel of each original pixel point in the original image is calculated by the above equation (13), and this one direction is any one of the horizontal direction, the vertical direction, and two diagonal directions. It is.

Ｓ７０２：一つの方向における各チャネルの筆画応答強度が以下の二つの条件、即ち、チャネルｎにおいて[f_n(i)−f_n(l)]と[f_n(i)−f_n(k)]との極性が一致し、且つこの筆画応答強度が予め設定された所定の閾値よりも大きいことを同時に満たしているか否かを判断する。肯定の場合に、ステップＳ７０３を実行する。否定の場合に、この方向において初期極性を設けない。 S702: The stroke response intensity of each channel in one direction has the following two conditions, that is, [f _n (i) −f _n (l)] and [f _n (i) −f _n (k) in channel n. ] And the stroke response intensity are simultaneously satisfied that the stroke response intensity is greater than a predetermined threshold value set in advance. If affirmative, step S703 is executed. In the negative case, no initial polarity is provided in this direction.

Ｓ７０３：[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性に基づいて、原画素点iの初期極性を特定する。 S703: The initial polarity of the original pixel point i is specified based on the polarity of [f _n (i) −f _n (l)] or [f _n (i) −f _n (k)].

ステップＳ７０１で算出した一つの方向の筆画応答強度が上記二つの条件を満たしている場合に、原画素点iのこの方向における初期極性を、[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性と同じように設置する。 When the stroke response intensity in one direction calculated in step S701 satisfies the above two conditions, the initial polarity of the original pixel point i in this direction is set to [f _n (i) −f _n (l)] or Install in the same way as the polarity of [f _n (i) −f _n (k)].

Ｓ７０４：四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断し、肯定の場合に、ステップＳ７０５を実行し、否定の場合に、ステップＳ７０１を実行する。 S704: It is determined whether or not the calculation of the stroke response intensities in the four directions has been completed. If the result is affirmative, step S705 is executed, and if the result is negative, step S701 is executed.

Ｓ７０５：四つの方向上の最大の筆画応答強度が対応する初期極性を、テキストの推定筆画極性として特定する。 S705: The initial polarity corresponding to the maximum stroke response intensity in the four directions is specified as the estimated stroke polarity of the text.

四つの方向上の筆画応答強度の算出がすべて完了した場合に、四つの方向上の、上記二つの条件を満たしている筆画応答強度から、最大の筆画応答強度の対応する初期極性を選択して、上述のテキストの推定筆画極性とする。四つの方向上の筆画応答強度の算出がすべて完了していない場合に、さらに算出されていない方向の何れかを選択してステップＳ７０１を実行する。 When the calculation of the stroke response strength in all four directions is completed, select the corresponding initial polarity of the maximum stroke response strength from the stroke response strength that satisfies the above two conditions in the four directions. , The estimated stroke polarity of the above text. If the calculation of the stroke response intensities in the four directions has not been completed, any one of the directions not calculated is selected and step S701 is executed.

再び図３を参照する。 Refer to FIG. 3 again.

Ｓ３０３：原画像における任意の原画素点からその近傍集合における各近傍画素点までの直接差異度及び間接差異度に基づいて、各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行って、原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得する。近傍集合の範囲は、原画素点を中心とし、且つ辺長がwである正方形となり、wが原画像の高さより小さい。 S303: Based on the direct difference degree and the indirect difference degree from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set, the strokes of the original luminance value and / or the primary color value of each original pixel point are drawn. Dimensional filtering is performed to obtain an updated luminance value or / and an updated color value after filtering of the original image. The range of the neighborhood set is a square whose center is the original pixel point and whose side length is w, and w is smaller than the height of the original image.

本ステップの詳細については、本実施例の関連内容を参照することができるため、ここで省略される。ここで説明すべきは、本ステップはステップＳ３０２と同時に実行しても良く、又はステップＳ３０３を実行してから、ステップＳ３０２を実行しても良い。 About the detail of this step, since the related content of a present Example can be referred, it abbreviate | omits here. What should be explained here is that this step may be executed simultaneously with step S302, or step S303 may be executed after step S303 is executed.

Ｓ３０４：原画像の各画素点に対して、フィルタリング後の更新輝度値又は／及び更新色値が上述の筆画極性と合わせるか否かを判断し、肯定の場合に、ステップＳ３０５を実行し、否定の場合に、置換を行わない。 S304: For each pixel point of the original image, it is determined whether or not the updated brightness value or / and the updated color value after filtering are matched with the above-described stroke polarity. If the result is affirmative, step S305 is executed and negative In this case, no replacement is performed.

算出により筆画極性が得られ、及び筆画フィルタリングが行われた後に、順に原画像の各画素点のフィルタリング後の更新輝度値又は／及び更新色値が上述の筆画極性と合わせるか否いかを判断することができる。ある画素点のフィルタリング後の更新輝度値又は／及び更新色値が上述の筆画極性と合わせない場合に、この画素点の原輝度値又は／及び色値を置換せず、次の画素点のフィルタリング後の更新輝度値又は／及び更新色値が上述の筆画極性と合わせるか否かを判断し続ける。 After the stroke polarity is obtained by calculation and the stroke filtering is performed, it is sequentially determined whether or not the updated brightness value or / and the updated color value after filtering of each pixel point of the original image matches the stroke polarity described above. be able to. If the updated luminance value or / and updated color value after filtering of a certain pixel point does not match the above-mentioned stroke polarity, the original luminance value or / and color value of this pixel point is not replaced, and the next pixel point is filtered. It continues to determine whether or not the later updated luminance value or / and updated color value match the above-described stroke polarity.

図８は、第２種のテキスト強調方法におけるステップＳ３０４を示すフローチャートである。図８に示されたように、前記ステップＳ３０４は具体的に、以下の内容を含む。 FIG. 8 is a flowchart showing step S304 in the second type of text enhancement method. As shown in FIG. 8, the step S304 specifically includes the following contents.

Ｓ８０１：フィルタリング後の更新輝度値又は／及び更新色値と、原輝度値又は／及び原色値との第１の大きさの関係を取得する。 S801: A first magnitude relationship between an updated luminance value or / and updated color value after filtering and a primary luminance value or / and primary color value is acquired.

まず、フィルタリング後の更新輝度値又は／及び更新色値と、原輝度値又は／及び原色値との第１の大きさの関係を取得する。ここでの第１の大きさの関係について、例えば、更新輝度値が原輝度値の輝度より明るい場合に、第１の大きさの関係は更新輝度値が原輝度値より大きいことであり、又は、例えば、更新色値が原色値より大きい場合に、第１の大きさの関係は、更新色値が原色値より大きいことである。この第１の大きさの関係は、更新輝度値又は／更新色値と、原輝度値又は／及び原色値とに対して代数的減算を行って取得することができる。 First, a first magnitude relationship between the updated luminance value or / and the updated color value after filtering and the primary luminance value or / and the primary color value is acquired. Regarding the relationship of the first magnitude here, for example, when the updated luminance value is brighter than the luminance of the original luminance value, the first magnitude relationship is that the updated luminance value is larger than the original luminance value, or For example, when the updated color value is larger than the primary color value, the relationship of the first magnitude is that the updated color value is larger than the primary color value. This first magnitude relationship can be obtained by performing algebraic subtraction on the updated luminance value or / and updated color value and the primary luminance value or / and primary color value.

Ｓ８０２：第１の大きさの関係が上述の筆画極性に示された第２の大きさの関係と合わせるか否かを判断する。 S802: It is determined whether or not the first magnitude relationship matches the second magnitude relationship indicated by the above-described stroke polarity.

上述の筆画極性が筆画内部の画素点の輝度値又は／及び色値と、筆画外部の画素点の輝度値又は／及び色値との大きさの関係を示すため、更新色値が原色値より大きい、且つ筆画極性が示す第２の大きさの関係も更新色値が原色値より大きいことである場合に、又は、更新色値が原色値より小さい、且つ筆画極性が示す第２の大きさの関係も更新色値が原色値より小さいことである場合に、第１の大きさの関係と、上述の筆画極性に示された第２の大きさの関係とが合わせると考えられる。逆に、第１の大きさの関係と、第２の大きさの関係とが合わせないと考えられる。 Since the above-mentioned stroke polarity indicates the relationship between the luminance value or / and color value of the pixel point inside the stroke and the luminance value or / and color value of the pixel point outside the stroke, the updated color value is greater than the primary color value. If the updated color value is greater than the primary color value, or if the updated color value is less than the primary color value and the second magnitude indicated by the stroke polarity When the updated color value is smaller than the primary color value, the relationship between the first size and the relationship between the second size shown in the above-described stroke polarity is considered to be matched. Conversely, it is considered that the relationship between the first size and the relationship between the second size do not match.

再び図３を参照する。 Refer to FIG. 3 again.

Ｓ３０５：フィルタリング後の更新輝度値又は／及び更新色値で、対応する原輝度値又は／及び原色値をそれぞれ置換して、原画像に対応するテキスト強調画像を生成する。 S305: Replace the corresponding original luminance value or / and primary color value with the updated luminance value or / and updated color value after filtering, respectively, to generate a text-enhanced image corresponding to the original image.

第１の大きさの関係と、第２の大きさの関係とが合わせた場合に、フィルタリング後の更新輝度値又は／及び更新色値で、対応する原輝度値又は／及び原色値をそれぞれ置換することにより、原画像に対応するテキスト強調画像を取得することができる。本実施例は、筆画極性推定のステップにより、筆画極性がフィルタリング後の更新輝度値又は更新色値と合わせた場合に、さらにテキスト強調を行うようにすることができ、第１種のテキスト強調方法と比べてテキスト強調の効果がより顕著になり、後続のテキスト抽出の正確性の向上に寄与することができる。 When the relationship of the first magnitude and the relation of the second magnitude are combined, the corresponding primary luminance value or / and primary color value is replaced with the updated luminance value or / and updated color value after filtering, respectively. By doing so, the text emphasis image corresponding to the original image can be acquired. In this embodiment, when the stroke polarity is combined with the updated luminance value or the updated color value after filtering in the stroke polarity estimation step, the text enhancement can be further performed. As compared with the above, the effect of text emphasis becomes more prominent and can contribute to the improvement of the accuracy of subsequent text extraction.

本発明の実施例による上述のテキスト強調方法によれば、取得されたテキスト強調画像中の画素点におけるテキスト筆画が強調され、筆画内部の画素の一致性が強調され、且つテキストと背景との差異度が深まれ、即ち原画像に含まれたテキストが強調された。これにより、後続のこのテキスト強調画像に対するテキスト抽出がより精度良く且つ正確になる。 According to the above-described text emphasizing method according to the embodiment of the present invention, the text stroke at the pixel point in the acquired text-enhanced image is enhanced, the matching of the pixels inside the stroke is enhanced, and the difference between the text and the background The text was deepened, ie the text contained in the original image was emphasized. This makes subsequent text extraction for this text-enhanced image more accurate and accurate.

また、本発明の一実施例は、上述の第１種のテキスト強調方法に対応する第１種のテキスト強調装置を提供する。図９は、本発明の一実施例による第１種のテキスト強調装置を示す模式図である。 In addition, an embodiment of the present invention provides a first type of text enhancement device corresponding to the above-described first type of text enhancement method. FIG. 9 is a schematic diagram showing a first type of text emphasizing apparatus according to an embodiment of the present invention.

図９に示されたように、第１種のテキスト強調装置は、少なくとも一行のテキストを含む原画像を取得するための取得モジュール９０１を備えることができる。 As shown in FIG. 9, the first type of text enhancement device can include an acquisition module 901 for acquiring an original image including at least one line of text.

さらに、第１種のテキスト強調装置は、図９に示されたように、原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度及び間接差異度に基づいて、各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行って、原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得するためのフィルタモジュール９０２を備えることができる。なお、近傍集合の範囲は、原画素点を中心とし、且つ辺長がwである正方形となり、前記wは原画像の高さより小さい。 Furthermore, as shown in FIG. 9, the first type of text enhancement device is based on the direct difference degree and the indirect difference degree from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set. A filter module 902 is provided for performing two-dimensional handwriting filtering on the original luminance value or / and primary color value of each original pixel point to obtain an updated luminance value or / and updated color value after filtering of the original image. Can do. The range of the neighborhood set is a square centered on the original pixel point and the side length is w, and w is smaller than the height of the original image.

図１０は、第１種のテキスト強調装置におけるフィルタモジュール９０２を示す模式図である。 FIG. 10 is a schematic diagram showing the filter module 902 in the first type of text enhancement device.

図１０に示されたように、フィルタモジュール９０２は、
原画素点と各近傍画素点の原輝度値又は／及び原色値に対して代数的減算を行って、直接差異度を取得するようにするための第１の取得サブモジュール１００１と、
原画素点からその近傍集合における各近傍画素点までのグラジエントモジュールに基づいて、間接差異度を取得するようにするための第２の取得サブモジュール１００２と、
直接差異度と間接差異度とに基づいて、各近傍画素点の原画素点に対する輝度値又は／及び色値の重み値を算出するための重み算出サブモジュール１００３と、
上述の筆画二次元フィルタ式（１０）により、原画素点の更新輝度値を算出するための更新輝度値算出サブモジュール１００４と、
上述の筆画二次元フィルタ式（１１）により、原画素点の更新色値を算出するための更新色値算出サブモジュール１００５と、を備えることができる。 As shown in FIG. 10, the filter module 902
A first acquisition sub-module 1001 for performing algebraic subtraction on the original luminance value or / and the primary color value of the original pixel point and each neighboring pixel point to directly acquire the difference degree;
A second acquisition sub-module 1002 for acquiring an indirect difference based on a gradient module from the original pixel point to each neighboring pixel point in the neighborhood set;
A weight calculation submodule 1003 for calculating a luminance value and / or a color value weight value for the original pixel point of each neighboring pixel point based on the direct difference degree and the indirect difference degree;
An updated luminance value calculation submodule 1004 for calculating an updated luminance value of the original pixel point by the above-described stroke two-dimensional filter equation (10);
An updated color value calculation submodule 1005 for calculating the updated color value of the original pixel point can be provided by the above-described stroke two-dimensional filter equation (11).

再び図９を参照する。第１種のテキスト強調装置は、フィルタリング後の更新輝度値又は／及び更新色値で、対応する原輝度値又は／及び色値をそれぞれ置換して、原画像に対応するテキスト強調画像を生成するための置換モジュール９０３を備えることができる。 Refer to FIG. 9 again. The first type of text enhancement device generates a text enhanced image corresponding to the original image by replacing the corresponding original luminance value or / and color value with the updated luminance value or / and updated color value after filtering, respectively. A replacement module 903 can be provided.

本発明の実施例による上述のテキスト強調装置によれば、フィルタリング後の更新輝度値又は／及び更新色値を取得した後に、更新輝度値又は／及び更新色値で、原輝度値又は／及び原色値をそれぞれ置換することにより、原画像における画素点中のテキスト筆画が強調され、さらに筆画内部の画素の一致性が強調され、且つテキストと背景の差異度が深まれ、これにより、後続のテキスト抽出のためによりよいテキスト強調画像を提供し、後続のテキスト抽出の正確性及び精度を向上することもできる。 According to the above text emphasis device according to the embodiment of the present invention, after obtaining the updated luminance value or / and the updated color value after filtering, the updated luminance value or / and the updated color value are used as the primary luminance value or / and the primary color. By substituting each value, the text stroke at the pixel point in the original image is emphasized, the pixel matching inside the stroke is enhanced, and the difference between the text and the background is deepened, so that the subsequent text extraction Therefore, it is possible to provide a better text-enhanced image and improve the accuracy and precision of subsequent text extraction.

また、本発明の一実施例は、上述の第２種のテキスト強調方法に対応する第２種のテキスト強調装置を更に提供する。図１１は、本発明の一実施例による第２種のテキスト強調装置を示す模式図である。 In addition, an embodiment of the present invention further provides a second type of text enhancement device corresponding to the above-described second type of text enhancement method. FIG. 11 is a schematic diagram showing a second type of text emphasizing apparatus according to an embodiment of the present invention.

図１１に示されたように、第２種のテキスト強調装置は、少なくとも一行のテキストを含む原画像を取得するための取得モジュール９０１と、原画像におけるテキストの筆画極性を推定するための筆画極性推定モジュール１１０１とを備えることができる。ここで、極性は、筆画領域内部に位置する画素点と、筆画領域外部に位置する画素点との間の輝度値又は／及び色値の大きさの関係を示す。 As shown in FIG. 11, the second type of text enhancement device includes an acquisition module 901 for acquiring an original image including at least one line of text, and a stroke polarity for estimating the stroke polarity of the text in the original image. An estimation module 1101. Here, the polarity indicates the relationship between the magnitude of the luminance value and / or the color value between the pixel point located inside the stroke area and the pixel point located outside the stroke area.

筆画極性推定モジュール１１０１は、異なる応用シーンにおいて異なる具体的な配置を行うことができる。図１２は、第２種のテキスト強調装置における筆画極性推定モジュール１１０１を示す模式図である。 The stroke polarity estimation module 1101 can perform different specific arrangements in different application scenes. FIG. 12 is a schematic diagram showing the stroke polarity estimation module 1101 in the second type of text enhancement device.

あるシーンにおいて、極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の輝度値の大きさの関係を示す場合に、図１２に示されたように、筆画極性推定モジュール１１０１は、
水平方向、垂直方向及び二つの対角線方向上において上記式（１２）により、筆画応答強度をそれぞれ算出するための第１の算出サブモジュール１２０１と、
算出した四つの筆画応答強度のうち、最大の筆画応答強度が以下の二つの条件、即ち[f(i)−f(l)]と[f(i)−f(k)]の極性が同じ、且つこの筆画応答強度が予め設定された所定の閾値より大きいことを満たしているか否かを判断するための第１の判断サブモジュール１２０２と、
第１の判断サブモジュール１２０２による判断結果が肯定の場合に、[f(i)−f(l)]と[f(i)−f(k)]の極性に基づいて、テキストの推定筆画極性を特定するための第１の特定サブモジュール１２０３と、
第１の判断サブモジュール１２０２による判断結果が否定の場合に、ある筆画応答強度が上記二つの条件を満たすまで、算出した筆画応答強度を大きさに従って順番に選択し、且つ第１の判断サブモジュールをトリガし（起動し）、又は、上記四つの筆画応答強度がいずれも上記二つの条件を満たさない場合に、画素点iを非筆画画素点とするための第１のトリガサブモジュール１２０４と、を備えることができる。 In a certain scene, when the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, as shown in FIG. 12, the stroke polarity estimation module 1101 Is
A first calculation sub-module 1201 for calculating the stroke response intensity according to the above equation (12) in the horizontal direction, the vertical direction, and the two diagonal directions;
Of the four calculated stroke response intensities, the maximum stroke response intensity has the following two conditions, that is, [f (i) −f (l)] and [f (i) −f (k)] have the same polarity And a first determination submodule 1202 for determining whether or not the stroke response strength satisfies a predetermined threshold value larger than a preset threshold value;
When the determination result by the first determination submodule 1202 is affirmative, the estimated stroke polarity of the text is determined based on the polarities of [f (i) −f (l)] and [f (i) −f (k)]. A first identification sub-module 1203 for identifying
When the determination result by the first determination submodule 1202 is negative, the calculated stroke response strength is selected in order according to the magnitude until a certain stroke response strength satisfies the above two conditions, and the first determination submodule A first trigger sub-module 1204 for setting the pixel point i as a non-stroke pixel point when none of the four stroke response intensities satisfies the two conditions, Can be provided.

図１３は、第２種のテキスト強調装置における筆画極性推定モジュール１１０１を示す他の模式図である。 FIG. 13 is another schematic diagram showing the stroke polarity estimation module 1101 in the second type of text enhancement device.

他のシーンにおいて、極性が筆画領域内部の画素点と筆画領域外部の画素点の間の輝度値の大きさの関係を示す場合に、図１３に示されたように、筆画極性推定モジュール１１０１は、
水平方向、垂直方向及び二つの対角線方向の何れかにおいて、上記式（１２）により、原画像における各原画素点の筆画応答強度を算出するための第２の算出サブモジュール１３０１と、
筆画応答強度が以下の二つの条件、即ち[f(i)−f(l)]と[f(i)−f(k)]の極性が同じ、且つこの筆画応答強度が予め設定された所定の閾値より大きいことを同時に満たしているか否かを判断するための第２の判断サブモジュール１３０２と、
第２の判断サブモジュール１３０２による判断結果が肯定の場合に、[f(i)−f(l)]と[f(i)−f(k)]の極性に基づいて、原画素点iの初期極性を特定するための第２の特定サブモジュール１３０３と、
四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断するための第３の判断サブモジュール１３０４と、
第３の判断サブモジュール１３０４による判断結果が肯定の場合に、四つの方向上の最大の筆画応答強度の対応する初期極性をテキストの推定筆画極性として特定するための第３の特定サブモジュール１３０５と、
第３の判断サブモジュール１３０４による判断結果が否定の場合に、第２の算出サブモジュールをトリガ（起動）するための第２のトリガサブモジュール１３０６と、を備えることができる。 In other scenes, when the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, the stroke polarity estimation module 1101 is shown in FIG. ,
A second calculation sub-module 1301 for calculating the stroke response strength of each original pixel point in the original image according to the above equation (12) in any of the horizontal direction, the vertical direction, and the two diagonal directions;
The stroke response strength is the following two conditions, that is, [f (i) −f (l)] and [f (i) −f (k)] have the same polarity and the stroke response strength is set in advance. A second determination sub-module 1302 for determining whether or not simultaneously satisfying that a value greater than a threshold value of
If the determination result by the second determination submodule 1302 is affirmative, the original pixel point i is determined based on the polarities of [f (i) −f (l)] and [f (i) −f (k)]. A second identification sub-module 1303 for identifying the initial polarity;
A third determination submodule 1304 for determining whether or not the calculation of the stroke response intensities in the four directions has been completed;
A third identification sub-module 1305 for identifying the corresponding initial polarity of the maximum stroke response intensity in the four directions as the estimated stroke polarity of the text when the determination result by the third determination sub-module 1304 is positive; ,
And a second trigger submodule 1306 for triggering (activating) the second calculation submodule when the determination result by the third determination submodule 1304 is negative.

図１４は、第２種のテキスト強調装置における筆画極性推定モジュール１１０１を示す他の模式図である。 FIG. 14 is another schematic diagram showing the stroke polarity estimation module 1101 in the second type of text enhancement device.

他のシーンにおいては、極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の色値の大きさの関係を示す場合に、図１４に示されたように、筆画極性推定モジュール１１０１は、
水平方向、垂直方向及び二つの対角線方向において、上記式（１３）により、筆画応答強度をそれぞれ算出するための第３の算出サブモジュール１４０１と、
算出した筆画応答強度のうち、最大の筆画応答強度が以下の二つの条件、即ちチャネルｎにおいて[f_n(i)−f_n(l)]と[f_n(i)−f_n(k)]の極性が一致し、且つ前記筆画応答強度が予め設定された所定の閾値より大きいことを満たしているか否かを判断するための第４の判断サブモジュール１４０２と、
第４の判断サブモジュール１４０２による判断結果が肯定の場合に、[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性に基づいて、テキストの推定筆画極性を特定するための第４の特定サブモジュール１４０３と、
第４の判断サブモジュール１４０２による判断結果が否定の場合に、ある筆画応答強度が上記二つの条件を満たすまで、大きさの関係に従って順番に算出した筆画応答強度を選択し、且つ第４の判断サブモジュール１４０２をトリガし（起動し）、又は、四つの筆画応答強度がいずれも上記二つの条件を満たしていない場合に、画素点iを非筆画画素点とするための第３のトリガサブモジュール１４０４と、を備えることができる。 In other scenes, when the polarity indicates the relationship of the magnitude of the color value between the pixel point inside the stroke area and the pixel point outside the stroke area, as shown in FIG. Module 1101
A third calculation sub-module 1401 for calculating the stroke response intensity according to the above equation (13) in the horizontal direction, the vertical direction, and the two diagonal directions;
Among the calculated stroke response intensities, the maximum stroke response intensity has the following two conditions, namely [f _n (i) −f _n (l)] and [f _n (i) −f _n (k) in channel n. And the fourth response sub-module 1402 for determining whether or not the stroke response intensity is greater than a predetermined threshold value,
If the determination result by the fourth determination submodule 1402 is affirmative, the text is determined based on the polarity of [f _n (i) −f _n (l)] or [f _n (i) −f _n (k)]. A fourth identification sub-module 1403 for identifying the estimated stroke polarity of
If the determination result by the fourth determination submodule 1402 is negative, the stroke response strength calculated in order according to the magnitude relationship is selected until a certain stroke response strength satisfies the above two conditions, and the fourth determination A third trigger sub-module for triggering (activating) the sub-module 1402 or making the pixel point i a non-stroke pixel point when none of the four stroke response intensities satisfy the above two conditions 1404.

図１５は、第２種のテキスト強調装置における筆画極性推定モジュール１１０１を示す他の模式図である。 FIG. 15 is another schematic diagram showing the stroke polarity estimation module 1101 in the second type of text enhancement device.

他のシーンにおいて、極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の色値の大きさの関係を示す場合に、図１５に示されたように、筆画極性推定モジュール１１０１は、
水平方向、垂直方向及び二つの対角線方向の何れかにおいて、上記式（１３）により、原画像における各原画素点の筆画応答強度を算出するための第４の算出サブモジュール１５０１と、
筆画応答強度が以下の二つの条件、即ち、チャネルｎにおいて[f_n(i)−f_n(l)]と[f_n(i)−f_n(k)]の極性が一致し、且つこの筆画応答強度が予め設定された所定の閾値より大きいことを同時に満たしているか否かを判断するための第５の判断サブモジュール１５０２と、
第５の判断サブモジュール１５０２による判断結果が肯定の場合に、[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性に基づいて、原画素点iの初期極性を特定するための第５の特定サブモジュール１５０３と、
四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断するための第６の判断サブモジュール１５０４と、
第６の判断サブモジュール１５０４による判断結果が肯定の場合に、四つの方向上の最大の筆画応答強度の対応する初期極性をテキストの推定筆画極性として特定するための第６の特定サブモジュール１５０５と、
第６の判断サブモジュール１５０５による判断結果が否定の場合に、第４の算出サブモジュール１５０１をトリガ（起動）するための第４のトリガサブモジュール１５０６と、を備えることができる。 In other scenes, when the polarity indicates the relationship of the magnitude of the color value between the pixel point inside the stroke area and the pixel point outside the stroke area, as shown in FIG. 15, the stroke polarity estimation module 1101 is
A fourth calculation submodule 1501 for calculating the stroke response strength of each original pixel point in the original image according to the above equation (13) in any one of the horizontal direction, the vertical direction, and the two diagonal directions;
The stroke response intensity is the following two conditions, that is, in channel n, [f _n (i) −f _n (l)] and [f _n (i) −f _n (k)] have the same polarity, and this A fifth determination sub-module 1502 for determining whether or not the stroke response intensity is simultaneously greater than a predetermined threshold value,
If the determination result by the fifth determination submodule 1502 is positive, based on the polarity of [f _n (i) −f _n (l)] or [f _n (i) −f _n (k)] A fifth identification sub-module 1503 for identifying the initial polarity of the pixel point i;
A sixth determination submodule 1504 for determining whether or not the calculation of the stroke response intensities in the four directions has been completed;
A sixth identification submodule 1505 for identifying the corresponding initial polarity of the maximum stroke response intensity in the four directions as the estimated stroke polarity of the text when the determination result by the sixth determination submodule 1504 is positive; ,
And a fourth trigger submodule 1506 for triggering (activating) the fourth calculation submodule 1501 when the determination result by the sixth determination submodule 1505 is negative.

再び図１１を参照する。第２種のテキスト強調装置は、さらに、原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度と間接差異度とに基づいて、各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行って、原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得するためのフィルタモジュール９０２を含むことができる。近傍集合の範囲は、原画素点を中心とし、且つ辺長がwである正方形となり、前記wは原画像の高さより小さい。 Refer to FIG. 11 again. The second type of text emphasizing device further provides an original luminance value of each original pixel point based on a direct difference degree and an indirect difference degree from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set. Alternatively, a filter module 902 may be included for performing stroke two-dimensional filtering on the primary color values to obtain updated luminance values or / and updated color values after filtering of the original image. The range of the neighborhood set is a square whose center is the original pixel point and whose side length is w, and the w is smaller than the height of the original image.

また、図１１に示すように、第２種のテキスト強調装置は、さらに、フィルタリング後の更新輝度値又は／及び更新色値と筆画極性とが合わせるか否かを判断し、肯定の場合に、置換モジュール９０３をトリガ（起動）するための判断モジュール１１０２を備えることができる。 Further, as shown in FIG. 11, the second type of text enhancement device further determines whether or not the updated luminance value after filtering and / or the updated color value and the stroke polarity are matched. A determination module 1102 for triggering (activating) the replacement module 903 can be provided.

図１６は、第２種のテキスト強調装置における判断モジュール１１０２を示す模式図である。 FIG. 16 is a schematic diagram showing the determination module 1102 in the second type of text enhancement device.

図１６に示されたように、実際の応用において、判断モジュール１１０２は、
フィルタリング後の更新輝度値又は／及び更新色値と、原輝度値又は／及び原色値との第１の大きさの関係を取得するための第３の取得サブモジュール１６０１と、
第１の大きさの関係と、筆画極性が示す第２の大きさの関係とが合わせるか否かを判断するための第７の判断サブモジュール１６０２とを備えることができる。 As shown in FIG. 16, in an actual application, the decision module 1102
A third acquisition submodule 1601 for acquiring a first magnitude relationship between the updated luminance value or / and the updated color value after filtering and the primary luminance value or / and the primary color value;
A seventh determination submodule 1602 for determining whether or not the first magnitude relationship and the second magnitude relationship indicated by the stroke polarity are matched can be provided.

なお、第１の大きさの関係及び第２の大きさの関係については、上述の第２種のテキスト強調方法におけるステップＳ３０４のフローチャットを示す図８に関連する記載に既に説明したため、ここで省略する。 Since the relationship between the first size and the relationship between the second size has already been described in the description related to FIG. 8 showing the flow chat in step S304 in the above-described second type of text enhancement method, Omitted.

再び図１１を参照する。第２種のテキスト強調装置は、さらに、フィルタリング後の更新輝度値又は／及び更新色値で、対応の原輝度値又は／及び原色値をそれぞれ置換して、原画像に対応するテキスト強調画像を生成するための置換モジュール９０３を備えることができる。 Refer to FIG. 11 again. The second type of text emphasizing device further replaces the corresponding original luminance value or / and primary color value with the updated luminance value or / and updated color value after filtering, respectively, so that a text enhanced image corresponding to the original image is obtained. A replacement module 903 for generating may be provided.

本発明の実施例による上述の装置によれば、さらに、筆画極性推定の形態を採用することにより、フィルタリング後の更新色値又は更新輝度値に対して検証を行うことができる。筆画極性がフィルタリング後の更新色値又は／及び更新輝度値と合わせた場合に、さらにフィルタリング後の更新輝度値又は／及び更新色値で、対応の原輝度値又は／及び原色値をそれぞれ置換することにより、取得したテキスト強調画像がより有効且つ精度が良くなる。 According to the above-described apparatus according to the embodiment of the present invention, the updated color value or the updated luminance value after filtering can be verified by adopting the form of the stroke polarity estimation. When the stroke polarity is combined with the updated color value or / and the updated luminance value after filtering, the corresponding primary luminance value or / and the primary color value are respectively replaced with the updated luminance value or / and the updated color value after filtering. As a result, the acquired text-enhanced image becomes more effective and accurate.

図１７は、本発明の一実施例によるテキスト抽出方法を示すフローチャートである。図17に示されたように、本発明の一実施例は、テキスト強調された後にテキスト抽出方法をさらに提供する。このテキスト抽出方法は、以下のステップを含む。なお、このテキスト抽出方法は、例えば、コンピュータにより実行されてもよい。 FIG. 17 is a flowchart illustrating a text extraction method according to an embodiment of the present invention. As shown in FIG. 17, one embodiment of the present invention further provides a text extraction method after text enhancement. This text extraction method includes the following steps. Note that this text extraction method may be executed by a computer, for example.

Ｓ１７０１：少なくとも一行のテキストを含む原画像を取得する。 S1701: An original image including at least one line of text is acquired.

Ｓ１７０２：原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度と間接差異度とに基づいて、各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行って、原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得する。近傍集合の範囲は、原画素点を中心とし、且つ辺長がwである正方形となり、wは原画像の高さより小さい。 S1702: Based on the direct difference degree and the indirect difference degree from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set, a stroke is drawn for the original luminance value or / and the primary color value of each original pixel point. Two-dimensional filtering is performed to obtain an updated luminance value or / and an updated color value after filtering of the original image. The range of the neighborhood set is a square whose center is the original pixel point and whose side length is w, and w is smaller than the height of the original image.

Ｓ１７０３：フィルタリング後の更新輝度値又は／及び更新色値で、対応の原輝度値又は／及び原色値をそれぞれ置換して、原画像に対応するテキスト強調画像を生成する。 S1703: The text-enhanced image corresponding to the original image is generated by replacing the corresponding original luminance value or / and primary color value with the updated luminance value or / and updated color value after filtering, respectively.

Ｓ１７０４：テキスト強調画像におけるテキストを抽出する。 S1704: Extract the text in the text enhanced image.

上述のテキスト抽出方法を採用すれば、テキスト強調画像に基づいてテキスト抽出を行うことができる。このように抽出されたテキストはより正確に且つ精度が良くなる。同時に、抽出時にテキストが既に強調されたため、テキスト抽出の複雑度を削減し、テキスト抽出の効率を向上することができる。 If the above-described text extraction method is employed, text extraction can be performed based on the text enhanced image. The text extracted in this way becomes more accurate and accurate. At the same time, since the text is already emphasized at the time of extraction, the complexity of text extraction can be reduced and the efficiency of text extraction can be improved.

図１８は、本発明の一実施例によるテキスト抽出装置を示す模式図である。本発明の一実施例は、図１８に示されたように、上述のテキスト抽出方法に対応するテキスト抽出装置をさらに提供する。このテキスト抽出装置は、
少なくとも一行のテキストを含む原画像を取得するための取得モジュール９０１と、
原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度と間接差異度とに基づいて、各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行って、原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得するためのフィルタモジュール９０２であって、近傍集合の範囲は、原画素点を中心とし、且つ辺長がwである正方形となり、wは原画像の高さより小さいフィルタモジュール９０２と、
フィルタリング後の更新輝度値又は／及び更新色値で、対応の原輝度値又は／及び色値をそれぞれ置換して、原画像に対応するテキスト強調画像を生成するための置換モジュール９０３と、
テキスト強調画像におけるテキストを抽出するための抽出モジュール１８０１と、を備える。 FIG. 18 is a schematic diagram illustrating a text extraction apparatus according to an embodiment of the present invention. As shown in FIG. 18, the embodiment of the present invention further provides a text extraction apparatus corresponding to the text extraction method described above. This text extraction device
An acquisition module 901 for acquiring an original image including at least one line of text;
Based on the direct difference and indirect difference from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set, the two-dimensional strokes for the original luminance value and / or the primary color value of each original pixel point A filter module 902 for performing filtering to obtain an updated luminance value or / and an updated color value after filtering of the original image, wherein the neighborhood set range is centered on the original pixel point and the side length is w And w is a filter module 902 smaller than the height of the original image,
A replacement module 903 for generating a text-enhanced image corresponding to the original image by replacing the corresponding original luminance value or / and color value with the updated luminance value or / and the updated color value after filtering, respectively;
An extraction module 1801 for extracting text in the text-enhanced image.

上述のテキスト抽出装置を採用すると、テキスト強調画像に基づいてテキスト抽出を行うことができる。このように抽出されたテキストはより正確に且つ精度が良くなる。同時に、抽出時にテキストが既に強調されたため、テキスト抽出の複雑度を低減し、テキスト抽出の効率を向上することができる。 When the above-described text extraction device is employed, text extraction can be performed based on the text enhanced image. The text extracted in this way becomes more accurate and accurate. At the same time, since the text is already emphasized during extraction, the complexity of text extraction can be reduced and the efficiency of text extraction can be improved.

また、ここで説明すべきは、上述の一連の処理又は装置は、ソフトウェア及び／又はファームウェアにより実現されてもよい。ソフトウェア及び／又はファームウェアにより実現される場合に、記憶媒体又はネットワークから、専用ハードウェア構造を有するコンピュータ、例えば図１９に示された汎用コンピュータ１９００に、このソフトウェア及び／又はファームウェアを構成する各種のプログラムをインストールすることにより、このコンピュータは、各種の機能を実行することができる。 Further, it should be described here that the above-described series of processes or apparatuses may be realized by software and / or firmware. When implemented by software and / or firmware, various programs that configure the software and / or firmware from a storage medium or network to a computer having a dedicated hardware structure, such as the general-purpose computer 1900 shown in FIG. By installing, this computer can execute various functions.

図１９は、本発明の一実施例による、情報処理装置としてのコンピュータの模式的な構造を示すブロック図である。図１９において、中央処理ユニット(ＣＰＵ)１９０１は、リードオンリーメモリ(ＲＯＭ)１９０２に記憶されたプログラム、又は記憶部１９０８からランダムアクセスメモリ(ＲＡＭ)１９０３にロードされた各種のプログラムに基づいて、各種の処理を実行する。ＲＡＭ１９０３には、必要に応じて、ＣＰＵ１９０１が各種の処理等を実行するときに必要とするデータも記憶される。 FIG. 19 is a block diagram showing a schematic structure of a computer as an information processing apparatus according to an embodiment of the present invention. In FIG. 19, the central processing unit (CPU) 1901 has various programs based on programs stored in a read-only memory (ROM) 1902 or various programs loaded from a storage unit 1908 to a random access memory (RAM) 1903. Execute the process. The RAM 1903 also stores data necessary when the CPU 1901 executes various processes as necessary.

ＣＰＵ１９０１、ＲＯＭ１９０２及びＲＡＭ１９０３は、バス１９０４を経由して互いに接続される。入力／出力インターフェース１９０５もバス１９０４に接続される。 The CPU 1901, ROM 1902 and RAM 1903 are connected to each other via a bus 1904. An input / output interface 1905 is also connected to the bus 1904.

キーボード及びマウス等を含む入力部１９０６と、例えば陰極線管（ＣＲＴ）又は液晶ディスプレイー（ＬＣＤ）等のようなディスプレイー及びスピーカ等を含む出力部１９０７と、ハードディスク等を含む記憶部１９０８と、ＬＡＮカードのようなネットワークインターフェースカード及びモデム等を含む通信部１９０９とは、入力／出力インターフェース１９０５に接続される。通信部１９０９は、インターネットのようなネットワークを経由して通信処理を実行する。 An input unit 1906 including a keyboard and a mouse, an output unit 1907 including a display and a speaker such as a cathode ray tube (CRT) or a liquid crystal display (LCD), a storage unit 1908 including a hard disk, etc. A communication unit 1909 including a network interface card such as a card and a modem is connected to an input / output interface 1905. A communication unit 1909 executes communication processing via a network such as the Internet.

必要に応じて、ドライブ１９１０も入力／出力インターフェース１９０５に接続される。取り外し可能な媒体１９１１、例えば磁気ディスク、光ディスク、光磁気ディスク、又は半導体メモリ等が必要に応じてドライブ１９１０に取り付けられ、これによりその中から読み出されたコンピュータプログラムが必要に応じて記憶部１９０８にインストールされる。 A drive 1910 is also connected to the input / output interface 1905 as needed. A removable medium 1911 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory is attached to the drive 1910 as necessary, and a computer program read out from the drive 1910 is stored in the storage unit 1908 as necessary. To be installed.

ソフトウェアにより上述の一連の処理を実現する場合は、ネットワーク（例えばインターネット）、又は記憶媒体（例えば取り外し可能な媒体１９１１）から、ソフトウェアを構成するプログラムをインストールする。 When the above-described series of processing is realized by software, a program constituting the software is installed from a network (for example, the Internet) or a storage medium (for example, removable medium 1911).

このような記憶媒体は、図１９に示された、その中にプログラムが記憶されており、デバイスから離れて配送されてユーザにプログラムを提供する取り外し可能な媒体１９１１に限定されないことを、当業者は理解すべきである。取り外し可能な媒体１９１１としては、例えば、磁気ディスク（フロッピディスク（登録商標）含む）、光ディスク（コンパクトディスクリードオンリーメモリ（CD−ROM）やディジタルヴァーサタイルディスク（DVD）を含む）、光磁気ディスク（ミニディスク(MD)（登録商標）含む）及び半導体メモリを含む。又は、記憶媒体は、ＲＯＭ１９０２、又は記憶部１９０８に含まれるハードディスク等であっても良い。それらは、プログラムを記憶しており、且つそれらを含むデバイスと一緒にユーザに配布される。 Those skilled in the art will recognize that such storage media is not limited to the removable media 1911 shown in FIG. 19 in which the program is stored and delivered remotely from the device to provide the program to the user. Should be understood. Examples of the removable medium 1911 include a magnetic disk (including a floppy disk (registered trademark)), an optical disk (including a compact disk read-only memory (CD-ROM) and a digital versatile disk (DVD)), and a magneto-optical disk ( Including mini disk (MD) (registered trademark) and semiconductor memory. Alternatively, the storage medium may be a ROM 1902 or a hard disk included in the storage unit 1908. They store programs and are distributed to users along with the devices that contain them.

さらに説明すべきは、上述の一連の処理のステップは、説明した通りに時間順に従って実行することが出来るが、必ず時間順に従うとは限らない。幾つかのステップは、並行又は互いに独立して実行することができる。 Further, it should be explained that the above-described series of processing steps can be executed in the order of time as described, but not necessarily in the order of time. Some steps can be performed in parallel or independently of each other.

以上、本発明の好ましい実施例を説明したが、本発明はこの実施例に限定されず、本発明の趣旨を離脱しない限り、本発明に対するあらゆる変更は本発明の技術的範囲に属する。 The preferred embodiment of the present invention has been described above, but the present invention is not limited to this embodiment, and all modifications to the present invention belong to the technical scope of the present invention unless departing from the spirit of the present invention.

以上の実施例を含む実施形態に関し、更に以下の付記を開示する。 The following additional notes are further disclosed with respect to the embodiment including the above examples.

（付記１）コンピュータが、画像におけるテキストを強調する方法であって、前記コンピュータが、少なくとも一行のテキストを含む原画像を取得する原画像取得ステップ、前記原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度と間接差異度に基づいて、前記各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行い、前記原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得する更新値取得ステップ、及び、フィルタリング後の前記更新輝度値又は／及び更新色値により、対応する前記原輝度値又は／及び原色値をそれぞれ置換し、前記原画像に対応するテキスト強調画像を生成する強調画像生成ステップを実行し、前記近傍集合の範囲は、前記原画素点を中心とし、且つ辺長がwである正方形となり、wは前記原画像の高さより小さい方法。 (Supplementary Note 1) A method in which a computer emphasizes text in an image, wherein the computer acquires an original image including at least one line of text, and an arbitrary original pixel point in the original image and its vicinity Based on the direct difference and the indirect difference to each neighboring pixel point in the set, the two-dimensional filtering is performed on the original luminance value or / and the primary color value of each original pixel point, and after the original image is filtered An update value acquisition step of acquiring an update brightness value or / and an update color value, and replacing the corresponding primary brightness value or / and primary color value with the updated brightness value or / and update color value after filtering, Executing an emphasized image generation step of generating a text-enhanced image corresponding to the original image, and the range of the neighborhood set is centered on the original pixel point And, and side length becomes square is w, w is less than the height of the original image method.

（付記２）前記更新値取得ステップは、前記原画素点と各近傍画素点の原輝度値又は／及び原色値に対して代数的減算を行って前記直接差異度を取得するステップ、前記原画素点からその近傍集合の各近傍画素点までのグラジエントモジュールに基づいて前記間接差異度を取得するステップ、前記直接差異度及び前記間接差異度に基づいて各近傍画素点の前記原画素点に対する輝度値又は／及び色値の重み値を算出するステップ、以下の筆画二次元フィルタ式を採用して前記原画素点の更新輝度値を算出するステップであって、

ここで、N(i)は画素点iの近傍集合を示し、w(i,j)は前記近傍集合内の近傍画素点ｊの、画素点iに対する輝度値の重み値を示し、f(j)は前記近傍画素点jの原輝度値であるステップ、及び／又は、以下の筆画二次元フィルタ式を採用して前記原画素点の更新色値を算出するステップであって、

ここで、w_n(i,j)はｎチャネルにおいて前記近傍画素点jの、原画素点iに対する色値の重み値を示し、f_n(j)はｎチャネルにおいて前記近傍画素点ｊの原色値であるステップを含む、付記１に記載の方法。 (Additional remark 2) The said update value acquisition step performs the algebraic subtraction with respect to the primary luminance value or / and primary color value of the said original pixel point and each neighboring pixel point, and acquires the said direct difference degree, The said original pixel Obtaining the indirect difference based on a gradient module from a point to each neighboring pixel point of the neighborhood set, a luminance value of each neighboring pixel point with respect to the original pixel point based on the direct difference and the indirect difference Or / and a step of calculating a weight value of the color value, a step of calculating an updated luminance value of the original pixel point by employing the following stroke two-dimensional filter equation,

Here, N (i) represents a neighborhood set of the pixel point i, w (i, j) represents a weight value of the luminance value of the neighborhood pixel point j in the neighborhood set with respect to the pixel point i, and f (j ) Is an original luminance value of the neighboring pixel point j, and / or a step of calculating an updated color value of the original pixel point using the following stroke two-dimensional filter equation,

Here, w _n (i, j) represents the weight value of the color value of the neighboring pixel point j with respect to the original pixel point i in the n channel, and f _n (j) represents the primary color of the neighboring pixel point j in the n channel. The method of claim 1, comprising the step of being a value.

（付記３）前記コンピュータが、さらに、前記原画像取得ステップの後に、前記原画像におけるテキストの筆画極性を推定する極性推定ステップであって、前記極性は筆画領域内部に位置する画素点と、筆画領域外部に位置する画素点との間の原輝度値又は／及び原色値の大きさの関係を示す極性推定ステップ、及び、前記更新値取得ステップの後に、フィルタリング後の前記更新輝度値又は／及び更新色値と、前記筆画極性とが合わせるか否かを判断し、肯定の場合に、前記原輝度値又は／及び原色値を置換する更新値判断ステップを実行する付記１に記載の方法。 (Additional remark 3) The said computer is a polarity estimation step which estimates the stroke polarity of the text in the said original image after the said original image acquisition step, Comprising: The said polarity is a pixel point located inside a stroke area, and a stroke Polarity estimation step indicating the relationship between the magnitude of the primary luminance value and / or the primary color value between pixel points located outside the region, and the updated luminance value after filtering or / and after the updated value acquisition step The method according to appendix 1, wherein it is determined whether or not the updated color value and the stroke polarity are matched, and if the result is affirmative, an updated value determining step of replacing the primary luminance value or / and the primary color value is executed.

（付記４）前記極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の輝度値の大きさの関係を示す場合に、前記極性推定ステップは、水平方向、垂直方向及び二つの対角線方向において、以下の式を採用して筆画応答強度をそれぞれ算出するステップであって、

ここで、wは前記原画像の高さの８分の一であり、f(i)は画素点iの輝度値を示すステップ、及び、算出した四つの筆画応答強度のうち最大の筆画応答強度が次の二つの条件、即ち、[f(i)−f(l)]と[f(i)−f(k)]の極性が同じ、且つ前記最大の筆画応答強度が所定の閾値よりも大きいという二つの条件を満たしているか否かを判断し、肯定の場合に、[f(i)−f(l)]と[f(i)−f(k)]の極性に基づいて前記テキストの推定筆画極性を特定し、否定の場合に、ある筆画応答強度が前記二つの条件を満たすまで、算出された前記筆画応答強度を大きさに従って順に選択して前記判断ステップを実行し、或いは、前記四つの筆画応答強度がいずれも前記二つの条件を満たさない場合に、前記画素点iを非筆画画素点とするステップを含む、付記３に記載の方法。 (Supplementary Note 4) When the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, the polarity estimation step includes the horizontal direction, the vertical direction, and the two In the two diagonal directions, the following formula is used to calculate the stroke response strength, respectively,

Here, w is one-eighth of the height of the original image, f (i) is a step indicating the luminance value of the pixel point i, and the maximum stroke response strength among the four stroke response strengths calculated. Are the following two conditions, that is, [f (i) −f (l)] and [f (i) −f (k)] have the same polarity and the maximum stroke response intensity is lower than a predetermined threshold value. It is determined whether or not two conditions of large are satisfied, and in the case of affirmation, the text is based on the polarities of [f (i) −f (l)] and [f (i) −f (k)]. The estimated stroke polarity is identified, and in the case of negative, until the certain stroke response strength satisfies the two conditions, the calculated stroke response strength is sequentially selected according to the magnitude, and the determination step is executed, or The method according to appendix 3, comprising the step of setting the pixel point i as a non-stroke pixel point when none of the four stroke response intensities satisfy the two conditions.

（付記５）前記極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の輝度値の大きさの関係を示す場合に、前記極性推定ステップは、水平方向、垂直方向及び二つの対角線方向の何れか一つの方向において、以下の式を採用して前記原画像における各原画素点の筆画応答強度を算出するステップであって、

ここで、wは前記原画像の高さの８分の一であり、f(i)は画素点iの輝度値を示すステップ、前記筆画応答強度が次の二つの条件、即ち、[f(i)−f(l)]と[f(i)−f(k)]の極性が同じ、且つ前記筆画応答強度が所定の閾値より大きいという二つの条件を同時に満たすか否かを判断し、肯定の場合に、[f(i)−f(l)]又は[f(i)−f(k)]の極性に基づいて前記原画素点iの初期極性を特定するステップ、及び、四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断し、肯定の場合に、四つの方向上の最大の筆画応答強度が対応する初期極性を、前記テキストの推定筆画極性として特定し、否定の場合に、前記筆画応答強度の算出ステップを繰り返すステップを含む、付記３に記載の方法。 (Supplementary Note 5) In the case where the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, the polarity estimation step includes the horizontal direction, the vertical direction, and the two In any one of the two diagonal directions, the following formula is adopted to calculate the stroke response strength of each original pixel point in the original image,

Here, w is one-eighth of the height of the original image, f (i) is a step indicating the luminance value of the pixel point i, and the stroke response intensity is the following two conditions: [f ( i) −f (l)] and [f (i) −f (k)] have the same polarity and determine whether or not the two conditions that the stroke response intensity is greater than a predetermined threshold are satisfied simultaneously, If yes, identifying the initial polarity of the original pixel point i based on the polarity of [f (i) −f (l)] or [f (i) −f (k)], and It is determined whether or not the calculation of the stroke response intensity in the direction has been completed, and in the case of affirmation, the initial polarity corresponding to the maximum stroke response intensity in the four directions is specified as the estimated stroke polarity of the text. The method according to supplementary note 3, including a step of repeating the step of calculating the stroke response strength in the case of negative.

（付記６）前記極性が筆画領域内部の画素点と筆画領域外部の画素点との間の色値の大きさの関係を示す場合に、前記極性推定ステップは、水平方向、垂直方向及び二つの対角線方向において、以下の式を採用して筆画応答強度をそれぞれ算出するステップであって、

ここで、wは前記原画像の高さの８分の一であり、f_n(i)は画素点iのチャネルｎにおける色値を示すステップ、算出した四つの筆画応答強度のうち最大の筆画応答強度が次の二つの条件、即ち、チャネルｎにおいて[f_n(i)−f_n(l)]と[f_n(i)−f_n(l)]との極性が一致し、且つ前記筆画応答強度が所定の閾値より大きいという二つの条件を満たしているか否かを判断し、肯定の場合に、[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性に基づいて前記テキストの推定筆画極性を特定し、否定の場合に、ある筆画応答強度が前記二つの条件を満たすまで、算出した前記筆画応答強度を大きさに従って順に選択して前記判断ステップを実行し、或いは、前記四つの筆画応答強度がいずれも前記二つの条件を満たしていない場合に、前記画素点iを非筆画画素点とするステップを含む、付記３に記載の方法。 (Supplementary note 6) When the polarity indicates the relationship of the magnitude of the color value between the pixel point inside the stroke area and the pixel point outside the stroke area, the polarity estimation step includes the horizontal direction, the vertical direction, and two In the diagonal direction, the following formula is used to calculate the stroke response strength, respectively,

Here, w is one-eighth of the height of the original image, f _n (i) is a step indicating the color value in the channel n at the pixel point i, and the largest stroke among the calculated four stroke response intensities. The response intensity has the following two conditions, that is, the polarities of [f _n (i) −f _n (l)] and [f _n (i) −f _n (l)] match in channel n, and It is determined whether or not two conditions that the stroke response intensity is larger than a predetermined threshold are satisfied, and if affirmative, [f _n (i) −f _n (l)] or [f _n (i) −f _n (k)] specifies the estimated stroke polarity of the text, and in the negative case, the calculated stroke response strength is selected in order according to the magnitude until a certain stroke response strength satisfies the two conditions. The determination step is executed, or when none of the four stroke response intensities satisfy the two conditions, the pixel point i is set as a non-stroke pixel point. Including steps, method of statement 3.

（付記７）前記極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の色値の大きさの関係を示す場合に、前記極性推定ステップは、水平方向、垂直方向及び二つの対角線方向の何れか一つの方向において、以下の式を採用して前記原画像における各原画素点の筆画応答強度を算出するステップであって、

ここで、wは前記原画像の高さの８分の一であり、f_n(i)は画素点iのチャネルｎにおいる色値を示すステップ、前記筆画応答強度が次の二つの条件、即ち、チャネルｎにおいて[f_n(i)−f_n(l)]と[f_n(i)−f_n(k)]との極性が一致し、且つ前記筆画応答強度が所定の閾値より大きいという二つの条件を同時に満たしているか否かを判断し、肯定の場合に、[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性に基づいて前記原画素点iの初期極性を特定するステップ、及び、四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断し、肯定の場合に、四つの方向上の最大の筆画応答強度が対応する初期極性を、前記テキストの推定筆画極性として特定し、否定の場合に、前記筆画応答強度算出ステップを繰り返すステップを含む、付記３に記載の方法。 (Supplementary note 7) In the case where the polarity indicates the relationship of the magnitude of the color value between the pixel point inside the stroke area and the pixel point outside the stroke area, the polarity estimation step includes the horizontal direction, the vertical direction, and the two In any one of the two diagonal directions, the following formula is adopted to calculate the stroke response strength of each original pixel point in the original image,

Here, w is one-eighth of the height of the original image, f _n (i) is a step indicating the color value in the channel n at the pixel point i, and the stroke response intensity is the following two conditions: That is, in channel n, [f _n (i) −f _n (l)] and [f _n (i) −f _n (k)] have the same polarity, and the stroke response intensity is greater than a predetermined threshold value. Judgment is made whether the two conditions of large are satisfied at the same time. If the result is affirmative, the polarity of [f _n (i) −f _n (l)] or [f _n (i) −f _n (k)] Determining the initial polarity of the original pixel point i based on the above, and determining whether or not the calculation of the stroke response intensity in the four directions has been completed. The method according to claim 3, further comprising: specifying the initial polarity corresponding to the stroke response intensity of the text as the estimated stroke polarity of the text, and repeating the stroke response intensity calculating step when negative.

（付記８）前記更新値判断ステップは、フィルタリング後の前記更新輝度値又は／及び更新色値と、前記原輝度値又は／及び原色値との第１の大きさの関係を取得するステップ、及び、前記第１の大きさの関係と、前記筆画極性に示された第２の大きさの関係とが合わせるか否かを判断するステップを含む、付記３に記載の方法。 (Supplementary Note 8) The update value determination step includes obtaining a first magnitude relationship between the updated luminance value or / and the updated color value after filtering and the primary luminance value or / and the primary color value; and The method according to claim 3, further comprising the step of determining whether or not the first magnitude relationship matches the second magnitude relationship indicated by the stroke polarity.

（付記９）画像におけるテキストを強調する装置であって、少なくとも一行のテキストを含む原画像を取得する取得モジュールと、前記原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度と間接差異度に基づいて、前記各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行って、前記原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得するフィルタモジュールと、フィルタリング後の前記更新輝度値又は／及び更新色値で、対応する前記原輝度値又は/及び原色値をそれぞれ置換して、前記原画像に対応するテキスト強調画像を生成する置換モジュールと、を含み、前記近傍集合の範囲は、前記原画素点を中心とし、且つ辺長がwである正方形となり、wは前記原画像の高さより小さい装置。 (Supplementary note 9) An apparatus for emphasizing text in an image, an acquisition module that acquires an original image including at least one line of text, and an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set Based on the direct difference degree and the indirect difference degree, the original luminance value or / and primary color value of each original pixel point is subjected to stroke two-dimensional filtering, and the updated luminance value or / and updating after filtering of the original image is performed. A text enhancement image corresponding to the original image by replacing the corresponding original luminance value or / and primary color value with the filter module for obtaining a color value and the updated luminance value or / and the updated color value after filtering, respectively. A range of the neighborhood set is a square centered on the original pixel point and having a side length of w, where w is the original image Height less than the device.

（付記１０）前記のフィルタモジュールは、前記原画素点と各近傍画素点との原輝度値又は／及び原色値に対して代数的減算を行って前記直接差異度を取得する第１の取得サブモジュール、前記原画素点からその近傍集合の各近傍画素点までのグラジエントモジュールに基づいて前記間接差異度を取得する第２の取得サブモジュール、前記直接差異度と間接差異度に基づいて各近傍画素点の、前記原画素点に対する輝度値又は／及び色値の重み値を算出する重み算出サブモジュール、以下の筆画二次元フィルタ式を採用して、前記原画素点の更新輝度値を算出する更新輝度値算出サブモジュールと、及び/又は、

以下の筆画二次元フィルタ式を採用して、前記原画素点の更新色値を算出する更新色値算出サブモジュールを含み、

ここで、N(i)は画素点iの近傍集合を示し、w(i,j)は近傍画素点ｊの、原画素点iに対する輝度値の重み値を示し、f(j)は前記近傍画素点jの輝度値であり、w_n(i,j)はｎチャネルにおいて近傍画素点jの、原画素点iに対する色値の重み値を示し、f_n(j)はｎチャネルにおいて前記近傍集合内の画素点ｊの色値である、付記９に記載の装置。 (Additional remark 10) The said filter module performs the algebraic subtraction with respect to the primary luminance value or / and primary color value of the said original pixel point and each neighboring pixel point, and acquires the said direct difference degree. A module, a second acquisition sub-module that acquires the indirect difference based on a gradient module from the original pixel point to each neighboring pixel point in the neighborhood set, and each neighboring pixel based on the direct difference and the indirect difference A weight calculation submodule that calculates a luminance value or / and a color value weight value of the point with respect to the original pixel point, and an update that calculates an updated luminance value of the original pixel point by employing the following stroke two-dimensional filter equation Luminance value calculation sub-module and / or

Adopting the following stroke two-dimensional filter equation, including an updated color value calculation submodule for calculating an updated color value of the original pixel point,

Here, N (i) represents the neighborhood set of pixel point i, w (i, j) represents the weight value of the luminance value of the neighboring pixel point j with respect to the original pixel point i, and f (j) is the neighborhood The luminance value of the pixel point j, w _n (i, j) represents the weight value of the color value of the neighboring pixel point j with respect to the original pixel point i in the n channel, and f _n (j) represents the neighborhood in the n channel The apparatus of claim 9 which is the color value of pixel point j in the set.

（付記１１）前記原画像におけるテキストの筆画極性を推定する筆画極性推定モジュールと、フィルタリング後の前記更新輝度値又は／及び更新色値と、前記筆画極性とが合わせるか否かを判断し、肯定の場合に、前記置換モジュールをトリガする判断モジュールと、を更に含み、前記極性は筆画領域内部に位置する画素点と、筆画領域外部に位置する画素点との間の原輝度値又は／及び原色値の大きさの関係を示す、付記９に記載の装置。 (Supplementary Note 11) It is determined whether or not the stroke polarity estimation module for estimating the stroke polarity of the text in the original image, the updated luminance value or / and the updated color value after filtering, and the stroke polarity are matched. A determination module for triggering the replacement module, wherein the polarity is a primary luminance value or / and a primary color between a pixel point located inside the stroke area and a pixel point located outside the stroke area The apparatus according to appendix 9, which shows a magnitude relationship.

（付記１２）前記極性が筆画領域内部の画素点と筆画領域外部の画素点との間の輝度値の大きさの関係を示す場合に、前記筆画極性推定モジュールは、水平方向、垂直方向及び二つの対角線方向において、下記の式を採用して筆画応答強度をそれぞれ算出する第１の算出サブモジュールであって、

ここで、wは前記原画像の高さの８分の一であり、f(i)は画素点iの輝度値を示す第１の算出サブモジュール、算出した四つの筆画応答強度のうち最大の筆画応答強度が次の二つの条件、即ち、[f(i)−f(l)]と[f(i)−f(k)]の極性が同じ、且つ前記筆画応答強度が所定の閾値よりも大きいという二つの条件を満たしているか否かを判断する第１の判断サブモジュール、及び、前記第１の判断サブモジュールによる結果が肯定の場合に、[f(i)−f(l)]と[f(i)−f(k)]の極性に基づいて前記テキストの推定筆画極性を特定する第１の特定サブモジュールと、前記第１の判断サブモジュールによる結果が否定の場合に、ある筆画応答強度が前記二つの条件を満たすまで、算出された前記筆画応答強度を大きさに従って順に選択して前記第１の判断サブモジュールをトリガし、或いは、前記四つの筆画応答強度がいずれも前記二つの条件を満たさない場合に、前記画素点iを非筆画画素点とする第１のトリガサブモジュールを含む、付記１１に記載の装置。 (Supplementary note 12) When the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, the stroke polarity estimation module performs the horizontal direction, the vertical direction, and the two A first calculation sub-module that calculates the stroke response strength using the following formula in each of the diagonal directions,

Here, w is one-eighth of the height of the original image, f (i) is a first calculation submodule indicating the luminance value of the pixel point i, and is the largest of the calculated four stroke response intensities. The stroke response strength is the following two conditions, that is, the polarities of [f (i) −f (l)] and [f (i) −f (k)] are the same, and the stroke response strength exceeds a predetermined threshold value. [F (i) −f (l)] when the result of the first determination submodule that determines whether or not the two conditions are satisfied and the result of the first determination submodule is affirmative And [f (i) −f (k)] based on the polarity of the first specific submodule for specifying the estimated stroke polarity of the text and the result of the first determination submodule is negative Until the stroke response strength satisfies the above two conditions, the calculated stroke response strengths are selected in order according to the magnitude, and the first determination submodule is set as a trigram. Or the first trigger sub-module that uses the pixel point i as a non-stroke pixel point when none of the four stroke response intensities satisfy the two conditions. .

（付記１３）前記極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の輝度値の大きさの関係を示す場合に、前記筆画極性推定モジュールは、水平方向、垂直方向及び二つの対角線方向の何れか一つの方向において、以下の式を採用して前記原画像における各原画素点の筆画応答強度を算出するため第２の算出サブモジュールであって、

ここで、wは前記原画像の高さの８分の一であり、f(i)は画素点iの輝度値を示す第２の算出サブモジュール、前記筆画応答強度が次の二つの条件、即ち、[f(i)−f(l)]と[f(i)−f(k)の極性が同じ、且つ前記筆画応答強度が所定の閾値よりも大きいという二つの条件を同時に満たすか否かを判断する第２の判断サブモジュール、前記第２の判断サブモジュールによる結果が肯定の場合に、[f(i)−f(l)]又は[f(i)−f(k)]の極性に基づいて前記原画素点iの初期極性を特定する第２の特定サブモジュール、四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断する第３の判断サブモジュール、前記第３の判断サブモジュールによる結果が肯定の場合に、四つの方向上の最大の筆画応答強度が対応する初期極性を、前記テキストの推定筆画極性として特定するための第３の特定サブモジュール、前記第３の判断サブモジュールによる結果が否定の場合に、前記の第２の算出サブモジュールをトリガ（起動）する第２のトリガサブモジュールを含む、付記１１に記載の装置。 (Supplementary note 13) When the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, the stroke polarity estimation module is A second calculation submodule for calculating the stroke response strength of each original pixel point in the original image by adopting the following formula in any one of two diagonal directions,

Here, w is one-eighth of the height of the original image, f (i) is a second calculation submodule indicating the luminance value of the pixel point i, and the stroke response intensity is the following two conditions: That is, whether or not the two conditions that [f (i) −f (l)] and [f (i) −f (k) have the same polarity and the stroke response intensity is larger than a predetermined threshold are satisfied simultaneously. A second determination sub-module for determining whether the result of the second determination sub-module is affirmative, [f (i) −f (l)] or [f (i) −f (k)] A second specifying submodule for specifying the initial polarity of the original pixel point i based on the polarity; a third determining submodule for determining whether or not the calculation of the stroke response intensities in the four directions has been completed; If the result of the third decision submodule is positive, the initial polarity corresponding to the maximum stroke response intensity in the four directions is identified as the estimated stroke polarity of the text. A third specific submodule for determining, and a second trigger submodule that triggers (activates) the second calculation submodule when the result of the third determination submodule is negative 11. The apparatus according to 11.

（付記１４）前記極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の色値の大きさの関係を示す場合に、前記筆画極性推定モジュールは、水平方向、垂直方向及び二つの対角線方向において、以下の式を採用して筆画応答強度をそれぞれ算出する第３の算出サブモジュールであって、

ここで、wは前記原画像の高さの８分の一であり、f_n(i)はチャネルｎにおいて画素点iの色値を示す第３の算出サブモジュール、算出した四つの筆画応答強度のうち最大の筆画応答強度が次の二つの条件、即ち、チャネルｎにおいて[f_n(i)−f_n(l)]と[f_n(i)−f_n(l)]との極性が一致し、且つ前記筆画応答強度が所定の閾値よりも大きいという二つの条件を満たしているか否かを判断する第４の判断サブモジュール、前記第４の判断サブモジュールによる結果が肯定の場合に、[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性に基づいて前記テキストの推定筆画極性を特定する第４の特定サブモジュール、及び、前記第４の判断サブモジュールによる結果が否定の場合に、ある筆画応答強度が前記二つの条件を満たすまで、算出した前記筆画応答強度を大きさに従って順に選択して前記第４の判断サブモジュールをトリガし、或いは、前記四つの筆画応答強度がいずれも前記二つの条件を満たしていない場合に、前記画素点iを非筆画画素点とする第３のトリガサブモジュールを含む、付記１１に記載の装置。 (Supplementary note 14) When the polarity indicates the relationship of the magnitude of the color value between the pixel point inside the stroke area and the pixel point outside the stroke area, the stroke polarity estimation module is A third calculation sub-module that calculates the stroke response strength by using the following formulas in two diagonal directions,

Here, w is one-eighth of the height of the original image, and f _n (i) is a third calculation submodule indicating the color value of pixel point i in channel n, and the calculated four stroke response intensities. The maximum stroke response intensity is the following two conditions, that is, the polarity of [f _n (i) −f _n (l)] and [f _n (i) −f _n (l)] in channel n A fourth determination submodule that determines whether or not two conditions are satisfied and that the stroke response strength is greater than a predetermined threshold value are satisfied, and the result of the fourth determination submodule is positive, a fourth specific sub-module that identifies the estimated stroke polarity of the text based on the polarity of [f _n (i) −f _n (l)] or [f _n (i) −f _n (k)]; and If the result of the fourth determination submodule is negative, the calculated stroke response strength is increased until a certain stroke response strength satisfies the two conditions. In order to trigger the fourth determination sub-module, or when the four stroke response intensities do not satisfy the two conditions, the pixel point i is set as a non-stroke pixel point. The apparatus of claim 11 comprising three trigger submodules.

（付記１５）前記極性が筆画領域内部の画素点と、筆画領域外部の画素点との間の色値の大きさの関係を示す場合に、前記筆画極性推定モジュールは、水平方向、垂直方向及び二つの対角線方向の何れか一つの方向において、以下の式を採用して前記原画像における各原画素点の筆画応答強度を算出する第４の算出サブモジュールであって、

ここで、ｗは原画像の高さの８分の一であり、f_n(i)はチャネルｎにおいて画素点iの色値を示す第４の算出サブモジュール、前記筆画応答強度が次の二つの条件、即ち、チャネルｎにおいて[f_n(i)−f_n(l)]と[f_n(i)−f_n(k)]との極性が一致し、且つ前記筆画応答強度が所定の閾値よりも大きいという二つの条件を同時に満たしているか否かを判断する第５の判断サブモジュール、前記第５の判断サブモジュールによる結果が肯定の場合に、[f_n(i)−f_n(l)]又は[f_n(i)−f_n(k)]の極性に基づいて前記原画素点iの初期極性を特定する第５の特定サブモジュール、四つの方向上の筆画応答強度の算出がすべて完了したか否かを判断する第６の判断サブモジュールと、前記第６の判断サブモジュールによる結果が肯定の場合に、四つの方向上の最大の筆画応答強度が対応する初期極性を、前記テキストの推定筆画極性として特定する第６の特定サブモジュール、及び、前記第６の判断サブモジュールによる結果が否定の場合に、前記の第４の算出サブモジュールをトリガする第４のトリガサブモジュールを含む、付記１１に記載の装置。 (Supplementary Note 15) When the polarity indicates the relationship of the magnitude of the color value between the pixel point inside the stroke area and the pixel point outside the stroke area, the stroke polarity estimation module is A fourth calculation submodule that calculates the stroke response strength of each original pixel point in the original image by adopting the following formula in any one of two diagonal directions,

Here, w is one-eighth of the height of the original image, f _n (i) is a fourth calculation submodule indicating the color value of pixel point i in channel n, and the stroke response intensity is In the channel n, [f _n (i) −f _n (l)] and [f _n (i) −f _n (k)] have the same polarity, and the stroke response intensity is predetermined. A fifth determination submodule that determines whether or not two conditions of greater than the threshold value are simultaneously satisfied. When the result of the fifth determination submodule is affirmative, [f _n (i) −f _n ( l)] or [f _n (i) −f _n (k)] Based on the polarity of the fifth specific submodule for specifying the initial polarity of the original pixel point i, the calculation of the stroke response intensity in four directions If the result of the sixth determination submodule and whether the sixth determination submodule is affirmative, the maximum stroke response in four directions is determined. A sixth specifying sub-module that specifies the initial polarity corresponding to the answer strength as the estimated stroke polarity of the text, and the fourth calculating sub-module when the result of the sixth determining sub-module is negative 12. The apparatus of claim 11 including a fourth trigger submodule that triggers.

（付記１６）前記の判断モジュールは、フィルタリング後の前記更新輝度値又は／及び更新色値と、原輝度値又は／及び原色値との第１の大きさの関係を取得する第３の取得サブモジュールと、前記第１の大きさの関係と、前記筆画極性に示された第２の大きさの関係とが合わせるか否かを判断する第７の判断サブモジュールとを含む、付記１１に記載の装置。 (Supplementary Note 16) The determination module may be configured to acquire a first magnitude relationship between the updated luminance value or / and the updated color value after filtering and the primary luminance value or / and the primary color value. The supplementary note 11 includes a module, and a seventh determination submodule that determines whether or not the first magnitude relationship and the second magnitude relationship indicated by the stroke polarity are matched. Equipment.

（付記１７）コンピュータが、画像におけるテキストを抽出する方法であって、コンピュータが、少なくとも一行のテキストを含む原画像を取得するステップ、前記原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度と間接差異度に基づいて、前記各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行い、前記原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得するステップ、前記フィルタリング後の更新輝度値又は／及び更新色値で、対応する前記原輝度値又は／及び原色値をそれぞれ置換して、前記原画像に対応するテキスト強調画像を生成するステップ、及び、前記テキスト強調画像におけるテキストを抽出するステップを実行し、前記近傍集合の範囲は、原画素点を中心とし、且つ辺長がwである正方形となり、wは前記原画像の高さより小さい方法。 (Supplementary note 17) A method in which a computer extracts text in an image, wherein the computer acquires an original image including at least one line of text, each neighborhood in a neighborhood set from any original pixel point in the original image Based on the direct difference degree and the indirect difference degree up to the pixel point, the original luminance value or / and the primary color value of each original pixel point is subjected to stroke two-dimensional filtering, and the updated luminance value after filtering of the original image or Obtaining the updated color value, replacing the corresponding original luminance value or / and the primary color value with the updated luminance value or / and the updated color value after filtering, respectively, and emphasizing the text corresponding to the original image Performing the steps of generating an image and extracting text in the text-enhanced image, wherein the range of the neighborhood set is The original pixel points as the center, and side length becomes square is w, w is less method than the height of the original image.

（付記１8）画像におけるテキストを抽出する装置であって、少なくとも一行のテキストを含む原画像を取得するモジュール、前記原画像における任意な原画素点からその近傍集合における各近傍画素点までの直接差異度と間接差異度に基づいて、前記各原画素点の原輝度値又は／及び原色値に対して筆画二次元フィルタリングを行い、前記原画像のフィルタリング後の更新輝度値又は／及び更新色値を取得するモジュール、フィルタリング後の前記更新輝度値又は／及び更新色値で、対応する前記原輝度値又は／及び原色値をそれぞれ置換して、前記原画像に対応するテキスト強調画像を生成するモジュール、及び、前記テキスト強調画像におけるテキストを抽出するモジュールを含み、前記近傍集合の範囲は、原画素点を中心とし、且つ辺長がwである正方形となり、wは前記原画像の高さより小さい装置。 (Supplementary Note 18) A device for extracting text in an image, a module for acquiring an original image including at least one line of text, and a direct difference from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set The original luminance value or / and primary color value of each original pixel point is subjected to stroke two-dimensional filtering based on the degree and the indirect difference degree, and the updated luminance value or / and the updated color value after filtering of the original image are obtained. A module for obtaining a text-enhanced image corresponding to the original image by replacing the corresponding original luminance value or / and primary color value with the updated luminance value or / and updated color value after filtering, And a module for extracting text in the text enhanced image, wherein the range of the neighborhood set is centered on the original pixel point and is A device whose length is a square with w being smaller than the height of the original image.

Claims

A method by which a computer emphasizes text in an image,
The computer is
A current image acquisition step of acquiring an original image including at least one line of text;
Based on the direct difference level and the indirect difference level from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set, a stroke of the original luminance value or / and the primary color value of each original pixel point is determined. An update value acquisition step of performing dimension filtering to acquire an updated luminance value or / and an updated color value after filtering of the original image, and corresponding to the updated luminance value or / and the updated color value after filtering of the original image Replacing the original luminance value or the primary color value, and executing an enhanced image generation step of generating a text enhanced image corresponding to the original image,
The range of the neighborhood set is a square whose center is the original pixel point and whose side length is w, and w is smaller than the height of the original image.

The update value acquisition step includes:
Performing the algebraic subtraction on the original luminance value or / and the primary color value between the original pixel point and each neighboring pixel point to obtain the direct difference degree;
Obtaining the indirect difference based on a gradient module from the original pixel point to each neighboring pixel point of the neighborhood set;
Calculating a luminance value and / or a color value weight value of each neighboring pixel point with respect to the original pixel point based on the direct difference degree and the indirect difference degree;
Adopting the following stroke two-dimensional filter equation, calculating the updated luminance value of the original pixel point,

Here, N (i) represents the neighborhood set of pixel point i, w (i, j) represents the weight value of the luminance value of the neighboring pixel point j with respect to the original pixel point i, and f (j) is the neighborhood A step that is a luminance value of pixel point j, and / or
Adopting the following stroke two-dimensional filter equation, calculating the updated color value of the original pixel point,

Here, w _n (i, j) represents the weight value of the color value of the neighboring pixel point j with respect to the original pixel point i in the n channel, and f _n (j) represents the color value of the neighboring pixel point j in the n channel. The method of claim 1, comprising the steps of:

The computer further comprises:
A polarity estimation step for estimating a stroke polarity of text in the original image after the original image acquisition step, wherein the polarity is between a pixel point located inside the stroke area and a pixel point located outside the stroke area; A polarity estimation step showing the relationship between the primary luminance values or / and the primary color values, and
After the update value acquisition step, it is determined whether or not the updated luminance value or / and the updated color value after filtering are matched with the stroke polarity, and if the result is affirmative, the primary luminance value or / primary color value is determined. The method according to claim 1, wherein an update value determination step for replacing is executed.

When the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, the polarity estimation step includes:
In the width direction, the height direction, and the two diagonal directions of the original image, the following steps are used to calculate the stroke response strength, respectively.

Here, w is one-eighth of the height of the original image, f (i) is a step indicating the luminance value of the pixel point i, and the maximum stroke response strength among the calculated four stroke response strengths is The following two conditions, that is, [f (i) −f (l)] and [f (i) −f (k)] have the same polarity and the stroke response intensity is larger than a predetermined threshold value. It is determined whether or not two conditions are satisfied. If the result is affirmative, the text is estimated based on the polarities of [f (i) −f (l)] and [f (i) −f (k)]. When the stroke polarity is specified and negative, the four stroke response strengths are selected in order according to the magnitude until a certain stroke response strength satisfies the two conditions, and the determination step is executed. The method according to claim 3, further comprising the step of setting the pixel point i as a non-stroke pixel point when neither of the stroke response intensities satisfies the two conditions.

When the polarity indicates the relationship of the magnitude of the luminance value between the pixel point inside the stroke area and the pixel point outside the stroke area, the polarity estimation step includes:
In one of the width direction, the height direction, and two diagonal directions of the original image, the step of calculating the stroke response intensity of each original pixel point in the original image by employing the following equation:

Here, w is one-eighth of the height of the original image, and f (i) is a step indicating the luminance value of the pixel point i.
The stroke response intensity is the following two conditions, that is, [f (i) −f (l)] and [f (i) −f (k)] have the same polarity, and the stroke response intensity is a predetermined threshold value. Is determined based on the polarity of [f (i) −f (l)] or [f (i) −f (k)]. Step of identifying the initial polarity of the original pixel point i and whether or not the calculation of the stroke response intensity in the four directions is all completed, and if the result is affirmative, the maximum stroke response in the four directions is determined. The method according to claim 3, further comprising: specifying an initial polarity corresponding to an intensity as an estimated stroke polarity of the text; and, if negative, repeating the stroke response intensity calculating step.

When the polarity indicates a color value magnitude relationship between a pixel point inside the stroke area and a pixel point outside the stroke area, the polarity estimation step includes:
In the width direction, the height direction and the two diagonal directions of the original image, the following steps are used to calculate the stroke response strength of each original pixel point in the original image, respectively:

Here, w is one-eighth of the height of the original image, f _n (i) is a step indicating the color value of pixel point i in channel n, and the largest of the four stroke response intensities calculated. The stroke response intensity has the following two conditions, that is, in channel n, [f _n (i) −f _n (l)] and [f _n (i) −f _n (l)] have the same polarity, and It is determined whether or not the two conditions that the stroke response intensity is greater than a predetermined threshold are satisfied, and if affirmative, [f _n (i) −f _n (l)] or [f _n (i ) −f _n (k)], the estimated stroke polarity of the text is specified, and in the negative case, the four stroke response strengths are increased until a certain stroke response strength satisfies the two conditions. The selection step is executed in accordance with the determination and the determination step is executed, or when none of the four stroke response intensities satisfy the two conditions, the pixel point i is set as a non-stroke pixel point. Including step A method according to claim 3.

When the polarity indicates a color value magnitude relationship between a pixel point inside the stroke area and a pixel point outside the stroke area, the polarity estimation step includes:
In one of the width direction, the height direction, and two diagonal directions of the original image, the step of calculating the stroke response intensity of each original pixel point in the original image by employing the following equation:

Where w is one-eighth of the height of the original image, and f _n (i) indicates the color value of pixel point i in channel n,
The stroke response intensity has the following two conditions, that is, in channel n, [f _n (i) −f _n (l)] and [f _n (i) −f _n (k)] have the same polarity, In addition, it is determined whether or not the two conditions that the stroke response intensity is greater than a predetermined threshold are satisfied at the same time, and if affirmative, [f _n (i) −f _n (l)] or [f _n (i) −f _n (k)] based on the polarity of the original pixel point i, and determining whether or not the calculation of the stroke response intensity in the four directions is completed, In the case of affirmation, the initial polarity corresponding to the maximum stroke response strength in four directions is specified as the estimated stroke polarity of the text, and in the case of negative, the step of calculating the stroke response strength is repeated. The method of claim 3.

The update value determining step includes:
Obtaining a first magnitude relationship between the updated luminance value or / and the updated color value after filtering and the primary luminance value or / and the primary color value; and the first magnitude relationship; The method according to claim 3, further comprising the step of determining whether or not the second magnitude relationship indicated by the stroke polarity matches.

A device for enhancing text in an image,
An acquisition module that acquires an original image containing at least one line of text,
Based on the direct difference level and the indirect difference level from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set, a stroke of the original luminance value or / and the primary color value of each original pixel point is determined. A filter module that performs dimensional filtering and obtains an updated luminance value or / and an updated color value after filtering of the original image, and the corresponding original luminance value or A replacement module that replaces each of the primary color values and generates a text-enhanced image corresponding to the original image,
The range of the neighborhood set is a square centered on the original pixel point and the side length is w, where w is smaller than the height of the original image.

A method for a computer to extract text in an image, comprising:
The computer is
Obtaining an original image containing at least one line of text;
Based on the direct difference level and the indirect difference level from an arbitrary original pixel point in the original image to each neighboring pixel point in the neighborhood set, a stroke of the original luminance value or / and the primary color value of each original pixel point is determined. Performing dimension filtering to obtain an updated luminance value or / and an updated color value after filtering of the original image;
Replacing the corresponding original luminance value or / and primary color value with the updated luminance value or / and updated color value after filtering, respectively, and generating a text enhanced image corresponding to the original image; and Perform steps to extract text in
The range of the neighborhood set is a square centered on the original pixel point and the side length is w, and w is smaller than the height of the original image.