CN101030257A

CN101030257A - File-image cutting method based on Chinese characteristics

Info

Publication number: CN101030257A
Application number: CN 200710065408
Authority: CN
Inventors: 黄祥林; 杨朝; 吕锐; 杨占昕
Original assignee: Communication University of China
Current assignee: Communication University of China
Priority date: 2007-04-13
Filing date: 2007-04-13
Publication date: 2007-09-05
Anticipated expiration: 2027-04-13
Also published as: CN100428268C

Abstract

A method for dividing file image based on Chinese character feature includes fetching file image and converting image to be grey scale if image is color image, carrying out recurrence layering according to maximum value obtained based on ratio of maximum type space to minimum type internal space, sequencing each layered image and combining sequenced sub-layer images, carrying out text division on combined sub-layer image and combining divided results of various sub-layer images to be final result.

Description

File-image cutting method based on Hanzi features

Technical field

The present invention is a kind of file-image cutting method based on Hanzi features, cuts apart at colour or gray scale scanning image, belongs to the computer digital image processing technology field.

Background technology

The file and picture partitioning algorithm is widely used in printing, fax, OCR (Optical CharacterRecognition, optical character identification), Flame Image Process work such as file and picture compression, it makes the effective search and the storage of the text image in the large database be more prone to, and is the strong instrument that extracts document data from file and picture.

Existing file-image cutting method roughly can be divided into based on piecemeal and based on the dividing method of layering.Based on the dividing method of piecemeal, earlier input picture is carried out piecemeal, and then each subimage block is handled.Based on the dividing method of layering, earlier input picture is pressed certain criterion layering, then each sublayer image is handled.Carrying out layering based on the maximal value of distance ratio in maximum between class distance and the infima species is image layered common method.Specifically describe as follows:

1) the ratio J of distance in computed image histogram and maximum kind spacing thereof and the infima species _f(t).

2) according to J _fT when (t) getting maximal value _ThValue is divided into image two-layer.

If the sum of all pixels of piece image I is n, gray level is [0, T-1], and wherein the pixel count of gray-scale value i is n _iGradient threshold t _ThIt is divided into two sub-tomographic images of A, B, and wherein the sum of all pixels of A is n _A, gray level be [0,1 ..., t _Th], the sum of all pixels of B is n _B, gray level is [t _Th+ 1,0,1 ..., T-1], then

n_{A} = Σ_{i = 0}^{t_{th}} n_{i}

n_{B} = Σ_{i = t_{th} + 1}^{T - 1} n_{i}

n = n_{A} + n_{B} = Σ_{i = 0}^{T - 1} n_{i}

The frequency h that each gray level i of sublayer image A, B and original image I occurs _i ^A, h _i ^BAnd h _i ^IBe respectively

h_{i}^{A} = \frac{n_{i}}{n_{A}}

i＝0，1，…，t _th

h_{i}^{B} = \frac{n_{i}}{n_{B}}

i＝t _th+1，t _th+2，…，T-1

h_{i}^{I} = \frac{n_{i}}{n}

i＝0，1，…，T-1

The Probability p that sublayer image A, B occur _A, p _BBe respectively

p_{A} = \frac{n_{A}}{n} = Σ_{i = 0}^{t_{th}} h_{i}^{I}

p_{B} = \frac{n_{B}}{n} = Σ_{i = t_{th} + 1}^{T - 1} h_{i}^{I} = 1 - p_{A}

The gray average m of sublayer image A, B and original image I _A, m _BBe respectively with m

m_{A} = Σ_{i = 0}^{t_{th}} i h_{i}^{A}

m_{B} = Σ_{i = t_{th} + 1}^{T - 1} i h_{i}^{B}

m = Σ_{i = 0}^{T - 1} i h_{i}^{I}

When two sub-tomographic images are considered as two time-likes, their between class distance is

s_{b}^{2} (t) = p_{A} {(m_{A} - m)}^{2} + p_{B} {(m_{B} - m)}^{2} - - - (5)

Distance is in the class

s_{w}^{2} (t) = p_{A} Σ_{i = 0}^{t_{th}} {(i - m_{A})}^{2} h_{i}^{A} + p_{B} Σ_{i = t_{th} + 1}^{T - 1} {(i - m_{B})}^{2} h_{i}^{B}

= Σ_{i = 0}^{t_{th}} {(i - m_{A})}^{2} h_{i}^{I} + Σ_{i = t_{th} + 1}^{T - 1} {(i - m_{B})}^{2} h_{i}^{I}

So, according in maximum kind spacing and the infima species apart from than the maximal value criterion, optimal threshold t _ThShould satisfy

J_{f} (t) = \frac{s_{b}^{2} (t)}{s_{w}^{2} (t)} |_{t = t_{th}} &RightArrow; \max

For the file and picture of complex background (character and patterning), still lack effective dividing method at present.(Yen-Lin Chen such as Yen-Lin Chen, Chung-Cheng Chiu and Bing-Fei Wu.Complex Document Image Segmentation using Localized Histogram Analysiswith Multi-Layer Matching and Clustering, 2004 IEEE InternationalConference on Systems, Man and Cybernetics:3063-3070) a kind of dividing method based on the zone has been proposed, this method is at first carried out even piecemeal to image, and utilize the histogram information antithetical phrase piece of each sub-piece to carry out layering, and then each sublayer is connected according to information such as sublayer edge of image, the sublayer that will belong to same type is connected to a big sublayer, at last text layers is carried out in these sublayers and judge, be partitioned into text.This method is calculated more complicated, image is carried out piecemeal handle the fracture that causes Chinese character easily.

Summary of the invention

The present invention proposes a kind of partitioning algorithm at complicated file and picture, this method is calculated simple, is not easy to cause the Chinese character fracture.

The invention belongs to image segmentation based on layering, according in maximum kind spacing and the infima species apart from than the maximal value criterion, input picture is carried out the recurrence layering, obtain a series of layering file and picture, and according to sublayer image pixel gray scale maximal value, to each tomographic image ordering.Merge rule according to the sublayer image, the result who sorts is carried out the sublayer image merge, obtain final plurality of sub tomographic image.Text segmentation is carried out in each sublayer after being combined, and each layer segmentation result merged, and obtains final split image.

Concrete innovative point: image layered recurrence stop criterion; The merging rule of relevant sublayer; Image segmentation based on Hanzi features.Particular content is as follows:

1, image layered recurrence stop criterion: in the maximum kind spacing of utilizing above-mentioned introduction and the infima species apart from than the maximal value criterion, input picture is divided into two-layer.The invention reside in, layering is continued in the sublayer of telling, till satisfying the recurrence stop criterion.Then and according to sublayer image pixel gray scale maximal value, sorted in a series of sublayers that are partitioned into.

2, the merging rule of relevant sublayer:, all be not beneficial to the text segmentation of image for ordering each sublayer image.The invention reside in,, utilize the merging rule that relevant sublayer is merged ordering each sublayer image.Sublayer image after the merging helps cutting apart of text.

3, based on the image segmentation of Hanzi features: the invention reside in, the connected region information that each sublayer image is comprised is calculated in the sublayer after being combined, and judges Chinese character zone and background area according to its feature, the background area is considered remove then.Segmentation result to each sublayer image is merged into final segmentation result.

Technical scheme of the present invention as shown in Figure 1.This file-image cutting method based on Hanzi features, with the image of gray scale or colored bmp form (or the image transitions of extended formatting is the bmp form) as input, be stored on hard disc of computer or the mobile storage medium, carry out computing and processing by computing machine again.Its main process is: computer system receives input picture, by segmentation procedure it is handled again.

The concrete grammar step is:

Behind the input file and picture, if coloured image then will transfer gray level image to, the grey level histogram of computed image then, utilize histogram that gray level image is carried out the recurrence layering, and the layering result sorted, according to merging criterion the correlator tomographic image is merged, the sublayer image after being combined carries out the dividing processing based on Hanzi features again, and merges the segmentation result of each layered image.

1, the method step of recurrence layering is as follows:

If gradation of image value t is in [a, b] (a, b are integer for 0≤a＜256,0≤b＜256, a＜b) scope, primary gradient threshold is for making J _f(t) obtain peaked t _Th, image is divided into two-layer, the scope of its gray-scale value is respectively [a, t _Th] and [t _Th+ 1, b].Next, continue at [a, t _Th] and [t _Th+ 1, b] find out on the interval and make J _fPairing gradient threshold t when (t) obtaining maximal value _Th1And t _Th2, with each sublayer image layering again.So carry out, up to satisfying following end condition:

If treat the interval [t of being of the gray-value variation of layered image ₁, t ₂] (t ₁＜t ₂), this interval sum of all pixels, gray average and variance are respectively n _t, m _tAnd δ _tWork as δ _t＜c * m _t(0.01＜c＜0.3) or n _tDuring＞d * n (0.01＜d＜0.5), promptly stop this image is continued layering.Wherein, n is the sum of all pixels of original document image, and i is a pixel grayscale, h _i ^tThe frequency that in treating layered image, occurs for i.

m_{t} = Σ_{i = t_{1}}^{t_{2}} i h_{i}^{t}

δ_{t}^{2} = Σ_{i = t_{1}}^{t_{2}} n_{i} {(i - m_{t})}^{2}

n_{t} = Σ_{i = t_{1}}^{t_{2}} n_{i}

After the recurrence layering is finished,, it is increased progressively (or successively decreasing) ordering according to each layering sub-image pixels grayscale maximal value.After the ordering, the gray-scale value scope of each sublayer image adjoins each other but non-overlapping copies.

2, the method step of sublayer merging is as follows:

The merging here is meant two sub-tomographic image additions, obtains a new sublayer image.For ordering each sublayer image, all be not beneficial to the text segmentation of image, need merge the correlator tomographic image.Judge at first whether need merge, merge if desired if working as anterior layer, should merge to which sublayer that is adjacent, as if having only an adjacent layer when anterior layer, then directly merging.The new sublayer image that obtains after being combined is judged, and is merged according to judgement, till not satisfying the merging condition.So go on, till the merging condition is not all satisfied in all sublayers.Merging can be undertaken by the order that increases progressively (perhaps successively decreasing).

If n altogether of ordering sublayer image, wherein i sub-tomographic image s _iThe gray-scale value scope be [t _i, t _I+1-1], i=0,1 ..., n-1.To s _i, n _p(grey scale pixel value is at [t for the sum of expression effective pixel points _i, t _I+1-1] Nei pixel is a valid pixel, otherwise is inactive pixels), n _PhRepresent total hole pixel number (between two effective pixel points with delegation, if having only an inactive pixels point, then these two effective pixel points are called the hole pixel), n _r ⁰Connected region sum in the expression subimage, n _RsRepresent that (little connected region is meant that the valid pixel that comprises counts less than the connected region of N to little connected region sum.Wherein, 0＜N＜50), n _PsRepresent the effective pixel points sum that all little connected regions are contained, n _{R max p}Represent the contained effective pixel points sum in largest connected zone (largest connected zone is meant and comprises the maximum connected region of effective pixel number), n _{ρ s}Expression valid pixel density is less than the effective pixel points sum that all connected regions comprised (valid pixel density is meant this connected region effective pixel points sum that comprises and the ratio that surrounds this connected region minimum rectangle area) of R, wherein, 0＜R＜0.5, then satisfy one of following 4 conditions, this layer s _iNeed to merge:

(1) if n _Ph＞a * n _p, then merge; (a＞0.05)

(2) if

n_{rs} > b {\times n}_{r}^{0}

And n _{R max p}＜c * n _p, then merge; (b＞0.6, c＞0.1)

(3) if n _Ps＞d * n _p, then merge; (d＞0.3)

(4) if n _{ρ s}＞e * n _p, then merge.(e＞0.3)

The determination methods that specifically merges to which layer is:

If sublayer s to be combined _iPreceding one deck and the back one deck be respectively s _I-1And s _I+1, its gray-scale value scope is respectively [t _I-1, t _i-1] and [t _I+1, t _I+2-1], its connected region sum that comprises is respectively n _r ^-1And n _r ¹If s _iWith s _I-1Merge, then new layer is s _I-1', scope is [t _I-1, t _I+1-1], the connected region number is n _rIf s _iWith s _I+1Merge, then new layer is s _I+1', scope is [t _i, t _I+2-1], the connected region number is n _r'.

Combining step is:

(1) calculates s _I-1With s _iThe ratio r of contained connected region number ₁,

r_{1} = \frac{\min (n_{r}^{- 1}, n_{r}^{0})}{\max (n_{r}^{- 1}, n_{r}^{0})};

s _I+1With s _iThe ratio r of contained connected region number ₂,

r_{2} = \frac{\min (n_{r}^{}, n_{r}^{0})}{\max (n_{r}^{}, n_{r}^{0})} .

(2) calculate s _I-1' with s _I-1The ratio r of contained connected region number ₁',

r_{1}^{'} = \frac{n_{r}}{n_{r}^{- 1}};

s _I+1' with s _I+1The ratio r of contained connected region number ₂',

r_{2}^{'} = \frac{n_{r}^{'}}{n_{r}^{}};

(3) if (r ₁+ r ₁')≤(r ₂+ r ₂'), then with s _iWith s _I-1Merge; If (r ₁+ r ₁')＞(r ₂+ r ₂'), then with s _iWith s _I+1Merge.

3, the text segmentation of sublayer image

Concrete steps are as follows:

For the subimage s after merging _i(its wide and height is respectively l _w, l _h, total valid pixel number is n _p), carry out region growing (promptly seeking out all connected regions that constitute by valid pixel), obtain a series of connected region.For i connected region, its valid pixel density is ρ _i:

ρ_{i} = \frac{n_{i}}{w_{i} {\times h}_{i}}

Wherein, n _iRepresent the valid pixel number that it comprises, w _iAnd h _iWide and high (is unit with the pixel count) of this connected region minimum rectangle surrounded in expression respectively.

For single connected region and surround the minimum rectangle of this connected region, the present invention proposes as gives a definition:

1) saltus step pixel p _v: in delegation (perhaps in same row), each valid pixel adjacent with inactive pixels is the saltus step pixel, and all are positioned at the pixel of square boundary, though the inactive pixels of itself and square boundary outside is adjacent, does not belong to the saltus step pixel.

2) hole pixel p _h: have only an inactive pixels between two saltus step pixels in delegation, these two saltus step pixels are the hole pixel;

3) saltus step row w _v, comprise the row of saltus step pixel;

4) saltus step row h _v: the row that comprise the saltus step pixel;

5) the capable w of hole _h: the row that comprises the hole pixel;

6) outer saltus step pixel p _Ov: for row, be meant first saltus step pixel or last saltus step pixel of certain saltus step row, wherein, first saltus step pixel left side is an inactive pixels, and last saltus step pixel right side is an inactive pixels.

7) the plain p of introskip transshaping _Iv: for row, be meant the saltus step pixel outside certain saltus step row China and foreign countries' saltus step pixel.

8) outer saltus step row w _v: the row that comprises outer saltus step pixel.

9) two outer saltus step row h _v: the row that comprises two outer saltus step pixels.

On the basis of above definition, following Hanzi features has been proposed:

1) rectangle the ratio of width to height

r_{wh} = \frac{\min (w_{i}, h_{i})}{\max (w_{i}, h_{i})}

The ratio of little value in the rectangle between width and the height and big value between the two;

2) saltus step row average transition number of times

m_{wv} = \frac{n_{wpv}}{n_{wv}},

For row, the ratio of all saltus step pixel counts and all saltus step line numbers;

3) saltus step column average transition times

m_{hv} = \frac{n_{hpv}}{n_{hv}},

For row, the ratio of all saltus step pixel counts and all saltus step columns;

4) go the saltus step picture element density

ρ_{wpv} = \frac{n_{wpv}}{n_{p}},

For row, the ratio of the total pixel number that all saltus step pixel counts and this sublayer image comprise;

5) row saltus step picture element density

ρ_{hpv} = \frac{n_{hpv}}{n_{p}},

6) saltus step line density

ρ_{wv} = \frac{n_{wv}}{n_{w}},

The ratio of total line number that all saltus step line numbers and current sublayer image comprise;

7) saltus step row density

ρ_{hv} = \frac{n_{hv}}{n_{h}},

The ratio of total columns that all saltus step columns and current sublayer image comprise;

8) hole line density

ρ_{wh} = \frac{n_{wh}}{n_{w}},

The ratio of total line number that all line numbers that comprises the hole pixel and current sublayer image comprise;

9) outer saltus step line density

ρ_{wov} = \frac{n_{wov}}{n_{wv}},

All comprise the sum of outer saltus step pixel (one or two) row and the ratio of total line number that current sublayer image comprises;

10) two outer saltus step line density

ρ_{wbov} = \frac{n_{wbov}}{n_{wv}},

All comprise the sum of two outer saltus step pixel columns and the ratio of total line number that current sublayer image comprises.

Here, n _WpvBe the saltus step pixel count (being saltus step pixel counts all on the horizontal direction) that total saltus step row comprises, n _HpvBe the saltus step pixel count (being saltus step pixel counts all on the vertical direction) that total saltus step row comprise, n _wBe total line number, n _hBe total columns, n _WvBe total saltus step line number (being total number of saltus step row), n _HvBe total saltus step columns (being total number of saltus step row), n _WhBe the line number (being the capable total number of hole) that comprises the hole pixel, n _WovBe the line number (total number of promptly outer saltus step row) that comprises outer saltus step pixel, n _WbovBe the line number (i.e. total number of two outer saltus step row) that comprises two outer saltus step pixels.

Make n _PhThe hole pixel sum of representing current sublayer image, n _PivThe introskip transshaping vegetarian refreshments sum of representing current sublayer image.

Segmentation procedure be two greatly the step, the first step is a coarse segmentation, second the step for the segmentation cut.

The process of coarse segmentation is, seeks all connected regions of this sublayer image earlier, then according to all the non-text connected regions in the image of following regular filtering sublayer:

1) for all connected regions, the person that one of meets the following conditions, then filtering:

A) if max is (w _i, h _i)＜a ₁, then filtering.

B) if max is (m _Wv, m _Hv)＞b ₁₁And ρ _Wov＜b ₁₂, then filtering.

C) if n _Ph＞c ₁, then filtering.

D) if n _Piv＜d ₁₁And ρ _i＜d ₁₂, then filtering.

E) if n _Hp＞e ₁* n _p, then filtering.

F) if r _Wh＜f ₁, then filtering.

G) if ρ _i＜g ₁, then filtering.

H) if ρ _Wh＞h ₁, then filtering.

Wherein, 0＜a ₁＜30, b ₁₁＞5,0.05＜b ₁₂＜0.2, c ₁＞20, d ₁₁＞1, d ₁₂＞0.5,0.05＜e ₁＜0.3,0＜f ₁＜0.3,0＜g ₁＜0.3, h ₁＞0.2.

2) for max (w _i, h _i)＞k ₂* max (l _w, l _h) connected region, the person that one of meets the following conditions, then filtering:

A) if min is (ρ _Wv, ρ _Hv)＜a ₂, then filtering.

B) if r _Wh＜b ₂, then filtering.

C) if ρ _i＜c ₂, then filtering.

Wherein, k ₂＞0.6,0.3＜a ₂＜0.8,0.4＜b ₂＜0.6,0.2＜c ₂＜0.5.

3) for max (w _i, h _i)＜k ₃Connected region, the person that one of meets the following conditions, then filtering:

A) if max is (ρ _Wpv, ρ _Hpv)＜a ₃, then filtering.

B) if n _p＜b ₃, then filtering.

C) if n _Hp＞c ₃, then filtering.

Wherein, 10＜k ₃＜30,0.6＜a ₃＜1,0＜b ₃＜30, c ₃＞10.

Each sublayer image through after above-mentioned cutting apart, is merged segmentation result and obtains the text segmentation image.

To the text image that above dividing method obtains, can also further cut apart, its step of cutting apart is:

1) for ρ _i〉=k ₄Connected region, satisfy r _Wh＞a ₄₁And max (ρ _Wv, ρ _Hv)＜a ₄₂, then filtering;

Wherein, k ₄＞0.95, a ₄₁＞0.1,0.3＜a ₄₂＜0.6.

2) for k ₅₁≤ ρ _i＜k ₅₂Connected region, the person that one of meets the following conditions, then filtering:

A) if r _Wh≤ a ₅₁, then filtering;

B) if r _Wh＞b ₅₁And max (ρ _Wv, ρ _Hv)＜b ₅₂, then filtering;

C) if c ₅₁≤ r _Wh≤ c ₅₂And max (ρ _Wv, ρ _Hv)＜c ₅₃, then filtering;

D) if d ₅₁≤ r _Wh≤ d ₅₂, then filtering;

Wherein, 0.8＜k ₅₁＜0.95,0.95＜k ₅₂＜1,0＜a ₅₁＜0.1, b ₅₁＞0.5, b ₅₂＞0.5,0＜c ₅₁＜0.1,0.1＜c ₅₂＜0.3, c ₅₃＞0.4,0.5＜d ₅₂＜0.8,0.1＜d ₅₁＜0.3.

3) for k ₆₁≤ ρ _i＜k ₆₂Connected region, the person that one of meets the following conditions, then filtering:

A) if r _Wh＞a ₆₁And max (ρ _Wpv, ρ _Hpv)＜a ₆₂And n _Piv＜a ₆₃, then filtering;

B) if b ₆₁＜r _Wh＜b ₆₂And max (ρ _Vw, ρ _Vh)＜b ₆₃, then filtering;

C) if r _Wh＜c ₆, then filtering;

Wherein, 0.7＜k ₆₁＜0.8,0.8＜k ₆₂＜0.95,0.3＜a ₆₁＜0.5,0.4＜a ₆₂＜0.6, a ₆₃＞1,0＜b ₆₁＜0.2,0.2＜b ₆₂＜0.4,0.6＜b ₆₃＜1,0.05＜c ₆＜0.2.

4) for k ₇₁≤ ρ _i＜k ₇₂Connected region, the person that one of meets the following conditions, then filtering:

A) if ρ _Wbov＜a ₇₁And n _Ph＞a ₇₂, then filtering;

B) if ρ _i＞b ₇₁And r _Wh＞b ₇₂And max (ρ _Wv, ρ _Hv)＞b ₇₃And n _Piv＜b ₇₄, then filtering;

C) if r _Wh＜c ₇, then filtering;

Wherein, 0.4＜k ₇₁＜0.6,0.6＜k ₇₂＜0.8,0.1＜a ₇₁＜0.3, a ₇₂＞15,0.5＜b ₇₁＜0.7,0.6＜b ₇₂＜0.8, b ₇₃＞0.7, b ₇₄＞1,0.05＜c ₇＜0.2.

5) for k ₈₁≤ ρ _i＜k ₈₂Connected region, the person that one of meets the following conditions, then filtering:

A) if r _Wh＞a ₈₁And max (ρ _Wv, ρ _Hv)＜a ₈₂, then filtering;

B) if r _Wh≤ b ₈, then filtering;

Wherein, 0.1＜k ₈₁＜0.3,0.3＜k ₈₂＜0.6, a ₈₁＞0.1,0.2＜a ₈₂＜0.5,0.1＜b ₈＜0.3.

Connected region among the present invention can be 4 connected regions, also can be 8 connected regions.

The present invention is not only clear but also complete to the extraction of Chinese character in the file and picture of complex background, is not subjected to the influence of change color between the Chinese character.

Description of drawings

Fig. 1: entire system FB(flow block)

Fig. 2: the original image of test input

Fig. 3: to the recurrence layering subimage of Fig. 2

Fig. 4: the sublayer image after each tomographic image of Fig. 3 merged

The final segmentation result of Fig. 5: Fig. 2

Fig. 6: the original image of test input

Fig. 7: to the recurrence layering subimage of Fig. 6

Fig. 8: the sublayer image after each tomographic image of Fig. 7 merged

The final segmentation result of Fig. 9: Fig. 6

Embodiment

Dispose embodiments of the invention according to Fig. 1.Computing machine is " Tsing Hua Tong Fang's microcomputer, Intel (R) Celeron (R) CPU 3.20GHz, 256 MB of memory, a 80G hard disk " in the present embodiment.Adopt the VC++6.0 programming to realize.

Specific embodiments is:

1, coloured image changes the gray level image scheme:

If input picture is a coloured image, then with following formula conversion:

Y＝0.299×R+0.587×G+0.144×B

Wherein, Y is the gray-scale value after changing, and R, G, B are respectively three color components of the coloured image before the conversion, and R represents red, and G represents green, and B represents blueness, and it is worth all in [0,255] scope.

2, recurrence is divided layered scheme:

If treat the interval [t of being of the gray-value variation of layered image ₁, t ₂] (t ₁＜t ₂), work as δ _t＜c * m _t(c=0.1) or n _tDuring＞d * n (d=0.2), stop this image is continued layering, otherwise, continue the recurrence layering, wherein, n is the pixel sum of input file and picture, n _t, m _tAnd δ _tDifference interval for this reason sum of all pixels, gray average and variance.All satisfy above-mentionedly when stopping stratified condition when all sublayers of telling, stop layering, and each the sublayer sort ascending to telling.

3, sublayer Merge Scenarios:

The initial sub-layer of arranging from small to large with boundary value need to be judged whether to merge from small to large one by one.For sublayer s to be combined _i, satisfying one of following 4 conditions, can merge:

(1) if n _Ph＞a * n _p, then merge; (a=0.1)

(2) if

n_{rs} > b {\times n}_{r}^{0}

And n _{R max p}＜c * n _p, then merge; (b=0.9, c=0.15)

(3) if n _Ps＞d * n _p, then merge; (d=0.5)

(4) if n _{ρ s}＞e * n _p, then merge.(e＝0.6)

Wherein: N=20, R=0.3

Calculate

r_{1} = \frac{\min (n_{r}^{- 1}, n_{r}^{0})}{\max (n_{r}^{- 1}, n_{r}^{0})}, r_{2} = \frac{\min (n_{r}^{}, n_{r}^{0})}{\max (n_{r}^{}, n_{r}^{0})}, r_{1}^{'} = \frac{n_{r}}{n_{r}^{- 1}}, r_{2}^{'} = \frac{n_{r}^{'}}{n_{r}^{}}

If (r ₁+ r ₁')≤(r ₂+ r ₂'), then with s _iWith s _I-1Merge; If (r ₁+ r ₁')＞(r ₂+ r ₂'), then with s _iWith s _I+1Merge.

4, sublayer splitting scheme:

Successively each sublayer image is cut apart, comprised that coarse segmentation and segmentation cut.Coarse segmentation comprised for first to the 3rd step in the following step, and the segmentation steamed sandwich contained for the 4th to the 8th step.Concrete segmentation procedure is:

At first find out all connected regions of this tomographic image, non-text filed according to following regular filtering:

A) if max is (w _i, h _i)＜a ₁, then filtering.

B) if max is (m _Wv, m _Hv)＞b ₁₁And ρ _Wov＜b ₁₂, then filtering.

C) if n _Ph＞c ₁, then filtering.

D) if n _Piv＜d ₁₁And ρ _i＜d ₁₂, then filtering.

E) if n _Hp＞e ₁* n _p, then filtering.

F) if r _Wh＜f ₁, then filtering.

G) if ρ _i＜g ₁, then filtering.

H) if ρ _Wh＞h ₁, then filtering.

Wherein, a ₁=4, b ₁₁=12, b ₁₂=0.15, c ₁=50, d ₁₁=2, d ₁₂=0.8, e ₁=0.1, f ₁=0.05, g ₁=0.2, h ₁=0.3.

A) if min is (ρ _Wv, ρ _Hv)＜a ₂, then filtering.

B) if r _Wh＜b ₂, then filtering.

C) if ρ _i＜c ₂, then filtering.

Wherein, k ₂=0.8, a ₂=0.5, b ₂=0.5, c ₂=0.4.

A) if max is (ρ _Wpv, ρ _Hpv)＜a ₃, then filtering.

B) if n _p＜b ₃, then filtering.

C) if n _Hp＞c ₃, then filtering.

Wherein, k ₃=20, a ₃=0.8, b ₃=30, c ₃=20.

4) for ρ _i〉=k ₄Connected region, satisfy r _Wh＞a ₄₁And max (ρ _Wv, ρ _Hv)＜a ₄₂, then filtering;

Wherein, k ₄=0.99, a ₄₁=0.2, a ₄₂=0.5.

5) for k ₅₁≤ ρ _i＜k ₅₂Connected region, the person that one of meets the following conditions, then filtering:

A) if r _Wh≤ a ₅₁, then filtering;

B) if r _Wh＞b ₅₁And max (ρ _Wv, ρ _Hv)＜b ₅₂, then filtering;

D) if d ₅₁≤ r _Wh≤ d ₅₂, then filtering;

Wherein, k ₅₁=0.9, k ₅₂=0.99, a ₅₁=0.05, b ₅₁=0.7, b ₅₂=0.6, c ₅₁=0.05, c ₅₂=0.2, c ₅₃=0.5, d ₅₁=0.2 d ₅₂=0.7.

6) for k ₆₁≤ ρ _i＜k ₆₂Connected region, the person that one of meets the following conditions, then filtering:

C) if r _Wh＜c ₆, then filtering;

Wherein, k ₆₁=0.75, k ₆₂=0.9, a ₆₁=0.35, a ₆₂=0.5, a ₆₃=3, b ₆₁=0.1, b ₆₂=0.1, b ₆₃=0.8, c ₆=0.1.

7) for k ₇₁≤ ρ _i＜k ₇₂Connected region, the person that one of meets the following conditions, then filtering:

A) if ρ _Wbov＜a ₇₁And n _Ph＞a ₇₂, then filtering;

C) if r _Wh＜c ₇, then filtering;

Wherein, k ₇₁=0.5, k ₇₂=0.75, a ₇₁=0.2, a ₇₂=25, b ₇₁=0.6, b ₇₂=0.7, b ₇₃=0.8, b ₇₄=3, c ₇=0.1.

8) for k ₈₁≤ ρ _i＜k ₈₂Connected region, the person that one of meets the following conditions, then filtering:

A) if r _Wh＞a ₈₁And max (ρ _Wv, ρ _Hv)＜a ₈₂, then filtering;

B) if r _Wh≤ b ₈, then filtering;

Wherein, k ₈₁=0.2, k ₈₂=0.5, a ₈₁=0.15, a ₈₂=0.3, b ₈=0.15.

Treat each tomographic image cut apart finish after, its result is merged (addition), obtain final split image.

5, summary:

According to above step the file and picture of input is handled.At first when the recurrence layering, a width of cloth file and picture is divided for a series of onesize subimages, and according to the pixel grey scale maximal value sort ascending of subimage.Secondly, ordering each subimage is merged by merging criterion, obtained being beneficial to some layered images of text segmentation.Then, the Hanzi features that utilizes the present invention to stipulate is cut apart these sublayer images, and segmentation result is merged, and obtains final text segmentation image.

Utilize the method in the present embodiment, respectively Fig. 2, original image shown in Figure 6 are cut apart.Wherein, Fig. 2 is 24 color document images, 224 pixels of horizontal direction, 129 pixels of vertical direction.Sublayer image to the recurrence layered image of Fig. 2, after merging, segmentation result are respectively shown in Fig. 3,4,5.Show that for clear pixel color in each sublayer image and the final segmentation result image all changes for black.In the processing of present embodiment to Fig. 2, said connected region all is meant 4 connected regions.

Original image shown in Figure 6 is 24 color document images, 498 pixels of horizontal direction, and 291 pixels of vertical direction, the sublayer image to the recurrence layered image of Fig. 6, after merging, segmentation result are respectively shown in Fig. 7,8,9.Show that for clear pixel color in each sublayer image and the final segmentation result image all changes for black.In the processing to Fig. 6, said connected region all is meant 8 connected regions.

Experimental result shows that the present invention is to the extraction of Chinese character in the complicated file and picture, and is not only clear, and complete.

Claims

1, based on the file-image cutting method of Hanzi features,, store the image in hard disc of computer or the movable storage device as input with gray scale or color document images, be read in the internal memory by program again and handle; If what read in is coloured image, then transfer gray level image earlier to, if gray level image then need not be changed; It is characterized in that concrete treatment step is as follows:

(1) original image is carried out the recurrence layering:

Utilize the ratio of distance in maximum kind spacing and the infima species to obtain peaked criterion, image is carried out the recurrence layering.Promptly when piece image obtains a segmentation threshold according to above-mentioned distance than criterion after, image is divided into two sub-tomographic images, and then these two sub-tomographic images are carried out layering respectively with this threshold value, till satisfying one of following two conditions:

1)δ _t＜c×m _t； (0.01＜c＜0.3)

2)n _t＜d×n； (0.01＜d＜0.5)

m_{t} = Σ_{i = t_{1}}^{t_{2}} i h_{i}^{t}

δ_{t}^{2} = Σ_{i = t_{1}}^{t_{2}} n_{i} {(i - m_{t})}^{2}

n_{t} = Σ_{i = t_{1}}^{t_{2}} n_{i}

t ₁And t ₂Be respectively the upper bound and the lower bound (t that treat the layered image grey scale pixel value ₁The expression minimum value, t ₂The expression maximal value), n is the pixel sum of input file and picture, n _t, m _tAnd δ _tBe respectively pixel sum, gray average and the variance for the treatment of layered image; I is a pixel grayscale, h _i ^tThe frequency that in treating layered image, occurs for i;

After the recurrence layering is finished,, it is increased progressively (or successively decreasing) ordering according to each layering sub-image pixels grayscale maximal value;-

(2) merging of sublayer:

Sublayer image after the ordering is merged according to criterion; Criterion is: if the sublayer image satisfies one of following four conditions person, then need merge:

1) if n _Ph＞a * n _p, then merge; (a＞0.05)

2) if

n_{rs} > b \times n_{r}^{0}

And n _{Rmax p}＜c * n _p, then merge; (b＞0.6, c＞0.1)

3) if n _Ps＞d * n _p, then merge; (d＞0.3)

4) if n _{ρ s}＞e * n _p, then merge; (e＞0.3)

Wherein, for for anterior layer, n _pThe sum of expression effective pixel points, n _PhRepresent total hole pixel number, n _r ⁰Connected region sum in the expression subimage; n _RsRepresent little connected region sum, little connected region is meant that the valid pixel that comprises counts less than the connected region of N, wherein, and 0＜N＜50; n _PsRepresent the effective pixel points sum that all little connected regions are contained, n _{R max p}Represent the effective pixel points sum that largest connected zone is contained, n _{ρ s}Expression valid pixel density is less than the effective pixel points sum that all connected regions comprised of R (0＜R＜0.5);

If judge that working as anterior layer needs to merge, the determination methods that then specifically merges to which layer is: if (r ₁+ r ₁')≤(r ₂+ r ₂'), then with s _iWith s _I-1Merge; If (r ₁+ r ₁')＞(r ₂+ r ₂'), then with s _iWith s _I+1Merge;

Wherein,

r_{1} = \frac{\min (n_{r}^{- 1}, n_{r}^{0})}{\max (n_{r}^{- 1}, n_{r}^{0})}, r_{2} = \frac{\min (n_{r}^{1}, n_{r}^{0})}{\max (n_{r}^{1}, n_{r}^{0})}, r_{1}^{'} = \frac{n_{r}}{n_{r}^{- 1}}, r_{2}^{'} = \frac{n_{r}^{'}}{n_{r}^{1}} .

Use s _iAnterior layer, s are worked as in expression _I-1And s _I+1Be respectively when preceding one deck of anterior layer and after one deck, n _r ⁰, n _r ^-1And n _r ¹Represent s respectively _i, s _I-1And s _I+1The total number of the connected region that comprises, n _rExpression s _iWith s _I-1Connected region number after the merging; n _r' expression s _iWith s _I+1Connected region number after the merging;

If the anterior layer of working as that needs to merge has only an adjacent layer, then directly merge to adjacent layer;

(3) text segmentation of sublayer image

For the subimage s after merging _i, its wide and height is respectively l _w, l _h, total valid pixel number is n _p, carry out region growing, obtain a series of connected region.For i connected region, its valid pixel density is ρ _i:

ρ_{i} = \frac{n_{i}}{w_{i} \times h_{i}}

1) saltus step pixel p _v: in delegation (perhaps in same row), each valid pixel adjacent with inactive pixels is the saltus step pixel, and all are positioned at the pixel of square boundary, though the inactive pixels of itself and square boundary outside is adjacent, does not belong to the saltus step pixel;

3) saltus step row w _v, comprise the row of saltus step pixel;

4) saltus step row h _v: the row that comprise the saltus step pixel;

5) the capable w of hole _h: the row that comprises the hole pixel;

6) outer saltus step pixel p _Ov: for row, be meant first saltus step pixel or last saltus step pixel of certain saltus step row, wherein, first saltus step pixel left side is an inactive pixels, and last saltus step pixel right side is an inactive pixels;

7) the plain p of introskip transshaping _Iv: for row, be meant the saltus step pixel outside certain saltus step row China and foreign countries' saltus step pixel;

8) outer saltus step row w _v: the row that comprises outer saltus step pixel;

9) two outer saltus step row h _v: the row that comprises two outer saltus step pixels;

On the basis of above definition, following Hanzi features has been proposed:

1) rectangle the ratio of width to height

r_{wh} = \frac{\min (w_{i}, h_{i})}{\max (w_{i}, h_{i})};

2) saltus step row average transition number of times

m_{wv} = \frac{n_{wpv}}{n_{wv}};

3) saltus step column average transition times

m_{hv} = \frac{n_{hpv}}{n_{hv}};

4) go the saltus step picture element density

ρ_{wpv} = \frac{n_{wpv}}{n_{p}};

5) row saltus step picture element density

ρ_{hpv} = \frac{n_{hpv}}{n_{p}};

6) saltus step line density

ρ_{wv} = \frac{n_{wv}}{n_{w}};

7) saltus step row density

ρ_{hv} = \frac{n_{hv}}{n_{h}};

8) hole line density

ρ_{wh} = \frac{n_{wh}}{n_{w}};

9) outer saltus step line density

ρ_{wov} = \frac{n_{wov}}{n_{wv}};

10) two outer saltus step line density

ρ_{wbov} = \frac{n_{wbov}}{n_{wv}};

Here, n _WpvBe the saltus step pixel count (being saltus step pixel counts all on the horizontal direction) that total saltus step row comprises, n _HpvBe the saltus step pixel count (being saltus step pixel counts all on the vertical direction) that total saltus step row comprise, n _wBe total line number, n _hBe total columns, n _WvBe total saltus step line number (being total number of saltus step row), n _HvBe total saltus step columns (being total number of saltus step row), n _WhBe the line number (being the capable total number of hole) that comprises the hole pixel, n _WovBe the line number (total number of promptly outer saltus step row) that comprises outer saltus step pixel, n _WbovBe the line number (i.e. total number of two outer saltus step row) that comprises two outer saltus step pixels;

Make n _PhThe hole pixel sum of representing current sublayer image, n _PivThe introskip transshaping vegetarian refreshments sum of representing current sublayer image;

The cutting procedure of sublayer image is, seeks all connected regions of this sublayer image, then according to all the non-text connected regions in the image of following regular filtering sublayer:

A) if max is (w _i, h _i)＜a ₁, then filtering;

B) if max is (m _Wv, m _Hv)＞b ₁₁And ρ _Wov＜b ₁₂, then filtering;

C) if n _Ph＞c ₁, then filtering;

D) if n _Piv＜d ₁₁And ρ _i＜d ₁₂, then filtering;

E) if n _Hp＞e ₁* n _p, then filtering;

F) if r _Wh＜f ₁, then filtering;

G) if ρ _i＜g ₁, then filtering;

H) if ρ _Wh＞h ₁, then filtering;

Wherein, 0＜a ₁＜30, b ₁₁＞5,0.05＜b ₁₂＜0.2, c ₁＞20, d ₁₁＞1, d ₁₂＞0.5,0.05＜e ₁＜0.3,0＜f ₁＜0.3,0＜g ₁＜0.3, h ₁＞0.2;

A) if min is (ρ _Wv, ρ _Hv)＜a ₂, then filtering;

B) if r _Wh＜b ₂, then filtering;

C) if ρ _i＜c ₂, then filtering;

Wherein, k ₂＞0.6,0.3＜a ₂＜0.8,0.4＜b ₂＜0.6,0.2＜c ₂＜0.5;

A) if max is (ρ _Wpv, ρ _Hpv)＜a ₃, then filtering;

B) if n _p＜b ₃, then filtering;

C) if n _Hp＞c ₃, then filtering;

Wherein, 10＜k ₃＜30,0.6＜a ₃＜1,0＜b ₃＜30, c ₃＞10;

2, the file-image cutting method based on Hanzi features according to claim 1 is characterized in that, the text segmentation of sublayer image is also comprised following filtering condition:

Wherein, k ₄＞0.95, a ₄₁＞0.1,0.3＜a ₄₂＜0.6;

A) if r _Wh≤ a ₅₁, then filtering;

B) if r _Wh＞b ₅₁And max (ρ _Wv, ρ _Hv)＜b ₅₂, then filtering;

D) if d ₅₁≤ r _Wh≤ d ₅₂, then filtering;

Wherein, 0.8＜k ₅₁＜0.95,0.95＜k ₅₂＜1,0＜a ₅₁＜0.1, b ₅₁＞0.5, b ₅₂＞0.5,0＜c ₅₁＜0.1,0.1＜c ₅₂＜0.3, c ₅₃＞0.4,0.5＜d ₅₂＜0.8,0.1＜d ₅₁＜0.3;

C) if r _Wh＜c ₆, then filtering;

Wherein, 0.7＜k ₆₁＜0.8,0.8＜k ₆₂＜0.95,0.3＜a ₆₁＜0.5,0.4＜a ₆₂＜0.6, a ₆₃＞1,0＜b ₆₁＜0.2,0.2＜b ₆₂＜0.4,0.6＜b ₆₃＜1,0.05＜c ₆＜0.2;

A) if ρ _Wbov＜a ₇₁And n _Ph＞a ₇₂, then filtering;

C) if r _Wh＜c ₇, then filtering;

Wherein, 0.4＜k ₇₁＜0.6,0.6＜k ₇₂＜0.8,0.1＜a ₇₁＜0.3, a ₇₂＞15,0.5＜b ₇₁＜0.7,0.6＜b ₇₂＜0.8, b ₇₃＞0.7, b ₇₄＞1,0.05＜c ₇＜0.2;

A) if r _Wh＞a ₈₁And max (ρ _Wv, ρ _Hv)＜a ₈₂, then filtering;

B) if r _Wh≤ b ₈, then filtering;