CN102521799B - Construction method of structural sparse dictionary for video image recovery enhancement - Google Patents
Construction method of structural sparse dictionary for video image recovery enhancement Download PDFInfo
- Publication number
- CN102521799B CN102521799B CN2011103715055A CN201110371505A CN102521799B CN 102521799 B CN102521799 B CN 102521799B CN 2011103715055 A CN2011103715055 A CN 2011103715055A CN 201110371505 A CN201110371505 A CN 201110371505A CN 102521799 B CN102521799 B CN 102521799B
- Authority
- CN
- China
- Prior art keywords
- image
- dct
- sparse
- cluster
- dictionary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Abstract
The invention discloses a construction method of a structural sparse dictionary for video image recovery enhancement. The construction method comprises the following steps of: carrying out DCT (Discrete Cosine Transform) on image pieces respectively selected from a natural image library and a fitting image with a clear edge, mapping the image pieces to a radio frequency space, carrying out an initial clustering process based on characteristics of a normal frequency domain of the image piece, then carrying out a secondary flustering process on each cluster based on the high-frequency information characteristics of the cluster, finally extracting first m main element components from the obtained secondary cluster to obtain a sparse sub-dictionary subDi-j of the secondary cluster, combining all obtained sparse sub-dictionaries subDi-j into a final structural sparse dictionary The established two-level structural sparse dictionary library is different from a traditional tediously long low-efficiency linear over-complete dictionary, can be used for quickly and effectively solving sparse expression of input image video signals, sparse coefficient vectors obtained by carrying out synergic and hierarchical sparse modeling on any image video signals are exact and effective, and an anti-noise characteristic with a high degree is provided.
Description
Technical field
The invention belongs to the video image enhancement processing technology field, more specifically say, relate at image/video and strengthen the building method that recovers the structural sparse dictionary of enhancing in processing for video image.
Background technology
Because image/video acquisition system self intrinsic defect or limitation, the Digital Image Data collected from reality scene by camera system is through the result after all image quality degradation effects.That is to say, because of various reasons, there are obvious gap in picture and real scene that the digital picture from gathering is seen at visual quality.Most typical, there is the point spread function (PSF) of camera to bring blurring effect, the resolution limit of camera CMOS or CCD induction chip is brought the down-sampling effect, the reason that the three major types image qualities such as obscures effect after air, the stack of camera system noise are degenerated.
In typical digital television system, be implied with the blocking effect flaw of degree varies through the video council of all kinds of video compression technology codings, and from traditional radio and television simulating signal except its resolution be confined to PAL/NTSC upper, also have the simulating signal decoding to transfer digital signal to, go in the process such as interlacing processing because the noise that error is brought or image quality are degenerated.Here it is we in the goodish reason of the image quality of seeing from televisor today.When playing the HD video film source as blue light disc, pleasing picture effect has just in time illustrated how serious our ordinary video image quality degeneration has.
Because the existence in the non-HD video of the tradition of magnanimity source, broadcast television signal still be take the pal mode standard as main at present, use the low image quality video of various low-cost mobile imaging equipment shootings in continuous increase, these all show to have the great demand of the new video image picture quality enhancement treatment technology of innovation.
The compressed sensing technology thinks under meeting some requirements, and the signal after stained by various degenerations can accurately be rebuild to a certain extent and recover.Specifically, in image processing field, modeling is as follows:
I
LR=UI
HR+w (1)
I
HRUntainted desirable high-definition image signal, I
LRBe viewed low clear picture signal after various degenerations, w is the noise variable of additivity, and the degeneracy operator of U linearity can be fuzzy operator, down-sampling operator, the stained operator of additivity etc.The image enhancement processing task is exactly according to known low clear image I
LRData recover unknown desirable high-definition image I
HRData.In order to obtain the high-definition image I approached of euphorosia
HR, need to integrate more natural image priori, with constraint high-definition image I
HRRejuvenation.The general modeling of this process is as follows:
The priori of this natural image is the sparse property of picture signal in the specific definitions territory namely, that is to say high-definition image I
HRCan be at transform domain D with sparse coefficient vector
Rationally express, wherein the threshold value of ε for setting.And the coefficient vector sparse coefficient vector
Middle most elements is all close to zero.As a rule, transform domain D is the complete sparse dictionary of mistake that a base element forms.Traditional sparse dictionary acquisition methods has two kinds: the one, and predefine transform-based well-known transform domains as various as Fourier basis, wavelet basis etc. form; Another kind is to obtain the complete dictionary of such mistake from a large amount of training data learnings.In a word, the structure of sparse dictionary has determined sparse coefficient vector in formula (3)
Sparse property, determined speed of convergence and the stability of oval protruding optimization problem in formula (2) simultaneously, also just finally affected the performance that high-definition signal high-definition image IHR recovers.
Sparse dictionary and the defect thereof of current prior art
The complete sparse dictionary methods of traditional mistake, implying assumed conditions and be between each sparse dictionary element, be discontinuous mutually independently, thereby also just suppose sparse coefficient vector
In the nonzero coefficient position be equally distributed at random.On the other hand in order to express high-definition image I
HRCompleteness, sparse dictionary, the element number N in transform domain D is with respect to sparse coefficient vector
In nonzero value number M very large, that is to say need to meet N>>condition of M.Therefore when solving the optimization problem of formula (2), solution space size in theory, degree of freedom is:
This has caused, and speed of convergence is slow, calculated amount is large, and final accuracy and stability of separating is weakened.
There is a kind of method of preliminary structural sparse dictionary to obtain sparse dictionary.Be to extract T angle from 0 to π direction is evenly unified specifically, each angle gauge is calculated a PCA base, will have subsequently T the additional DCT base of PCA base to form together this structurized sparse dictionary, as shown in Figure 1.In solution procedure, by each PCA base regard as one independently dictionary carry out the sparse signal recovery, selecting the wherein base of trueness error item and sparse constraint item sum minimum is that this signal recovers the transform domain dictionary calculated.Yet its defect of the building method of this structural sparse dictionary is, at first to calculate initialized oriented PCA base from the image at comprehensive T black and white line angle edge out, use subsequently EM (expectation maximization) algorithm iteration to upgrade the content in the PCA base.
Structurized its advantage of sparse dictionary is significantly, and solution space size (degree of freedom) exists
Basis on greatly reduce.If if the element number of each PCA base is identical, the degree of freedom solved on this structuring dictionary so is down to
Secondly, can be according to the low clear image I of difference
LRSelect different PCA bases, namely sub-dictionary, improved low clear image I like this
LRAnd the coupling adaptability between structurized sparse dictionary, thereby obtain sparse property constraint more accurately.
Sparse dictionary carries out the value that the structuring constraint has two aspects, can obtain more healthy and strong sparse expression on the one hand, and in signal interpretation, the former subitem collection of active dictionary has shown certain signal physical attribute on the other hand.
Yet, use the local sheet of expressing natural image from the PCA base of comprehensive image calculation, himself intrinsic defect is arranged.Because various natural image sheets and regular image depart from and can only carry out modeling by the variance parameter in Gauss model without exception, so just lost greatly the accuracy of sparse expression.
Summary of the invention
The object of the invention is to overcome the deficiencies in the prior art, a kind of building method that recovers the structural sparse dictionary of enhancing for video image is provided, make sparse expression more effective and accurate.
For achieving the above object, the present invention recovers the building method of the structural sparse dictionary of enhancing for video image, it is characterized in that, comprises the following steps:
(1), choosing respectively some levels image from the clearly demarcated fitted figure picture in natural image storehouse and edge generates and is of a size of
Whole image sheet S set, the pixel number that wherein n is image sheet;
(2), the image sheet in the image sheet S set is carried out to dct transform, the DCT coefficient formation DCT coefficient sets S obtained
dct
(3), to DCT coefficient sets S
dctIn image sheet DCT coefficient carry out K
1The one-level clustering processing at individual center, thus put respectively image sheet DCT coefficient under corresponding K
1Individual cluster
(4), for each cluster, it is done to structuring operation again: to cluster S
Dct_i(1≤i≤K
1) in image sheet DCT coefficient extract its high fdrequency component, obtain the cluster that high fdrequency component forms
(5), to cluster
In high fdrequency component carry out K
2_i(1≤i≤K
i) clustering processing at individual center, obtain K thereby divide
2_iIndividual secondary cluster
(6), to each secondary cluster
(1≤i≤K
1, 1≤j≤K
2_i) carry out principal component decomposition, extract its front m major component component, form sparse sub-dictionary subD that should cluster
i_j(1≤i≤K
1, 1≤j≤K
2_i); Whole sparse sub-dictionary subD
i_jForm final structural sparse dictionary.
Goal of the invention of the present invention is achieved in that
The present invention recovers the building method of the structural sparse dictionary of enhancing for video image, carry out dct transform by the image sheet to choosing respectively from the clearly demarcated fitted figure picture in natural image storehouse and edge, be mapped to the frequency domain space, general frequency domain character based on image sheet carries out preliminary clustering processing, subsequently each cluster is further carried out to the secondary clustering processing based on its high-frequency information feature, finally, the secondary cluster obtained is extracted to front m major component component, obtain the sparse sub-dictionary subD of this cluster
i_j, whole sparse sub-dictionary subD
i_jForm final structural sparse dictionary, the sparse dictionary of the two-layer configurationization of setting up like this, the linearity that is different from traditional tediously long poor efficiency is crossed complete dictionary.
The present invention has following beneficial effect:
1, under the condition of the sparse dictionary of secondary hierarchical structuring of the present invention, can solve fast and effectively the sparse expression of input picture vision signal, and avoided traditional tediously long calculating that sparse expression solves of carrying out in a complete dictionary of mistake, its reason is the invention provides the two-stage hierarchical structure, thereby has effectively removed the information redundancy between each former subitem in sparse dictionary.
2,, on the basis of the sparse dictionary of secondary hierarchical structuring of the present invention, the arbitrary image vision signal is worked in coordination with to the resulting sparse coefficient vector of the sparse modeling of level
Be accurate and effective, possess the noiseproof feature of certain degree.
The accompanying drawing explanation
Fig. 1 is that the sparse dictionary of prior art preliminary structure forms structural drawing;
Fig. 2 is the particular flow sheet of the present invention for the building method of the structural sparse dictionary of video image recovery enhancing;
Fig. 3 is the schematic diagram of the sparse dictionary of hierarchical structuring;
Fig. 4 is sparse sub-dictionary configuration schematic diagram.
Embodiment
Below in conjunction with accompanying drawing, the specific embodiment of the present invention is described, so that those skilled in the art understands the present invention better.Requiring particular attention is that, in the following description, when perhaps the detailed description of known function and design can desalinate main contents of the present invention, these are described in here and will be left in the basket.
Fig. 2 is the particular flow sheet of the present invention for the building method of the structural sparse dictionary of video image recovery enhancing.
In this enforcement, as shown in Figure 2, specific implementation process of the present invention is as follows:
(1), the image of some levels is provided in provided natural image storehouse 201 and the clearly demarcated matching image library 202 in edge that provides, the ratio of the two can depend on the circumstances, for example 3: 1.Extract length and width and be of a size of all choose on image
Image sheet, wherein the interval of n is generally [25,100].
In Fig. 2, can be from the outside disclosed high-quality natural image in natural image storehouse 201 is collected, its required image order of magnitude generally 100 with interior just enough.The image that in Fig. 2, the image of matching image library 202 is the pixel value black and white boundary graph line that is 0 or 255, its separatrix angle is 18 the even decile angles of direction from 0 to π.Step 203 is extracted
Image sheet pie graph photo S set.
(2) step 204 signal is carried out pre-service to the image sheet S set, and it is projected to the frequency domain space, thereby provides basis for these image sheet samples are carried out to cluster based on features such as texture, edges.Specifically, the image sheet in the image sheet S set is carried out to dct transform, the DCT coefficient obtained forms DCT coefficient sets S
dct, that is:
s
dct=DCT(s),s∈S,s
dct∈S
dct (4)
In (4), s represents an image sheet, s
dctRepresent the DCT coefficient after its dct transform.It should be noted that dct transform can have multiple replacement method, only need effectively the image sheet S set to be projected in the frequency domain space and to get final product.
As in Fig. 2 in step 205, image sheet is carried out to clustering processing based on the significant characteristic of the frequency domain characters such as texture, edge, be exactly to DCT coefficient sets S specifically
dctIn image sheet DCT coefficient carry out K
1The clustering processing at individual center, thereby by DCT coefficient sets S
dctBe divided into K
1Individual cluster
The clustering processing method here of it should be noted that can have several different methods, a kind of classics and effective method is exactly the K means clustering method, also:
That is to say the DCT coefficient sets S expressed on frequency domain here
dctAll be classified into K after the kmean conversion
1In individual one-level cluster, namely
According to this one-level cluster, expressing epigraph sheet S set in spatial domain has the classification of unique correspondence to express
In Fig. 2, step 205 is done structuring for each cluster to it again and is processed, and need to extract its high fdrequency component to each image sheet DCT coefficient in each one-level cluster, is specially cluster S
Dct_i(1≤i≤K
1) in each DCT coefficient extract its high fdrequency component and be designated as
Also:
Conversion that it should be noted that the extraction high fdrequency component in formula (6) can be by accomplished in many ways, and a kind of is directly in the expression by the DCT coefficient, to extract high fdrequency component.Another kind method is to use classical Laplace operator to extract its high fdrequency component in the spatial domain expression before DCT frequency domain projection conversion, also:
As shown in Figure 2, in step 207, after the high fdrequency component of step 206 is extracted pre-service, to cluster
Carry out K
2-i(1<i<K
1) clustering processing at individual center, carry out K
2_i(1≤i≤K
i) clustering processing at individual center, obtain K thereby divide
2_iIndividual secondary cluster
Also:
In Fig. 2, in step 208 and 209, to each secondary cluster
(1≤i≤K
1, 1≤j≤K
2_i) construct its corresponding sparse sub-dictionary subD
i_j, that is to say each secondary cluster
Carry out principal component decomposition, extract its front m major component component V
1~V
mThereby form sparse sub-dictionary subD that should cluster
i_j.But a kind of implementation method of carrying out principal component decomposition is exactly classical PCA method, also:
If use the PCA method to carry out principal component decomposition, before extracting, m important proper vector sequentially arranged and formed together sparse sub-dictionary subD as the former subitem of sub-dictionary
i_j, as schematically shown in Figure 4.
In step 210, through formula (10), process and obtain the corresponding sparse sub-dictionary subD of each secondary cluster
i_jAfter, their complete or collected works anticipate as shown in Figure 3 and have jointly formed the structural sparse dictionary with secondary level.
That is to say, this sparse dictionary has two-stage retrieval dimension, search index i, and the definition space of j is 1≤j≤K
2_i, 1≤i≤K
1, and each sparse sub-dictionary subD
i_jBy m major component component, as former subitem, sequentially rearranged again.
It should be noted that the building method that the present invention recovers the structural sparse dictionary of enhancing for video image can processed offline complete, finally obtain a structural sparse dictionary with secondary level, for the image/video of online processing input is subsequently prepared.
After obtaining structural sparse dictionary, need on its basis, the arbitrary image vision signal is worked in coordination with the sparse modeling of level, thereby tries to achieve its sparse coefficient vector
.
Although the above is described the illustrative embodiment of the present invention; so that those skilled in the art understand the present invention; but should be clear; the invention is not restricted to the scope of embodiment; to those skilled in the art; as long as various variations appended claim limit and definite the spirit and scope of the present invention in, these variations are apparent, all innovation and creation that utilize the present invention to conceive are all at the row of protection.
Claims (1)
1. a building method that recovers the structural sparse dictionary of enhancing for video image, is characterized in that, comprises the following steps:
(1), choosing respectively some levels image from the clearly demarcated fitted figure picture in natural image storehouse and edge generates and is of a size of the whole image sheet S set of n * n, the pixel number that wherein n is image sheet;
(2), the image sheet in the image sheet S set is carried out to dct transform, the DCT coefficient formation DCT coefficient sets S obtained
dct
(3), to DCT coefficient sets S
dctIn image sheet DCT coefficient carry out K
1The one-level clustering processing at individual center, thus put respectively image sheet DCT coefficient under corresponding K
1Individual cluster
(4), for each cluster, it is done to structuring operation again: to cluster S
Dct_i, 1≤i≤K
1In image sheet DCT coefficient extract its high fdrequency component, obtain the cluster that high fdrequency component forms
(5), to cluster
In high fdrequency component carry out K
2_iThe clustering processing at individual center, obtain K thereby divide
2_iIndividual secondary cluster
(6), to each secondary cluster
1≤i≤K
1, 1≤j≤K
2_iCarry out principal component decomposition, extract its front m major component component, form sparse sub-dictionary subD that should cluster
i_j, 1≤i≤K
1, 1≤j≤K
2_iWhole sparse sub-dictionary subD
i_jForm final structural sparse dictionary;
Described one-level cluster is based on texture, frequency domain character significant characteristic in edge is carried out clustering processing.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011103715055A CN102521799B (en) | 2011-11-21 | 2011-11-21 | Construction method of structural sparse dictionary for video image recovery enhancement |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011103715055A CN102521799B (en) | 2011-11-21 | 2011-11-21 | Construction method of structural sparse dictionary for video image recovery enhancement |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102521799A CN102521799A (en) | 2012-06-27 |
CN102521799B true CN102521799B (en) | 2013-12-04 |
Family
ID=46292704
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2011103715055A Expired - Fee Related CN102521799B (en) | 2011-11-21 | 2011-11-21 | Construction method of structural sparse dictionary for video image recovery enhancement |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102521799B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103177125B (en) * | 2013-04-17 | 2016-04-27 | 镇江诺尼基智能技术有限公司 | One short text double focusing fast class methods |
CN104463798A (en) * | 2014-11-28 | 2015-03-25 | 河海大学常州校区 | Image day and night scene change method and system based on sparse representation algorithm |
CN108470195B (en) * | 2018-05-11 | 2021-01-26 | 东方网力科技股份有限公司 | Video identity management method and device |
CN110414538B (en) * | 2019-07-24 | 2022-05-27 | 京东方科技集团股份有限公司 | Defect classification method, defect classification training method and device thereof |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8842891B2 (en) * | 2009-06-09 | 2014-09-23 | Arizona Board Of Regents On Behalf Of Arizona State University | Ultra-low dimensional representation for face recognition under varying expressions |
CN101950365B (en) * | 2010-08-30 | 2012-07-04 | 西安电子科技大学 | Multi-task super-resolution image reconstruction method based on KSVD dictionary learning |
CN101957993A (en) * | 2010-10-11 | 2011-01-26 | 上海交通大学 | Adaptive infrared small object detection method |
-
2011
- 2011-11-21 CN CN2011103715055A patent/CN102521799B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN102521799A (en) | 2012-06-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103020897B (en) | Based on device, the system and method for the super-resolution rebuilding of the single-frame images of multi-tiling | |
CN102006425B (en) | Method for splicing video in real time based on multiple cameras | |
WO2018023734A1 (en) | Significance testing method for 3d image | |
Roa'a et al. | Generation of high dynamic range for enhancing the panorama environment | |
CN102510438B (en) | Acquisition method of sparse coefficient vector for recovering and enhancing video image | |
CN108632625A (en) | A kind of method for video coding, video encoding/decoding method and relevant device | |
CN102521799B (en) | Construction method of structural sparse dictionary for video image recovery enhancement | |
CN104994375A (en) | Three-dimensional image quality objective evaluation method based on three-dimensional visual saliency | |
Nedović et al. | Stages as models of scene geometry | |
Richardson et al. | Learning convolutional filters for interest point detection | |
CN105654493B (en) | A kind of affine constant binocular solid Matching power flow of improved optics and parallax optimization method | |
CN102842115A (en) | Compressed sensing image super-resolution reconstruction method based on double dictionary learning | |
CN104616026A (en) | Monitor scene type identification method for intelligent video monitor | |
CN102930518A (en) | Improved sparse representation based image super-resolution method | |
Höschl IV et al. | Robust histogram-based image retrieval | |
CN104268520A (en) | Human motion recognition method based on depth movement trail | |
CN104036280A (en) | Video fingerprinting method based on region of interest and cluster combination | |
CN104036468A (en) | Super-resolution reconstruction method for single-frame images on basis of pre-amplification non-negative neighbor embedding | |
CN108829711A (en) | A kind of image search method based on multi-feature fusion | |
CN103095996A (en) | Multi-sensor video fusion method based on space-time conspicuousness detection | |
CN109982071A (en) | The bis- compression video detecting methods of HEVC based on time space complexity measurement and local prediction residual distribution | |
Hadfield et al. | Hollywood 3d: what are the best 3d features for action recognition? | |
CN104778447B (en) | A kind of crowd massing behavioral value method based on grid cell feature | |
CN102222321A (en) | Blind reconstruction method for video sequence | |
CN102306383B (en) | Descriptor constructing method suitable for dense matching of wide baseline image |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20131204 Termination date: 20161121 |