CN101751671B - Visual attention computing method and system based on bit entropy rate - Google Patents

Visual attention computing method and system based on bit entropy rate Download PDF

Info

Publication number
CN101751671B
CN101751671B CN200910243706XA CN200910243706A CN101751671B CN 101751671 B CN101751671 B CN 101751671B CN 200910243706X A CN200910243706X A CN 200910243706XA CN 200910243706 A CN200910243706 A CN 200910243706A CN 101751671 B CN101751671 B CN 101751671B
Authority
CN
China
Prior art keywords
entropy rate
subband
bit entropy
kij
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN200910243706XA
Other languages
Chinese (zh)
Other versions
CN101751671A (en
Inventor
王亦洲
王威
黄庆明
高文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Original Assignee
Peking University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University filed Critical Peking University
Priority to CN200910243706XA priority Critical patent/CN101751671B/en
Publication of CN101751671A publication Critical patent/CN101751671A/en
Application granted granted Critical
Publication of CN101751671B publication Critical patent/CN101751671B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a visual attention computing method and a system based on bit entropy rate. The method includes that: a sparse codebook is learned to obtain a sparse codebook primary function; the sparse codebook primary function is adopted to filter the image or video data and obtain a plurality of sub-band feature graphs; a corresponding fully-connected graph is established for each sub-band feature graph; the information transfer method of each fully-connected graph adopts the random walk method, during the random walk process, the significance measurement is performed according to the bit entropy rate to accordingly obtain a bit entropy rate graph corresponding to each sub-band feature graph; the bit entropy rate graphs are added together to obtain the significance graph of the image or video data. As indicated in large amount of experiments, and compared with other methods in prior art, the analysis result of image significance analysis or video significance analysis obtained through the invention is more accurate and is supported by the basis of physiology and psychology.

Description

Visual attention computing method and system based on bit entropy rate
Technical field
The present invention relates to image and video processing technique, a plurality of fields such as computer vision and visually-perceptible relate in particular to a kind of visual attention computing method and system based on bit entropy rate.
Background technology
Selective attention is meant that psychological resource selectively is assigned to some cognitive course, make these cognitive processes to information Processing more quick and precisely.Attention is extremely important for coordinating various cognitive course.Human each moment all receives a large amount of external informations, is in by in the state of " INFORMATION BOMB ".Psychological resource that we are limited and neural resource can not be handled much more so information simultaneously, can only optionally handle the information with high priority and ignore the information of low priority, and the effect of attention is embodied in this just.
Selective attention is a very complicated cognitive process, and it at every moment affects the process of information processing of brain.Research to selective attention has been the hot fields of cognitive science since the eighties of last century the eighties always, and this point can obtain proof from the quality and quantity about the article noted that deliver every year.Theoretical numerous and complicated about attention mechanism, but the researcher relatively admits following classification: divide from noting the direction that produces, selective attention comprises from bottom to top, the process of data-driven (bottom-up and data-driven process) and from top to bottom, the process (top-down and goal-directed process) that target is guided; Divide from the target of note selecting, selective attention comprises the attention (space-basedattention) based on the space, based on the attention (feature-based attention) of feature with based on the attention (object-based attention) of object.
Selective attention generally comprises static noted analyzing and dynamically notes analyzing, respectively at the detection of marking area in still image and the dynamic video.In static state is noted analyzing two kinds of models are arranged, a kind of is the irrelevant model of task from bottom to top, and being also referred to as stimulates the model that drives; Another kind is the relevant model of task from top to bottom, is also referred to as the model of task-driven.Method from bottom to top mainly comes from some achievements in research of visual cognition psychology aspect.Koch and Ullman have just proposed a method simply from bottom to top in 1985, as input and be encoded into significantly figure of an explicit two dimension, the order that conspicuousness descends in this figure provides a kind of effective attention scan mode the stimulation conspicuousness of each position in visual scene response for it.Method from bottom to top need not to consider any knowledge information, yet we are constantly in the influence that is subjected to its knowledge, culture background.In other words, we at every moment are task-driven, observe surroundings with top-down method.Therefore, existing a few thing is applied to priori to improve the degree of agreement of analysis result and eye movement experimental result in the vision attention analysis.In recent years, increasing work begins to be devoted to dynamically to note analyzing.In video, the significance of each frame distributes and not only is subjected to himself content influence, is subjected to the influence of time context relation simultaneously, so motion feature plays crucial effects in video.
There is the problem of two aspects in existing method: on the one hand, existing method has plenty of and has plenty of based on the information maximization principle based on the model of center-on every side, seldom has method to explain from this two aspect simultaneously; On the other hand, existing method is not accurate especially to the prediction of vision attention point, and there is a big difference with real eye movement data.
Summary of the invention
The objective of the invention is to, a kind of visual attention computing method and system are provided,, can obtain the significance analysis result of accurate more image or video, meet physiology and psychologic foundation more based on the present invention.
The invention provides a kind of visual attention computing method based on bit entropy rate, comprise the steps: filter step, adopt sparse code book basis function that image or video data are carried out filtering, obtain a plurality of subband feature figure, wherein said sparse code book basis function is based on the sparse code book of study and obtains; Full connection layout establishment step, each the subband feature figure that is respectively described a plurality of subband feature figure sets up corresponding full connection layout; Bit entropy rate figure obtaining step adopts the method for random walk to carry out the transmission of information on each described full connection layout, in the random walk process, carries out significance tolerance according to bit entropy rate, and then obtains the bit entropy rate figure of each described subband feature figure correspondence; The saliency map obtaining step is added up described a plurality of bit entropy rate figure, obtains the saliency map of described image or video data.Above-mentioned
Above-mentioned visual attention computing method based on bit entropy rate, in the preferred described bit entropy rate figure obtaining step, described bit entropy rate is used for determining the full connection layout of described each subband feature figure correspondence, each node i is to the average information DER of other nodes iBe expressed as:
SER i = π i Σ j - P ij log P ij
Wherein, π iBe the static probability of random walk process, P IjBe the transition probability of node i to node j, i, j are natural number.
Above-mentioned visual attention computing method based on bit entropy rate in the preferred described saliency map obtaining step, is integrated theory based on feature, and described a plurality of bit entropy rate figure are added up, and obtains the saliency map of described image or video data.
Above-mentioned visual attention computing method based on bit entropy rate, in the preferred described filter step, employing independent component analysis method is respectively gray level image and coloured image is learnt sparse code book.
Above-mentioned visual attention computing method based on bit entropy rate, in the preferred described filter step, adopt described sparse code book basis function that each frame of video is carried out filtering, obtain a plurality of subband feature figure of each frame, j subband characteristic pattern of t frame, upgrade according to following formula:
f ′ j ( x , y , t ) = | f j ( x , y , t ) - Σ τ = 1 k exp ( - τ σ ) f j ( x , y , t - τ ) |
Wherein, f j(x, y t) are j subband characteristic pattern of the t frame before upgrading, f ' j(x, y are j subband characteristic patterns of the t frame after upgrading t), and σ is the characteristic decay rate, and σ is set to 1.5, utilize t frame k frame before to come the t frame is upgraded.
On the other hand, the present invention also provides a kind of vision attention computing system based on bit entropy rate, comprising: filtration module, full connection layout are set up module, bit entropy rate figure acquisition module and saliency map acquisition module.Wherein, filtration module is used to adopt sparse code book basis function that image or video data are carried out filtering, obtains a plurality of subband feature figure, and wherein said sparse code book basis function is based on the sparse code book of study and obtains; Full connection layout is set up each subband feature figure that module is used to be respectively described a plurality of subband feature figure and is set up corresponding full connection layout; Bit entropy rate figure acquisition module is used for adopting on each described full connection layout the method for random walk to carry out the transmission of information, in the random walk process, carries out significance tolerance according to bit entropy rate, and then obtains the bit entropy rate figure of each described subband feature figure correspondence; The saliency map acquisition module is used for described a plurality of bit entropy rate figure are added up, and obtains the saliency map of described image or video data.
Above-mentioned vision attention computing system based on bit entropy rate, in the preferred described bit entropy rate figure acquisition module, described bit entropy rate is used for determining the full connection layout of described each subband feature figure correspondence, each node i is to the average information of other nodes, described bit entropy rate SER iBe expressed as:
SE R i = π i Σ j - P ij log P ij
Wherein, π iBe the static probability of random walk process, P IjBe the transition probability of node i to node j.
Above-mentioned vision attention computing system based on bit entropy rate in the preferred described saliency map acquisition module, is integrated theory based on feature, and described a plurality of bit entropy rate figure are added up, and obtains the saliency map of described image or video data.
Above-mentioned vision attention computing system based on bit entropy rate, in the preferred described filtration module, employing independent component analysis method is respectively gray level image and coloured image is learnt sparse code book.
Above-mentioned vision attention computing system based on bit entropy rate in the preferred described filtration module, adopts described sparse code book basis function that video data is carried out filtering, obtain a plurality of subband feature figure after, j subband characteristic pattern of t frame, upgrade according to following formula:
f ′ j ( x , y , t ) = | f j ( x , y , t ) - Σ τ = 1 k exp ( - τ σ ) f j ( x , y , t - τ ) |
Wherein, f j(x, y t) are j subband characteristic pattern of the t frame before upgrading, f ' j(x, y are j subband characteristic patterns of the t frame after upgrading t), and σ is the characteristic decay rate, and σ is set to 1.5, utilize t frame k frame before to come the t frame is upgraded.
In prior art, the present invention has following several advantage:
The first, no matter passed through a large amount of experiment showed, is in analysis of image significance or the analysis of video significance, all more accurate than existing additive method based on the analysis result that the present invention obtains.
The second, the computing method of Ti Chuing have the foundation of physiology, psychology aspect to support: the sparse coding characteristic of simple cell instructs this model to adopt sparse code book to generate the subband feature figure of image in the initial stage visual cortex, the regeneration that exists in the visual cortex neural network is local to be connected and the horizontal connected mode of long scope instructs this model to adopt a kind of full graph model that is connected as basic expression on each subband feature figure, and neuronic behavior is driven by total cynapse input of peripheral nerve unit instructs us to propose bit entropy rate.
Three, simultaneously can be from this model of model around two kinds of type of drive-information maximization of vision attention and the center-explain.
Description of drawings
Fig. 1 is the flow chart of steps that the present invention is based on the visual attention computing method embodiment of bit entropy rate;
Fig. 2 is the processing procedure synoptic diagram that the present invention is based on the visual attention computing method of bit entropy rate;
Fig. 3 is the sparse code book synoptic diagram about gray level image that extracts among the present invention;
Fig. 4 A predicting the outcome that be the visual attention computing method that the present invention is based on bit entropy rate to the vision attention point of 3 width of cloth coloured images;
Fig. 4 B predicting the outcome that be the visual attention computing method that the present invention is based on bit entropy rate to the vision attention point of other 3 width of cloth coloured images;
Fig. 5 is the structured flowchart that the present invention is based on the vision attention computing system embodiment of bit entropy rate.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, the present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
With reference to Fig. 1, Fig. 1 is the flow chart of steps that the present invention is based on the visual attention computing method embodiment of bit entropy rate, comprises the steps:
Filter step 110 is learnt sparse code book, obtains sparse code book basis function; Adopt described sparse code book basis function that image or video data are carried out filtering, obtain a plurality of subband feature figure.
Full connection layout establishment step 120 is respectively described each subband feature figure and sets up corresponding full connection layout.
Bit entropy rate figure obtaining step 130 adopts the method for random walk to carry out the transmission of information on described each full connection layout, in the random walk process, makes to carry out significance tolerance according to bit entropy rate, and then obtains the bit entropy rate figure of each described subband feature figure correspondence.
Saliency map obtaining step 140 is added up described a plurality of bit entropy rate figure, obtains the saliency map of described image or video data.
With reference to Fig. 2, Fig. 2 is the processing procedure synoptic diagram that the present invention is based on the visual attention computing method of bit entropy rate.Among Fig. 2, a is the image or the video data of input.At first the sparse code book basis function b that obtains with study carries out filtering to input picture or video data a, the corresponding sparse code book of each the subband feature figure c that obtains; For having set up a full connection layout d, each subband feature figure c represents that this full connection layout d represents to describe the relation of pixel far away in the image then; For the transmission of signal (information) between each neuron in the simulative neural network, this method has adopted the method for random walk on the full connection layout d of each correspondence, entropy rate (Entropy Rate) has been described the average information of random walk process, this total average information is distributed to each figure node to get on, a new significance tolerance-bit entropy rate has been proposed, it has also described the average information of each node to every other node simultaneously, just can obtain a bit entropy rate figure e corresponding to each subband feature figure like this; At last, all bit entropy rate figure are added up just obtained saliency map f.
Below in conjunction with Fig. 3 and Fig. 4 above-mentioned each step is elaborated.
In the filter step 110, after the sparse code book study, obtain sparse code book basis function, adopt this sparse code book basis function that target image or video are carried out filtering, below sparse code book is described.
A large amount of evidences show to have only a spot of early vision neuron to be activated when stimulation appears at the cell receptive field, in order to simulate this specific character, the sparse coding theory is suggested the immanent structure of the expression natural image of making a return journey.Sparse code book basis function is V k, k is the index to basis function position, direction and yardstick, then image I can be expressed as
I=∑ kα kB k
α kBe the coefficient of basis function, this method adopts α kAs the early vision feature, it can be by the filter function G corresponding with basis function kCalculate:
α k=∑ x,yG k(x,y)I(x,y)
Present embodiment adopts independent component analysis method (Independent ComponentAnalysis, ICA) be respectively gray level image and coloured image and learnt the sparse code book of a cover, the ICA here is a kind of method that mixed signal is resolved into subsignal separate on the statistical significance.As shown in Figure 3, the sparse code book synoptic diagram for extracting among the present invention about gray level image.
Full connection layout establishment step 120 is set up corresponding full connection layout for being respectively described each subband feature figure, specifies as follows:
Corresponding to each subband feature figure F kSet up a full connection layout G K={ V k, E k, V wherein k={ v K1..., v KnBe node corresponding to the image pixel place, v Ki=(x i, y i, f k(x i, y i)) position and two attributes of characteristic response, E arranged k={ e Kij, i, j=1 ... n} is the weighting limit between the node, wherein e Kij=(i, j, w Kij).Weight w KijComprise feature difference degree Φ KijWith space length d IjThese two
w kij=Φ kij*d ij
Φ wherein KijAnd d IjCan be expressed as
Φ kij=exp{|f k(x i,y i)-f k(x j,y j)|/M k}
d ij = exp { - λ ( x i - x j ) 2 + ( y i - y j ) 2 / D }
M in this model kBe that maximum characteristic response among each subband feature figure is poor, D is the maximum dimension (being the maximal value in width and the height) of image, and λ is used for regulating this importance of two, is traditionally arranged to be 5.
Bit entropy rate figure obtaining step 130 illustrates the bit entropy rate (SiteEntropy Rate) that is used to measure significance.
For the transmission of signal between the neuron in the simulative neural network, on the pairing full connection layout of each subband feature figure, implemented a random walk process (Random Walk), transition probability from node i to node j in this random walk process be (below in order to describe the index k that simply dispenses subband feature figure, as w KijBe w Ij)
P ij = w ij Σ j w ij
The entropy rate of random walk is the total average information that is used for describing this stochastic process, it has also described total quantity of information that all figure nodes (neuron) transmit simultaneously, the entropy rate is decomposed each node to get on, this model just proposed a new notion-bit entropy rate (Site Entropy Rate, SER)-go to describe the average information of each node to other all nodes
SER i = π i Σ j - P ij log P ij
π wherein iIt is the static probability of random walk process.Obtain maximized principle according to human visual system's information, this model proposes to describe with SER the visual saliency of each node.
At last, integrate theoretical (Feature-IntegrationTheory based on the feature that Treisman proposes, be that vision system at first can extract initial stage visual signature formation characteristic pattern, characteristic pattern is integrated into the focus that a remarkable figure goes to instruct people then), significantly figure obtains (recovering subband feature index of the picture k here again) by the pairing SER figure of each characteristic pattern phase Calais:
S i = Σ k SER ki
S wherein iIt is the remarkable value of node i.
With reference to Fig. 4 A and Fig. 4 B, Fig. 4 A is the visual attention computing method that the present invention is based on bit entropy rate to the predicting the outcome of the vision attention point of 3 width of cloth coloured images, Fig. 4 B predicting the outcome to the vision attention point of other 3 width of cloth coloured images that be the visual attention computing method that the present invention is based on bit entropy rate.In Fig. 4 A and Fig. 4 B, one of the leftmost side is classified the original image of input as, middle one classifies the saliency map that adopts method provided by the present invention to obtain as, and rightmost one classifies the lime light data (eye movement data) that people are recorded by the eye movement instrument as when the observation original image.As can be seen, saliency map and eye movement data are very close, and it is more accurate that this method that has illustrated that the present invention proposes is predicted the human eye lime light.
Saliency map obtaining step 140 evidence show in Neuscience: in signal transduction process, have only that unpredictable signal just can be delivered to next stage in present stage; Simultaneously electrophysiology evidence show: neural response can along with cellular exposure under same stimulation time span and sharply descend.Based on these facts, this model thinks and if the node place produces the signal at unpredictable signal or node place variation has taken place that this node is exactly relatively more significant.In video is noted analyzing, when calculating t remarkable figure constantly, should balance out t constantly before the influence of frame of video, specifically, the weighting subband feature figure of number frame of need deducting over upgrades the subband feature figure of present frame.f j(x, y t) are j subband characteristic pattern of t frame, upgrade it with following formula:
f ′ j ( x , y , t ) = | f j ( x , y , t ) - Σ τ = 1 k exp ( - τ σ ) f j ( x , y , t - τ ) |
Wherein, f ' j(x, y are j subband characteristic patterns of the t frame after upgrading t), and σ is the characteristic decay rate, and σ is set to 1.5, utilize t frame k frame before to come the t frame is upgraded.The full connection layout of follow-up foundation is noted analyzing consistent with calculating bit entropy rate and last saliency map with image.
With reference to Fig. 5, Fig. 5 is the structured flowchart that the present invention is based on the vision attention computing system embodiment of bit entropy rate, comprises the steps:
Filtration module 52 is used to learn sparse code book, obtains sparse code book basis function; Adopt described sparse code book basis function that image or video data are carried out filtering, obtain a plurality of subband feature figure.
Full connection layout is set up module 54, is used to be respectively described each subband feature figure and sets up corresponding full connection layout.
Bit entropy rate figure acquisition module 56, be used on described each full connection layout, adopting the method for random walk to carry out the transmission of information, in the random walk process, make to carry out significance tolerance according to bit entropy rate, and then obtain the bit entropy rate figure of each described subband feature figure correspondence.
Saliency map acquisition module 58 is used for described a plurality of bit entropy rate figure are added up, and obtains the saliency map of described image or video data.
Wherein, in the bit entropy rate figure acquisition module 54, bit entropy rate is used for determining the full connection layout of described each subband feature figure correspondence, and each node i is to the average information of other nodes, and described bit entropy rate is expressed as:
SER i = π i Σ j - P ij log P ij
π iBe the static probability of random walk process, P IjBe the transition probability of node i to node j.
In saliency map acquisition module 58, can integrate theory based on feature, described a plurality of bit entropy rate figure are added up, obtain the saliency map of described image or video data.
In described filtration module 52, employing independent component analysis method is respectively gray level image and coloured image is learnt sparse code book.
In addition, native system has also proposed the update strategy of the subband feature figure of present frame during video is noted analyzing.Specify as follows:
In the filtration module, adopt described sparse code book basis function that video data is carried out filtering, obtain a plurality of subband feature figure after, j subband characteristic pattern of t frame, upgrade according to following formula:
f ′ j ( x , y , t ) = | f j ( x , y , t ) - Σ τ = 1 k exp ( - τ σ ) f j ( x , y , t - τ ) |
Wherein, f j(x, y t) are j subband characteristic pattern of the t frame before upgrading, f ' j(x, y are j subband characteristic patterns of the t frame after upgrading t), and σ is the characteristic decay rate, and σ is set to 1.5, utilize t frame k frame before to come the t frame is upgraded.
Need to prove that the foregoing description is similar based on the vision attention computing system of bit entropy rate with visual attention computing method principle based on bit entropy rate, relevant part can be with reference to the explanation for method embodiment.Do not repeat them here.
More than a kind of visual attention computing method and system based on bit entropy rate provided by the present invention is described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (6)

1. the visual attention computing method based on bit entropy rate is characterized in that, comprises the steps:
Filter step adopts sparse code book basis function that image or video data are carried out filtering, obtains a plurality of subband feature figure, and wherein said sparse code book basis function is based on the sparse code book of study and obtains;
Full connection layout establishment step, each the subband feature figure that is respectively described a plurality of subband feature figure sets up corresponding full connection layout;
Bit entropy rate figure obtaining step adopts the method for random walk to carry out the transmission of information on each described full connection layout, in the random walk process, carries out significance tolerance according to bit entropy rate, and then obtains the bit entropy rate figure of each described subband feature figure correspondence;
The saliency map obtaining step is added up described a plurality of bit entropy rate figure, obtains the saliency map of described image or video data; And
In the described filter step, adopt described sparse code book basis function that each frame of video is carried out filtering, obtain a plurality of subband feature figure of each frame, j subband characteristic pattern of t frame, upgrade according to following formula:
Figure FSB00000553992600011
Wherein, f j(x, y t) are j subband characteristic pattern of the t frame before upgrading, f ' j(x, y are j subband characteristic patterns of the t frame after upgrading t), and σ is the characteristic decay rate, and σ is set to 1.5, utilize t frame k frame before to come the t frame is upgraded;
In the described full connection layout establishment step, the full connection layout of each subband feature figure correspondence is set up according to following mode:
Set up a full connection layout G corresponding to each subband feature figure Fk K={ V k, E k, V wherein k={ v K1..., v KnBe node corresponding to the image pixel place, v Ki=(x i, y i, f k(x i, y i)) position and two attributes of characteristic response, E arranged k={ e Kij, i, j=1 ... n} is the weighting limit between the node, wherein e Kij=(i, j, w Kij); Weight w KijComprise feature difference degree Φ KijWith space length d IjThese two,
w kij=Φ kij*d ij
Φ wherein KijAnd d IjBe expressed as
Φ kij=exp{|f k(x i,y i)-f k(x j,y j)|/M k}
M kBe that maximum characteristic response among each subband feature figure is poor, D is the maximum dimension of image, and λ is used for regulating this importance of two, is set to 5;
In the described bit entropy rate figure obtaining step, described bit entropy rate is used for determining the full connection layout of described each subband feature figure correspondence, and each node i is to the average information SER of other nodes iBe expressed as:
Figure FSB00000553992600022
Wherein, π iBe the static probability of random walk process, P IjBe the transition probability that node i arrives node j, i, j are natural number.
2. the visual attention computing method based on bit entropy rate according to claim 1, it is characterized in that, in the described saliency map obtaining step, integrate theoretical based on feature, described a plurality of bit entropy rate figure are added up, obtain the saliency map of described image or video data.
3. the visual attention computing method based on bit entropy rate according to claim 2 is characterized in that, in the described filter step, employing independent component analysis method is respectively gray level image and coloured image is learnt sparse code book.
4. the vision attention computing system based on bit entropy rate is characterized in that, comprising:
Filtration module is used to adopt sparse code book basis function that image or video data are carried out filtering, obtains a plurality of subband feature figure, and wherein said sparse code book basis function is based on the sparse code book of study and obtains;
Full connection layout is set up module, and each the subband feature figure that is used to be respectively described a plurality of subband feature figure sets up corresponding full connection layout;
Bit entropy rate figure acquisition module is used for adopting on each described full connection layout the method for random walk to carry out the transmission of information, in the random walk process, carries out significance tolerance according to bit entropy rate, and then obtains the bit entropy rate figure of each described subband feature figure correspondence;
The saliency map acquisition module is used for described a plurality of bit entropy rate figure are added up, and obtains the saliency map of described image or video data; Wherein
In the described filtration module, adopt described sparse code book basis function that each frame of video is carried out filtering, obtain a plurality of subband feature figure of each frame, j subband characteristic pattern of t frame, upgrade according to following formula:
Figure FSB00000553992600031
Wherein, f j(x, y t) are j subband characteristic pattern of the t frame before upgrading, f ' j(x, y are j subband characteristic patterns of the t frame after upgrading t), and σ is the characteristic decay rate, and σ is set to 1.5, utilize t frame k frame before to come the t frame is upgraded;
Described full connection layout is set up in the module, and the full connection layout of each subband feature figure correspondence is set up according to following mode:
Corresponding to each subband feature figure F kSet up a full connection layout G K={ V k, E k, V wherein k={ v K1..., v KnBe node corresponding to the image pixel place, v Ki=(x i, y i, f k(x i, y i)) position and two attributes of characteristic response, E arranged k={ e Kij, i, j=1 ... n} is the weighting limit between the node, wherein e Kij=(i, j, w Kij); Weight w KijComprise feature difference degree Φ KijWith space length d IjThese two,
w kij=Φ kij*d ij
Φ wherein KijAnd d IjBe expressed as
Φ kij=exp{|f k(x i,y i)-f k(x j,y j)|/M k}
Figure FSB00000553992600041
M kBe that maximum characteristic response among each subband feature figure is poor, D is the maximum dimension of image, and λ is used for regulating this importance of two, is set to 5;
In the described bit entropy rate figure acquisition module, described bit entropy rate is used for determining the full connection layout of described each subband feature figure correspondence, and each node i is to the average information SER of other nodes iBe expressed as:
Figure FSB00000553992600042
Wherein, π iBe the static probability of random walk process, P IjBe the transition probability that node i arrives node j, i, j are natural number.
5. the vision attention computing system based on bit entropy rate according to claim 4, it is characterized in that, in the described saliency map acquisition module, integrate theoretical based on feature, described a plurality of bit entropy rate figure are added up, obtain the saliency map of described image or video data.
6. the vision attention computing system based on bit entropy rate according to claim 5 is characterized in that, in the described filtration module, employing independent component analysis method is respectively gray level image and coloured image is learnt sparse code book.
CN200910243706XA 2009-12-23 2009-12-23 Visual attention computing method and system based on bit entropy rate Expired - Fee Related CN101751671B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200910243706XA CN101751671B (en) 2009-12-23 2009-12-23 Visual attention computing method and system based on bit entropy rate

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200910243706XA CN101751671B (en) 2009-12-23 2009-12-23 Visual attention computing method and system based on bit entropy rate

Publications (2)

Publication Number Publication Date
CN101751671A CN101751671A (en) 2010-06-23
CN101751671B true CN101751671B (en) 2011-09-28

Family

ID=42478615

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200910243706XA Expired - Fee Related CN101751671B (en) 2009-12-23 2009-12-23 Visual attention computing method and system based on bit entropy rate

Country Status (1)

Country Link
CN (1) CN101751671B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102831621B (en) * 2012-08-09 2014-11-05 西北工业大学 Video significance processing method based on spectral analysis
CN107392211B (en) * 2017-07-19 2021-01-15 苏州闻捷传感技术有限公司 Salient target detection method based on visual sparse cognition

Also Published As

Publication number Publication date
CN101751671A (en) 2010-06-23

Similar Documents

Publication Publication Date Title
CN109978893B (en) Training method, device, equipment and storage medium of image semantic segmentation network
CN111126258B (en) Image recognition method and related device
CN109558942B (en) Neural network migration method based on shallow learning
CN110059620A (en) Bone Activity recognition method based on space-time attention
CN110569886A (en) Image classification method for bidirectional channel attention element learning
CN110135562B (en) Distillation learning method, system and device based on characteristic space change
CN103679191B (en) An automatic fake-licensed vehicle detection method based on static state pictures
CN110889450B (en) Super-parameter tuning and model construction method and device
Wang et al. Learning performance prediction via convolutional GRU and explainable neural networks in e-learning environments
CN104103033A (en) Image real-time processing method
CN110490136A (en) A kind of human body behavior prediction method of knowledge based distillation
Cardoso et al. Classifying the content of social media images to support cultural ecosystem service assessments using deep learning models
Gaihua et al. A serial-parallel self-attention network joint with multi-scale dilated convolution
CN108510126A (en) A kind of Predictive Methods of Road Accidents based on PCA and BP neural network
Kulkarni et al. Knowledge distillation using unlabeled mismatched images
CN114549470A (en) Method for acquiring critical region of hand bone based on convolutional neural network and multi-granularity attention
CN101751671B (en) Visual attention computing method and system based on bit entropy rate
CN108073978A (en) A kind of constructive method of the ultra-deep learning model of artificial intelligence
Mermillod et al. The coarse-to-fine hypothesis revisited: Evidence from neuro-computational modeling
CN109284752A (en) A kind of rapid detection method of vehicle
CN113762331A (en) Relational self-distillation method, apparatus and system, and storage medium
CN108304915A (en) A kind of decomposition of deep learning neural network and synthetic method and system
CN108073985A (en) A kind of importing ultra-deep study method for voice recognition of artificial intelligence
CN106709480A (en) Partitioning human face recognition method based on weighted intensity PCNN model
CN108073979A (en) A kind of ultra-deep study of importing artificial intelligence knows method for distinguishing for image

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110928

Termination date: 20191223

CF01 Termination of patent right due to non-payment of annual fee