CN110517328A

CN110517328A - It is a kind of based on related double application methods of the self-encoding encoder in zero degree in study

Info

Publication number: CN110517328A
Application number: CN201910629751.2A
Authority: CN
Inventors: 姜明; 刘志勇; 张旻; 汤景凡; 吴俊磊
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2019-07-12
Filing date: 2019-07-12
Publication date: 2019-11-29
Anticipated expiration: 2039-07-12
Also published as: CN110517328B

Abstract

The invention discloses a kind of based on related double application methods of the self-encoding encoder in zero degree in study.The present invention establishes self-encoding encoder to visual signature and semantic feature respectively.But the two self-encoding encoders be not it is independent, they be it is associated, the feature that we obtain after encoding visual signature be added to coding after semantic feature on, then this semantic feature after being added is decoded again.Finally by this decode after obtained semantic feature is added acquisition more preferably more complete semantic feature with former semantic feature.Semantic feature after this is optimized, which re-maps, does Classification and Identification on visual signature.The present invention optimizes semantic feature using related double self-encoding encoder models, and acquisition more has discrimination, more fine-grained semantic feature.Semantic feature after the optimization obtained in this way re-maps visual signature spatially, can obtain better Classification and Identification accuracy rate.

Description

It is a kind of based on related double application methods of the self-encoding encoder in zero degree in study

Technical field

The present invention has used related double self-encoding encoders in zero degree in study, belongs to zero degree learning art field, and in particular to It is a kind of based on related double application methods of the self-encoding encoder in zero degree in study.

Background technique

In the research of existing related zero degree study, attention be more placed on image visual signature and each class In mapping relations between semantic feature.However, feature itself can also have important influence to final Classification and Identification.Especially Semantic feature, somewhat like classification, their character representation is very close, and the discrimination between classification is little.In addition to this, If classification span is bigger, the case where there is also feature redundancy.Therefore, construction more has discrimination, more fine-grained semanteme Feature has very important significance.

Summary of the invention

The present invention more has discrimination to construct, more fine-grained semantic feature, proposes a kind of based on related double self-editing Application method of the code device in zero degree in study, a kind of side optimizing semantic feature based on the model framework of related double self-encoding encoders Method.

For technical purpose more than realization, the present invention will take technical solution below to include the following steps:

1. a kind of based on related double application methods of the self-encoding encoder in zero degree in study, it is characterised in that including walking as follows It is rapid:

Step (1) obtains the visual signature after coding；

Step (2) obtains the semantic feature after optimization；

Semantic feature after optimization is mapped to visual signature by step (3)；

Wherein step (2) is realized by following process:

Step A establishes self-encoding encoder to semantic feature, and the visual signature after encoding obtained in step (1) is added to coding In semantic feature afterwards, then this semantic feature after being added is decoded again；Step B by this decode after obtained language Adopted feature is added the more preferable more complete semantic feature of acquisition with former semantic feature.

Step (1) is implemented as follows:

Visual signature is handled by a self-encoding encoder, the feature V after being encoded_e；Formula expression are as follows:

V_e=σ (VW₁+b₁)

Wherein, V_eWhat is indicated is the feature after visual signature coding, and what V was indicated is original visual feature, and what σ was indicated is Relu activation primitive；W₁It is the parameter matrix that model needs training；b₁It is the offset parameter that model needs training；

Decoding process is indicated with following formula:

V_d=σ (V_eW₂+b₂)

Wherein, V_dThat indicate is V_eVisual signature after decoding and rebuilding, W₂It is the parameter matrix that model needs training；b₂It is Model needs the offset parameter of training.

Step A specifically:

First semantic feature is encoded:

A_e=σ (AW₃+b₃)

A_eThat indicate is the attribute spy after semantic feature coding, and what A was indicated is original semantic feature, and what σ was indicated is Relu activation primitive；W₃It is the parameter matrix that model needs training；b₃It is the offset parameter that model needs training；

Then by the visual signature V after obtained coding_eIt is mapped to obtain V_ae,V_aeDimension and coding after attribute it is special Levy A_eDimension it is the same；Two full articulamentum+Relu activation primitive layers are used in mapping process, are formulated as follows:

B=σ (V_eW₄+b₄)

V_ae=σ (BW₅+b₅)

Wherein B is V_eThe value that first Relu+FC block obtains, V_aeIt is V_eMap obtained end value；W₄And W₅Be model Need trained parameter matrix；b₄And b₅It is the offset parameter that model needs training；Then the V that will be obtained_aeWith the category after coding Property feature A_eIt is added, obtains a new attributive character A '_e, this new attributive character A '_eThe influence of visual signature is received, Include more information；What σ was indicated is Relu activation primitive；

A′_e=A_e+V_ae

Decoded process is indicated with following formula:

A_d=σ (A '_eW₆+b₆)

Wherein A_dThat indicate is the attributive character after rebuilding, W₆It is the parameter matrix that model needs training；b₆It is model needs Trained offset parameter.

Step B specifically: original semantic feature is added the semanteme after being optimized with decoded semantic feature Feature；

A_all=A+A_d

A_allSemantic feature after indicating optimization.

Step (3) specifically: the semantic feature after the optimization for obtaining step (2) is swashed by two full articulamentum+Relu Function layer living is mapped to visual signature space, and mapping relations formula is expressed as follows:

C=σ (A_allW₇+b₇)

V_all=σ (CW₈+b₈)

Wherein, C is A_allThe value that first Relu+FC block obtains, V_allIt is A_allMap obtained end value；

W₇And W₈With the parameter matrix for being model needs training；b₇And b₈It is the offset parameter that model needs training.

There is three constraint conditions for model of the present invention: firstly the need of establish two about the two from coding constraint items Part；That is:

min||V-V_d||²

min||A-A_d||²

The last one constraint condition is exactly that the semantic feature after finally obtained optimization is mapped to initial visual signature； That is:

min||V_all-V||²

The parameter of entire model establishes loss function under the action of these constraint conditions, and constantly training obtains optimal Training result.

The present invention has the beneficial effect that:

The present invention optimizes semantic feature using related double self-encoding encoder models, and acquisition more has discrimination, more particulate The semantic feature of degree.Semantic feature after the optimization obtained in this way re-maps visual signature spatially, can obtain and preferably divide Class recognition accuracy.

Detailed description of the invention

Fig. 1 is flow chart of the present invention.

Fig. 2 is present invention building visual signature self-encoding encoder illustraton of model.

Fig. 3 is the semantic feature illustraton of model after the present invention is optimized.

Fig. 4 is mapping relations figure of the present invention.

Specific embodiment

Present invention will be further explained below with reference to the attached drawings and examples.

As shown in Figure 1, it is a kind of based on related double application methods of the self-encoding encoder in zero degree in study, include the following steps:

Step (1) obtains the visual signature after coding；

Step (2) obtains the semantic feature after optimization；

Step (2) obtains as follows: as shown in Fig. 2, step A establishes self-encoding encoder to semantic feature, by step (1) visual signature after encoding obtained in is added in the semantic feature after encoding, then after being added semantic special to this again Sign is decoded；Step B by this decode after obtained semantic feature is added acquisition more preferably more complete language with former semantic feature Adopted feature.

The technical solution that the present invention further limits are as follows:

Step (1) specifically: visual signature is handled by a self-encoding encoder, the feature V after being encoded_e。 This process can be expressed simply with following formula:

V_e=σ (VW₁+b₁)

Wherein V_eWhat is indicated is after visual signature encodes as a result, V expression is original visual signature.Decoding process can To be indicated with following formula:

V_d=σ (V_eW₂+b₂)

Wherein V_dThat indicate is V_eVisual signature after decoding and rebuilding.

Further, step A specifically:

First semantic feature is encoded:

A_e=σ (AW₃+b₃)

A_eWhat is indicated is the result after semantic feature coding, and what A was indicated is original semantic feature.

Next, we are by the visual signature V after coding obtained in the previous step_eIt is mapped to obtain V_ae,V_aeDimension with Attributive character A after coding_eDimension it is the same.We used two full articulamentum+Relu to activate letter during experiment Several layers.This process can be indicated with following simple formula:

B=σ (V_eW₄+b₄)

V_ae=σ (BW₅+b₅)

Wherein B is V_eThe value that first Relu+FC block obtains, V_aeIt is V_eMap obtained end value.Then we incite somebody to action The V arrived_aeWith the attributive character A after coding_eIt is added, obtains a new A_e, this A_eThe influence of visual signature is received, includes More information.

A′_e=A_e+V_ae

Decoded process is indicated with following formula:

A_d=σ (A '_eW₆+b₆)

Further, as shown in figure 3, step B specifically: initial semantic feature to be added with decoded semantic feature Semantic feature after being optimized.

A_all=A+A_d

A_allSemantic feature after indicating optimization.

Further, as shown in figure 4, step (3) specifically: pass through the semantic feature after optimization that step (2) obtains Two full articulamentum+Relu activation primitive layers are mapped to visual signature space.Here mapping relations can use following formula It indicates:

C=σ (A_allW₇+b₇)

V_all=σ (CW₈+b₈)

Wherein C is A_allThe value that first Relu+FC block obtains, V_allIt is A_allMap obtained end value.

There is three constraint conditions for this model, it is intended that these three constraint conditions reach as much as possible.This three Under the collective effect of a constraint condition, entire model is optimized.Firstly, since visual signature and semantic feature do it is self-editing The operation of code device, it is intended that the feature after reconstruction is similar to primitive character as far as possible, so we need to establish two passes In the constraint condition that the two are encoded certainly.That is:

min||V-V_d||²

min||A-A_d||²

The last one constraint condition is exactly that the semantic feature after finally obtained optimization is mapped to initial visual signature. That is:

min||V_all-V||²

Claims

1. a kind of based on related double application methods of the self-encoding encoder in zero degree in study, it is characterised in that include the following steps:

Step (1) obtains the visual signature after coding；

Step (2) obtains the semantic feature after optimization；

Wherein step (2) is realized by following process:

Step A establishes self-encoding encoder to semantic feature, after the visual signature after encoding obtained in step (1) is added to coding In semantic feature, then this semantic feature after being added is decoded again；Step B by this decode after obtained semanteme it is special Sign is added the more preferable more complete semantic feature of acquisition with former semantic feature.

2. according to claim 1 a kind of based on related double application methods of the self-encoding encoder in zero degree in study, feature It is that step (1) is implemented as follows:

V_e=σ (VW₁+b₁)

Wherein, V_eWhat is indicated is the feature after visual signature coding, and what V was indicated is original visual feature, and what σ was indicated is that Relu swashs Function living；W₁It is the parameter matrix that model needs training；b₁It is the offset parameter that model needs training；

Decoding process is indicated with following formula:

V_d=σ (V_eW₂+b₂)

Wherein, V_dThat indicate is V_eVisual signature after decoding and rebuilding, W₂It is the parameter matrix that model needs training；b₂It is that model needs The offset parameter to be trained.

3. according to claim 2 a kind of based on related double application methods of the self-encoding encoder in zero degree in study, feature It is step A specifically:

First semantic feature is encoded:

A_e=σ (AW₃+b₃)

A_eThat indicate is the attribute spy after semantic feature coding, and what A was indicated is original semantic feature, and what σ was indicated is Relu activation Function；W₃It is the parameter matrix that model needs training；b₃It is the offset parameter that model needs training；

Then by the visual signature V after obtained coding_eIt is mapped to obtain V_ae, V_aeDimension and coding after attributive character A_e Dimension it is the same；Two full articulamentum+Relu activation primitive layers are used in mapping process, are formulated as follows:

B=σ (V_eW₄+b₄)

V_ae=σ (BW₅+b₅)

Wherein B is V_eThe value that first Relu+FC block obtains, V_aeIt is V_eMap obtained end value；W₄And W₅Be model needs Trained parameter matrix；b₄And b₅It is the offset parameter that model needs training；Then the V that will be obtained_aeIt is special with the attribute after coding Levy A_eIt is added, obtains a new attributive character A '_e, this new attributive character A '_eThe influence of visual signature is received, includes More information；What σ was indicated is Relu activation primitive；

A′_e=A_e+V_ae

Decoded process is indicated with following formula:

A_d=σ (A '_eW₆+b₆)

Wherein A_dThat indicate is the attributive character after rebuilding, W₆It is the parameter matrix that model needs training；b₆It is that model needs to train Offset parameter.

4. according to claim 3 a kind of based on related double application methods of the self-encoding encoder in zero degree in study, feature It is step B specifically: original semantic feature is added the semantic feature after being optimized with decoded semantic feature；

A_all=A+A_d

A_allSemantic feature after indicating optimization.

5. according to claim 4 a kind of based on related double application methods of the self-encoding encoder in zero degree in study, feature It is step (3) specifically: the semantic feature after the optimization for obtaining step (2) activates letter by two full articulamentum+Relu Several layers are mapped to visual signature space, and mapping relations formula is expressed as follows:

C=σ (A_allW₇+b₇)

V_all=σ (CW₈+b₈)

6. it is according to claim 4 or 5 a kind of based on related double application methods of the self-encoding encoder in zero degree in study, it is special Sign is model there is three constraint conditions: firstly the need of establishing two constraint conditions about the two from coding；That is:

min||V-V_d||²

min||A-A_d||²

The last one constraint condition is exactly that the semantic feature after finally obtained optimization is mapped to initial visual signature；That is:

min||V_all-V||²

The parameter of entire model establishes loss function under the action of these constraint conditions, and constantly training obtains optimal training As a result.