CN107145539A

CN107145539A - A kind of method for handling unreasonable data in negative investigation

Info

Publication number: CN107145539A
Application number: CN201710267513.2A
Authority: CN
Inventors: 赵冬冬; 方舒; 向剑文
Original assignee: Wuhan University of Technology WUT
Current assignee: Wuhan University of Technology WUT
Priority date: 2017-04-21
Filing date: 2017-04-21
Publication date: 2017-09-08
Anticipated expiration: 2037-04-21
Also published as: CN107145539B

Abstract

The invention discloses a kind of method for handling unreasonable data in negative investigation, including following four steps：Calculate the value just investigated；Irrational data are adjusted；For the data after adjustment, the ratio that the difference caused by adjustment is assigned to other options is calculated；Value just investigated for the value of the negative investigation obtained after adjustment, calculating etc.；During positive survey data is rebuild in negative investigation, the present invention can both handle the irrational data of this class of negative value, can also handle the unreasonable data disagreed with background knowledge, and obtain higher reconstruction precision.

Description

A kind of method for handling unreasonable data in negative investigation

Technical field

The invention belongs to secret protection technical field, it is related to a kind of method that positive survey data is rebuild in negative investigation, specifically relates to And a kind of processing method of traditional algorithm for reconstructing when there are unreasonable data.

Background technology

The epoch of information huge explosion, people gradually recognize the importance of individual privacy safety.In order to meet people increasingly The demand of the secret protection of growth, the method for increasing secret protection is suggested, and it is exactly one kind therein to bear investigation.It is negative to adjust It is that a kind of of information negative indication applies to look into, and compared with conventional survey, it can be effectively protected the privacy of participant, particularly suitable In the collection of sensitive data.In traditional positive investigation, whether related to sensitive data, participant, which is required for selecting, to be met That option of oneself actual conditions, and negative investigation and traditional positive investigation are on the contrary, participant is required do not meeting oneself in fact One is selected in the option of border situation.

The negative data collected, are not the data required for us, it would be desirable to be to be reconstructed from negative investigation The data of the positive investigation come.Algorithm for reconstructing most at present proposes that uniformly negative investigation is finger counting both for uniform negative investigation Method assumes that participant is to bear option with equiprobable possible Sexual behavior mode.NStoPS algorithm for reconstructing is the most basic weight of uniform negative investigation Algorithm is built, but NStoPS algorithms are likely to occur negative value in the result of reconstruction, negative value does not meet actual conditions, so category Then irrational data.Then, for negative value problem, two kinds of innovatory algorithms are proposed on the basis of NStoPS, are referred to as NStoPS-I、NStoPS-II.Although both algorithms can be very good to handle negative value problem, NStoPS-I iteration speeds Slowly, efficiency is low, and NStoPS-II is not suitable for the negative investigation that select probability is Arbitrary distribution.

Questionnaire in reality is often in the presence of some background knowledges, such as the investigation for some diseases, hospital Often know incidence rate of the disease etc..If background knowledge is incorporated into negative investigation, using traditional algorithm for reconstructing, it can go out The unreasonable data now disagreed with background knowledge.For this kind of unreasonable data, NStoPS-BK algorithms are suggested, the algorithm Demonstrate reasonably can effectively improve the precision of data reconstruction using background knowledge.

Negative Investigation requirements are randomly choosed also likely to be present artificially partially in an option for not meeting oneself situation, fact-finding process Good, these are likely to irrational data occur when causing negative investigation to rebuild positive survey data.Obviously, these irrational numbers According to can influence to rebuild the precision of positive survey data, so, the unreasonable data in rational processing reconstructed algorithm are for improving number It is extremely important according to reconstruction precision.

The content of the invention

In order to solve the above-mentioned technical problem, the invention provides a kind of method for handling unreasonable data in negative investigation.

The technical solution adopted in the present invention is：A kind of method for handling unreasonable data in negative investigation, it is characterised in that Comprise the following steps：

Step 1：Calculate the value just investigated；

Step 2：Irrational data are adjusted；

Step 3：For the data after adjustment, the ratio that the difference caused by adjustment is assigned to other options is calculated；

Step 4：For the value of the negative investigation obtained after adjustment, the value just investigated is calculated.

Preferably, in step 1, use traditional NStoPS algorithm for reconstructing calculate the value just investigated for Wherein,Represent the number ratio estimate value of option i in positive investigation, i=1,2 ..., c；

Preferably, in step 2, irrational data are adjusted, the relation of negative data and correction data is after adjustment：

Wherein, r_iRepresent the number ratio of each option in negative investigation, x_iThe number ratio just investigated after adjustment is represented, and And r₁+...+r_c=1, X₁+...+X_c=1, △ r_iRepresent the difference that the irrational data of adjustment are brought, p_ijRepresentative is just being adjusted The people for looking into selection option i have selected option j probability in negative investigation, i=1,2 ..., c, j=1,2 ..., c；

For uniform negative investigation：

Preferably, in step 3, it is assumed that it is irrational result to have n option, by the estimate of this n optionIt is adjusted to reasonable value x_c,x_c-1,...,x_c-n+1, calculate the difference that causes of adjustment and be assigned to other options Ratio, if option i ratio is in positive investigationThen：

If option i ratio is △ r in negative investigation_i, according to the code fo practice of negative investigation：

Preferably, in step 4, for the value of the negative investigation obtained after adjustment, re-using NStoPS algorithms and calculating just The value x of investigation_i：

x_i=1- (c-1) (r_i+△r_i)

Further obtain calculating x_iFormula：

Wherein,

x_s=x_c+x_c-1+…+x_c-n+1, r_s=r_c+r_c-1+…+r_c-n+1。

The beneficial effects of the present invention are：During positive survey data is rebuild in negative investigation, the present invention can both be handled The irrational data of this class of negative value, can also handle the unreasonable data disagreed with background knowledge, and obtain higher Reconstruction precision.

Brief description of the drawings

Fig. 1 is the method flow diagram of the embodiment of the present invention.

Embodiment

Understand for the ease of those of ordinary skill in the art and implement the present invention, below in conjunction with the accompanying drawings and embodiment is to this hair It is bright to be described in further detail, it will be appreciated that implementation example described herein is merely to illustrate and explain the present invention, not For limiting the present invention.

See Fig. 1, the method for unreasonable data, comprises the following steps in the negative investigation of a kind of processing that the present invention is provided：

Step 1：The value just investigated is calculated using traditional NStoPS algorithm for reconstructing；

Assuming that the total number of persons for participating in investigation is N, the option number of problem is c, bears the number ratio of each option in investigation For R={ r_i,r_i,...,r_c, the number ratio for each option just investigated accordingly is T={ t₁,t₂,...,t_c, p_ijRepresent Selection option i people have selected option j probability, p in negative investigation in positive investigation_ijMatrix P is constituted, therefore：

For background, i.e., the method and this paper algorithm that existing positive survey data is rebuild all be with uniform negative investigation：

Traditional NStoPS method for reconstructing, matrix R, T, P meet relation：

R=TP

Therefore, matrix T can be calculated using following formula：

T=RP^-1

According to above formula, the estimate of correction data can be calculated with following formula：

Represent the number ratio estimate value of option i in positive investigation, i=1,2 ..., c；

Step 2：Irrational data are adjusted；

The present embodiment assumes that the reconstructed results for having an option are irrational data by taking an option as an example, that is,.Using Traditional NStoPS methods calculate the positive investigation estimate of each optionAssuming that the result of c-th of option is unreasonable data, Reasonable value x should be adjusted to_c.Because the irrational selection of certain subparticipation person, result in option c correction data estimateWith Known x_cDifference △ r_c, this paper algorithm is first by the irrational estimates of option cIt is adjusted to x_c, and it is resulting Difference △ r_cAgain by corresponding pro rate into other c-1 option, if the ratio for being assigned to each option is designated as △ r₁,△ r₂,...,△r_c-1, therefore x_i、p_ij、r_i、△r_iRelation is as follows

By difference △ r_cDuring by corresponding pro rate to other options, △ r are calculated first_cOther options in positive investigation Number ratio, if option i ratio is in positive investigationAssuming that being calculated as follows：

According to the create-rule of uniform negative investigation, other c-1 option in positive investigationEquiprobability is assigned to negative investigation C-2 option in, probability be 1/ (c-2), so △ r_iIt is calculated as follows

Step 4：For the value of the negative investigation obtained after adjustment, re-use NStoPS algorithms and calculate the value just investigated.

Calculate the estimate just investigated using NStoPS again using the negative investigation value after adjustment:

x_i=1- (c-1) (r_i+△r_i)

It can to sum up obtain, except option c, the reconstructed results of other each options are calculated as follows:

The calculating process of algorithm when above example is an only unreasonable data, the present invention is applied to multiple unreasonable numbers According to processing, and higher accuracy can be obtained.

It should be appreciated that the part that this specification is not elaborated belongs to prior art.

It should be appreciated that the above-mentioned description for preferred embodiment is more detailed, therefore it can not be considered to this The limitation of invention patent protection scope, one of ordinary skill in the art is not departing from power of the present invention under the enlightenment of the present invention Profit is required under protected ambit, can also be made replacement or be deformed, each fall within protection scope of the present invention, this hair It is bright scope is claimed to be determined by the appended claims.

Claims

1. a kind of method for handling unreasonable data in negative investigation, it is characterised in that comprise the following steps：

Step 1：Calculate the value just investigated；

Step 2：Irrational data are adjusted；

2. the method for unreasonable data in the negative investigation of processing according to claim 1, it is characterised in that：, will not in step 2 Rational data are adjusted, and the relation of negative data and correction data is after adjustment：

<mfenced open = "{" close = ""> <mtable> <mtr> <mtd> <mrow> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>c</mi> </msubsup> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>*</mo> <msub> <mi>p</mi> <mrow> <mi>i</mi> <mn>1</mn> </mrow> </msub> <mo>=</mo> <msub> <mi>r</mi> <mn>1</mn> </msub> <mo>+</mo> <msub> <mi>&Delta;r</mi> <mn>1</mn> </msub> </mrow> </mtd> </mtr> <mtr> <mtd> <mrow> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>c</mi> </msubsup> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>*</mo> <msub> <mi>p</mi> <mrow> <mi>i</mi> <mn>2</mn> </mrow> </msub> <mo>=</mo> <msub> <mi>r</mi> <mn>2</mn> </msub> <mo>+</mo> <msub> <mi>&Delta;r</mi> <mn>2</mn> </msub> </mrow> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mrow> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>c</mi> </msubsup> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>*</mo> <msub> <mi>p</mi> <mrow> <mi>i</mi> <mi>c</mi> </mrow> </msub> <mo>=</mo> <msub> <mi>r</mi> <mi>c</mi> </msub> <mo>+</mo> <msub> <mi>&Delta;r</mi> <mi>c</mi> </msub> </mrow> </mtd> </mtr> </mtable> </mfenced>

Wherein, r_iRepresent the number ratio of each option in negative investigation, x_iRepresent the number ratio just investigated after adjustment, and r₁ +...+r_c=1, X₁+...+X_c=1, △ r_iRepresent the difference that the irrational data of adjustment are brought, p_ijRepresent in positive investigation Selection option i people have selected option j probability in negative investigation, i=1,2 ..., c, j=1,2 ..., c.

3. the method for unreasonable data in the negative investigation of processing according to claim 2, it is characterised in that：In step 3, it is assumed that It is irrational result to have n option, by the estimate of this n optionIt is adjusted to reasonable value x_c, x_c-1,...,x_c-n+1, the ratio that the difference caused is assigned to other options that adjusts is calculated, if option i ratio is in positive investigationThen：

<mrow> <msub> <mi>d</mi> <msub> <mi>x</mi> <mi>i</mi> </msub> </msub> <mo>=</mo> <mfrac> <msub> <mi>x</mi> <mi>i</mi> </msub> <mrow> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <mrow> <mi>c</mi> <mo>-</mo> <mi>n</mi> </mrow> </msubsup> <msub> <mi>X</mi> <mi>j</mi> </msub> </mrow> </mfrac> <mo>*</mo> <mrow> <mo>(</mo> <mo>-</mo> <msub> <mi>&Delta;r</mi> <mi>c</mi> </msub> <mo>)</mo> </mrow> <mo>+</mo> <mn>...</mn> <mo>+</mo> <mfrac> <msub> <mi>x</mi> <mi>i</mi> </msub> <mrow> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <mrow> <mi>c</mi> <mo>-</mo> <mi>n</mi> </mrow> </msubsup> <msub> <mi>X</mi> <mi>j</mi> </msub> </mrow> </mfrac> <mo>*</mo> <mrow> <mo>(</mo> <mo>-</mo> <msub> <mi>&Delta;r</mi> <mrow> <mi>c</mi> <mo>-</mo> <mi>n</mi> <mo>+</mo> <mn>1</mn> </mrow> </msub> <mo>)</mo> </mrow> <mo>,</mo> <mrow> <mo>(</mo> <mi>i</mi> <mo>=</mo> <mn>1...</mn> <mi>c</mi> <mo>-</mo> <mi>n</mi> <mo>)</mo> </mrow> <mo>;</mo> </mrow>

<mrow> <msub> <mi>&Delta;r</mi> <mi>i</mi> </msub> <mo>=</mo> <munderover> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> <mo>,</mo> <mi>j</mi> <mo>&NotEqual;</mo> <mi>i</mi> </mrow> <mrow> <mi>c</mi> <mo>-</mo> <mi>n</mi> </mrow> </munderover> <msub> <mi>d</mi> <msub> <mi>x</mi> <mi>j</mi> </msub> </msub> <mfrac> <mn>1</mn> <mrow> <mi>c</mi> <mo>-</mo> <mi>n</mi> <mo>-</mo> <mn>1</mn> </mrow> </mfrac> <mo>.</mo> </mrow>

4. the method for unreasonable data in the negative investigation of processing according to claim 3, it is characterised in that in step 5, for The value of the negative investigation obtained after adjustment, recalculates the value x just investigated_i：

Wherein,

x_s=x_c+x_c-1+…+x_c-n+1, r_s=r_c+r_c-1+…+r_c-n+1。