CN109377218A

CN109377218A - A kind of method, server and the mobile terminal of the false perception attack of containment

Info

Publication number: CN109377218A
Application number: CN201811101427.5A
Authority: CN
Inventors: 刘杨; 张珍杰; 关建峰; 许长桥
Original assignee: Beijing University of Posts and Telecommunications
Current assignee: Beijing University of Posts and Telecommunications
Priority date: 2018-09-20
Filing date: 2018-09-20
Publication date: 2019-02-22
Anticipated expiration: 2038-09-20
Also published as: CN109377218B

Abstract

The embodiment of the invention provides method, server and the mobile terminals of a kind of false perception attack of containment, this method comprises: obtaining the first perception task, formulate the first payment rule according to the first perception task；The first perception task and the first payment rule are sent to multiple mobile terminals, so that mobile terminal chooses whether to participate in the first perception task according to the first payment rule；It obtains and participates in the perception data that multiple mobile terminals of the first perception task are sent, EM algorithm evaluation is carried out to perception data, obtains the corresponding perception accuracy of each perception data；According to the first payment rule, it is based on the corresponding perception accuracy of each perception data, to the corresponding remuneration of each mobile terminal payment；The second payment rule is obtained, the second payment rule is obtained according to the perception accuracy of the first perception task based on Q-learning algorithm or DQN algorithm, and the second payment rule is for perception task next time.The embodiment of the present invention is provided with the effect that containment user sends false perception attack.

Description

A kind of method, server and the mobile terminal of the false perception attack of containment

Technical field

The present embodiments relate to the methods of intelligent perception field more particularly to a kind of false perception attack of containment, service Device and mobile terminal.

Background technique

It is more and more with the swift and violent growth of the mobile devices such as smart phone, tablet computer, smartwatch and Intelligent bracelet Mobile device be equipped with the sensors of various functions, such as accelerometer, gyroscope, global positioning system and thermometer, with These mobile devices be perception basic unit, gradually formed mobile gunz sensing network (Mobile Crowdsesning, with Lower abbreviation MCS), by cooperating with mobile Internet, distributed awareness task simultaneously collects the perception data of mobile device upload, complete At large-scale perception task.Therefore, environment, network and in terms of, MCS platform or server are moved by recruiting The situation that family is monitored ambient enviroment is employed, to provide numerous services.It is quick with Intelligent programmable wireless device Development, user can be to their wireless device be controlled, for example, user is by manipulating some distinctive built-in sensings freely Device can accurately determine the effort to be paid for completing perception task, further will affect the quality of data.As one Private smart phone user can select perception to make great efforts to maximize personal income, and intelligent perception system must stimulate user to mention For accurately sensing report, and inhibit to forge the attack of sensing data.Otherwise, if user learns sends falseness in MCS task Perception data will not pay for or even certain smart phone users upload false perception data by excitation and attacked, it will MCS server is caused to receive the perception report of a large amount of low forgery.

To solve the above-mentioned problems, game theory is the important means for formulating MCS process, such as auctions, is based on price or base In mechanism such as prestige, it is that MCS task is made contributions that user is motivated using these mechanism.Wherein, the MCS based on auction is proposed The price in user's auction that solution payment is bid minimum is to save cost.It was noted that effectiveness MCS server not The payment user to service is depended only on, their position is additionally depended on, incudes the quality of dynamics and sensor.Therefore, MCS server, which can improve its sensing capabilities by assessment sensing quality and only recruit, provides accurate smart phone report.It moves Innervation answers server application data mining and learning algorithm to assess false sensing report and can inhibit the motivation of deception.But by In estimation error, the accuracy that server excites user to provide report in the case where not knowing the sensing model of user is still obtained Less than guarantee.

Due to the mistake that assessment aspect occurs, how server excites use in the unwitting situation of user's sensing model Family, which provides accurately report, becomes current institute's facing challenges.Therefore, a kind of method of false perception attack of containment is needed now.

Summary of the invention

The embodiment of the present invention is to solve that perception that the user in the prior art in MCS provides is reported lower to be lacked It falls into, provides method, server and the mobile terminal of a kind of false perception attack of containment.

In a first aspect, the embodiment of the invention provides a kind of methods of the false perception attack of containment, comprising:

101, the first perception task is obtained, the first payment rule is formulated according to first perception task；

102, first perception task and first payment rule are sent to multiple mobile terminals, for mobile terminal It is chosen whether to participate in first perception task according to first payment rule；

103, it obtains and participates in the perception data that multiple mobile terminals of first perception task are sent, to the perception number According to EM algorithm evaluation is carried out, the corresponding perception accuracy of each perception data is obtained；

104, according to first payment rule, it is based on the corresponding perception accuracy of each perception data, to each movement Terminal pays corresponding remuneration；

105, the second payment rule is obtained, second payment rule is accurate according to the perception of first perception task Degree, is obtained based on Q-learning algorithm or DQN algorithm, and second payment rule is for perception task next time.

Second aspect, the embodiment of the invention provides a kind of methods of the false perception attack of containment, comprising:

Obtain perception task and payment rule；

Perception task remuneration is estimated in acquisition, described to estimate perception task remuneration according to the perception task and payment rule Then, it estimates to obtain based on perceived quality；

It estimates task remuneration according to described and chooses whether to receive the perception task, if receiving, complete the perception After task, server is sent by the perception data of the perception task；

Receive the corresponding remuneration of the perception task, the corresponding remuneration of the perception task is quasi- according to the perception of the perception data Exactness and the payment rule obtain.

The third aspect, the embodiment of the invention provides a kind of servers of the false perception attack of containment, comprising:

First processing module is formulated the first payment according to first perception task and is advised for obtaining the first perception task Then；

First sending module, it is whole to multiple movements for sending first perception task and first payment rule End, so that mobile terminal chooses whether to participate in first perception task according to first payment rule；

Second processing module, for obtaining the perception number for participating in multiple mobile terminals of first perception task and sending According to, to the perception data carry out EM algorithm evaluation, obtain the corresponding perception accuracy of each perception data；

First payment module, for it is accurate to be based on the corresponding perception of each perception data according to first payment rule Degree, to the corresponding remuneration of each mobile terminal payment；

Third processing module, for obtaining the second payment rule, second payment rule is appointed according to first perception The perception accuracy of business, is obtained based on Q-learning algorithm or DQN algorithm, and second payment rule is for sense next time Know task.

Fourth aspect, the embodiment of the invention provides a kind of mobile terminals of the false perception attack of containment, comprising:

First obtains module, for obtaining perception task and payment rule；

Second obtains module, estimates perception task remuneration for obtaining, described to estimate perception task remuneration according to the sense Know task and the payment rule, estimates to obtain based on perceived quality；

Selecting module chooses whether to receive the perception task for estimating task remuneration according to, if receiving, After completing the perception task, server is sent by the perception data of the perception task；

Remuneration receiving module, for receiving the perception task remuneration, the perception task remuneration is according to the perception number According to perception accuracy and the payment rule obtain.

5th aspect the embodiment of the invention provides a kind of computer equipment, including memory, processor and is stored in On reservoir and the computer program that can run on a processor, the processor realize when executing described program such as first aspect or The method of the false perception attack of containment described in second aspect.

6th aspect, the embodiment of the invention provides a kind of non-transient computer readable storage medium, the non-transient meter Calculation machine readable storage medium storing program for executing stores computer instruction, and the computer instruction makes the computer execute such as first aspect or second The method of the false number perception attack of containment described in aspect.

The method and device of the false perception attack of a kind of containment provided in an embodiment of the present invention, by EM algorithm to each sense Know that the perception data of task is assessed, and according to perception data, using Q-learning algorithm or the study of DQN algorithm to most Good payment rule, excitation user send most accurate perception data, achieve the effect that contain that user sends false perception data.

Detailed description of the invention

In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is this hair Bright some embodiments for those of ordinary skill in the art without creative efforts, can be with root Other attached drawings are obtained according to these attached drawings.

Fig. 1 is the method flow schematic diagram of the false perception attack of containment provided in an embodiment of the present invention；

Fig. 2 is the method flow schematic diagram of the false perception attack of another containment provided in an embodiment of the present invention；

Fig. 3 is the server architecture schematic diagram of the false perception attack of containment provided in an embodiment of the present invention；

Fig. 4 is the mobile terminal structure schematic diagram of the false perception attack of containment provided in an embodiment of the present invention；

Fig. 5 is computer equipment structural schematic diagram provided in an embodiment of the present invention.

Specific embodiment

In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art Every other embodiment obtained without creative efforts, shall fall within the protection scope of the present invention.

MCS is a kind of using mobile devices such as the smart phones, tablet computer or wearable device of user as basic sense Know node, the wireless network of perception task and collection perception data is issued by moving internet.Game theory is current research gunz The important means of sensing network, such as Game Theory, Option Pricing and credit system are applied in intelligent perception network, For motivating user to participate in perception task.However, interests of some malicious users for itself, false perception data is sent To MCS, on the one hand reduce network trap, on the other hand, but also the quality decline of network aware report, or even cause network Blocking.

Fig. 1 is the method flow schematic diagram of the false perception attack of containment provided in an embodiment of the present invention, as shown in Figure 1, this Inventive embodiments provide a kind of method of false perception attack of containment, comprising:

Step 101 obtains the first perception task, formulates the first payment rule according to first perception task；

MCS server is responsible for data collection, processing and application, and is made of multiple aware platform servers, and server is first First perception task is classified, currently, with intelligent perception network research development and deeply, perception task is divided into environment Monitoring, infrastructure monitoring, Social behaviors and social medical information etc., while according to sorted mission requirements, it collects specific The perception information of user, in order to which the perception behavior for paying user obtains reasonable return, server formulates branch according to perception task Pay rule.It should be noted that the first perception task and the first payment rule in the embodiment of the present invention are intelligent perception networks Establish initial stage setting, wherein the first payment rule according in perception task and network historical data, artificial experience or Classified reference data is formulated in internet, and the second payment rule and later described after the embodiment of the present invention It is to learn to obtain by algorithm iteration, therefore, the first payment rule is at the beginning of MCS by the new payment rule for learning to obtain Phase may show distribution principle it is unreasonable or distribution the lower defect of accuracy rate, pass through the iteration of payment rule new below Study, new payment rule will gradually tend to be optimal.

Step 102 sends first perception task and first payment rule to multiple mobile terminals, for movement Terminal chooses whether to participate in first perception task according to first payment rule；

Firstly, MCS server broadcast recruit information, recruit information in include the first perception task and the first payment rule, Wherein, the remuneration information for completing corresponding first perception task is carried in the first payment rule, so that user be motivated to play an active part in Perception task.After user receives recruitment information by mobile terminal, the resource paid according to oneself, for example, when perception consumption Between, terminal power consumption or CPU occupancy situation, consider after paying these resources, whether the remuneration acquired meets expectation, from And decides whether selection and participate in perception task.

Step 103 obtains the perception data for participating in multiple mobile terminals transmission of first perception task, to the sense Primary data carries out EM algorithm evaluation, obtains the corresponding perception accuracy of each perception data；

In MCS, any user for carrying mobile terminal can receive perception task, the perception for causing user to upload The accuracy of data cannot be guaranteed, and especially upload as user to mobile terminal misoperation or certain malicious users false Perception attack, therefore, the reliability of perception data is assessed, so that the accuracy of entire MCS could be improved.In this hair In bright embodiment, user submit perception data accuracy be it is unknown, need to carry out Accuracy evaluation, since there are hidden changes Amount, i.e., the exact value (section) of each perception task, so cannot directly be estimated using maximum likelihood, so calculated using EM Method.

It in embodiments of the present invention, is each user a_kSet an effort matrix e^k, e here^kIt is the square of a m*m Battle array, each element of the insideWherein, i=1,2 ... m, j=1,2...m indicate user a_kThe perception of submission Data are in section d_jIn, but accurately perception data is in section d_iIn, specifically,Include The possible situation of the m kind for the correct perception data that user submits, wherein the value in matrix meets following formula:

Then, the interval probability collection that defining each perception task may be distributed is combined intoAccording to receipts The data set S of collection initializes P, then executes the E step in EM algorithm, the i.e. value of estimation effort square matrix E, then executes in EM algorithm M step, the correct section P of estimation task is gone according to the value of obtained square matrix E in turn, later constantly execute E step and M step, until Convergence, wherein t indicates t-th of perception task.

Step 104, according to first payment rule, the corresponding perception accuracy of each perception data is based on, to each The corresponding remuneration of mobile terminal payment；

It is set as according to the effort that the user j that EM algorithm is estimated is paid in period kSo server is available The scale of precision of estimation is that the total number of the report of i can be indicated with following formula:

Wherein, I function is a Knowledge Function, whenWhen equation is set up, I=1, otherwise I=0.

The benefit of server end refers to that the obtained interests of server subtract expense to be paid, formula are as follows:

Wherein, G⁽ⁱ⁾Indicate the income that server obtains the data that a grade is i, in embodiments of the present invention, due to The factors such as sensing location and submission time, the data that user submits are affected to the contribution of server, can set in the formula Setting impact factor is λ, then the data of grade i is submitted for user j, the income that server obtains is λ_jG⁽ⁱ⁾.Y indicates payment rule Then, in embodiments of the present invention, the corresponding perception accuracy of payment rule y is divided into H rank, be expressed as Wherein P_iInclude H different payment rules.In this way, server is in period k according to the accuracy and payment rule that are estimated to It can be indicated to the remuneration of user are as follows:

The remuneration paid to all users of the task of participation can indicate are as follows:

By above-mentioned formula, according to the perception accuracy for the perception data that EM algorithm evaluation arrives, in conjunction with payment rule, to The user for participating in perception task pays corresponding remuneration, while obtaining the total benefit of intelligent perception network under current state.

Step 105 obtains the second payment rule, and second payment rule is quasi- according to the perception of first perception task Exactness is obtained based on Q-learning algorithm or DQN algorithm, and second payment rule is for perception task next time.

It is exerted since the payment rule in MCS will affect mobile terminal user what the perception task that will be participated in be paid Power, therefore, in embodiments of the present invention, payment process can be described as limited markov decision process (MDP).Due in work In the dynamic environment that the sensing model of jump user hardly results in, MCS can apply Q-learning algorithm, a kind of model-free reinforcing Learning method tests to obtain optimal payment strategy by limited Markovian decision.More specifically statement is based on Q- The intelligent perception network of learning can arrive according to the observation before perception report quality, payment strategy and mass function (the Q function for having discount to long-term reward) to determine payment strategy for each perception task.For example, in Q-learning algorithm In, the ambient condition that period k as movement, can be arranged in payment rule is s^(k), each accurate etc. comprising preceding state Number of stages set and payment rule, formula are as follows:

Q (s, y) is set by the Q equation that the MCS payment strategy based on Q-learning is wherein relied on, then state is dynamic (s, y) long-term expected utility of opposing is updated according to Bellman equation, formula are as follows:

Q(s,y)←(1-α)Q(s,y)+α(u_s(s,y)+γV(s'))；

Wherein s' is next state after state s implementation strategy y, and value equation V provides the maximum value of Q equation, γ For discount factor, indicate that the time is longer, the reward that future obtains is lower, and α ∈ [0,1] indicates the learning efficiency of s-y-s'.

According to the state value s of current system^(k)The value for the Q equation being calculated with action value, MCS server use ε- Greedy algorithm removes selection action value, in this way can be to avoid resting on local optimum.It is specifically exactly to be acted in selection When, the optimal policy predicted under current state is selected with the high probability of 1- ε:

Other strategies are randomly choosed with the probability of ε.

When state space reaches certain radix, due to needing a large amount of calculating, the operation effect of the MCS based on Q-learning Rate will be very low, and this problem can pass through depth Q network (DQN) very good solution.More specifically, using applying instantly The depth convolutional neural networks (CNN) and Q-learning algorithm of every field combine, and have both reached the safety of perception task, Study statespace is had compressed again, improves operational efficiency.

A kind of method and device of the false perception attack of containment provided in an embodiment of the present invention, issues when MCS server and feels After knowing task, initial payment rule is formulated first, excitation user uploads perception data, by EM algorithm to each perception task Perception data assessed, and according to perception data, utilize Q-learning algorithm or the study of DQN algorithm to optimal branch Rule is paid, further user is motivated to send most accurate perception data, achievees the effect that contain that user sends false perception data.

On the basis of the above embodiments, step 105, in the second payment rule of the acquisition, second payment rule It according to the perception accuracy of first perception task, is obtained based on Q-learning algorithm or DQN algorithm, second payment After rule is for perception task next time, comprising:

Based on next time perception task and second payment rule, repeat step 102 to 105, update current state Benefit value, until MCS total benefit restrain.

Q-learning algorithm or DQN algorithm be one kind in dynamic environment by trial and error experimental learning behavior artificial intelligence Technology, the action after being allowed to by study automatically selects ideal behavior in certain circumstances, thus the state being optimal. In embodiments of the present invention, when being in the initial stage due to MCS, to the perception number of external web environment and mobile terminal upload According to priori knowledge is lacked, the payment rule formulated at this time is simultaneously not perfect, and general mobile terminal user can select according to payment rule The expected maximum sensed activation of benefit can be obtained by selecting.

However, the mobile terminal user of certain malice may find certain payment rule according to the not perfect of payment rule Loophole then uploads false perception data with lesser cost and carrys out payt to cheat MCS, therefore, MCS at this time easily by To falseness perception attack.By Q-learning algorithm or DQN algorithm to MCS iterative learning, server can sense to observing The last perception data knowing that report quality and payment rule etc. become familiar with grasp, and arriving according to the observation, is gradually increased sense Know the quotation of task, adjust price catalog, final server obtains optimal payment rule, so that malicious user uploads false perception The probability of attack is preferably minimized to reach a stationary value.

In embodiments of the present invention, it is iterated based on Q-learning algorithm or DQN algorithm, has obtained payment rule, On the one hand excitation user avoids uploading false perception attack, on the other hand as far as possible using the perception data for uploading high quality Reduce the perception data that MCS is user's upload high quality and the remuneration paid.

On the basis of the above embodiments, step 105, the second payment rule of the acquisition, the second payment rule root According to the perception accuracy of first perception task, obtained based on Q-learning algorithm or DQN algorithm, comprising:

According to the perception accuracy of first perception task, obtained based on Q-learning algorithm or DQN algorithm multiple Payment rule；

Target payment rule is chosen from multiple payment rules according to ε-greedy algorithm as the second payment rule.

In the learning process of Q-learning algorithm or DQN algorithm, to avoid learning outcome false convergence, according to ε- Greedy algorithm determines optimal payment rule, and in current perception task, server is gone to select most effective by the probability of ε Payment rule randomly chooses other payment rules with the probability of 1- ε, wherein 0 < ε < 1, and levels off to 1.

In embodiments of the present invention, optimal payment rule is determined by ε-greedy algorithm, had both guaranteed to select every time in this way The payment rule taken makes MCS obtain greatest benefit as far as possible, and prevents from falling into local maximum, to obtain global optimum Payment rule.

On the basis of the above embodiments, step 101, the first perception task of the acquisition is appointed according to first perception The first payment rule is formulated in business, comprising:

Using classified perception data as initial perception data；

The first perception task is obtained, the first payment is formulated according to first perception task and corresponding initial perception data Rule.

In embodiments of the present invention, since there are the insufficient defects of priori by the MCS of initialization, to improve study effect Rate, by having classified existing and perception data that accuracy rate has been assessed is input in MCS, these classified perception data categories It in history perception data, is acquired according to the prior art, details are not described herein again for concrete mode.When server receives first When perception task, the first perception task is classified, it should be noted that classified perception data is only used as payment rule The reference sample of formulation establishes initial stage in MCS, initial payment rule is formulated by existing perception data and perception task, Namely the first payment rule.In addition, also including that malice perception attack or perceived quality are lower in classified perception data And undesirable data information.

By inputting existing initial perception data in the MCS of initialization, the efficiency of e-learning is improved, to make It obtains entire MCS and reaches benefit convergence in a relatively short period of time, achieved the effect that the false perception of containment is attacked.

On the basis of the above embodiments, step 104, described according to first payment rule, it is based on each perception number According to corresponding perception accuracy, to the corresponding remuneration of each mobile terminal payment, comprising:

If the perception accuracy is less than or equal to first threshold, determine that the perception data is false data attack；

If the perception accuracy is greater than first threshold and is less than or equal to second threshold, determine that the perception data is target Perception data；

If the perception data is greater than the second threshold, determine that the perception data is excess perception data；

The corresponding remuneration of mobile terminal payment to the target apperception data and the excess perception data are determined as；

Wherein, the second threshold is greater than the first threshold.

In embodiments of the present invention, after the perception data of this perception task being assessed, to perception data by accurate Grade is classified, it is assumed that each perception data can be attributed to one of them in scale of precision, setting first threshold and Second threshold, it should be noted that second threshold schedule be this perception task best quality and satisfactory perception data, That is, if the quality for the perception data that certain perception task needs need to only reach medium level, if it is more than to appoint that user, which submits, Business requires the perception data of quality, also judges not meet perception task requirement.It should be noted that can be according to actual needs The numerical value of first threshold and second threshold is set, and the present invention is not especially limit this.

For example, setting 0 for first threshold, second threshold is set as 1, and the perception by perception data less than or equal to 0 is reported Be defined as false perception attack, wherein in actual assessment, can also the perception Report Definition by perception data less than 0 be to use Family receives an assignment, but the case where cannot participate in perception task, is still defined as false perception at this point, perception data is equal to 0 and attacks It hits, any remuneration will not be sent to the mobile terminal of such perception data at this time.When perception accuracy is greater than 0 and is less than or equal to When 1, then defining perception data is target apperception data, pays corresponding remuneration to mobile terminal according to payment rule.Equally, work as sense When knowing that accuracy is greater than 1, corresponding remuneration can be also paid according to payment rule at this time to mobile terminal, the difference is that, at this time Perception data quality is higher, also more deviates the requirement of perception task, and the corresponding remuneration of payment rule is typically provided to perception number It is fewer according to the higher payt of quality, so that the user of higher perceived equipment weighs the resource consumption situation of itself, to select Whether selection receives perception task.

The embodiment of the present invention has contained that user initiates the general of false perception attack by setting first threshold and second threshold Rate, user higher for perceived quality, makes such user keep silent under budget limit, reduces the redundancy of perception data And the loss of transimission power.

Fig. 2 is the method for the false perception attack of another containment provided in an embodiment of the present invention, as shown in Fig. 2, the present invention is real It applies example and provides a kind of method of false perception attack of containment, comprising:

Step 201 obtains perception task and payment rule；

Perception task remuneration is estimated in step 202, acquisition, described to estimate perception task remuneration according to the perception task and institute Payment rule is stated, estimates to obtain based on perceived quality；

Step 203 estimates task remuneration according to and chooses whether to receive the perception task, if receiving, completes After the perception task, server is sent by the perception data of the perception task；

Step 204 receives the corresponding remuneration of the perception task, and the corresponding remuneration of the perception task is according to the perception data Perception accuracy and the payment rule obtain.

In embodiments of the present invention, firstly, MCS server broadcast recruits information, recruiting information includes perception task and branch Rule is paid, wherein payment rule can motivate user to play an active part in perception task, when each user is received by mobile terminal After recruiting information, the perceptual strategy of oneself is determined.Such as, if agreement receives perception task, if agreeing to, can think deeply this for sense Know that task distribution how many resource are handled, since the quality of perception data depends on the perception of the sensor of mobile terminal Degree, such as time, the electricity of perception consumption, mobile terminal can be according to the estimated distribution resource combination mobile terminal sheet of user at this time The perception dynamics of body estimates a perception task remuneration, if the perception task remuneration estimated has reached the ideal expectation value of user, Then user will receive this perception task, and send MCS server by mobile terminal for perception data after the completion, Server assesses the perception accuracy to perception data, and according to payment rule, sends corresponding report to mobile terminal Reward.

In embodiments of the present invention, mobile terminal is according to payment rule, in conjunction with the perception dynamics of itself, estimates out this sense Know task remuneration, user can analyze the ideal expectation value of itself early stage receiving perception task, select next Corresponding actions have contained the probability of false perception attack to improve the accuracy of perception data.

Fig. 3 is the server architecture schematic diagram of the false perception attack of containment provided in an embodiment of the present invention, as shown in figure 3, The embodiment of the invention provides a kind of servers of the false perception attack of containment, comprising: first processing module 301, sending module 302, Second processing module 303, payment module 304 and third processing module 305, wherein first processing module 301 is for obtaining First perception task formulates the first payment rule according to first perception task；Sending module 302 is for sending described first Perception task and first payment rule are to multiple mobile terminals, so that mobile terminal is selected according to first payment rule Whether first perception task is participated in；Second processing module 303 is for obtaining the multiple shiftings for participating in first perception task The perception data that dynamic terminal is sent carries out EM algorithm evaluation to the perception data, obtains the corresponding perception of each perception data Accuracy；Payment module 304 is used to be based on the corresponding perception accuracy of each perception data according to first payment rule, To the corresponding remuneration of each mobile terminal payment；Third processing module 305 is for obtaining the second payment rule, second payment Rule is obtained according to the perception accuracy of first perception task based on Q-learning algorithm or DQN algorithm, and described second Payment rule is for perception task next time.

In embodiments of the present invention, third processing module 305 is based on Q-learning algorithm or DQN algorithm to this perception The perception data of task is iterated, and is obtained the payment rule for perception task next time, is on the one hand motivated user using upper The perception data of biography high quality avoids uploading falseness perception attack, and on the other hand reduction MCS as far as possible is uploaded high-quality for user The perception data of amount and the remuneration paid.

Fig. 4 is the mobile terminal structure schematic diagram of the false perception attack of containment provided in an embodiment of the present invention, such as Fig. 4 institute Show, the embodiment of the invention provides a kind of mobile terminals of the false perception attack of containment, comprising: first obtains module 401, second Obtain module 402, selecting module 403 and remuneration receiving module 404, wherein the first acquisition module 401 is for obtaining perception task And payment rule；Second acquisition module 402 estimates perception task remuneration for obtaining, described to estimate perception task remuneration according to institute Perception task and the payment rule are stated, estimates to obtain based on perceived quality；Selecting module 403 according to for estimating task Remuneration chooses whether to receive the perception task, if receiving, after completing the perception task, by the sense of the perception task Primary data is sent to server；Remuneration receiving module 404 is for receiving the perception task remuneration, the perception task remuneration root It is obtained according to the perception accuracy and the payment rule of the perception data.

In embodiments of the present invention, second module 402 is obtained according to payment rule, in conjunction with the perception dynamics of mobile terminal, This perception task remuneration is estimated out, user can analyze the ideal expectation value of itself early stage receiving perception task, Next corresponding actions are selected, to improve the accuracy of perception data, have contained the probability of false perception attack.

Device provided in an embodiment of the present invention is for executing above-mentioned each method embodiment, detailed process and detailed content Above-described embodiment is please referred to, details are not described herein again.

Fig. 5 is computer equipment structural schematic diagram provided in an embodiment of the present invention, as shown in figure 5, the computer equipment can To include: processor (processor) 501,502, memory communication interface (Communications Interface) (memory) 503 and communication bus 504, wherein processor 501, communication interface 502, memory 503 pass through communication bus 504 Complete mutual communication.Processor 501 can call the logical order in memory 503, to execute following method: obtain the One perception task formulates the first payment rule according to first perception task；Send first perception task and described One payment rule is to multiple mobile terminals, so that mobile terminal chooses whether to participate in described first according to first payment rule Perception task；It obtains and participates in the perception data that multiple mobile terminals of first perception task are sent, to the perception data EM algorithm evaluation is carried out, the corresponding perception accuracy of each perception data is obtained；According to first payment rule, based on each The corresponding perception accuracy of perception data, to the corresponding remuneration of each mobile terminal payment；Obtain the second payment rule, described Two payment rules are obtained according to the perception accuracy of first perception task based on Q-learning algorithm or DQN algorithm, institute The second payment rule is stated for perception task next time；

Or, obtaining perception task and payment rule；Perception task remuneration is estimated in acquisition, described to estimate perception task remuneration root According to the perception task and the payment rule, estimate to obtain based on perceived quality；According to it is described estimate task remuneration selection be It is no to receive the perception task, if receiving, after completing the perception task, the perception data of the perception task is sent To server；Receive the corresponding remuneration of the perception task, the corresponding remuneration of the perception task is according to the perception of the perception data Accuracy and the payment rule obtain.

In addition, the logical order in above-mentioned memory 503 can be realized by way of SFU software functional unit and conduct Independent product when selling or using, can store in a computer readable storage medium.Based on this understanding, originally Substantially the part of the part that contributes to existing technology or the technical solution can be in other words for the technical solution of invention The form of software product embodies, which is stored in a storage medium, including some instructions to So that a computer equipment (can be personal computer, server or the network equipment etc.) executes each implementation of the present invention The all or part of the steps of example the method.And storage medium above-mentioned include: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. it is various It can store the medium of program code.

The embodiment of the present invention discloses a kind of computer program product, and the computer program product is non-transient including being stored in Computer program on computer readable storage medium, the computer program include program instruction, when described program instructs quilt When computer executes, computer is able to carry out method provided by above-mentioned each method embodiment, for example, obtains the first perception Task formulates the first payment rule according to first perception task；Send first perception task and first payment Rule arrives multiple mobile terminals, so that mobile terminal chooses whether to participate in first perception times according to first payment rule Business；It obtains and participates in the perception data that multiple mobile terminals of first perception task are sent, EM is carried out to the perception data Algorithm evaluation obtains the corresponding perception accuracy of each perception data；According to first payment rule, it is based on each perception number According to corresponding perception accuracy, to the corresponding remuneration of each mobile terminal payment；Obtain the second payment rule, second payment Rule is obtained according to the perception accuracy of first perception task based on Q-learning algorithm or DQN algorithm, and described second Payment rule is for perception task next time；

The embodiment of the present invention provides a kind of non-transient computer readable storage medium, the non-transient computer readable storage medium The instruction of matter storage server, the computer instruction make computer execute the false perception attack of containment provided by above-described embodiment Method, for example, obtain the first perception task, the first payment rule is formulated according to first perception task；Described in transmission First perception task and first payment rule are to multiple mobile terminals, so that mobile terminal is according to first payment rule It chooses whether to participate in first perception task；It obtains and participates in the perception that multiple mobile terminals of first perception task are sent Data carry out EM algorithm evaluation to the perception data, obtain the corresponding perception accuracy of each perception data；According to described One payment rule is based on the corresponding perception accuracy of each perception data, to the corresponding remuneration of each mobile terminal payment；It obtains Second payment rule, second payment rule are based on Q-learning according to the perception accuracy of first perception task Algorithm or DQN algorithm obtain, and second payment rule is for perception task next time；

Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations；Although Present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: it still may be used To modify the technical solutions described in the foregoing embodiments or equivalent replacement of some of the technical features； And these are modified or replaceed, technical solution of various embodiments of the present invention that it does not separate the essence of the corresponding technical solution spirit and Range.

Claims

1. a kind of method of the false perception attack of containment characterized by comprising

102, first perception task and first payment rule are sent to multiple mobile terminals, for mobile terminal according to First payment rule chooses whether to participate in first perception task；

103, obtain and participate in the perception data that multiple mobile terminals of first perception task are sent, to the perception data into Row EM algorithm evaluation obtains the corresponding perception accuracy of each perception data；

104, according to first payment rule, it is based on the corresponding perception accuracy of each perception data, to each mobile terminal Pay corresponding remuneration；

105, the second payment rule, perception accuracy of second payment rule according to first perception task, base are obtained It is obtained in Q-learning algorithm or DQN algorithm, second payment rule is for perception task next time.

2. the method according to claim 1, wherein described second pays in the second payment rule of the acquisition Rule is obtained according to the perception accuracy of first perception task based on Q-learning algorithm or DQN algorithm, and described second After payment rule is for perception task next time, comprising:

Based on next time perception task and second payment rule, repeat step 102 to 105, update the effect of current state Benefit value, until the total benefit of intelligent perception network is restrained.

3. method according to claim 1 or 2, which is characterized in that the second payment rule of the acquisition, second payment Rule is obtained according to the perception accuracy of first perception task based on Q-learning algorithm or DQN algorithm, comprising:

According to the perception accuracy of first perception task, multiple payments are obtained based on Q-learning algorithm or DQN algorithm Rule；

4. the method according to claim 1, wherein the first perception task of the acquisition, feels according to described first Know that task formulates the first payment rule, comprising:

Using classified perception data as initial perception data；

The first perception task is obtained, the first payment is formulated according to first perception task and corresponding initial perception data and is advised Then.

5. being based on each sense the method according to claim 1, wherein described according to first payment rule The corresponding perception accuracy of primary data, to the corresponding remuneration of each mobile terminal payment, comprising:

If the perception accuracy is greater than first threshold and is less than or equal to second threshold, determine that the perception data is target apperception Data；

Wherein, the second threshold is greater than the first threshold.

6. a kind of method of the false perception attack of containment characterized by comprising

Obtain perception task and payment rule；

Perception task remuneration is estimated in acquisition, described to estimate perception task remuneration according to the perception task and the payment rule, It estimates to obtain based on perceived quality；

It estimates task remuneration according to described and chooses whether to receive the perception task, if receiving, complete the perception task Afterwards, server is sent by the perception data of the perception task；

Receive the corresponding remuneration of the perception task, the corresponding remuneration of the perception task is according to the perception accuracy of the perception data It is obtained with the payment rule.

7. a kind of server of the false perception attack of containment characterized by comprising

First processing module formulates the first payment rule according to first perception task for obtaining the first perception task；

Sending module, for sending first perception task and first payment rule to multiple mobile terminals, for moving Dynamic terminal chooses whether to participate in first perception task according to first payment rule；

Second processing module is right for obtaining the perception data for participating in multiple mobile terminals of first perception task and sending The perception data carries out EM algorithm evaluation, obtains the corresponding perception accuracy of each perception data；

Payment module, for being based on the corresponding perception accuracy of each perception data according to first payment rule, to each The corresponding remuneration of mobile terminal payment；

Third processing module, for obtaining the second payment rule, second payment rule is according to first perception task Accuracy is perceived, is obtained based on Q-learning algorithm or DQN algorithm, second payment rule is appointed for perception next time Business.

8. a kind of mobile terminal of the false perception attack of containment characterized by comprising

First obtains module, for obtaining perception task and payment rule；

Second obtains module, estimates perception task remuneration for obtaining, described to estimate perception task remuneration according to the perception times Business and the payment rule, estimate to obtain based on perceived quality；

Selecting module chooses whether to receive the perception task for estimating task remuneration according to, if receiving, completes After the perception task, server is sent by the perception data of the perception task；

Remuneration receiving module, for receiving the perception task remuneration, the perception task remuneration is according to the perception data Perception accuracy and the payment rule obtain.

9. a kind of computer equipment including memory, processor and stores the meter that can be run on a memory and on a processor Calculation machine program, which is characterized in that the processor realizes the containment as described in any one of claim 1 to 6 when executing described program The method of false perception data.

10. a kind of non-transient computer readable storage medium, which is characterized in that the non-transient computer readable storage medium is deposited Computer instruction is stored up, the computer instruction executes the computer and contains false sense as described in any one of claim 1 to 6 The method of primary data.