CN110992951A

CN110992951A - Method for protecting personal privacy based on countermeasure sample

Info

Publication number: CN110992951A
Application number: CN201911228334.3A
Authority: CN
Inventors: 付强; 郭九麟; 彭凝多
Original assignee: Sichuan Hongwei Technology Co Ltd
Current assignee: Sichuan Hongwei Technology Co Ltd
Priority date: 2019-12-04
Filing date: 2019-12-04
Publication date: 2020-04-10
Anticipated expiration: 2039-12-04
Also published as: CN110992951B

Abstract

The invention discloses a method for protecting personal privacy based on a confrontation sample, intelligent equipment comprises a microphone, a loudspeaker, a voice recognition module and an interference generation module, and the method comprises the following steps: the user starts an interference generating module, and the interference generating module generates an interference signal and plays the interference signal through a loudspeaker; the microphone collects voice signals and sends the voice signals to the voice recognition module for recognition, and the voice signals comprise user voices and interference signals; the interference generating module is turned off when a voice assistant is needed to use the smart device normally. The method determines the voice space needing interference based on the basic pronunciation, greatly reduces the space and reduces the time for generating the confrontation sample compared with the space of the speaking content; from the acoustic perspective, the voice signal outside the human hearing range is used as a search space, and the search is not perceived by the user; and finding out the universal interference m by adopting a particle swarm optimization algorithm in a search space to obtain a countermeasure sample and protect the privacy of a user.

Description

Method for protecting personal privacy based on countermeasure sample

Technical Field

The invention relates to the technical field of information security, in particular to a method for protecting personal privacy based on a confrontation sample.

Background

With the increasing maturity of artificial intelligence theory and technology, the application field is continuously expanded, and the life style of people is greatly improved and facilitated. The voice recognition is one of the enabling directions of the artificial intelligence technology, and the accuracy of the voice recognition is also continuously improved. The characteristics of the voice interaction mode enable the voice interaction mode to have great potential in the environment of the Internet of things. For example, aiming at the task of setting an alarm clock for a smart phone in daily life, a voice instruction of a few seconds is needed to complete. In the process, the hands are released, low-cost devices such as a microphone, a processor and a loudspeaker of the smart phone are used, and the task is completed through simple voice instructions. From this we can see that voice interaction has the advantages of simplicity, rapidity, low equipment cost, context comprehension, etc. Based on the advantages, the voice interaction also has absolute advantages in a plurality of scenes such as families, vehicles, hiking and the like. While the artificial intelligence technology brings convenience and convenience to people, problems are also associated, for example, when a user does not turn on a voice function of the intelligent device, a voice assistant and the like of the intelligent device always waits for a wake-up word of the user to activate the voice function, namely, the voice assistant and the like listen to the voice content of the user and do not recognize the voice content of the user in a voice recognition system, and the recognized content is used for recommendation of other applications, which is obviously not desired by the user and infringes the privacy of the user.

Disclosure of Invention

The invention aims to provide a method for protecting personal privacy based on an antagonistic sample, which is used for solving the problem that the intelligent equipment identifies the chat content of a user to damage the privacy of the user in the prior art.

The invention protects the voice content of the user from being correctly identified by the intelligent equipment through the following technical scheme:

a method of protecting personal privacy based on an antagonistic sample, comprising a smart device comprising a microphone, a speaker, a voice recognition module, the smart device further comprising an interference generation module, the method comprising:

step S1: the user starts an interference generating module, and the interference generating module generates an interference signal and plays the interference signal through a loudspeaker;

step S2: a microphone collects voice signals, wherein the voice signals comprise user voices and interference signals and are sent to a voice recognition module for recognition, and the voice recognition module cannot recognize original contents of the user voices;

step S3: and when the voice assistant of the intelligent equipment needs to be used normally, the interference generation module is closed so as to achieve the purpose of normal use.

The method is characterized in that small interference (interference signals) which cannot be perceived by some people are intentionally added to an input sample (user voice), so that a voice recognition module of the intelligent equipment gives an erroneous output with high confidence level, namely, an antagonistic sample is introduced into the voice recognition field, and different purposes can be achieved by making different limits on the interference signals.

Further, the method for generating the interference signal by the interference generation module is as follows:

step A: the interference generation module acquires the calling right of the voice recognition module;

and B: determining a voice space needing to be interfered, and recording the voice space as X;

and C: recording voice signals outside the audible range of a person as a search space S according to the acoustic angle;

step D: searching general interference m by adopting a particle swarm optimization algorithm, so that m meets the following requirements: when the universal interference m and any element in the voice space X are played together, the voice recognition module can make a mistake, and the universal interference m is an interference signal.

Further, the particle swarm optimization algorithm in the step D includes:

step D1: initializing optimal values of an optimal particle sequence and an optimization function, wherein the optimization function is defined according to the alignment error of a voice recognition result obtained after all members in a voice space X are added with a universal interference m;

step D2: randomly generating 99 random particle sequences, and adding the optimal particle sequences to obtain 100 particles in total;

step D3: calculating an optimization function value of each of the 99 random particle sequences and the optimal particle sequence, comparing the minimum optimization function value with the optimal value of the optimization function, if the minimum optimization function value is smaller than the optimal value of the optimization function, setting the particle sequence corresponding to the minimum optimization function value as the optimal particle sequence, and setting the minimum optimization function value as the optimal value of the optimization function;

step D4: judging whether the optimal value of the optimization function is smaller than a preset value or not, if so, determining the optimal particle sequence as the universal interference m, and ending; otherwise, the positions and velocities of the 99 random particle sequences are updated, and the procedure returns to step D3.

Further, the method for acquiring the call right of the speech recognition module by the interference generation module in the step a includes: and accessing through a developer account of a voice recognition module manufacturer, or directly using a voice recognition module of the intelligent equipment, and acquiring a voice recognition interface in a reverse mode.

Compared with the prior art, the invention has the following advantages and beneficial effects:

the method determines the voice space needing interference based on the basic pronunciation, greatly reduces the space and reduces the time for generating the confrontation sample compared with the space of the speaking content; from the acoustic perspective, the voice signal outside the human hearing range is used as a search space, and the search is not perceived by the user; the particle swarm optimization algorithm is adopted in the search space, the interference m is found, one interference is aimed at the elements of the pronunciation space, the elements of the pronunciation spaces can make mistakes and are universal interference, a countermeasure sample is obtained, and the privacy of a user is protected.

Drawings

FIG. 1 is a functional block diagram of the present invention;

fig. 2 is a threshold curve of the present invention.

Detailed Description

The present invention will be described in further detail with reference to examples, but the embodiments of the present invention are not limited thereto.

Example 1:

with reference to fig. 1, a method for protecting personal privacy based on a confrontation sample includes a smart device, the smart device includes a microphone, a speaker, and a voice recognition module, the smart device further includes an interference generation module, and the method includes:

step S3: the interference generating module is turned off when a voice assistant is needed to use the smart device normally.

step A: the interference generation module acquires the calling right of the voice recognition module; the access can be realized through a developer account of a voice recognition module manufacturer, and the developer account is registered only by cloud service provision;

or the voice recognition module of the intelligent equipment is directly used, only equipment merchants need to be called normally, and a voice recognition interface (a third party) can be obtained in a reverse mode;

and B: determining a voice space needing to be interfered, and recording the voice space as X; the space of the basic pronunciation is much smaller than the space of the human speech content, such as: the combination of initial consonants and final consonants in Chinese removes the total 400 multi-syllables which can not pronounce, therefore, collect these 400 multi-syllables and obtain the speech space needing to interfere and note as X;

and C: recording voice signals outside the audible range of a person as a search space S according to the acoustic angle; the search is not perceived by the user, and it needs to be considered from the acoustic point of view, and the human ear hearing can not sense the sound of all frequencies and all sound intensities, but only sense the sound of a certain sound pressure and frequency range. The frequency range of normal human audible sound pressure is 20 Hz-20 kHz. Generally, young people can hear 20kHz sounds, while old people can hear high frequency sounds reduced to 10 kHz. Furthermore, the audible sound intensity range of normal people is 0-120 dB SPL (sound pressure level). The pure tone threshold is also called absolute threshold and silent threshold, which reflects the minimum sound pressure level that the human ear can just hear the sound in a quiet environment without any other sound interference, and the unit of the sound pressure level is dB, and the sound pressure level is related to the frequency. The threshold of hearing refers to the lowest sound pressure level that can be heard. There is a corresponding relationship between absolute hearing threshold and frequency. As shown in fig. 2, which is the "absolute hearing threshold" curve obtained according to the formula. Below the threshold curve is our search space.

Defining an optimization function in the particle swarm optimization algorithm according to the errors of the voice recognition results after all members in the voice space X are added with the universal interference m; for example: the recognition is correct to 1 (recognition structure is consistent with the collection), and the error is 0 (recognition result is inconsistent with the collection). The optimized search space is S in step C, and by defining an optimization function, we can know that each member in the speech space X can be correctly identified after m is added, and the final value of the optimization function is the total number of pronunciations, namely the number of members in X; if m is added, errors are identified, when the final value of the optimization function is 0. When m is obtained, most of m can be made wrong, namely the minimum value of the optimization function value reaches a preset value;

the solving step comprises the following steps:

step D1: initializing optimal values of the optimal particle sequence and the optimization function;

Through the steps, a universal interference m is found, namely the result to be found by the user.

The interference generation module obtains the calling right of the voice recognition module and is used for verifying and determining whether the added interference achieves the purpose of making voice error.

Although the present invention has been described herein with reference to the illustrated embodiments thereof, which are intended to be preferred embodiments of the present invention, it is to be understood that the invention is not limited thereto, and that numerous other modifications and embodiments can be devised by those skilled in the art that will fall within the spirit and scope of the principles of this disclosure.

Claims

1. A method for protecting personal privacy based on confrontation samples, comprising a smart device, the smart device comprising a microphone, a speaker, a voice recognition module, wherein the smart device further comprises an interference generation module, the method comprising:

step S2: a microphone collects voice signals and sends the voice signals to a voice recognition module for recognition, wherein the voice signals comprise user voice and interference signals;

2. The method of claim 1, wherein the method for generating the interference signal by the interference generation module comprises:

3. The method according to claim 2, wherein the particle swarm optimization algorithm in step D comprises:

4. The method for protecting personal privacy based on the countermeasure sample according to claim 2 or 3, wherein the method for the interference generation module to obtain the call right of the voice recognition module in step A comprises: and accessing through a developer account of a voice recognition module manufacturer, or directly using a voice recognition module of the intelligent equipment, and acquiring a voice recognition interface in a reverse mode.