CN113470666A

CN113470666A - Reversible robust medical audio method based on two-stage embedding

Info

Publication number: CN113470666A
Application number: CN202110687228.2A
Authority: CN
Inventors: 张小瑞; 孙逊; 孙星明; 宋爱国
Original assignee: Nanjing University of Information Science and Technology
Current assignee: Nanjing University of Information Science and Technology
Priority date: 2021-06-21
Filing date: 2021-06-21
Publication date: 2021-10-01
Anticipated expiration: 2041-06-21
Also published as: CN113470666B

Abstract

The invention discloses a reversible robust medical audio method based on two-stage embedding, which comprises the following steps: (1) the original audio is converted into two independent embedded domains by a frequency domain transform function: a low frequency embedded domain and a high frequency embedded domain; (2) embedding a watermark in a low-frequency embedded domain by adopting a robust watermark algorithm to form a new low-frequency embedded domain; (3) embedding the watermark error between the new low-frequency embedded domain and the low-frequency embedded domain into the high-frequency embedded domain to form a new high-frequency embedded domain; (4) generating the watermark-containing audio by inverse transformation of the frequency domain transformation function for the new low-frequency embedded domain and the new high-frequency embedded domain; (5) and extracting watermark information from the audio containing the watermark. The invention can effectively extract the watermark information and restore the complete audio frequency under the condition that the audio frequency is not attacked, and is used for the integrity and authenticity authentication of the medical data; the robustness of the watermark is enhanced by using the continuity of the audio and the correlation between sampling points, a small number of sampling points are modified in the reversible watermark, and the audio distortion is reduced.

Description

Reversible robust medical audio method based on two-stage embedding

Technical Field

The invention relates to the field of digital watermarking, in particular to a reversible robust medical audio method based on two-stage embedding.

Background

With the progress of internet and communication technology, remote medical treatment and non-contact medical treatment are widely regarded in the medical field, however, in the current complicated network era, medical data may be maliciously tampered in the transmission process of remote medical treatment, so that data distortion or illegal acquisition causes privacy disclosure of patients. Even small levels of distortion in medical data can affect a physician's diagnosis of a patient's condition, with serious consequences. When the traditional robust watermark and the fragile or semi-fragile watermark extract the watermark, the carrier data can be permanently distorted, so that a doctor cannot effectively diagnose.

Disclosure of Invention

The purpose of the invention is as follows: in view of the above problems, it is an object of the present invention to provide a reversible robust medical audio method based on two-stage embedding.

The technical scheme is as follows: the reversible robust medical audio method based on the two-stage embedding comprises the following steps:

(1) converting original audio into two independent embedding domains including a low frequency embedding domain and A by a frequency domain transformation function F_lHigh frequency embedded domain A_h；

(2) Embedding a domain A in a low frequency by adopting a robust watermarking algorithm_lA robust watermark w1 is embedded in the medium to form a new low-frequency embedded domain

(3) Will new low frequency audio

And A of the original audio_lEmbedding the watermark error E as reversible watermark information w2 into the high-frequency embedding domain A_hIn, forming a new high frequency embedded domain

(4) Embedding new low frequency into domain

And a new high frequency embedded domain

By frequency domain transformation functionsIs inverse transformation F^-1Generating watermark-containing Audio A^w；

(5) For watermark-containing audio A^wAnd extracting watermark information.

Further, the audio conversion process in step 1 is as follows:

the original audio A is represented as a set of sample points, denoted as { a }₁，a₂，a₃,a₄,...,a_2g-1,a_2gDecomposing original audio A into low-frequency embedded domain A by wavelet transform_lAnd a high frequency embedded domain A_hWherein, the sampling point pair is processed by a frequency domain transformation function F, and the expression is as follows:

where g represents half the number of sampling points, g being an integer, a_lRepresenting a low frequency embedded domain signal, a_hRepresenting a high frequency embedded domain signal; low frequency embedded domain A_lAnd a high frequency embedded domain A_hThe lengths are the same.

Further, step 2 includes the following steps in the process of embedding the robust watermark:

(201) embedding low frequencies into domain A_lUniformly dividing the audio into sub-audios which are not overlapped and have the same length, wherein each sub-audio has 2n sampling points, n represents half of the number of the sampling points of one sub-audio, n is an integer, and the sampling point of each sub-audio is marked as a_l(1),a_l(2),......,a_l(2n), in order to ensure information security, defining a random mapping relation:

the positions of the original sampling points are orderly disturbed,

a sequence of positions representing the sample points after mapping;

definition of

To representLow frequency embedded domain A_lIn the sub-audio of

I is more than or equal to 1 and less than or equal to 2 n;

(202) embedding a 1-bit watermark w1 for each sub-audio of the low-frequency embedded domain, wherein w1 belongs to {0,1}, and the embedding formula is as follows:

wherein

Representing the low-frequency embedded domain A after embedding a watermark_lIn the sub-audio of

A sampling point value of

The composed set is represented as a new low frequency embedded domain

μ is watermark embedding strength, d_lRepresenting the low frequency embedded domain A_lThe difference value of the sampling point sets of the front part and the rear part of the sub audio is expressed as follows:

after embedding the robust watermark, the new low frequency embedded domain

Difference value of sampling point sets of front and back parts of sub-audio

The modification is as follows:

(203) the sampling point value is changed to generate overflow, the sampling point mark sequence which overflows after being changed is restored to the original sampling point value, a new sampling point value is reserved for sampling points which do not overflow, and the extraction of the watermark is not influenced because the sampling point value of the overflow point has small change amount and the sampling points which overflow are generated little.

Further, step 3, the watermark error is denoted by E, and the factors for generating the watermark error include the watermark embedding strength mu and the low frequency embedding area A_lThe difference value set D ═ D of sampling point sets of the front part and the back part of the sub-audio_l(1),d_l(2) ,.. }, wherein d_l(1),d_l(2) Respectively representing the low frequency embedded domain A_lD of the first and second sub-audio_lThe expression is:

wherein

Representing an additive binary operation.

Further, the step 3 embedding process includes:

(301) embedding high frequencies into domain A_hUniformly dividing the audio into sub-audio frequencies which are not overlapped and have equal length, wherein sampling points in each sub-audio frequency are divided into M groups, and M represents the number of groups in one sub-audio frequency;

each set comprising 2 sampling points per group,

a qth sampling point representing the kth sampling point group in the sub-audio of the pth high frequency embedding domain, q ∈ {1,2 };

represents the pth high frequency embedded domain A_hOf the k-th group of samples of the sub-audio, k being an integerThe expression is:

the difference statistic of the sub-audio of the pth high-frequency embedded domain is called S (p), and the expression:

(302) by changing

And (3) realizing the change of S (p), so that the watermark is embedded reversibly, wherein the embedding expression is as follows:

wherein B represents the amount of change in S (p),

means not exceeding

Is the largest integer of (a) to (b),

to represent

The changed value; s (p)' represents the changed value of s (p), and the expression is:

selecting secret key T > | S_maxWhere S represents a consonant of the high frequency embedded domainThe overall difference statistic for frequency, change B of s (p), is calculated as:

(303) reversible embedding of watermark sequences into high frequency embedding domain A by changing S (p) histogram of moving difference statistics_hIn the method, 1 bit watermark information is embedded in the sub-audio of each high-frequency embedded domain, and w (p) represents A_hThe watermark embedded in the sub audio of the pth high-frequency embedding domain, w (p) is E, and E represents a watermark error; when the watermark is 0, s (p) is unchanged, and when the watermark is 1, s (p)' ═ s (p) + B, i.e.

The difference statistic S ═ { S (p) |1 ≦ p ≦ N }, where N is the high-frequency embedding region A_hThe number of neutron audios; order to

Wherein α (k) represents

By the q-th sampling point of the k-th sampling point group in the sub-audio of the p-th high-frequency embedded domain

Implementing integer transforms

To effect a change in s (p); the integer transform expression is:

wherein

Is that

The value after integer transformation is composed of

The constructed set represents the new high frequency embedded domain

(304) After transformation

The audio may not be in the range of the original sampling point value, and the audio needs to be preprocessed before the watermark is embedded, so as to prevent the overflow phenomenon from being generated, which is obtained by the following formula:

since k is more than or equal to 1 and less than or equal to M,

therefore, the method comprises the following steps:

order to

Wherein σ represents

The maximum value can be obtained by firstly passing through the sampling point with the embedded watermark and marking the sampling value which is not in the range of the original sampling point value,and then, the marked sampling point values are adjusted into the original sampling point values, the values lower than the lower limit of the original sampling point values are adjusted to the lower limit, the values higher than the upper limit of the original sampling point values are adjusted to the upper limit, the range of the sampling point values is large, sigma is small, overflowed sampling points are few, and the influence on the audio quality is small. The range of sample point values is large and σ is small, the overflow sample points are few, and the influence on the audio quality is small.

Further, the process of generating the watermark-containing audio in the step 4 comprises the following steps: embedding the new low frequency in step 2 into the domain

And the new high frequency embedded domain in step 3

Inverse transformation F by a transformation function^-1Reconstructing watermark-containing Audio A^w。

Further, the step 5 of extracting comprises the following steps:

(501) watermark-containing audio A through frequency domain function F^wDecomposition into two independent embedded domains, including a new low frequency embedded domain

And a new high frequency embedded domain

(502) Reversible watermark information w2 is extracted by shifting the differential histogram and stored,

is recovered to be A_h；

(503) From

The robust watermark w1 is extracted, and the reversible watermark information w2 is used for converting the robust watermark w1 into the reversible watermark information

Is recovered to be A_l；

(504) Inverse transformation F by a transformation function^-1A is to be_hAnd A_lReverting to the original audio a.

Has the advantages that: compared with the prior art, the invention has the following remarkable advantages:

1. the invention is a reversible robust watermarking algorithm, can effectively extract watermarking information and restore complete audio under the condition that the audio is not attacked, and is used for the integrity and authenticity authentication of medical data;

2. the robustness of the watermark is enhanced by using the continuity of the audio and the correlation between sampling points, a small number of sampling points are modified in the reversible watermark, and the audio distortion is reduced;

3. the robust watermark and the reversible watermark are respectively embedded by utilizing two independent embedding domains, so that the influence of the reversible watermark on the robust watermark is effectively reduced, and the robustness is improved.

Drawings

Fig. 1 is a diagram of a watermark embedding framework of the present invention;

fig. 2 is a diagram of a watermark extraction framework of the present invention.

Detailed Description

The reversible robust medical audio method based on two-stage embedding described in this embodiment includes:

(1) converting the audio into two independent embedding domains including a low frequency embedding domain and a high frequency embedding domain by a frequency domain transformation function;

the original audio A is represented as a set of sample points, denoted as { a }₁,a₂,a₃,a₄,...,a_2g-1,a_2gDecomposing original audio A into low-frequency embedded domain A by wavelet transform_lAnd a high frequency embedded domain A_hWherein the sampling point pairs are processed by a transformation function F, and the expression is:

where g represents half the number of sampling points, g being an integer, a_lRepresenting a low frequency embedded domain signal, a_hRepresenting high frequenciesAn embedded domain signal;

low frequency embedded domain A_lAnd a high frequency embedded domain A_hThe lengths are the same.

(2) And embedding the robust watermark in the low-frequency embedded domain by adopting a robust watermark algorithm to form a new low-frequency embedded domain. Fig. 1 is a diagram of a watermark embedding framework.

(201) Embedding low frequencies into domain A, as shown in Table 1_lUniformly dividing the audio into sub-audios which are not overlapped and have the same length, wherein each sub-audio has 2n sampling points, n represents half of the number of sampling points of a character audio, n is an integer, and the sampling point of each sub-audio is marked as a_l(1),a_l(2)，......，a_l(2n), defining a random mapping relation:

the positions of the original sampling points are orderly disturbed,

a sequence of positions representing the sample points after mapping;

definition of

Representing the low frequency embedded domain A_lIn the sub-audio of

I is more than or equal to 1 and less than or equal to 2 n;

TABLE 1 p sub-Audio Frames for the Low frequency Embedded Domain

(202) Embedding a 1-bit watermark w1, w 1E {0,1} for each sub-audio, the embedding formula is as follows:

wherein

A sampling point value of

The composed set is represented as a new low frequency embedded domain

μ is the embedding strength of the watermark, d_lRepresenting the low frequency embedded domain A_lThe difference value of the sampling point sets of the front part and the rear part of the sub audio is expressed as follows:

after embedding the robust watermark, the new low frequency embedded domain

The difference between the sampling points of the front and the rear parts of the sub-audio

The modification is as follows:

(3) And embedding the watermark error between the new low-frequency audio and the original audio into a high-frequency embedded domain to form a new high-frequency embedded domain, wherein the watermark error serves as reversible watermark information.

The watermark error is denoted by E, and the factors for generating the watermark error include the watermark embedding strength mu and the low frequency embedding area A_lThe difference value set D ═ D of sampling point sets of the front part and the back part of the sub-audio_l(1),d_l(2) ,.. }, wherein d_l(1),d_l(2) Respectively representing the low frequency embedded domain A_lD of the first and second sub-audio_lThe expression is:

wherein

Representing an additive binary operation.

The specific embedding process comprises the following steps:

(301) embedding high frequency into Domain A, as shown in Table 2_hUniformly dividing the audio into sub-audios which are not overlapped and have equal length, wherein sampling points in each sub-audio are divided into M groups, and M represents the number of groups in the sub-audio of a high-frequency embedded domain;

each set comprising 2 sampling points per group,

representing the difference of the sampling points of the kth sampling point group of the sub audio of the pth high-frequency embedded domain, and the expression is as follows:

TABLE 2 p sub-Audio Frames for high frequency Embedded Domain

(302) By changing

wherein B represents the amount of change in S (p),

means not exceeding

Is the largest integer of (a) to (b),

to represent

selecting secret key T > | S_maxWherein S represents the overall difference statistics of the sub-audio of the high frequency embedded domain, and the amount of change B of S (p) is calculated as:

Wherein α (k) represents

Implementing integer transforms

To effect a change in s (p); the integer transform expression is:

wherein

Is that

The value after integer transformation is composed of

The constructed set represents the new high frequency embedded domain

(304) After transformation

since k is more than or equal to 1 and less than or equal to M,

therefore, the method comprises the following steps:

order to

Wherein σ represents

The maximum value can be obtained by firstly traversing the sampling points with embedded watermarks, marking sampling values which are not in the range of the original sampling point values, then adjusting the marked sampling point values into the original sampling point values, adjusting the values which are lower than the lower limit of the original sampling point values into the lower limit, adjusting the values which are higher than the upper limit of the original sampling point values into the upper limit, and adjusting the values which are lower than the lower limit of the original sampling point values into the upper limitThe range is large and σ is small, the overflow sampling points are few, and the impact on audio quality is small. The range of sample point values is large and σ is small, the overflow sample points are few, and the influence on the audio quality is small.

(4) Embedding new low frequency into domain

And a new high frequency embedded domain

Inverse transformation F by frequency domain transformation function^-1Generating watermark-containing Audio A^w: embedding the new low frequency in step 2 into the domain

And the new high frequency embedded domain in step 3

(5) For watermark-containing audio A^wExtraction of watermark information is performed as shown in fig. 2.

And a new high frequency embedded domain

is recovered to be A_h；

(503) From

Mid-extraction of robust watermark w1, reversible watermark information w2

Is recovered to be A_l；

(504) Inverse transformation F by frequency domain function^-1A is to be_hAnd A_lReverting to the original audio a.

Claims

1. The reversible robust medical audio method based on the two-stage embedding is characterized by comprising the following steps:

(1) converting original audio into two independent embedded domains including a low-frequency embedded domain and a high-frequency embedded domain through a frequency domain transformation function;

(2) embedding a robust watermark in the low-frequency embedded domain by adopting a robust watermark algorithm to form a new low-frequency embedded domain;

(3) embedding the watermark error between the new low-frequency embedded domain and the low-frequency embedded domain into the high-frequency embedded domain to form a new high-frequency embedded domain, wherein the watermark error is used as reversible watermark information;

(4) generating the watermark-containing audio by inverse transformation of the frequency domain transformation function for the new low-frequency embedded domain and the new high-frequency embedded domain;

(5) and extracting watermark information from the audio containing the watermark.

2. The reversible robust medical audio method based on two-stage embedding of claim 1, wherein the step 1 audio conversion process is:

the original audio A is represented as a set of sample points, denoted as { a }₁,a₂,a₃,a₄,...,a_2g-1,a_2gDecomposing original audio A into low-frequency embedded domain A by wavelet transform_lAnd a high frequency embedded domain A_hAnd processing the sampling point pairs through a frequency domain transformation function F, wherein the expression is as follows:

wherein g represents the number of sampling pointsHalf, g is an integer, a_lRepresenting a low frequency embedded domain signal, a_hRepresenting a high frequency embedded domain signal; low frequency embedded domain A_lAnd a high frequency embedded domain A_hThe lengths are the same.

3. The reversible robust medical audio method based on two-stage embedding of claim 2, wherein the step 2 comprises the following steps in the process of embedding the robust watermark:

the positions of the original sampling points are orderly disturbed,

a sequence of positions representing the sample points after mapping;

definition of

Representing the low frequency embedded domain A_lIn the sub-audio of

I is more than or equal to 1 and less than or equal to 2 n;

wherein

A sampling point value of

The composed set is represented as a new low frequency embedded domain

after embedding the robust watermark, the new low frequency embedded domain

The modification is as follows:

(203) and (4) marking the sequence of the sampling points which overflow after the change because the sampling point values overflow due to the change, restoring the sequence to the original sampling point values, and reserving new sampling point values for the sampling points which do not overflow.

4. The reversible robust medical audio method based on two-stage embedding of claim 3, wherein the watermark error in step 3 is denoted by E, and the watermark error is generatedFactors include watermark embedding strength mu and low frequency embedding domain A_lThe difference value set D ═ D of sampling point sets of the front part and the back part of the sub-audio_l(1),d_l(2) ,.. }, wherein d_l(1),d_l(2) Respectively representing the low frequency embedded domain A_lD of the first and second sub-audio_lThe expression is:

wherein

Representing an additive binary operation.

5. The reversible robust medical audio method based on two-stage embedding of claim 4, wherein the step 3 embedding process comprises:

(301) embedding high frequencies into domain A_hUniformly dividing the audio into sub-audios which are not overlapped and have equal length, wherein sampling points in each sub-audio are divided into M groups, and M represents the number of groups in one sub-audio in a high-frequency embedded domain;

each set comprising 2 sampling points per group,

representing the difference of the sampling points of the kth sampling point group of the sub-audio of the pth high-frequency embedded domain, k is an integer, and the expression is as follows:

(302) by changing

wherein B represents the amount of change in S (p),

means not exceeding

Is the largest integer of (a) to (b),

to represent

Wherein α (k) represents

Implementing integer transforms

To effect a change in s (p); the integer transform expression is:

wherein

Is that

The value after integer transformation is composed of

The constructed set represents the new high frequency embedded domain

(304) After transformation

since k is more than or equal to 1 and less than or equal to M,

therefore, the method comprises the following steps:

order to

Wherein σ represents

The maximum value can be obtained by firstly passing through the sampling point with the embedded watermark and marking the sampling point value which is not at the original sampling point valueAnd (4) adjusting the marked sampling point values to be in the original sampling point values, adjusting the values which are lower than the lower limit of the original sampling point values to be in the lower limit, and adjusting the values which are higher than the upper limit of the original sampling point values to be in the upper limit.

6. The reversible robust medical audio method based on two-stage embedding of claim 5, wherein the step 4 of generating the watermarked audio is as follows: embedding the new low frequency in step 2 into the domain

And the new high frequency embedded domain in step 3

7. The reversible robust medical audio method based on two-stage embedding of claim 6, wherein the step 5 extraction process comprises:

And a new high frequency embedded domain

is recovered to be A_h；

(503) From

Extracting robust watermark w1 from the reversible watermark informationw2 will

Is recovered to be A_l；