CN112017675B

CN112017675B - Method for detecting single sound in broadcast audio signal based on audio characteristics

Info

Publication number: CN112017675B
Application number: CN202010772907.5A
Authority: CN
Inventors: 陈永泽; 彭小光; 吕连新; 盛国林; 顾江标
Original assignee: Hangzhou Linker Technology Co ltd
Current assignee: Hangzhou Linker Technology Co ltd
Priority date: 2020-08-04
Filing date: 2020-08-04
Publication date: 2023-06-27
Anticipated expiration: 2040-08-04
Also published as: CN112017675A

Abstract

The invention discloses a method for detecting single sound in a broadcast audio signal based on audio characteristics, which comprises the following steps: s1, preprocessing an input signal and converting the input signal into PCM data; s2, performing DFT processing on the preprocessed signals, and outputting frequency spectrum data P; s3, converting the frequency spectrum data to obtain converted data P _S The method comprises the steps of carrying out a first treatment on the surface of the S4, converting the data P _S Transposed data P is obtained by transposed _S ^T The method comprises the steps of carrying out a first treatment on the surface of the S5, searching for P _S ^T If there is a value greater than the first threshold, then go to step S6, if there is no, then determine that there is no monophonic component in the input signal, and the detection process ends; s6, calculating P _S ^T Is the median absolute deviation of (2); s7, searching whether a value smaller than or equal to a second threshold value exists in the MAD sequence, if so, judging that a single-tone component exists in the input signal, otherwise, judging that the single-tone component does not exist in the input signal. The scheme is suitable for the field of broadcast audio signal detection.

Description

Method for detecting single sound in broadcast audio signal based on audio characteristics

Technical Field

The invention relates to the technical field of audio signal analysis, in particular to a method for detecting single sound in a broadcast audio signal based on audio characteristics.

Background

During the process from production to broadcasting of the broadcast program, a single-tone signal may be mixed into a normal program signal through a series of software and hardware processing procedures. The single-tone signal is a conventional wireless communication signal, and because the signal does not carry any modulation information, the available detection and identification characteristics of the single-tone signal are few, so that the problem of rapid identification of the single-tone signal is not only a common problem in the field of signal spectrum monitoring, but also a relatively troublesome problem.

Disclosure of Invention

The invention mainly solves the technical problem that the prior art lacks a means for rapidly and accurately detecting the single sound signal, and provides a method for detecting the single sound in a broadcast audio signal based on audio characteristics.

The invention aims at the technical problems and is mainly solved by the following technical scheme: a method of detecting a tone in a broadcast audio signal based on an audio feature, comprising the steps of:

s1, preprocessing an input signal, and converting the input signal into PCM (Pulse Code Modulation ) data;

s2, performing DFT (Discrete Fourier Transform ) processing on the preprocessed signals, and outputting frequency spectrum data P;

s3, converting the frequency spectrum data according to the following formula:

P _S ＝10·lg(P)

obtaining conversion data P _S ，P _S Is a matrix, wherein each element is the signal strength of a signal component; the step can improve the special diagnosis analysis precision and reduce the calculation complexity;

s4, converting the data P _S Transposed data P is obtained by transposed _S ^T ；

S5, searching for P _S ^T If there is a value greater than the first threshold, then go to step S6, if there is no, then determine that there is no monophonic component in the input signal, and the detection process ends;

s6, calculating P _S ^T The calculation formula is as follows:

MAD＝median(|X _i -median(X)|)

wherein X is _i For the ith scalar in the entire set of scalars X, the medium operates to find intermediate values;

s7, searching whether a value smaller than or equal to a second threshold value exists in the MAD sequence, if so, judging that a single-tone component exists in the input signal, otherwise, judging that the single-tone component does not exist in the input signal.

The scheme realizes the characteristic analysis of the single-tone signal, calculates the single-tone characteristic component, and detects whether the single-tone signal is mixed in the input signal.

Preferably, in the step S5, the first threshold is-0.2; in step S7, the second threshold is 0.8.

By screening the two thresholds, the existence of noise components can be accurately reflected.

Preferably, in the step S2, the formula of the DFT process is as follows:

where X (k) is the kth DFT output value, X (N) is the input signal at a time point N, and N is the number of samples of the time-domain input sequence and DFT output frequency points.

Preferably, the step S1 specifically includes:

s101, judging whether an input signal is an analog signal, if so, entering a step S102, and if so, jumping to a step S103;

s102, quantizing the analog signals into digital signals, outputting PCM data, and entering step S105;

s103, judging whether the digital signal is PCM data, if so, entering a step S105, otherwise, entering a step S104;

s104, decoding the digital signal into a PCM signal, and then entering step S105;

s105, outputting the preprocessed PCM signal.

Preferably, the PCM signal after the preprocessing has a uniform sampling rate, bit depth and channel number.

Typically, a sampling rate of 48KHz, bit depth of 16 bits, and stereo may be used.

The invention has the substantial effects of being capable of rapidly and accurately detecting whether the input signal is mixed with the single-tone signal, and having simple steps, small calculated amount and high instantaneity.

Drawings

Fig. 1 is a flow chart of the present invention.

Detailed Description

The technical scheme of the invention is further specifically described below through examples and with reference to the accompanying drawings.

Examples: a method for detecting a single tone in a broadcast audio signal based on an audio feature of the present embodiment, as shown in fig. 1, includes the following steps:

stage one, signal preprocessing:

1. if the accessed signal is an analog signal, entering a step 2, otherwise, entering a step 3;

2. A/D conversion, namely, quantizing an analog signal into a digital signal, outputting a PCM, and entering a step 5;

3. if the digital signal is not PCM, entering a step 4, otherwise, entering a step 5;

4. decoding to PCM, and entering step 5;

5. the preprocessed signals are PCM data and have uniform parameters: sampling rate, bit depth. (the sample rate is 48KHz, bit depth is 16bit, stereo).

Stage two, feature analysis:

6. the input signal is DFT-processed and spectrum data (discrete linear table) "P" is output.

Wherein X (k) is the kth DFT output value, X (N) is an input signal when the time point is N, and N is the sample value of the time domain input sequence and the number of DFT output frequency points;

7. after DFT processing is performed on an input signal, conversion is performed first, and a conversion formula is as follows:

P _S ＝10·lg (P)

P _s is the spectrum data after the conversion processing.

The step can improve the feature analysis precision and reduce the calculation complexity.

Step three, calculating a single-tone component, and detecting whether a single-tone signal is mixed or not:

8. will P _s Transposed to obtain P _s ^T If P _s ^T If there is a value of > -0.2, then calculate P _s ^T The median absolute deviation (also called median absolute difference) of (a) is calculated as:

MAD＝median(|X _i -median(X)|)

and the MAD sequence has a value less than or equal to 0.8, then the single-tone component in the audio signal can be judged; if P _s ^T If the value of > -0.2 is not present or the value of 0.8 or less is not present in the MAD sequence, it is determined that no monophonic component is present in the audio signal.

The specific embodiments described herein are offered by way of example only to illustrate the spirit of the invention. Those skilled in the art may make various modifications or additions to the described embodiments or substitutions thereof without departing from the spirit of the invention or exceeding the scope of the invention as defined in the accompanying claims.

Although terms PCM, transpose, median absolute deviation, etc. are used more herein, the possibility of using other terms is not precluded. These terms are used merely for convenience in describing and explaining the nature of the invention; they are to be interpreted as any additional limitation that is not inconsistent with the spirit of the present invention.

Claims

1. A method for detecting a tone in a broadcast audio signal based on an audio feature, comprising the steps of:

s1, preprocessing an input signal and converting the input signal into PCM data;

s2, performing DFT processing on the preprocessed signals, and outputting frequency spectrum data P;

s3, converting the frequency spectrum data according to the following formula:

P _S =10·lg（P）

obtaining the conversion numberAccording to P _S ；

s6, calculating P _S ^T The calculation formula is as follows:

MAD=median(|X _i -median(X)|)

2. The method according to claim 1, wherein in the step S5, the first threshold is-0.2; in step S7, the second threshold is 0.8.

3. A method for detecting single tones in a broadcast audio signal based on audio characteristics according to claim 1 or 2, wherein in step S2, the formula of the DFT process is as follows:

in the method, in the process of the invention,X(k)is the firstkThe number of DFT output values is set,x(n)for the time point ofnAn input signal at the time of the start of the process,Nthe number of frequency bins is output for samples of the time domain input sequence and DFT.

4. A method for detecting single tones in a broadcast audio signal based on audio characteristics according to claim 1 or 2, wherein said step S1 is specifically:

s105, outputting the preprocessed PCM signal.

5. The method of claim 4, wherein the pre-processed PCM signal has a uniform sampling rate, bit depth and channel number.