CN113380274A

CN113380274A - Smart broadcast audio quality evaluation noise detection method

Info

Publication number: CN113380274A
Application number: CN202110653954.2A
Authority: CN
Inventors: 程国斌
Original assignee: Shenzhen Fengjiao Intelligent Technology Co ltd
Current assignee: Shenzhen Fengjiao Intelligent Technology Co ltd
Priority date: 2021-06-11
Filing date: 2021-06-11
Publication date: 2021-09-10

Abstract

The invention discloses a method for detecting noise in intelligent broadcast audio quality evaluation, which comprises the steps of firstly forming audio data of each channel of equipment to be detected into an audio group to be detected, then collecting audio data of a reference channel of reference equipment, and carrying out time domain framing and feature extraction processing on the audio data in the audio group to be detected and the audio data of the reference channel to obtain a plurality of frame feature vectors of each channel of the equipment to be detected. By adopting the detection method designed by the invention, the broadcast audio can be comprehensively detected, not only can the manual inspection work be greatly compressed, the labor cost is saved, the efficiency is improved, but also the accuracy of detecting the audio quality of the multi-channel synchronous audio acquisition equipment is greatly improved, and the audio abnormity detection result in each channel equipment is obtained by calculating the correlation of the audio data acquired by each channel; the same problems of all channels of the equipment to be detected and detection abnormity caused by various noises are avoided.

Description

Smart broadcast audio quality evaluation noise detection method

Technical Field

The invention relates to the technical field of audio detection, in particular to an intelligent broadcast audio quality evaluation noise detection method.

Background

The broadcast is a news spreading tool for transmitting sound through radio waves or wires, and is produced in the 20 th century, wherein the broadcast is called wireless broadcast for transmitting programs through radio waves and is called wired broadcast for transmitting programs through wires, and has the advantages of wide objects, quick transmission, various functions and strong infectivity; the short spot is one-time passing, sequential listening cannot be selected, and listening is difficult if the language is not communicated, at present, broadcasting further realizes a strategic target mainly comprising frequency modulation and medium wave and secondarily comprising short wave, which is a significant transformation in the history of international broadcasting, and audio quality evaluation noise detection needs to be carried out on broadcasting audio before broadcasting playing, but the existing detection mode generally adopts manual sampling detection, so that not only is the efficiency low, but also the accuracy of a detection result cannot be ensured.

Disclosure of Invention

The invention aims to provide an intelligent broadcast audio quality evaluation noise detection method, which is used for solving the problems in the background technology.

In order to achieve the purpose, the invention provides the following technical scheme: a noise detection method for intelligent broadcast audio quality evaluation comprises the following steps:

firstly, the collected audio data of each channel of the equipment to be detected form an audio group to be detected, then the audio data of a reference channel of reference equipment is collected, time domain framing and feature extraction processing are carried out on the audio data in the audio group to be detected and the audio data of the reference channel, and a plurality of frame feature vectors of each channel of the equipment to be detected and a plurality of frame feature vectors of the reference channel are obtained;

performing correlation calculation and frame correlation judgment by using the frame characteristic vector of each channel of the equipment to be detected to obtain a plurality of judgment results of each channel of the equipment to be detected, and performing correlation calculation and frame correlation judgment by using the frame characteristic vector of the reference channel and the frame characteristic vector of the target channel of the equipment to be detected to obtain a plurality of inter-equipment judgment results of the target channel;

thirdly, judging whether each channel of the equipment to be detected is abnormal or not according to the plurality of judgment results of each channel of the equipment to be detected, and obtaining the detection result of the audio abnormity in the equipment of each channel of the equipment to be detected; judging whether the target channel of the equipment to be detected is abnormal or not according to the judgment results among the plurality of pieces of equipment of the target channel, and obtaining the detection result of the audio abnormality among the pieces of equipment of the target channel;

and (IV) generating an audio detection report according to the in-equipment audio abnormality detection result of each channel of the equipment to be detected and the inter-equipment audio abnormality detection result of the target channel.

Preferably, in the first step, when time domain framing and feature extraction processing are performed on the audio data in the audio group to be detected and the audio data of the reference channel, a differential sequence of the audio signal to be detected is obtained through a plurality of audio frames, and the differential sequence is used for indicating the difference degree between the plurality of audio frames; performing autocorrelation operation on the differential sequence, and acquiring a plurality of correlation values from an autocorrelation operation result; combining the plurality of correlation values into the sequence of correlation values.

Preferably, the correlation values of the audio to be detected include: dividing the correlation value sequence into a plurality of sequence vectors according to a preset vector dimension, obtaining correlation coefficients of two adjacent sequence vectors in the plurality of sequence vectors, and determining an average value of the correlation coefficients of the two adjacent sequence vectors in the plurality of sequence vectors as an average correlation coefficient of the correlation value sequence.

Preferably, in the first step, a voice activity detection algorithm is used to remove non-voice segments in the audio data of the reference channel and non-voice segments in the audio data of each channel of the device to be detected.

Preferably, the standard threshold values of the plurality of frame feature vectors of the reference channel in the first step include a first threshold value and a second threshold value.

Preferably, the target channel in the second step is any one of a plurality of channels of the device to be detected.

Preferably, in the second step, frame correlation coefficients of every two channels of the device to be detected are calculated by using frame feature vectors of every two channels of the device to be detected, the frame correlation coefficients of every two channels of the device to be detected form a frame correlation coefficient matrix of the device to be detected, whether the frame correlation coefficient in the frame correlation coefficient matrix of the device to be detected is greater than a first preset threshold value or not is judged, and if yes, 1 is used as a judgment result of the channel corresponding to the frame correlation coefficient; and if not, taking 0 as the judgment result of the channel corresponding to the frame correlation coefficient.

Preferably, in the third step, the sum of the judgment results of each channel and other channels of the device to be detected is counted, whether the sum of the judgment results of each channel and other channels of the device to be detected is greater than a preset value or not is judged, if yes, 1 is used as the in-device detection result of the channel of the device to be detected, and if not, 0 is used as the in-device detection result of the channel of the device to be detected; and when the number of the in-device detection results of the detection device channel which are continuously 1 exceeds N, judging the channel to be abnormal, setting the in-device audio abnormality detection result of the channel to be 1, and otherwise, setting the in-device audio abnormality detection result of the channel to be 0.

Preferably, in the fourth step, the frame feature vector of the reference channel and the frame feature vector of the target channel of the device to be detected are used to calculate the frame correlation coefficient between the reference channel and the target channel of the device to be detected.

Preferably, in the fourth step, it is determined whether a frame correlation coefficient between the reference channel and the target channel of the device to be detected is greater than a second preset threshold, and if so, 1 is taken as an inter-device decision result of the target channel; and if not, taking 0 as the inter-device judgment result of the target channel.

Compared with the prior art, the invention has the following beneficial effects:

by adopting the detection method designed by the invention, the broadcast audio can be comprehensively detected, not only can the manual inspection work be greatly compressed, the labor cost is saved, the efficiency is improved, but also the accuracy of detecting the audio quality of the multi-channel synchronous audio acquisition equipment is greatly improved, and the audio abnormity detection result in each channel equipment is obtained by calculating the correlation of the audio data acquired by each channel; the same problems of all channels of the equipment to be detected and detection abnormity caused by various noises are avoided.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The invention provides a technical scheme that:

a noise detection method for intelligent broadcast audio quality evaluation comprises the following steps:

The first embodiment is as follows:

firstly, the collected audio data of each channel of the equipment to be detected form an audio group to be detected, then the audio data of a reference channel of reference equipment is collected, time domain framing and feature extraction processing are carried out on the audio data in the audio group to be detected and the audio data of the reference channel, and a plurality of frame feature vectors of each channel of the equipment to be detected and a plurality of frame feature vectors of the reference channel are obtained; performing correlation calculation and frame correlation judgment by using the frame characteristic vector of each channel of the equipment to be detected to obtain a plurality of judgment results of each channel of the equipment to be detected, and performing correlation calculation and frame correlation judgment by using the frame characteristic vector of the reference channel and the frame characteristic vector of the target channel of the equipment to be detected to obtain a plurality of inter-equipment judgment results of the target channel; judging whether each channel of the equipment to be detected is abnormal according to a plurality of judgment results of each channel of the equipment to be detected, and obtaining the detection result of the audio abnormality in the equipment of each channel of the equipment to be detected; judging whether the target channel of the equipment to be detected is abnormal or not according to the judgment results among the plurality of pieces of equipment of the target channel, and obtaining the detection result of the audio abnormality among the pieces of equipment of the target channel; and generating an audio detection report according to the in-equipment audio abnormality detection result of each channel of the equipment to be detected and the inter-equipment audio abnormality detection result of the target channel.

Example two:

in the first embodiment, the following steps are added:

step one, when time domain framing and feature extraction processing are carried out on audio data in an audio group to be detected and audio data of a reference channel, a differential sequence of an audio signal to be detected is obtained through a plurality of audio frames, and the differential sequence is used for indicating the difference degree among the plurality of audio frames; performing autocorrelation operation on the differential sequence, and acquiring a plurality of correlation values from an autocorrelation operation result; combining the plurality of correlation values into the sequence of correlation values.

Example three:

in the second embodiment, the following steps are added:

the correlation values of the audio to be detected include: dividing the correlation value sequence into a plurality of sequence vectors according to a preset vector dimension, obtaining correlation coefficients of two adjacent sequence vectors in the plurality of sequence vectors, and determining an average value of the correlation coefficients of the two adjacent sequence vectors in the plurality of sequence vectors as an average correlation coefficient of the correlation value sequence.

Example four:

in the third embodiment, the following steps are added:

in the first step, a voice activity detection algorithm is used for removing non-voice sections in the audio data of the reference channel and non-voice sections in the audio data of each channel of the equipment to be detected.

Example five:

in the fourth example, the following steps were added:

the standard threshold values of the plurality of frame feature vectors of the reference channel in the first step comprise a first threshold value and a second threshold value.

Example six:

in the fifth example, the following steps were added:

and the target channel in the step two is any one of a plurality of channels of the equipment to be detected.

Example seven:

in example six, the following steps were added:

calculating frame correlation coefficients of every two channels of the equipment to be detected by using frame feature vectors of every two channels of the equipment to be detected, wherein the frame correlation coefficients of every two channels of the equipment to be detected form a frame correlation coefficient matrix of the equipment to be detected, judging whether the frame correlation coefficients in the frame correlation coefficient matrix of the equipment to be detected are larger than a first preset threshold value or not, and if so, taking 1 as a judgment result of the channel corresponding to the frame correlation coefficients; and if not, taking 0 as the judgment result of the channel corresponding to the frame correlation coefficient.

Example eight:

in example seven, the following steps were added:

counting the sum of judgment results of each channel and other channels of the equipment to be detected, judging whether the sum of the judgment results of each channel and other channels of the equipment to be detected is larger than a preset value, if so, taking 1 as an in-equipment detection result of the channel of the equipment to be detected, and if not, taking 0 as an in-equipment detection result of the channel of the equipment to be detected; and when the number of the in-device detection results of the detection device channel which are continuously 1 exceeds N, judging the channel to be abnormal, setting the in-device audio abnormality detection result of the channel to be 1, and otherwise, setting the in-device audio abnormality detection result of the channel to be 0.

Example nine:

in example eight, the following steps were added:

and in the fourth step, the frame characteristic vector of the reference channel and the frame characteristic vector of the target channel of the equipment to be detected are used for calculating the frame correlation coefficient of the reference channel and the target channel of the equipment to be detected.

Example ten:

in example nine, the following procedure was added:

judging whether the frame correlation coefficient of the reference channel and the target channel of the device to be detected is greater than a second preset threshold value or not, if so, taking 1 as an inter-device judgment result of the target channel; and if not, taking 0 as the inter-device judgment result of the target channel.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. A noise detection method for intelligent broadcast audio quality assessment is characterized in that: the method comprises the following steps:

2. The method of claim 1, wherein the method comprises: in the first step, when time domain framing and feature extraction processing are carried out on the audio data in the audio group to be detected and the audio data of the reference channel, a differential sequence of the audio signal to be detected is obtained through a plurality of audio frames, and the differential sequence is used for indicating the difference degree among the plurality of audio frames; performing autocorrelation operation on the differential sequence, and acquiring a plurality of correlation values from an autocorrelation operation result; combining the plurality of correlation values into the sequence of correlation values.

3. The method of claim 2, wherein the method comprises: the correlation values of the audio to be detected comprise: dividing the correlation value sequence into a plurality of sequence vectors according to a preset vector dimension, obtaining correlation coefficients of two adjacent sequence vectors in the plurality of sequence vectors, and determining an average value of the correlation coefficients of the two adjacent sequence vectors in the plurality of sequence vectors as an average correlation coefficient of the correlation value sequence.

4. The method of claim 1, wherein the method comprises: in the first step, a voice activity detection algorithm is used for removing non-voice sections in the audio data of the reference channel and non-voice sections in the audio data of each channel of the equipment to be detected.

5. The method of claim 1, wherein the method comprises: the standard threshold values of the plurality of frame feature vectors of the reference channel in the first step comprise a first threshold value and a second threshold value.

6. The method of claim 1, wherein the method comprises: and the target channel in the second step is any one of a plurality of channels of the equipment to be detected.

7. The method of claim 1, wherein the method comprises: in the second step, frame correlation coefficients of every two channels of the equipment to be detected are calculated by using frame feature vectors of every two channels of the equipment to be detected, the frame correlation coefficients of every two channels of the equipment to be detected form a frame correlation coefficient matrix of the equipment to be detected, whether the frame correlation coefficients in the frame correlation coefficient matrix of the equipment to be detected are larger than a first preset threshold value or not is judged, and if yes, 1 is used as a judgment result of the channel corresponding to the frame correlation coefficients; and if not, taking 0 as the judgment result of the channel corresponding to the frame correlation coefficient.

8. The method of claim 1, wherein the method comprises: counting the sum of the judgment results of each channel and other channels of the equipment to be detected, judging whether the sum of the judgment results of each channel and other channels of the equipment to be detected is larger than a preset value, if so, taking 1 as the in-equipment detection result of the channel of the equipment to be detected, and if not, taking 0 as the in-equipment detection result of the channel of the equipment to be detected; and when the number of the in-device detection results of the detection device channel which are continuously 1 exceeds N, judging the channel to be abnormal, setting the in-device audio abnormality detection result of the channel to be 1, and otherwise, setting the in-device audio abnormality detection result of the channel to be 0.

9. The method of claim 1, wherein the method comprises: and in the fourth step, the frame characteristic vector of the reference channel and the frame characteristic vector of the target channel of the equipment to be detected are used for calculating the frame correlation coefficient of the reference channel and the target channel of the equipment to be detected.

10. The method of claim 9, wherein the method comprises: judging whether the frame correlation coefficient of the reference channel and the target channel of the device to be detected is greater than a second preset threshold value or not in the fourth step, and if so, taking 1 as an inter-device judgment result of the target channel; and if not, taking 0 as the inter-device judgment result of the target channel.