CN112017675B - Method for detecting single sound in broadcast audio signal based on audio characteristics - Google Patents

Method for detecting single sound in broadcast audio signal based on audio characteristics Download PDF

Info

Publication number
CN112017675B
CN112017675B CN202010772907.5A CN202010772907A CN112017675B CN 112017675 B CN112017675 B CN 112017675B CN 202010772907 A CN202010772907 A CN 202010772907A CN 112017675 B CN112017675 B CN 112017675B
Authority
CN
China
Prior art keywords
signal
input signal
data
pcm
entering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010772907.5A
Other languages
Chinese (zh)
Other versions
CN112017675A (en
Inventor
陈永泽
彭小光
吕连新
盛国林
顾江标
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Linker Technology Co ltd
Original Assignee
Hangzhou Linker Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Linker Technology Co ltd filed Critical Hangzhou Linker Technology Co ltd
Priority to CN202010772907.5A priority Critical patent/CN112017675B/en
Publication of CN112017675A publication Critical patent/CN112017675A/en
Application granted granted Critical
Publication of CN112017675B publication Critical patent/CN112017675B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Abstract

The invention discloses a method for detecting single sound in a broadcast audio signal based on audio characteristics, which comprises the following steps: s1, preprocessing an input signal and converting the input signal into PCM data; s2, performing DFT processing on the preprocessed signals, and outputting frequency spectrum data P; s3, converting the frequency spectrum data to obtain converted data P S The method comprises the steps of carrying out a first treatment on the surface of the S4, converting the data P S Transposed data P is obtained by transposed S T The method comprises the steps of carrying out a first treatment on the surface of the S5, searching for P S T If there is a value greater than the first threshold, then go to step S6, if there is no, then determine that there is no monophonic component in the input signal, and the detection process ends; s6, calculating P S T Is the median absolute deviation of (2); s7, searching whether a value smaller than or equal to a second threshold value exists in the MAD sequence, if so, judging that a single-tone component exists in the input signal, otherwise, judging that the single-tone component does not exist in the input signal. The scheme is suitable for the field of broadcast audio signal detection.

Description

Method for detecting single sound in broadcast audio signal based on audio characteristics
Technical Field
The invention relates to the technical field of audio signal analysis, in particular to a method for detecting single sound in a broadcast audio signal based on audio characteristics.
Background
During the process from production to broadcasting of the broadcast program, a single-tone signal may be mixed into a normal program signal through a series of software and hardware processing procedures. The single-tone signal is a conventional wireless communication signal, and because the signal does not carry any modulation information, the available detection and identification characteristics of the single-tone signal are few, so that the problem of rapid identification of the single-tone signal is not only a common problem in the field of signal spectrum monitoring, but also a relatively troublesome problem.
Disclosure of Invention
The invention mainly solves the technical problem that the prior art lacks a means for rapidly and accurately detecting the single sound signal, and provides a method for detecting the single sound in a broadcast audio signal based on audio characteristics.
The invention aims at the technical problems and is mainly solved by the following technical scheme: a method of detecting a tone in a broadcast audio signal based on an audio feature, comprising the steps of:
s1, preprocessing an input signal, and converting the input signal into PCM (Pulse Code Modulation ) data;
s2, performing DFT (Discrete Fourier Transform ) processing on the preprocessed signals, and outputting frequency spectrum data P;
s3, converting the frequency spectrum data according to the following formula:
P S =10·lg(P)
obtaining conversion data P S ,P S Is a matrix, wherein each element is the signal strength of a signal component; the step can improve the special diagnosis analysis precision and reduce the calculation complexity;
s4, converting the data P S Transposed data P is obtained by transposed S T
S5, searching for P S T If there is a value greater than the first threshold, then go to step S6, if there is no, then determine that there is no monophonic component in the input signal, and the detection process ends;
s6, calculating P S T The calculation formula is as follows:
MAD=median(|X i -median(X)|)
wherein X is i For the ith scalar in the entire set of scalars X, the medium operates to find intermediate values;
s7, searching whether a value smaller than or equal to a second threshold value exists in the MAD sequence, if so, judging that a single-tone component exists in the input signal, otherwise, judging that the single-tone component does not exist in the input signal.
The scheme realizes the characteristic analysis of the single-tone signal, calculates the single-tone characteristic component, and detects whether the single-tone signal is mixed in the input signal.
Preferably, in the step S5, the first threshold is-0.2; in step S7, the second threshold is 0.8.
By screening the two thresholds, the existence of noise components can be accurately reflected.
Preferably, in the step S2, the formula of the DFT process is as follows:
Figure BDA0002617315680000021
where X (k) is the kth DFT output value, X (N) is the input signal at a time point N, and N is the number of samples of the time-domain input sequence and DFT output frequency points.
Preferably, the step S1 specifically includes:
s101, judging whether an input signal is an analog signal, if so, entering a step S102, and if so, jumping to a step S103;
s102, quantizing the analog signals into digital signals, outputting PCM data, and entering step S105;
s103, judging whether the digital signal is PCM data, if so, entering a step S105, otherwise, entering a step S104;
s104, decoding the digital signal into a PCM signal, and then entering step S105;
s105, outputting the preprocessed PCM signal.
Preferably, the PCM signal after the preprocessing has a uniform sampling rate, bit depth and channel number.
Typically, a sampling rate of 48KHz, bit depth of 16 bits, and stereo may be used.
The invention has the substantial effects of being capable of rapidly and accurately detecting whether the input signal is mixed with the single-tone signal, and having simple steps, small calculated amount and high instantaneity.
Drawings
Fig. 1 is a flow chart of the present invention.
Detailed Description
The technical scheme of the invention is further specifically described below through examples and with reference to the accompanying drawings.
Examples: a method for detecting a single tone in a broadcast audio signal based on an audio feature of the present embodiment, as shown in fig. 1, includes the following steps:
stage one, signal preprocessing:
1. if the accessed signal is an analog signal, entering a step 2, otherwise, entering a step 3;
2. A/D conversion, namely, quantizing an analog signal into a digital signal, outputting a PCM, and entering a step 5;
3. if the digital signal is not PCM, entering a step 4, otherwise, entering a step 5;
4. decoding to PCM, and entering step 5;
5. the preprocessed signals are PCM data and have uniform parameters: sampling rate, bit depth. (the sample rate is 48KHz, bit depth is 16bit, stereo).
Stage two, feature analysis:
6. the input signal is DFT-processed and spectrum data (discrete linear table) "P" is output.
Figure BDA0002617315680000031
Wherein X (k) is the kth DFT output value, X (N) is an input signal when the time point is N, and N is the sample value of the time domain input sequence and the number of DFT output frequency points;
7. after DFT processing is performed on an input signal, conversion is performed first, and a conversion formula is as follows:
P S =10·lg (P)
P s is the spectrum data after the conversion processing.
The step can improve the feature analysis precision and reduce the calculation complexity.
Step three, calculating a single-tone component, and detecting whether a single-tone signal is mixed or not:
8. will P s Transposed to obtain P s T If P s T If there is a value of > -0.2, then calculate P s T The median absolute deviation (also called median absolute difference) of (a) is calculated as:
MAD=median(|X i -median(X)|)
and the MAD sequence has a value less than or equal to 0.8, then the single-tone component in the audio signal can be judged; if P s T If the value of > -0.2 is not present or the value of 0.8 or less is not present in the MAD sequence, it is determined that no monophonic component is present in the audio signal.
The specific embodiments described herein are offered by way of example only to illustrate the spirit of the invention. Those skilled in the art may make various modifications or additions to the described embodiments or substitutions thereof without departing from the spirit of the invention or exceeding the scope of the invention as defined in the accompanying claims.
Although terms PCM, transpose, median absolute deviation, etc. are used more herein, the possibility of using other terms is not precluded. These terms are used merely for convenience in describing and explaining the nature of the invention; they are to be interpreted as any additional limitation that is not inconsistent with the spirit of the present invention.

Claims (5)

1. A method for detecting a tone in a broadcast audio signal based on an audio feature, comprising the steps of:
s1, preprocessing an input signal and converting the input signal into PCM data;
s2, performing DFT processing on the preprocessed signals, and outputting frequency spectrum data P;
s3, converting the frequency spectrum data according to the following formula:
P S =10·lg(P)
obtaining the conversion numberAccording to P S
S4, converting the data P S Transposed data P is obtained by transposed S T
S5, searching for P S T If there is a value greater than the first threshold, then go to step S6, if there is no, then determine that there is no monophonic component in the input signal, and the detection process ends;
s6, calculating P S T The calculation formula is as follows:
MAD=median(|X i -median(X)|)
wherein X is i For the ith scalar in the entire set of scalars X, the medium operates to find intermediate values;
s7, searching whether a value smaller than or equal to a second threshold value exists in the MAD sequence, if so, judging that a single-tone component exists in the input signal, otherwise, judging that the single-tone component does not exist in the input signal.
2. The method according to claim 1, wherein in the step S5, the first threshold is-0.2; in step S7, the second threshold is 0.8.
3. A method for detecting single tones in a broadcast audio signal based on audio characteristics according to claim 1 or 2, wherein in step S2, the formula of the DFT process is as follows:
Figure QLYQS_1
in the method, in the process of the invention,X(k)is the firstkThe number of DFT output values is set,x(n)for the time point ofnAn input signal at the time of the start of the process,Nthe number of frequency bins is output for samples of the time domain input sequence and DFT.
4. A method for detecting single tones in a broadcast audio signal based on audio characteristics according to claim 1 or 2, wherein said step S1 is specifically:
s101, judging whether an input signal is an analog signal, if so, entering a step S102, and if so, jumping to a step S103;
s102, quantizing the analog signals into digital signals, outputting PCM data, and entering step S105;
s103, judging whether the digital signal is PCM data, if so, entering a step S105, otherwise, entering a step S104;
s104, decoding the digital signal into a PCM signal, and then entering step S105;
s105, outputting the preprocessed PCM signal.
5. The method of claim 4, wherein the pre-processed PCM signal has a uniform sampling rate, bit depth and channel number.
CN202010772907.5A 2020-08-04 2020-08-04 Method for detecting single sound in broadcast audio signal based on audio characteristics Active CN112017675B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010772907.5A CN112017675B (en) 2020-08-04 2020-08-04 Method for detecting single sound in broadcast audio signal based on audio characteristics

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010772907.5A CN112017675B (en) 2020-08-04 2020-08-04 Method for detecting single sound in broadcast audio signal based on audio characteristics

Publications (2)

Publication Number Publication Date
CN112017675A CN112017675A (en) 2020-12-01
CN112017675B true CN112017675B (en) 2023-06-27

Family

ID=73500138

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010772907.5A Active CN112017675B (en) 2020-08-04 2020-08-04 Method for detecting single sound in broadcast audio signal based on audio characteristics

Country Status (1)

Country Link
CN (1) CN112017675B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR920014304A (en) * 1990-12-31 1992-07-30 경상현 CCT Receiving Method Using Digital Signal Processor
JPH0535266A (en) * 1991-07-30 1993-02-12 Casio Comput Co Ltd Musical sound generation device
CN101587707A (en) * 2009-06-26 2009-11-25 上海大学 Single audio frequency testing method based on embedded system
CN102006252A (en) * 2010-11-30 2011-04-06 中国工程物理研究院电子工程研究所 Single-tone signal identification method
CN110277087A (en) * 2019-07-03 2019-09-24 四川大学 A kind of broadcast singal anticipation preprocess method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3291234B1 (en) * 2016-08-31 2019-10-09 Digithep GmbH Method for evaluation of a quality of the voice usage of a speaker

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR920014304A (en) * 1990-12-31 1992-07-30 경상현 CCT Receiving Method Using Digital Signal Processor
JPH0535266A (en) * 1991-07-30 1993-02-12 Casio Comput Co Ltd Musical sound generation device
CN101587707A (en) * 2009-06-26 2009-11-25 上海大学 Single audio frequency testing method based on embedded system
CN102006252A (en) * 2010-11-30 2011-04-06 中国工程物理研究院电子工程研究所 Single-tone signal identification method
CN110277087A (en) * 2019-07-03 2019-09-24 四川大学 A kind of broadcast singal anticipation preprocess method

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
一种基于迭代的多音干扰消除方案;王鑫;张晓林;曹晏波;;北京航空航天大学学报(03);全文 *
一种新的单音信号快速识别算法;岳新宇;张润生;;无线电通信技术(03);全文 *
带幅频反馈的单音信号相位翻转检测;田宇,李式巨,金向东;电声技术(01);全文 *
音乐信号框架逼近的快速算法和SVM识别单音旋律;李星达;优秀硕士学位论文;全文 *

Also Published As

Publication number Publication date
CN112017675A (en) 2020-12-01

Similar Documents

Publication Publication Date Title
JP4401173B2 (en) Signal synthesis method
CN101120398B (en) Method for concatenating frames in communication system
RU2325046C2 (en) Audio coding
RU2526745C2 (en) Sbr bitstream parameter downmix
CN1926608B (en) Device and method for processing a multi-channel signal
KR101327895B1 (en) Method and device for audio signal classification
CN1938758B (en) Method and apparatus for determining an estimate
CN101010725A (en) Multichannel signal coding equipment and multichannel signal decoding equipment
US8892430B2 (en) Noise detecting device and noise detecting method
CN101971249A (en) Device and method for calculating a fingerprint of an audio signal, device and method for synchronizing and device and method for characterizing a test audio signal
CN102467910A (en) Encoding apparatus, encoding method, and program
US20230178092A1 (en) Data communication system
US8903730B2 (en) Content feature-preserving and complexity-scalable system and method to modify time scaling of digital audio signals
CN102119413B (en) Acoustic signal processing system, acoustic signal decoding device, and processing method therein
JP2012226106A (en) Music-piece section detection device and method, program, recording medium, and music-piece signal detection device
CN112017675B (en) Method for detecting single sound in broadcast audio signal based on audio characteristics
US8155327B2 (en) Tone detector and method used in a robot for detecting a tone
EP1436805B1 (en) 2-phase pitch detection method and appartus
CN1318187A (en) Speech coding with background noise reproduction
CN112017674B (en) Method for detecting noise in broadcast audio signal based on audio characteristics
CN113542977B (en) Method, device and related assembly for synthesizing sound card and microphone sound
KR100329875B1 (en) A transmission system for continuously transmitting signal samples, transmitters, receivers, coders, decoders, and interpolation means for reconstructing signal samples, how to add redundancy to signal samples, and how to reconstruct signal samples
CN115243183A (en) Audio detection method, device and storage medium
CN112995425B (en) Equal loudness sound mixing method and device
CN112992160A (en) Audio error concealment method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant