CN113556605A - Illegal advertisement determination method and device, electronic equipment and storage medium - Google Patents

Illegal advertisement determination method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN113556605A
CN113556605A CN202110824317.7A CN202110824317A CN113556605A CN 113556605 A CN113556605 A CN 113556605A CN 202110824317 A CN202110824317 A CN 202110824317A CN 113556605 A CN113556605 A CN 113556605A
Authority
CN
China
Prior art keywords
audio
screen projection
target audio
target
specified
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110824317.7A
Other languages
Chinese (zh)
Inventor
李腾飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing QIYI Century Science and Technology Co Ltd
Original Assignee
Beijing QIYI Century Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing QIYI Century Science and Technology Co Ltd filed Critical Beijing QIYI Century Science and Technology Co Ltd
Priority to CN202110824317.7A priority Critical patent/CN113556605A/en
Publication of CN113556605A publication Critical patent/CN113556605A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • G06F3/1407General aspects irrespective of display type, e.g. determination of decimal point position, display with fixed or driving decimal point, suppression of non-significant zeros
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0248Avoiding fraud
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/4104Peripherals receiving signals from specially adapted client devices
    • H04N21/4122Peripherals receiving signals from specially adapted client devices additional display device, e.g. video projector
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/436Interfacing a local distribution network, e.g. communicating with another STB or one or more peripheral devices inside the home
    • H04N21/4363Adapting the video or multiplex stream to a specific local network, e.g. a IEEE 1394 or Bluetooth® network

Abstract

The embodiment of the invention provides an illegal advertisement determination method, an illegal advertisement determination device, electronic equipment and a storage medium, which are applied to the technical field of screen projection and comprise the following steps: obtaining target audio to be processed, wherein the target audio is audio collected by a screen projection device within a preset time period for starting to play screen projection content; extracting specified audio features of the target audio; wherein, the specified audio features are feature information for uniquely identifying the target audio; judging whether the specified audio features are matched with preset audio features, wherein the preset audio features are audio features in advertisements which are associated with the target audio in advance, or the preset audio features are audio features in the target audio; and if the specified audio characteristics are matched with the preset audio characteristics, determining that illegal advertisements are not inserted into the screen projection equipment when the target audio is played. Therefore, the scheme can identify whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played.

Description

Illegal advertisement determination method and device, electronic equipment and storage medium
Technical Field
The invention relates to the technical field of screen projection, in particular to an illegal advertisement determination method and device, electronic equipment and a storage medium.
Background
Video screen projection playing is a video projection mode which is commonly used at present. Specifically, the screen projection device serving as the screen projection party projects the screen projection content to the screen projection device serving as the screen projection receiving party and having the display function, and the screen projection device plays the received screen projection content.
In the prior art, before playing received screen-casting contents, screen-casting equipment is often inserted with illegal advertisements which are not allowed to be played by a video party, so that the screen-casting contents or the legal advertisements which are cast by a user cannot be played in time, and the screen-casting use viscosity of the user to the screen-casting party is reduced.
Disclosure of Invention
The embodiment of the invention aims to provide an illegal advertisement determination method, an illegal advertisement determination device, electronic equipment and a storage medium, so as to identify whether a screen projection device is inserted with an illegal advertisement before screen projection content is played. The specific technical scheme is as follows:
in a first aspect of the present invention, there is provided an illegal advertisement determination method, applied to a screen projection device, the method including:
obtaining target audio to be processed, wherein the target audio is audio collected by the screen projection equipment within a preset time period for starting to play screen projection content;
extracting specified audio features of the target audio; wherein the specified audio features are feature information for uniquely identifying the target audio;
judging whether the specified audio features are matched with preset audio features, wherein the preset audio features are audio features in advertisements pre-associated with the target audio, or the preset audio features are audio features in the target audio;
and if the specified audio features are matched with preset audio features, determining that illegal advertisements are not inserted into the screen projection equipment when the target audio is played.
Optionally, the specified audio features comprise mel-frequency cepstral coefficients.
Optionally, the extracting the specified audio feature of the target audio includes:
preprocessing the target audio to obtain a preprocessed target audio; wherein the pre-processing comprises at least one of pre-emphasis processing, framing processing, and windowing processing;
calculating spectral line energy of the preprocessed target audio;
and calculating a Mel frequency cepstrum coefficient of the preprocessed target audio based on the spectral line energy of the preprocessed target audio to obtain the designated audio characteristics of the target audio.
Optionally, the obtaining target audio to be processed includes:
and after the screen projection equipment plays the screen projection content, utilizing an audio acquisition unit to acquire environmental audio to obtain target audio to be processed.
Optionally, after the acquiring of the environmental audio by using the audio acquiring unit to obtain the target audio to be processed and before the extracting of the specified audio feature of the target audio, the method further includes:
judging whether the audio data of the target audio is mute data, if not, executing the extraction of the specified audio characteristics of the target audio; if so, ending.
Optionally, before the determining whether the specified audio feature matches a preset audio feature, the method further includes:
and sending the identification information of the screen projection equipment or the identification information of the screen projection content to a background server, and acquiring the audio characteristics in the advertisement which is fed back by the background server and is searched based on the identification information and is associated with the screen projection equipment in advance as preset audio characteristics.
In a second aspect of the present invention, there is also provided an illegal advertisement determination apparatus, including:
the audio acquisition module is used for acquiring target audio to be processed, wherein the target audio is audio collected by the screen projection equipment within a preset time period for starting to play screen projection content;
the characteristic extraction module is used for extracting the specified audio characteristic of the target audio; wherein the specified audio features are feature information for uniquely identifying the target audio;
a feature matching module, configured to determine whether the specified audio feature matches a preset audio feature, where the preset audio feature is an audio feature in an advertisement associated with the target audio in advance, or the preset audio feature is an audio feature in the target audio
And the advertisement determining module is used for determining that illegal advertisements are not inserted into the screen projection equipment when the target audio is played if the specified audio characteristics are matched with preset audio characteristics.
Optionally, the specified audio features comprise mel-frequency cepstral coefficients.
Optionally, the feature extraction module is specifically configured to pre-process the target audio to obtain a pre-processed target audio; wherein the pre-processing comprises at least one of pre-emphasis processing, framing processing, and windowing processing; calculating spectral line energy of the preprocessed target audio; and calculating a Mel frequency cepstrum coefficient of the preprocessed target audio based on the spectral line energy of the preprocessed target audio to obtain the designated audio characteristics of the target audio.
Optionally, the audio obtaining module includes: and the audio acquisition submodule is used for acquiring audio of the environment where the screen projection equipment is located by using the audio acquisition unit after the screen projection equipment plays the screen projection content, so as to obtain target audio to be processed.
Optionally, the apparatus further comprises: a mute judgment module, configured to judge whether audio data of the target audio is mute data after the audio acquisition sub-module and before the feature extraction module, and if not, execute the extraction of the specified audio feature of the target audio; if so, ending.
Optionally, the apparatus further comprises: and the information sending module is used for sending the identification information of the screen projection equipment or the identification information of the screen projection content to a background server before the characteristic matching module judges whether the specified audio characteristic is matched with a preset audio characteristic, acquiring the audio characteristic in the advertisement which is searched for based on the identification information and is associated with the screen projection equipment in advance and fed back by the background server, and taking the audio characteristic as the preset audio characteristic.
In a third aspect of the present invention, there is also provided an electronic device, including a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory complete communication with each other through the communication bus;
a memory for storing a computer program;
and a processor for implementing any of the above illegal advertisement determination method steps when executing the program stored in the memory.
In a fourth aspect implemented by the present invention, there is further provided a computer-readable storage medium, in which a computer program is stored, and the computer program, when executed by a processor, implements any of the above illegal advertisement determination methods.
In a fifth aspect of the present invention, there is also provided a computer program product containing instructions which, when run on a computer, cause the computer to perform any of the above described illegal advertisement determination methods.
According to the scheme provided by the embodiment of the invention, the target audio is the audio collected by the screen projection equipment in the preset time period for starting to play the screen projection content, if the screen projection equipment is inserted with an illegal advertisement, the target audio should contain the audio inserted with the illegal advertisement for playing, otherwise, if the voting equipment is not inserted with the illegal advertisement, the target audio contains the audio of the advertisement associated in advance or the audio of the screen projection content for playing. Further, after the specified audio features of the target audio are extracted, it may be determined whether the specified audio features are matched with the preset audio features, and if the specified audio features are matched with the preset audio features, it is indicated that the target audio is an audio for playing an associated advertisement or an audio for playing screen-shot content, that is, the screen-shot device plays the associated advertisement or the screen-shot content instead of the non-advertisement within a preset time period for starting playing the screen-shot content. Therefore, the invention can realize the identification of whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below.
Fig. 1 is a schematic flow chart of an illegal advertisement determination method according to an embodiment of the present invention;
fig. 2 is a schematic view of another flowchart of an illegal advertisement determination method according to an embodiment of the present invention;
fig. 3 is a schematic diagram of an illegal advertisement determination process according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of an illegal advertisement determination device according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention.
In order to realize identification of whether an illegal advertisement is inserted into a screen projection device before screen projection content is played, the embodiment of the invention provides an illegal advertisement determination method, an illegal advertisement determination device, electronic equipment and a storage medium.
It should be noted that the illegal advertisement determination method provided by the embodiment of the present invention is applied to a screen projection device, and the screen projection device may be a device capable of implementing a screen projection function, such as a display, an intelligent television, a projector, a television set-top box, and the like. The screen projection function may include, but is not limited to, a DLAN (Digital Living Network alliance) function. Specifically, an execution subject of the illegal advertisement determination method provided by the embodiment of the invention can be an illegal advertisement determination device running in screen projection equipment. Illustratively, the illegal advertisement determination device may be application software with a video playing function running in a screen projection device, or a web client.
The illegal advertisement determination method provided by the embodiment of the invention can comprise the following steps:
obtaining target audio to be processed, wherein the target audio is audio acquired by a screen projection device serving as a screen projection party to an environment where the screen projection device is located by using an audio acquisition unit after the screen projection device plays screen projection contents;
extracting specified audio features of the target audio; wherein, the specified audio features are feature information for uniquely identifying the target audio;
judging whether the specified audio features are matched with preset audio features, wherein the preset audio features are audio features in advertisements which are associated with screen projection equipment in advance, or the preset audio features are audio features in screen projection contents;
and if the specified audio characteristics are matched with the preset audio characteristics, determining that illegal advertisements are not inserted into the screen projection equipment when the target audio is played.
According to the scheme provided by the embodiment of the invention, the target audio is the audio collected by the screen projection equipment in the preset time period for starting to play the screen projection content, if the screen projection equipment is inserted with an illegal advertisement, the target audio should contain the audio inserted with the illegal advertisement for playing, otherwise, if the voting equipment is not inserted with the illegal advertisement, the target audio contains the audio of the advertisement associated in advance or the audio of the screen projection content for playing. Further, after the specified audio features of the target audio are extracted, it may be determined whether the specified audio features are matched with the preset audio features, and if the specified audio features are matched with the preset audio features, it is indicated that the target audio is an audio for playing an associated advertisement or an audio for playing screen-shot content, that is, the screen-shot device plays the associated advertisement or the screen-shot content instead of the non-advertisement within a preset time period for starting playing the screen-shot content. Therefore, the invention can realize the identification of whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played.
An illegal advertisement determination method provided by the embodiment of the present invention is described below with reference to the accompanying drawings.
As shown in fig. 1, an illegal advertisement determination method provided in an embodiment of the present invention may include the following steps:
s101, obtaining target audio to be processed, wherein the target audio is audio collected by a screen projection device within a preset time period for starting to play screen projection content;
the audio acquisition unit can be an audio acquisition unit of the screen projection device, and can also be an audio acquisition unit associated with the screen projection device, which is reasonable.
In addition, there may be a variety of implementations of obtaining the target audio to be processed.
In one implementation, after the screen projection device plays the screen projection content, an audio acquisition unit is used for acquiring environmental audio to obtain target audio to be processed.
It can be understood that, in order to ensure that the content to be projected is projected to the target device, the screen projection device may establish a communication connection with the target device before the screen projection, and may project the content to be projected to the screen projection device after the communication connection is established. The screen projection device and the screen projection device can establish communication connection through communication modes such as a local area network and a wireless fidelity (WIFI).
In order to identify whether illegal advertisements are inserted into the screen projection device when the target audio is played, the screen projection device can call the audio acquisition unit when screen projection content is received, so that the audio acquisition unit is used for acquiring audio data of the current environment. When the voting equipment is not inserted with illegal advertisements, the screen projection equipment receives the voting content, and may play the advertisement associated in advance, or may play the screen projection content directly without playing any advertisement. When the voting device is inserted with an illegal advertisement, the screen projection device may play the inserted illegal advertisement when playing the received voting content.
Therefore, the collected environmental audio may be the audio of the target audio containing the pre-associated advertisement, or the audio when the screen shot content is played, or the audio when the illegal advertisement is inserted for playing.
In order to further analyze the target audio, after the target audio is acquired, the audio acquisition unit of the screen projection device may store the target audio in any file format that can be parsed by the prior art, for example, a file in a wave format, that is, a wav file.
In one embodiment, to relieve the storage pressure of the target audio, the audio may be collected only for a preset time period when the audio is collected. Namely, the target audio is the audio collected by the screen projection device within the preset time period for starting to play the screen projection content. The preset time period may be a time period in which the starting time is the time when the screen projection device receives the screen projection content, and the duration is a specified duration. The setting of the specified time period can be manually set by a user, and can also be set by the screen projection device by default, such as 30s, 2 minutes or 3 minutes. Illustratively, the time duration refers to 30s, the time is counted and the audio acquisition is started when the screen projection content is received from the screen projection device, and when the time duration reaches 30s, the time is counted and the audio acquisition is stopped.
The audio acquisition unit may be a microphone prefabricated in the screen projection device, or may also be an audio acquisition plug-in embedded in the screen projection device, and the like.
Optionally, there are various ways to trigger the audio capturing unit to capture the environmental audio, for example: after the screen projection equipment receives screen projection content, a display interface of the screen projection equipment pops up a prompt box so that a user can permit the screen projection equipment to carry out audio acquisition by clicking the prompt box, and after the screen projection equipment acquires permission, the audio acquisition unit is used for carrying out environmental audio acquisition, or after the screen projection equipment puts the screen projection content into the screen projection equipment, the audio acquisition unit is directly used for carrying out environmental audio acquisition.
S102, extracting specified audio features of the target audio; wherein the specified audio feature is feature information for uniquely identifying the target audio.
There may be various designated audio features, and for example, the designated audio features may be Mel-Frequency Cepstral Coefficients (MFCCs) of the target audio, or spectral features of the target audio, and so on, wherein the Mel-Frequency Cepstral Coefficients may be used to characterize the voice frequencies that can be recognized by the human ear.
It can be understood that, by analyzing any audio, the spectral feature corresponding to the audio can be determined, and by analyzing any spectral feature, the audio corresponding to the spectral feature can be obtained. For example, in one implementation, extracting the specified audio feature of the target audio may include: and carrying out Fourier transform on the target audio to obtain the spectral characteristics of the target audio, and taking the spectral characteristics as the designated audio characteristics.
For clarity of the scheme and clarity of layout, the implementation manner for determining the mel-frequency cepstrum coefficients will be described in detail with reference to another embodiment.
S103, judging whether the specified audio features are matched with preset audio features or not; .
In one implementation, the preset audio features may be audio features in an advertisement that are pre-associated with the screen-casting device. Among them, the advertisement previously associated with the screen projecting device is also referred to as a legitimate advertisement. In this case, when the screen projection device receives the screen projection content, the pre-associated advertisement will be played first, and after the advertisement is played, the received voting content will be played again. At this time, the target audio acquired by the voting device is the audio of the voting device playing the pre-associated advertisement. The retrieved specified audio features should match the audio features in the pre-associated advertisement, via step S102.
In another implementation, the preset audio feature may be an audio feature in the screen-shot content. In this case, the screen projection device directly starts to play the screen projection content after acquiring the screen projection content, and at this time, the target audio obtained by the screen projection device is the audio when the screen projection content is played. The acquired specified audio features should match the audio features in the screen-shot content, via step S102.
Before this step is performed, an audio feature of a pre-associated advertisement or an audio feature of a screen shot content may be acquired in advance as a preset audio feature.
After the specified audio feature of the target audio is obtained through step S102, the feature difference between the specified audio feature and the preset audio feature may be compared. In one implementation, when the feature difference is smaller than a preset threshold, it is determined that the specified audio feature matches the preset audio feature, and conversely, when the feature difference is not smaller than the preset threshold, it is determined that the specified audio feature does not match the preset audio feature.
And S104, if the specified audio features are matched with the preset audio features, determining that illegal advertisements are not inserted into the screen projection equipment when the target audio is played.
If the specified audio characteristics are matched with the preset audio characteristics, the target audio is the audio when the preset associated advertisement is played or the audio when the video content is played, and both the audio and the video content belong to legal playing contents, namely, the screen projection equipment does not insert illegal advertisements when the target audio is played.
If the specified audio characteristics are not matched with the preset audio characteristics, the target audio is neither the audio in the playing process of the preset associated advertisement nor the audio in the playing process of the video content, at the moment, it can be judged that the illegal advertisement is inserted into the screen projection equipment in the playing process of the target audio, and further measures can be taken.
According to the scheme provided by the embodiment of the invention, the target audio is the audio collected by the screen projection equipment in the preset time period for starting to play the screen projection content, if the screen projection equipment is inserted with an illegal advertisement, the target audio should contain the audio inserted with the illegal advertisement for playing, otherwise, if the voting equipment is not inserted with the illegal advertisement, the target audio contains the audio of the advertisement associated in advance or the audio of the screen projection content for playing. Further, after the specified audio features of the target audio are extracted, it may be determined whether the specified audio features are matched with the preset audio features, and if the specified audio features are matched with the preset audio features, it is indicated that the target audio is an audio for playing an associated advertisement or an audio for playing screen-shot content, that is, the screen-shot device plays the associated advertisement or the screen-shot content instead of the non-advertisement within a preset time period for starting playing the screen-shot content. Therefore, the invention can realize the identification of whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played.
Optionally, as shown in fig. 2, in the embodiment of the present invention, the step S102 may include the following steps S102A-S102C:
S102A, preprocessing the target audio to obtain a preprocessed target audio;
wherein the preprocessing includes at least one of pre-emphasis processing, framing processing, and windowing processing.
It is to be understood that, in order to determine the specified audio characteristics of the target audio, the target audio may be preprocessed, and the preprocessing is performed by one or more of pre-emphasis processing, framing, or windowing, and each preprocessing may be implemented by the method of implementing the preprocessing in the prior art. For example, when the preprocessing manner includes pre-emphasis processing, framing and windowing, an implementation manner for preprocessing the target audio to obtain the preprocessed target audio may include the following steps (1) to (3):
(1) carrying out pre-emphasis processing on a target audio by adopting a high-pass filter to obtain a first audio;
it will be appreciated that the high pass filter used as described above may be any of the known art. Moreover, when the target audio is pre-emphasized, the adopted processing formula may be: h (z) ═ 1-uz-1
(2) Framing the reference first audio to obtain a plurality of second audio;
to facilitate framing of the reference audio, sample point N for the framing may be set to 256 or 512, such that the acquisition time of the encompassed target audio is 20-30 ms.
(3) And windowing the plurality of second audios to obtain the preprocessed target audio.
Illustratively, the window function used for windowing the plurality of second audios may be a hamming window, a hanning window, or the like. When the hamming window is used for windowing, assuming that the second audio after framing is s0(N), N is 0,1, …, N-1, where N is the sampling point of the framing, and the signal after windowing is s1(N), then s1(N) is s0(N) w (N), and w (N) has the following form:
W(n,a)=(1-a)-a*cos(2πn/(N-1)),
wherein N is more than or equal to 0 and less than or equal to N-1, different Hamming windows can be generated by different values of a, and a is 0.46 in general.
S102B, calculating spectral line energy of the preprocessed target audio;
for example, in one implementation, calculating the spectral line energy of the preprocessed target audio may include: carrying out Fourier transform on the preprocessed target audio to obtain a frequency spectrum of the preprocessed target audio; and performing modular squaring on the frequency spectrum of the preprocessed target audio to obtain spectral line energy of the preprocessed target audio.
It can be understood that the fourier transform of the audio and the modulo square of the frequency spectrum can be implemented by the prior art, and are not described herein.
S102C, calculating a Mel frequency cepstrum coefficient of the preprocessed target audio frequency based on the spectral line energy of the preprocessed target audio frequency, and obtaining the designated audio frequency characteristic of the target audio frequency.
For example, in one implementation, calculating mel-frequency cepstrum coefficients of the preprocessed target audio based on spectral line energies of the preprocessed target audio may include:
obtaining the logarithmic energy of the target audio by adopting a preset first calculation formula based on the spectral line energy of the preprocessed target audio;
obtaining a Mel frequency cepstrum coefficient of the target audio as a designated audio characteristic of the target audio by adopting a preset second calculation formula based on the logarithmic energy of the target audio;
wherein, the preset first calculation formula comprises:
Figure BDA0003173095820000101
the preset second calculation formula includes:
Figure BDA0003173095820000102
wherein the content of the first and second substances,
Figure BDA0003173095820000103
k is more than or equal to 0 and less than or equal to N, x (N) is input target audio, N is points of Fourier transform, s (m) is used for representing logarithmic energy of the target audio, C (N) is a Mel frequency cepstrum coefficient, Hm(k) The frequency response function of the filter is characterized, and M is the number of filters.
The scheme provided by the embodiment of the invention can realize the identification of whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played. Furthermore, the target audio frequency can be read firstly for processing, and the specified audio frequency characteristics of the target audio frequency are extracted, so that the influence of noise on the specified audio frequency characteristics can be avoided, and the accuracy rate of identifying illegal advertisements is improved.
The screen projection equipment provides a foundation for identifying whether illegal advertisements are inserted into the screen projection equipment before screen projection content is played by the screen projection equipment in a mode of acquiring the preset audio characteristics in advance.
Optionally, based on the above embodiment, in the embodiment of the present invention, before determining whether the specified audio feature matches the preset audio feature, the preset audio feature may be obtained from the background server in advance.
Optionally, in an implementation, the method includes:
and sending the identification information of the screen projection equipment or the identification information of the screen projection content to the background server, and acquiring the audio characteristics in the advertisement which is fed back by the background server and is pre-associated with the screen projection equipment on the basis of the identification information, wherein the audio characteristics are used as preset audio characteristics.
When the screen projection device needs to play a pre-associated advertisement, the screen projection device can send identification information of the screen projection device, such as a device identification, to the background server. If the screen projection device needs to play the pre-associated advertisement, the identification information of the screen projection content, such as a content identification, may be sent to the background server.
After the background server receives the identification information sent by the screen projection device, the audio characteristics of the pre-associated advertisement corresponding to the identification information can be searched based on the identification information, or the audio characteristics of the screen projection content corresponding to the identification information can be searched, and the searched audio characteristics are fed back to the screen projection device.
The scheme provided by the embodiment of the invention can realize the identification of whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played. Furthermore, the screen projection equipment provides a basis for identifying whether the screen projection equipment is inserted with illegal advertisements before playing the screen projection content in a mode of acquiring the preset audio characteristics in advance.
Optionally, in an embodiment of the present invention, after the audio acquisition unit is used to perform audio acquisition on an environment where the screen projection device is located, and obtain a target audio to be processed, before extracting a specified audio feature of the target audio, the method further includes:
judging whether the audio data of the target audio is mute data, if not, executing the extraction of the specified audio characteristics of the target audio; if so, ending.
It can be understood that, in order to identify whether the screen projection device is inserted with an illegal advertisement before playing the screen projection content, it is necessary to acquire the audio of the screen projection device within a preset time period when the screen projection device starts playing the screen projection content. If the audio data of the target audio is the mute data, the specified audio characteristics of the target audio cannot be determined from the mute data. However, if the user adjusts the volume to be mute in the screen projection process of the screen projection device, or after the screen projection, the volume of the screen projection device is small, the acquired audio data may be mute data, and thus the specified audio feature of the target audio cannot be extracted.
For example, the determining whether the audio data of the target audio is mute data may include:
and analyzing the reference audio by using a Voice Activity Detection (VAD) algorithm to determine whether the audio data of the reference audio is mute data. Wherein analyzing the reference audio using the VAD may include: and analyzing the reference audio by adopting a Gaussian Mixture Model (GMM for short) to obtain whether the audio data of the reference audio is mute data.
The scheme provided by the embodiment of the invention can realize the identification of whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played. Further, after the audio data of the target audio is judged to be the mute data, the process can be ended, so that the extraction of the specified features and the process of feature matching are not needed, and the operation resources can be saved.
As shown in fig. 3, an embodiment of the present invention further provides a schematic diagram of an illegal advertisement determination process. In fig. 3, the process of extracting mel-frequency cepstrum coefficients from the preset associated advertisement and screen-shot content may be performed in advance and stored in the audio feature library in advance. In the audio feature library, the Mel frequency cepstrum coefficient of each advertisement is associated with an advertisement identifier of the advertisement, and the advertisement identifier is associated with a device identifier of a voting device to which the advertisement is put. The mel-frequency cepstrum coefficient of each projected content is associated with the projected content. After the screen projection equipment acquires the target audio acquired by audio data acquisition, the Mel frequency cepstrum coefficient of the target audio can be extracted, further, according to the Mel frequency cepstrum coefficient of the target audio, feature matching is carried out on the Mel frequency cepstrum coefficient of a preset associated advertisement or the Mel frequency cepstrum coefficient of voting content, and further according to a matching result, whether illegal advertisements are inserted into the screen projection equipment when the target audio is played or not is determined.
Corresponding to the above method embodiment, as shown in fig. 4, an embodiment of the present invention further provides an illegal advertisement determination apparatus, which is applied to a screen projection device, where the apparatus includes:
the audio obtaining module 401 is configured to obtain a target audio to be processed, where the target audio is an audio collected by the screen projection device within a preset time period when the screen projection device starts to play the screen projection content;
a feature extraction module 402, configured to extract a specified audio feature of the target audio; wherein, the specified audio features are feature information for uniquely identifying the target audio;
a feature matching module 403, configured to determine whether the specified audio feature matches a preset audio feature, where the preset audio feature is an audio feature in an advertisement associated with the target audio in advance, or the preset audio feature is an audio feature in the target audio
And the advertisement determining module 404 is configured to determine that an illegal advertisement is not inserted into the screen projection device when the target audio is played if the specified audio characteristics are matched with the preset audio characteristics.
Optionally, the specified audio features comprise mel-frequency cepstral coefficients.
Optionally, the feature extraction module is specifically configured to pre-process the target audio to obtain a pre-processed target audio; wherein the preprocessing comprises at least one of pre-emphasis processing, framing processing and windowing processing; calculating spectral line energy of the preprocessed target audio; and calculating a Mel frequency cepstrum coefficient of the preprocessed target audio based on spectral line energy of the preprocessed target audio to obtain the designated audio characteristics of the target audio.
Optionally, the audio obtaining module comprises: and the audio acquisition submodule is used for acquiring audio of the environment where the screen projection equipment is located by using the audio acquisition unit after the screen projection equipment plays the screen projection content, so as to obtain target audio to be processed.
Optionally, the apparatus further comprises: the mute judgment module is used for judging whether the audio data of the target audio is the mute data or not after the audio acquisition submodule and before the feature extraction module, and if not, extracting the specified audio feature of the target audio; if so, ending.
Optionally, the apparatus further comprises: and the information sending module is used for sending the identification information of the screen projection equipment or the identification information of the screen projection content to the background server before the characteristic matching module judges whether the specified audio characteristic is matched with the preset audio characteristic, acquiring the audio characteristic in the advertisement which is searched for based on the identification information and is pre-associated with the screen projection equipment and fed back by the background server, and taking the audio characteristic as the preset audio characteristic.
According to the scheme provided by the embodiment of the invention, the target audio is the audio collected by the screen projection equipment in the preset time period for starting to play the screen projection content, if the screen projection equipment is inserted with an illegal advertisement, the target audio should contain the audio inserted with the illegal advertisement for playing, otherwise, if the voting equipment is not inserted with the illegal advertisement, the target audio contains the audio of the advertisement associated in advance or the audio of the screen projection content for playing. Further, after the specified audio features of the target audio are extracted, it may be determined whether the specified audio features are matched with the preset audio features, and if the specified audio features are matched with the preset audio features, it is indicated that the target audio is an audio for playing an associated advertisement or an audio for playing screen-shot content, that is, the screen-shot device plays the associated advertisement or the screen-shot content instead of the non-advertisement within a preset time period for starting playing the screen-shot content. Therefore, the invention can realize the identification of whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played.
An embodiment of the present invention further provides an electronic device, as shown in fig. 5, which includes a processor 501, a communication interface 502, a memory 503 and a communication bus 504, where the processor 501, the communication interface 502 and the memory 503 complete mutual communication through the communication bus 504,
a memory 503 for storing a computer program;
the processor 501 is configured to implement the steps of any illegal advertisement specifying method when executing the program stored in the memory 503.
According to the electronic device provided by the embodiment of the invention, as the target audio is the audio collected by the screen projection device in the preset time period for starting to play the screen projection content, if the screen projection device is inserted with an illegal advertisement, the target audio should contain the audio inserted with the illegal advertisement for playing, otherwise, if the voting device is not inserted with the illegal advertisement, the target audio contains the audio of the advertisement associated in advance or the audio of the screen projection content for playing. Further, after the specified audio features of the target audio are extracted, it may be determined whether the specified audio features are matched with the preset audio features, and if the specified audio features are matched with the preset audio features, the target audio is an audio for playing the associated advertisement or an audio for playing the screen-shot content, that is, the screen-shot device plays the associated advertisement or the screen-shot content instead of the non-advertisement within the preset time period for starting playing the screen-shot content. Therefore, the invention can realize the identification of whether the screen projection equipment is inserted with illegal advertisements before the screen projection content is played.
The communication bus mentioned in the above terminal may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but this does not mean that there is only one bus or one type of bus.
The communication interface is used for communication between the terminal and other equipment.
The Memory may include a Random Access Memory (RAM) or a non-volatile Memory (non-volatile Memory), such as at least one disk Memory. Alternatively, the memory may be at least one memory device located remotely from the processor.
The Processor may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a Network Processor (NP), and the like; the Integrated Circuit may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, or a discrete hardware component.
In still another embodiment of the present invention, a computer-readable storage medium is further provided, in which a computer program is stored, and the computer program, when executed by a processor, implements the illegal advertisement determination method described in any of the above embodiments.
In yet another embodiment of the present invention, there is also provided a computer program product containing instructions which, when run on a computer, cause the computer to perform the illegal advertisement determination method described in any of the above embodiments.
In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed on a computer, cause the processes or functions described in accordance with the embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, from one website site, computer, server, or data center to another website site, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for embodiments such as the apparatus, the electronic device, and the storage medium, since they are substantially similar to the method embodiments, the description is relatively simple, and for the relevant points, reference may be made to the partial description of the method embodiments.
The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (10)

1. An illegal advertisement determination method is applied to screen projection equipment, and comprises the following steps:
obtaining target audio to be processed, wherein the target audio is audio collected by the screen projection equipment within a preset time period for starting to play screen projection content;
extracting specified audio features of the target audio; wherein the specified audio features are feature information for uniquely identifying the target audio;
judging whether the specified audio features are matched with preset audio features, wherein the preset audio features are audio features in advertisements which are associated with the screen projection equipment in advance, or the preset audio features are audio features in the screen projection contents;
and if the specified audio features are matched with preset audio features, determining that illegal advertisements are not inserted into the screen projection equipment when the target audio is played.
2. The method of claim 1, wherein the specified audio features comprise mel-frequency cepstral coefficients.
3. The method of claim 2, wherein the extracting the specified audio feature of the target audio comprises:
preprocessing the target audio to obtain a preprocessed target audio; wherein the pre-processing comprises at least one of pre-emphasis processing, framing processing, and windowing processing;
calculating spectral line energy of the preprocessed target audio;
and calculating a Mel frequency cepstrum coefficient of the preprocessed target audio based on the spectral line energy of the preprocessed target audio to obtain the designated audio characteristics of the target audio.
4. The method according to any one of claims 1-3, wherein the obtaining target audio to be processed comprises:
and after the screen projection equipment plays the screen projection content, utilizing an audio acquisition unit to acquire environmental audio to obtain target audio to be processed.
5. The method of claim 4, wherein after the capturing of the environmental audio by the audio capturing unit to obtain the target audio to be processed and before the extracting of the specified audio feature of the target audio, the method further comprises:
judging whether the audio data of the target audio is mute data, if not, executing the extraction of the specified audio characteristics of the target audio; if so, ending.
6. The method of claim 4, wherein prior to said determining whether the specified audio feature matches a preset audio feature, the method further comprises:
and sending the identification information of the screen projection equipment or the identification information of the screen projection content to a background server, and acquiring the audio characteristics in the advertisement which is fed back by the background server and is searched based on the identification information and is associated with the screen projection equipment in advance as preset audio characteristics.
7. An illegal advertisement determination device applied to screen projection equipment, the device comprising:
the audio acquisition module is used for acquiring target audio to be processed, wherein the target audio is audio collected by the screen projection equipment within a preset time period for starting to play screen projection content;
the characteristic extraction module is used for extracting the specified audio characteristic of the target audio; wherein the specified audio features are feature information for uniquely identifying the target audio;
a feature matching module, configured to determine whether the specified audio feature matches a preset audio feature, where the preset audio feature is an audio feature in an advertisement associated with the target audio in advance, or the preset audio feature is an audio feature in the target audio
And the advertisement determining module is used for determining that illegal advertisements are not inserted into the screen projection equipment when the target audio is played if the specified audio characteristics are matched with preset audio characteristics.
8. The apparatus of claim 7, wherein the specified audio features comprise mel-frequency cepstral coefficients.
9. An electronic device is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor and the communication interface are used for realizing mutual communication by the memory through the communication bus;
a memory for storing a computer program;
a processor for implementing the method steps of any of claims 1-6 when executing a program stored in the memory.
10. A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, which computer program, when being executed by a processor, carries out the method steps of any one of claims 1 to 6.
CN202110824317.7A 2021-07-21 2021-07-21 Illegal advertisement determination method and device, electronic equipment and storage medium Pending CN113556605A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110824317.7A CN113556605A (en) 2021-07-21 2021-07-21 Illegal advertisement determination method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110824317.7A CN113556605A (en) 2021-07-21 2021-07-21 Illegal advertisement determination method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113556605A true CN113556605A (en) 2021-10-26

Family

ID=78103839

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110824317.7A Pending CN113556605A (en) 2021-07-21 2021-07-21 Illegal advertisement determination method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113556605A (en)

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140033049A1 (en) * 2007-10-30 2014-01-30 Adobe Systems Incorporated Context recognition through screensharing
US9215239B1 (en) * 2012-09-28 2015-12-15 Palo Alto Networks, Inc. Malware detection based on traffic analysis
WO2016172328A1 (en) * 2015-04-24 2016-10-27 Vid Scale, Inc. Content protection and modification detection in adaptive streaming and transport streams
CN206226634U (en) * 2016-10-13 2017-06-06 上海分众软件技术有限公司 It is wireless to throw screen system in real time
CN108920937A (en) * 2018-07-03 2018-11-30 广州视源电子科技股份有限公司 It throws screen system, throw screen method and apparatus
CN109982322A (en) * 2019-03-26 2019-07-05 连尚(新昌)网络科技有限公司 A kind of throwing screen method, equipment, system and storage medium
US10418065B1 (en) * 2006-01-21 2019-09-17 Advanced Anti-Terror Technologies, Inc. Intellimark customizations for media content streaming and sharing
CN110458591A (en) * 2019-06-14 2019-11-15 深圳壹账通智能科技有限公司 Advertising information detection method, device and computer equipment
CN110933484A (en) * 2019-11-25 2020-03-27 泰康保险集团股份有限公司 Management method and device of wireless screen projection equipment
CN111147953A (en) * 2020-01-03 2020-05-12 北京勾正数据科技有限公司 Method and device for detecting television advertisement delivery flow
CN111459433A (en) * 2020-03-30 2020-07-28 广州视源电子科技股份有限公司 Screen transmission method, equipment and storage medium
CN112116394A (en) * 2020-09-24 2020-12-22 北京明略昭辉科技有限公司 Method and computer-readable storage medium for advertisement placement monitoring
US20210097181A1 (en) * 2019-09-26 2021-04-01 At&T Intellectual Property I, L.P. Ransomware detection and mitigation

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10418065B1 (en) * 2006-01-21 2019-09-17 Advanced Anti-Terror Technologies, Inc. Intellimark customizations for media content streaming and sharing
US20140033049A1 (en) * 2007-10-30 2014-01-30 Adobe Systems Incorporated Context recognition through screensharing
US9215239B1 (en) * 2012-09-28 2015-12-15 Palo Alto Networks, Inc. Malware detection based on traffic analysis
WO2016172328A1 (en) * 2015-04-24 2016-10-27 Vid Scale, Inc. Content protection and modification detection in adaptive streaming and transport streams
CN206226634U (en) * 2016-10-13 2017-06-06 上海分众软件技术有限公司 It is wireless to throw screen system in real time
CN108920937A (en) * 2018-07-03 2018-11-30 广州视源电子科技股份有限公司 It throws screen system, throw screen method and apparatus
CN109982322A (en) * 2019-03-26 2019-07-05 连尚(新昌)网络科技有限公司 A kind of throwing screen method, equipment, system and storage medium
CN110458591A (en) * 2019-06-14 2019-11-15 深圳壹账通智能科技有限公司 Advertising information detection method, device and computer equipment
US20210097181A1 (en) * 2019-09-26 2021-04-01 At&T Intellectual Property I, L.P. Ransomware detection and mitigation
CN110933484A (en) * 2019-11-25 2020-03-27 泰康保险集团股份有限公司 Management method and device of wireless screen projection equipment
CN111147953A (en) * 2020-01-03 2020-05-12 北京勾正数据科技有限公司 Method and device for detecting television advertisement delivery flow
CN111459433A (en) * 2020-03-30 2020-07-28 广州视源电子科技股份有限公司 Screen transmission method, equipment and storage medium
CN112116394A (en) * 2020-09-24 2020-12-22 北京明略昭辉科技有限公司 Method and computer-readable storage medium for advertisement placement monitoring

Similar Documents

Publication Publication Date Title
JP6855527B2 (en) Methods and devices for outputting information
US9832523B2 (en) Commercial detection based on audio fingerprinting
WO2020181824A1 (en) Voiceprint recognition method, apparatus and device, and computer-readable storage medium
CN110782920B (en) Audio recognition method and device and data processing equipment
CN107612815B (en) Information sending method, device and equipment
CN107507626B (en) Mobile phone source identification method based on voice frequency spectrum fusion characteristics
CN111640411B (en) Audio synthesis method, device and computer readable storage medium
CN108364656B (en) Feature extraction method and device for voice playback detection
WO2021042537A1 (en) Voice recognition authentication method and system
CN113010139B (en) Screen projection method and device and electronic equipment
CN110428835B (en) Voice equipment adjusting method and device, storage medium and voice equipment
US9058384B2 (en) System and method for identification of highly-variable vocalizations
CN104091596A (en) Music identifying method, system and device
US11520806B1 (en) Tokenized voice authenticated narrated video descriptions
CN107197404B (en) Automatic sound effect adjusting method and device and recording and broadcasting system
CN111737515B (en) Audio fingerprint extraction method and device, computer equipment and readable storage medium
CN113556605A (en) Illegal advertisement determination method and device, electronic equipment and storage medium
CN111294642B (en) Video stream playing method and device
CN111627416A (en) Audio noise elimination method, device, equipment and storage medium
CN108630208B (en) Server, voiceprint-based identity authentication method and storage medium
CN114125368B (en) Conference audio participant association method and device and electronic equipment
CN112634942B (en) Method for identifying originality of mobile phone recording, storage medium and equipment
CN111145769A (en) Audio processing method and device
WO2024082928A1 (en) Voice processing method and apparatus, and device and medium
CN113707183A (en) Audio processing method and device in video

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20211026