CN108205550A - The generation method and device of audio-frequency fingerprint - Google Patents

The generation method and device of audio-frequency fingerprint Download PDF

Info

Publication number
CN108205550A
CN108205550A CN201611173755.7A CN201611173755A CN108205550A CN 108205550 A CN108205550 A CN 108205550A CN 201611173755 A CN201611173755 A CN 201611173755A CN 108205550 A CN108205550 A CN 108205550A
Authority
CN
China
Prior art keywords
audio
audio file
fingerprint
frequency fingerprint
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201611173755.7A
Other languages
Chinese (zh)
Other versions
CN108205550B (en
Inventor
吴岩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kuwo Technology Co Ltd
Original Assignee
Beijing Kuwo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kuwo Technology Co Ltd filed Critical Beijing Kuwo Technology Co Ltd
Priority to CN201611173755.7A priority Critical patent/CN108205550B/en
Publication of CN108205550A publication Critical patent/CN108205550A/en
Application granted granted Critical
Publication of CN108205550B publication Critical patent/CN108205550B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/635Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/61Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present embodiments relate to the generation methods and device of a kind of audio-frequency fingerprint.Including:The second audio file based on pcm encoder, audio file of second audio file for the interception at the first audio file first time are intercepted according to the first audio file;Multiple sub fingerprints are obtained according to second audio file;Audio-frequency fingerprint of the setting quantity sub fingerprint as first audio file in the multiple sub fingerprint is intercepted at the second time.Audio-frequency fingerprint of various formatted audio files a string of the identifiers extracting and be calculated as audio file can be directed to, song is identified with this, even if it is also that will not change to change information, the audio-frequency fingerprints such as singer's name of song, album name.

Description

The generation method and device of audio-frequency fingerprint
Technical field
The present invention relates to audio data processing technology field more particularly to the generation methods and device of a kind of audio-frequency fingerprint.
Background technology
Audio file is generally comprised to store the identification informations such as singer, title, album name, age and style Data segment, for example, for the audio file of MP3 format, the storage mark letter generally in the ID3 information of the MP3 audio files Breath.When playing audio file, usually by reading the identification information being stored in the data segment of identification information, which is believed Breath is shown in broadcast interface, is supplied to user.
But being constantly progressive with technology, it, can be easily in audio file in order to evade copyright and other reasons The data segment of storage identification information is modified or is deleted.For this kind of audio file, when playing out, it will nothing occur Method correctly identifies the situation of song, this will certainly influence the appreciation experience of song.
Invention content
An embodiment of the present invention provides the generation methods and device of a kind of audio-frequency fingerprint.By extracting taking in audio file Audio-frequency fingerprint of a string of the identifiers for going out and being calculated as audio file identifies song with this, can change in ID3 information etc. After change, song still can not can be correctly identified.
On the one hand, an embodiment of the present invention provides a kind of generation method of audio-frequency fingerprint, including:
It is encoded according to the interception of the first audio file based on pulse code modulation (Pulse Code Modulation, PCM) Second audio file, audio file of second audio file for the interception at the first audio file first time;
Multiple sub fingerprints are obtained according to second audio file;
The setting quantity sub fingerprint in the multiple sub fingerprint is intercepted at the second time as first sound The audio-frequency fingerprint of frequency file.
Optionally, it further includes:
It determines source audio file, the source audio file is converted into first audio file.
Optionally, the first time is 45 seconds.
Optionally, second time is more than 32 seconds, and less than the first time.
Optionally, the quantity that sets is 512.
On the other hand, an embodiment of the present invention provides a kind of methods that audio-frequency fingerprint is added in audio file data library. The audio file data library includes multiple audio files, the method includes:
Determine at least one audio file for not including audio-frequency fingerprint in the multiple audio file;
Calculate each corresponding multiple sub fingerprints at least one audio file;
It generates at least one audio file and refers to more than the audio-frequency fingerprint of the audio file of first time, the audio Line is the setting quantity sub fingerprint that intercepts at the first time more than the audio file of first time;
Database statement is generated, and by audio-frequency fingerprint addition in the database according to the audio-frequency fingerprint.
Another aspect, an embodiment of the present invention provides a kind of generating means of audio-frequency fingerprint.Including:
Interception unit, for intercepting the second audio file based on pcm encoder, second sound according to the first audio file Audio file of the frequency file for the interception at the first audio file first time;
Sub fingerprint generation unit, for obtaining multiple sub fingerprints according to second audio file;
Audio-frequency fingerprint generation unit, for intercepting the setting quantity in the multiple sub fingerprint at the second time Audio-frequency fingerprint of the sub fingerprint as first audio file.
Optionally, it further includes:
The source audio file for determining source audio file, is converted to first audio file by determination unit.
Optionally, second time is more than 32 seconds, and less than the first time.
In another aspect, an embodiment of the present invention provides a kind of devices that audio-frequency fingerprint is added in audio file data library. The audio file data library includes multiple audio files, and described device includes:
Determination unit, for determining not include at least one audio file of audio-frequency fingerprint in the multiple audio file;
Sub fingerprint generation unit, for calculating each corresponding multiple sub fingerprints at least one audio file;
Audio-frequency fingerprint generation unit, for generating the audio file for being more than first time at least one audio file Audio-frequency fingerprint, the audio-frequency fingerprint is the setting number that intercepts at the first time more than the audio file of first time Amount sub fingerprint;
Adding device for generating database statement according to the audio-frequency fingerprint, and the audio-frequency fingerprint is added in institute It states in database.
Through the embodiment of the present invention, it can be extracted for various formatted audio files and a string of identifier conducts are calculated The audio-frequency fingerprint of audio file identifies song with this, even if the information such as the singer name of change song, album name, audio-frequency fingerprint And it will not change.
Description of the drawings
Fig. 1 is a kind of flow chart of the generation method of audio-frequency fingerprint provided in an embodiment of the present invention;
Fig. 2 is a kind of method flow that audio-frequency fingerprint is added in audio file data library provided in an embodiment of the present invention Figure;
Fig. 3 is a kind of generating means structure diagram of audio-frequency fingerprint provided in an embodiment of the present invention;
Fig. 4 is that a kind of apparatus structure that audio-frequency fingerprint is added in audio file data library provided in an embodiment of the present invention shows It is intended to.
Specific embodiment
Purpose, technical scheme and advantage to make the embodiment of the present invention are clearer, below in conjunction with the embodiment of the present invention In attached drawing, the technical solution in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is Part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art All other embodiments obtained without making creative work shall fall within the protection scope of the present invention.
The embodiment of the present invention is converted into reference format by the audio file to different arbitrary versions, according to the reticle The audio file of formula goes to extract, so being not in since standard for manual sampling caused by the multi version problem of audio file is skimble-scamble Problem, meanwhile, during fingerprint is calculated, by down-sampled, Fourier transformation mode has sampled song portions fingerprint, both The uniqueness of the fingerprint generation of various audio files is met, also identifies that the fingerprint efficiency of the audio file provides base to improve Plinth.
For ease of the understanding to the embodiment of the present invention, it is further explained below in conjunction with attached drawing with specific embodiment Bright, embodiment does not form the restriction to the embodiment of the present invention.
Fig. 1 is a kind of flow chart of the generation method of audio-frequency fingerprint provided in an embodiment of the present invention.As shown in Figure 1, the party Method specifically includes:
S110, according to the first audio file intercept the second audio file based on pcm encoder, second audio file be The audio file intercepted at first audio file first time.
First audio file is the audio file of reference format, and the form of first audio file can be that WMA etc. is general Audio file form.
Source audio file, that is, need the audio file being identified using audio-frequency fingerprint, it is understood that there may be multiple versions Source audio file is converted to the audio file of reference format by multiple format first so that when audio-frequency fingerprint generates, the system of sampling One, improve the accuracy of audio-frequency fingerprint.
When according to the first audio file generation audio-frequency fingerprint, a part for the audio file, a part of sound can be intercepted Frequency fingerprint is the data fingerprint for being regarded as the audio source file.The part is the audio file based on pcm encoder.
Specifically, first audio file is played using Mplayer, first audio file was intercepted at 45 seconds from the beginning of The second audio file being played at this 45 seconds, second audio file are the WAV audio format files based on pcm encoder, with Analog signal ratio, it is not easy to be influenced by the clutter of conveyer system and distortion, and wide dynamic range can obtain the fairly good shadow of sound quality Ring effect.It should be appreciated that the duration of the second audio file is longer, recognition accuracy is higher, and 45 seconds are only the one of the present invention A example is not formed and is limited.
S120 obtains multiple sub fingerprints according to the second audio file.
Wherein, the generating process of multiple sub fingerprints is described in detail below:
The second audio file is carried out for various sound channels and sample rate down-sampled.It is handled by Hanning window, eliminates high frequency Interference and leakage energy, carry out Fourier transformation.Energy is calculated in frequency domain by calculating frequency domain amplitude and each frequency band.It calculates Go out energy differences, difference WRT is more than to 0 typing fingerprint, obtains sub fingerprint.
It is actually also frequency information that audio, which is realized, each sampled point record is amplitude of the waveform in the point, for For one audio file, he is characterized on frequency information.
In one example, the generation of sub fingerprint specifically comprises the following steps:
1st, a frame audio-frequency information of the second down-sampled audio file is passed through in extraction.
2nd, it is handled by Hanning window, eliminates High-frequency Interference and leakage energy, carry out Fourier transformation.
3rd, according to the second audio file after Fourier transformation, amplitude information is changed into energy information.
4th, the result of energy information is taken absolute value.
5th, frequency is mapped to 9 frequency bands, calculates each frequency band energy in 300---2000.
According to 300---2000HZ frequency bark values, be divided into 9 frequency band, calculate each frequency band energy and.
6. the generation sub fingerprint compared with previous frame energy value.
We obtain 9 energy informations, E [1....9], E_ [i]=[i+1]-E [i];F [n, M] represents n-th frame, E_'s [M] Value.
F if [n, M]-F [n-1, M]>0 sub fingerprint M is 1, is otherwise 0, in this way can be according to the comparison of two frames Generate the sub fingerprint of 8 bytes.
S130, at the second time intercept setting quantity sub fingerprint in multiple sub fingerprints as the first audio text The audio-frequency fingerprint of part.
It can determine that the second audio file is corresponding with multiple sub fingerprints according to aforementioned S110, S120, multiple son can be intercepted and referred to A part for line, the combination of a part of sub fingerprint are the audio-frequency fingerprint of the first audio file or source audio file.
Specifically, it may be determined that it is corresponding more to intercept second audio file since at the second time for the second audio file The sub fingerprint of quantity is set in a sub fingerprint as audio-frequency fingerprint.Wherein, when which may be greater than 32 seconds less than first Between random time, such as at the first time for 45 seconds, the second time can be 32 seconds or 35 seconds etc., can avoid audio text in this way The prelude of part enhances different song fingerprints othernesses.It can be 512 sub fingerprints (corresponding son of general 6 seconds audios to set quantity Fingerprint).
The data line data example of generation:5939cd89,5d39dd8b, 5d39dda3 ... ... (omit 508 sons to refer to Line), a96a76ab.
It should be noted that the initial value of multiple sub fingerprints of the second audio file of interception is the second time, this second when Between for 32 seconds be only an example provided in an embodiment of the present invention, form limit.
It should also be noted that, the corresponding fingerprint of 6 seconds audios of interception is only an example provided in an embodiment of the present invention, and Restriction is not formed.The bigger the time span for calculating fingerprint the more accurate, and the smaller efficiency of time span is higher.6 seconds fingerprints are only calculated to know It is not efficient, and recognition effect can reach 95%.
Through the embodiment of the present invention, the extraction of various formatted audio files can be directed to and a string of identifiers is calculated as sound The audio-frequency fingerprint of frequency file, a string of character strings are corresponding with audio file, and the probability for identical audio-frequency fingerprint occur is very small, Song is identified with this, even if information, the audio-frequency fingerprint such as singer's name of change song, album name are also that will not change.
Fig. 2 is a kind of method flow that audio-frequency fingerprint is added in audio file data library provided in an embodiment of the present invention Figure.As shown in Fig. 2, audio file data library includes multiple audio files, this method specifically includes:
S210 determines at least one audio file for not including audio-frequency fingerprint in multiple audio files.
Audio file data library generally comprises multiple audio files, which a part of may possess audio and refer to Line, a part do not have.It can be examined in, determine whether each audio file has been computed audio-frequency fingerprint, will not count The audio file of calculation adds in miss and (misses) list.
The miss lists generally comprise at least one audio file, which does not all calculate audio and refer to Line.
S220 calculates each corresponding multiple sub fingerprints at least one audio file.
Audio-frequency fingerprint is calculated respectively at least one audio file that miss lists include.
First, the corresponding multiple sub fingerprints of each audio file in miss lists are calculated, the calculation of the sub fingerprint can Referring to the description in S120 in aforementioned embodiment shown in FIG. 1, repeat no more.
S230 is generated at least one audio file and is referred to more than the audio-frequency fingerprint of the audio file of first time, the audio Line is the setting quantity sub fingerprint that intercepts at the first time more than the audio file of first time.
Wherein, the generation of audio-frequency fingerprint can be found in the description in embodiment shown in FIG. 1 in S130.
When in embodiments of the present invention, due to generation audio-frequency fingerprint, need to intercept sub fingerprint since at the first time, for Audio file in miss lists may include the audio file that a part is less than first time length, further include a part of big In the audio file of first time length.Wherein, the second time in aforementioned embodiment illustrated in fig. 1, example be can be found at the first time It such as can be 32 seconds.
It needs to calculate audio-frequency fingerprint to the audio file for being more than first time length.
For being less than the audio file of first time length when calculating audio-frequency fingerprint, it may appear that the situation of failure is calculated, The mark of all audio files for calculating failure of merger.
S240 generates database statement, and the audio-frequency fingerprint is added in the database according to the audio-frequency fingerprint In.
For properly generating the audio file of audio-frequency fingerprint, the audio file is identified using the audio-frequency fingerprint, according to the sound Frequency fingerprint creation MYSQL sentences, the operations such as to be inquired the audio file, deleted according to the MYSQL sentences.By the sound Frequency fingerprint is according to its correspondence with audio file, and addition is in the database.
Song fingerprints can be added, and count addition successfully and do not add into each audio file in database with this The song files of work(.
Fig. 3 is a kind of generating means structure diagram of audio-frequency fingerprint provided in an embodiment of the present invention.It as shown in figure 3, should Device includes:
Interception unit 301, for intercepting the second audio file based on pcm encoder according to the first audio file, described the Audio file of two audio files for the interception at the first audio file first time;
Sub fingerprint generation unit 302, for obtaining multiple sub fingerprints according to second audio file;
Audio-frequency fingerprint generation unit 303, for intercepting the setting number in the multiple sub fingerprint at the second time Audio-frequency fingerprint of the amount sub fingerprint as first audio file.
Optionally, it further includes:
The source audio file for determining source audio file, is converted to first audio file by determination unit.
Optionally, the first time is 45 seconds.
Optionally, second time is more than 32 seconds, and less than the first time.
Optionally, the quantity that sets is 512.
Fig. 4 is that a kind of apparatus structure that audio-frequency fingerprint is added in audio file data library provided in an embodiment of the present invention shows It is intended to.The audio file data library includes multiple audio files, and as described in Figure 4, which includes:
Determination unit 401, for determining at least one audio text for not including audio-frequency fingerprint in the multiple audio file Part;
Sub fingerprint generation unit 402, for calculating each corresponding multiple sub fingerprints at least one audio file;
Audio-frequency fingerprint generation unit 403, for generating the audio for being more than first time at least one audio file The audio-frequency fingerprint of file, the audio-frequency fingerprint are set for what is intercepted at the first time more than the audio file of first time Fixed number amount sub fingerprint;
Adding device 404 for generating database statement according to the audio-frequency fingerprint, and audio-frequency fingerprint addition is existed In the database.
Professional should further appreciate that, be described with reference to the embodiments described herein each exemplary Unit and algorithm steps can be realized with the combination of electronic hardware, computer software or the two, hard in order to clearly demonstrate The interchangeability of part and software generally describes each exemplary composition and step according to function in the above description. These functions are performed actually with hardware or software mode, specific application and design constraint depending on technical solution. Professional technician can realize described function to each specific application using distinct methods, but this realization It is it is not considered that beyond the scope of this invention.
The step of method or algorithm for being described with reference to the embodiments described herein, can use hardware, processor to perform The combination of software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only memory (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technical field In any other form of storage medium well known to interior.
Above-described specific embodiment has carried out the purpose of the present invention, technical solution and advantageous effect further It is described in detail, it should be understood that the foregoing is merely the specific embodiment of the present invention, is not intended to limit the present invention Protection domain, all any modification, equivalent substitution, improvement and etc. within the scope of the present invention, done should be included in this hair Within bright protection domain.

Claims (10)

1. a kind of generation method of audio-frequency fingerprint, which is characterized in that including:
The second audio file based on pcm encoder is intercepted according to the first audio file, second audio file is described the The audio file intercepted at one audio file first time;
Multiple sub fingerprints are obtained according to second audio file;
The setting quantity sub fingerprint in the multiple sub fingerprint is intercepted at the second time as first audio text The audio-frequency fingerprint of part.
2. it according to the method described in claim 1, it is characterized in that, further includes:
It determines source audio file, the source audio file is converted into first audio file.
3. method according to claim 1 or 2, which is characterized in that the first time is 45 seconds.
4. method according to claim 1 or 2, which is characterized in that second time is more than 32 seconds, and less than described the One time.
5. according to the method described in claim 4, it is characterized in that, the quantity that sets is 512.
A kind of 6. method that audio-frequency fingerprint is added in audio file data library, which is characterized in that the audio file data library Including multiple audio files, the method includes:
Determine at least one audio file for not including audio-frequency fingerprint in the multiple audio file;
Calculate each corresponding multiple sub fingerprints at least one audio file;
It generates at least one audio file and is more than the audio-frequency fingerprint of the audio file of first time, the audio-frequency fingerprint The setting quantity sub fingerprint intercepted at the first time more than the audio file of first time;
Database statement is generated, and by audio-frequency fingerprint addition in the database according to the audio-frequency fingerprint.
7. a kind of generating means of audio-frequency fingerprint, which is characterized in that including:
Interception unit, for intercepting the second audio file based on pcm encoder, the second audio text according to the first audio file Audio file of the part for the interception at the first audio file first time;
Sub fingerprint generation unit, for obtaining multiple sub fingerprints according to second audio file;
Audio-frequency fingerprint generation unit refers to for intercepting the setting quantity height in the multiple sub fingerprint at the second time Audio-frequency fingerprint of the line as first audio file.
8. device according to claim 7, which is characterized in that further include:
The source audio file for determining source audio file, is converted to first audio file by determination unit.
9. device according to claim 7 or 8, which is characterized in that second time is more than 32 seconds, and less than described the One time.
A kind of 10. device that audio-frequency fingerprint is added in audio file data library, which is characterized in that the audio file data library Including multiple audio files, described device includes:
Determination unit, for determining not include at least one audio file of audio-frequency fingerprint in the multiple audio file;
Sub fingerprint generation unit, for calculating each corresponding multiple sub fingerprints at least one audio file;
Audio-frequency fingerprint generation unit, for generating at least one audio file more than the sound of the audio file of first time Frequency fingerprint, the audio-frequency fingerprint are the setting quantity that intercepts at the first time more than the audio file of first time Sub fingerprint;
Adding device for generating database statement according to the audio-frequency fingerprint, and the audio-frequency fingerprint is added in the number According in library.
CN201611173755.7A 2016-12-16 2016-12-16 Audio fingerprint generation method and device Active CN108205550B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611173755.7A CN108205550B (en) 2016-12-16 2016-12-16 Audio fingerprint generation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611173755.7A CN108205550B (en) 2016-12-16 2016-12-16 Audio fingerprint generation method and device

Publications (2)

Publication Number Publication Date
CN108205550A true CN108205550A (en) 2018-06-26
CN108205550B CN108205550B (en) 2021-03-12

Family

ID=62601719

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611173755.7A Active CN108205550B (en) 2016-12-16 2016-12-16 Audio fingerprint generation method and device

Country Status (1)

Country Link
CN (1) CN108205550B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113641423A (en) * 2021-08-31 2021-11-12 青岛海信传媒网络技术有限公司 Display device and system starting method

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101334822A (en) * 2007-06-28 2008-12-31 汤姆森许可贸易公司 Method and device for implementing video processing copyright
EP2067508A1 (en) * 2007-11-29 2009-06-10 AMBX UK Limited A method for providing a sensory effect to augment an experience provided by a video game
CN101651694A (en) * 2009-09-18 2010-02-17 北京亮点时间科技有限公司 Method, system, client and server for providing related audio information
CN101853262A (en) * 2009-12-07 2010-10-06 清华大学 Voice frequency fingerprint rapid searching method based on cross entropy
US20110085781A1 (en) * 2009-10-13 2011-04-14 Rovi Technologies Corporation Content recorder timing alignment
US20110282471A1 (en) * 2004-09-27 2011-11-17 Juergen Herre Apparatus and Method for Synchronizing Additional Data and Base Data
CN102270200A (en) * 2010-06-03 2011-12-07 盛乐信息技术(上海)有限公司 Music abstract automatic generation method
CN102289518A (en) * 2011-09-13 2011-12-21 盛乐信息技术(上海)有限公司 Method and system for updating audio fingerprint search library
CN102314875A (en) * 2011-08-01 2012-01-11 北京百度网讯科技有限公司 Audio file identification method and device
CN103621106A (en) * 2011-06-20 2014-03-05 微软公司 Providing video presentation commentary
CN105224581A (en) * 2014-07-03 2016-01-06 北京三星通信技术研究有限公司 The method and apparatus of picture is presented when playing music
CN105825850A (en) * 2016-04-29 2016-08-03 腾讯科技(深圳)有限公司 Audio processing method and device
CN105975568A (en) * 2016-04-29 2016-09-28 腾讯科技(深圳)有限公司 Audio processing method and apparatus

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110282471A1 (en) * 2004-09-27 2011-11-17 Juergen Herre Apparatus and Method for Synchronizing Additional Data and Base Data
CN101334822A (en) * 2007-06-28 2008-12-31 汤姆森许可贸易公司 Method and device for implementing video processing copyright
EP2067508A1 (en) * 2007-11-29 2009-06-10 AMBX UK Limited A method for providing a sensory effect to augment an experience provided by a video game
CN101651694A (en) * 2009-09-18 2010-02-17 北京亮点时间科技有限公司 Method, system, client and server for providing related audio information
US20110085781A1 (en) * 2009-10-13 2011-04-14 Rovi Technologies Corporation Content recorder timing alignment
CN101853262A (en) * 2009-12-07 2010-10-06 清华大学 Voice frequency fingerprint rapid searching method based on cross entropy
CN102270200A (en) * 2010-06-03 2011-12-07 盛乐信息技术(上海)有限公司 Music abstract automatic generation method
CN103621106A (en) * 2011-06-20 2014-03-05 微软公司 Providing video presentation commentary
CN102314875A (en) * 2011-08-01 2012-01-11 北京百度网讯科技有限公司 Audio file identification method and device
CN102289518A (en) * 2011-09-13 2011-12-21 盛乐信息技术(上海)有限公司 Method and system for updating audio fingerprint search library
CN105224581A (en) * 2014-07-03 2016-01-06 北京三星通信技术研究有限公司 The method and apparatus of picture is presented when playing music
CN105825850A (en) * 2016-04-29 2016-08-03 腾讯科技(深圳)有限公司 Audio processing method and device
CN105975568A (en) * 2016-04-29 2016-09-28 腾讯科技(深圳)有限公司 Audio processing method and apparatus

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
刘继新: "基于矢量量化技术的音频信息隐藏算法的研究", 《中国博士学位论文全文数据库 信息科技辑》 *
李伟 等: "数字音频指纹技术综述", 《小型微型计算机系统》 *
沈迤淳: "歌曲中相似片段的检测及其应用", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
郭永帅: "基于音频指纹和版本识别的音乐检索技术研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113641423A (en) * 2021-08-31 2021-11-12 青岛海信传媒网络技术有限公司 Display device and system starting method
CN113641423B (en) * 2021-08-31 2023-07-07 青岛海信传媒网络技术有限公司 Display device and system starting method

Also Published As

Publication number Publication date
CN108205550B (en) 2021-03-12

Similar Documents

Publication Publication Date Title
US11657798B2 (en) Methods and apparatus to segment audio and determine audio segment similarities
WO2017181852A1 (en) Song determining method and device, and storage medium
KR101363534B1 (en) Beat extraction device and beat extraction method
KR101292698B1 (en) Method and apparatus for attaching metadata
Williams Tracking timbral changes in metal productions from 1990 to 2013
JP2008504741A (en) Method for characterizing the overlap of two media segments
CN111640411B (en) Audio synthesis method, device and computer readable storage medium
CN104036788B (en) The acoustic fidelity identification method of audio file and device
CN107103915A (en) A kind of audio data processing method and device
US9704507B2 (en) Methods and systems for decreasing latency of content recognition
CN108280074A (en) The recognition methods of audio and system
Turchet et al. Real-time hit classification in a Smart Cajón
WO2015111014A1 (en) A method and a system for decomposition of acoustic signal into sound objects, a sound object and its use
CN105975568A (en) Audio processing method and apparatus
US20190172432A1 (en) Systems and methods for analyzing components of audio tracks
JP5395399B2 (en) Mobile terminal, beat position estimating method and beat position estimating program
CN109271501B (en) Audio database management method and system
KR20040101299A (en) Feature-based audio content identification
CN106531202A (en) Audio processing method and device
CN108205550A (en) The generation method and device of audio-frequency fingerprint
CN106782601A (en) A kind of multimedia data processing method and its device
CN107886941A (en) A kind of audio mask method and device
JP6263383B2 (en) Audio signal processing apparatus, audio signal processing apparatus control method, and program
KR20160056104A (en) Analyzing Device and Method for User's Voice Tone
Tang et al. Melody Extraction from Polyphonic Audio of Western Opera: A Method based on Detection of the Singer's Formant.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant