CN105791973A

CN105791973A - Resolving method and resolving device based on sound wave watermark

Info

Publication number: CN105791973A
Application number: CN201610130157.5A
Authority: CN
Inventors: 李万欣; 王智鹏; 遆宁; 林荣越; 曹晨; 王娜
Original assignee: Dalian Leyun Information Technology Co Ltd
Current assignee: Dalian Leyun Information Technology Co Ltd
Priority date: 2016-03-07
Filing date: 2016-03-07
Publication date: 2016-07-20

Abstract

The invention discloses a resolving method based on a sound wave watermark. The resolving method comprises the following steps of a step S100, interpolating a sound wave watermark ID into an audio/video file by a sound wave watermark processing module; and a step S200, receiving the played audio/video file by the sound wave water mark processing module, and resolving the sound wave watermark ID. The resolving method has advantages of high identification rate, high interference resistance, real-time detecting performance. One or more program segment can be identified in one second. Furthermore simple system structure and wide application range.

Description

A kind of analytic method based on sound wave watermark and device

Technical field

The present invention relates to voice recognition technology field, particularly relate to a kind of analytic method based on sound wave watermark and device.

Background technology

At present, the voice recognition of scene application has two ways:

First: by Waveform Matching identification.This method is equipped with substantial amounts of wave file on data server, and in time opening reception, resolver can record a section audio, go into wave file and upload to data server, compare with a large amount of wave files therein, carry out voice recognition, and then carry out next step command operating.Such as: wechat shakes function.

There is the problem that in a noisy environment, resolve bad, Consumer's Experience is bad；Wave file to be put in data server in advance.

Second: identify by playing sound near field.Its detailed process is, playing device plays one section of specific sound, after receiving device reception, specific sound is resolved, identifies, and then carry out next step command operating.

There is the problem that needing large-scale data backstage to intercept analyzes.

Summary of the invention

In order to overcome the problem of above-mentioned existing two kinds of voice recognition technologies, the invention provides a kind of analytic method based on sound wave watermark and device.

A kind of analytic method based on sound wave watermark provided by the invention comprises the following steps: step S100: sound wave watermark processing module inserts a sound wave watermark ID in audio-video document；Step S200: sound wave watermark parsing module receives the audio-video document play, and parses sound wave watermark ID.

Wherein, described step S200 includes following sub-step: step S210: receives submodule and listens to audio-video document；Step S220: the sound wave watermark ID in audio-video document is separated by segregant module, and is uploaded to sound wave watermark analyzing platform；Step S230: sound wave watermark ID is decrypted by sound wave watermark analyzing platform.

Wherein, described step S200 farther includes following sub-step: step S240: the ID instruction that the storage of preset table submodule is corresponding with sound wave watermark ID, and this ID instruction is analyzed, is distributed, and reaches implementation sub-module；Step S250: implementation sub-module, according to this ID instruction, performs corresponding order.

Wherein, described sound wave watermark ID is the audio file that high-frequency sound wave changes into hexadecimal sequence form.

The present invention additionally provides a kind of resolver based on sound wave watermark, including: sound wave watermark processing module, for inserting a sound wave watermark ID in audio-video document；Sound wave watermark parsing module, for receiving the audio-video document of broadcasting, parses sound wave watermark ID.

Wherein, described sound wave watermark parsing module farther includes: receives submodule, is used for listening to audio-video document；Segregant module, for being separated by the sound wave watermark ID of the hexadecimal sequence number in audio-video document, and is uploaded to sound wave watermark analyzing platform；Sound wave watermark analyzing platform, for being decrypted the sound wave watermark ID of hexadecimal sequence number.

Wherein, described sound wave watermark parsing module farther includes: preset table submodule, for storing the ID instruction corresponding with sound wave watermark ID, and this ID instruction is analyzed, is distributed, reaches implementation sub-module；Implementation sub-module, for according to this ID instruction, performing corresponding order.

The invention has the beneficial effects as follows:

(1) discrimination is high: applications of sound waves is mainly the coding of sound wave in a core of new media and intelligent terminal and identifies parsing, it is necessary to ensure that discrimination 100%, it is clean that sound wave frequency range does not interfere with comparison, it is the basic guarantee improving discrimination, also can be greatly improved discrimination additionally by Optimized Coding Based rule.

(2) precision marketing: sound wave is also equipped with the function of precision marketing, it is possible to select region or fixed point propelling movement scope user, improves accurate information and pushes.

(3) anti-interference: sound wave interaction technique Non-Destructive Testing, information gathering is more accurate, has safe encrypting and decrypting characteristic, has anti-interference.This is also shake very big difference with wechat, as long as wechat shakes centre brouhaha, shake search less than.Sound wave has then evaded this problem.

(4) detection property in real time: sound wave has the characteristic of detection in real time, as long as the video that plays back of playback equipment or sound contain sound wave watermark, terminal will receive, and then resolves.

(5) per second or within many seconds, may identify which one or more than one program segment.

(6) live, program request, carousel application are gone for；Can be also used for the application such as the loudspeaker of the occasion such as square, station, large-size screen monitors, broadcast.

(7) system structure is simple: front end system has only to increase a ripples watermark and inserts link；Terminal system has only to increase an APP.Other be all use deposit system and equipment.

Accompanying drawing explanation

Fig. 1 is based on the flow chart of the analytic method of sound wave watermark.

Detailed description of the invention

For the technical scheme making to present invention solves the technical problem that, adopting and the technique effect reached clearly, below in conjunction with drawings and Examples, the present invention is described in further detail.It is understood that specific embodiment described herein is used only for explaining the present invention, but not limitation of the invention.It also should be noted that, for the ease of describing, accompanying drawing illustrate only part related to the present invention but not full content.

Refer to Fig. 1, the analytic method based on sound wave watermark comprises the following steps: step S100: sound wave watermark processing module (encoding software) audio-video document (various forms (and MP3 MP4 TS FLV etc.) audio-video document) in insert sound wave watermark ID (hexadecimal sequence number)；Step S200: sound wave watermark parsing module receives the audio-video document play, and parses sound wave watermark ID.

The source code of resolving is as follows:

voidAQRecorder::MyInputBufferHandler(void*inUserData,

AudioQueueRefinAQ,

AudioQueueBufferRefinBuffer,

constAudioTimeStamp*inStartTime,

UInt32inNumPackets,

constAudioStreamPacketDescription*inPacketDesc)

{

AQRecorder*aqr=(AQRecorder*) inUserData；

try{

if(inNumPackets>0){

//writepacketstofile

XThrowIfError(AudioFileWritePackets(aqr->mRecordFile,FALSE,inBuffer->mAudioDataByteSize,

inPacketDesc,aqr->mRecordPacket,&inNumPackets,inBuffer->mAudioData),

"AudioFileWritePacketsfailed")；

Aqr-> mRecordPacket+=inNumPackets；

}

Sscanf (" abc ", " %16 ", inBuffer-> mAudioData)；

if(aqr->IsRunning())

XThrowIfError(AudioQueueEnqueueBuffer(inAQ,inBuffer,0,NULL),"AudioQueueEnqueueBufferfailed")；

}catch(CAXExceptione){

charbuf[256]；

Fprintf (stderr, " Error:%s (%s) n ", e.mOperation, e.FormatError (buf))；

}

Wherein, described step S200 includes following sub-step: step S210: receives submodule (mike of mobile phone) and listens to audio-video document；Step S220: segregant module (is called underlay approach, referring to code) and separated by the sound wave watermark ID in audio-video document, and is uploaded to sound wave watermark analyzing platform (server)；Step S230: sound wave watermark ID is decrypted and (converts the hexadecimal sequence number received to decimal sequence number by sound wave watermark analyzing platform, adopt des encryption standard, in the middle of this sequence numbers match to preset table, find the execution instruction of its correspondence)；Step S240: the ID instruction that the storage of preset table submodule is corresponding with sound wave watermark ID, such as: jump to Baidu, calculate data " with " " percentage rate ", judge be what type instruction (advertisement, news, TV play etc.), and this ID instruction is analyzed, distributes, reach implementation sub-module；Step S250: implementation sub-module, according to this ID instruction, performs corresponding order (mobile phone terminal shows).

Embodiment 1 public transport mobile media

Time by bus, the contents such as news, TV play, various types of programs, advertisement in mobile TV, can be play；But being as the universal of smart mobile phone, the sight line of people has transferred to mobile terminal from mobile TV, and the income of mobile TV advertisement will reduce then, the eyeball of passenger can be attracted back mobile TV by the sound analytic method of the present invention.

Such as mobile TV has various audio frequency and video when playing various types of programs, sound wave watermark ID is added in these programs, when user open mobile phone terminal be simultaneously received in mobile TV play audio frequency, the sound wave watermark ID in program will be resolved to, sound wave watermark ID can be sent to sound wave watermark analyzing platform by network, decipher according to sound wave watermark analyzing platform, then this instruction is performed, return in mobile phone terminal, perform instruction, such as, when playing news, open mobile phone terminal will eject about dependent event and the up-to-date information of news, user may browse through viewing；When playing program, the clothing that occur in program, car, house, cuisines, the content such as cosmetics, by mobile phone terminal, just can jump to link and even can directly buy.

Embodiment 2 home shopping channel is interactive

For the home shopping channel of cable television, current home shopping channel is to make a phone call to place an order or scan Quick Response Code by user to place an order.Sound wave watermark ID can be added in shopping program later, when user opens the audio frequency being simultaneously received home shopping channel broadcasting of mobile phone A PP, the sound wave watermark ID in program will be resolved to, sound wave watermark ID can be sent to sound wave watermark analyzing platform by network, decipher according to sound wave watermark analyzing platform, then perform this instruction, return in mobile phone terminal, performing instruction, such as mobile phone terminal jumps directly to pay the page and places an order；Can as the interactive tool of all television channels, it is possible to understand the TV story of a play or opera for the degree of depth, it is also possible to directly push into the payment page to user mobile phone end for TV play being implanted commodity.

Embodiment 3 TV media carries out the ballot competition of contest

It is in and sees large-scale interaction ballot program simultaneously, open mobile phone terminal, the audio frequency and video sound wave watermark ID of television transmission can be resolved to, it is decrypted by sound wave watermark analyzing platform, corresponding instruction can be performed, mobile phone terminal can redirect the respective program ballot page, the inside has the introduction of program, ballot button etc., this have also been enlarged the scope of ballot, decrease the tedious steps (editing short message sends information ballot etc.) of a lot of programs of voting now, and a cell-phone number can be controlled and can only throw a ticket, be also prevented from brush ticket behavior.

Embodiment 4 TV is interactive with lottery

A lot of satellite TVs now, when playing TV play, advertisement etc., can be illustrated below " shaking " at screen interactive with lottery, the sound identification method of the present invention can be used instead, it is not necessary to shake, as long as receiving sound wave watermark instruction, just can perform, redirect the corresponding interactive page with lottery.

Other application

Also have a lot of places to apply, if commercial circle, night shop, tourism etc. have the place of sound to use.

Last it is noted that various embodiments above is only in order to illustrate technical scheme, it is not intended to limit；Although the present invention being described in detail with reference to foregoing embodiments, it will be understood by those within the art that: the technical scheme described in foregoing embodiments is modified by it, or wherein some or all of technical characteristic is carried out equivalent replacement, does not make the essence of appropriate technical solution depart from the scope of various embodiments of the present invention technical scheme.

Claims

1. the analytic method based on sound wave watermark, it is characterised in that comprise the following steps:

Step S100: sound wave watermark processing module inserts a sound wave watermark ID in audio-video document；

Step S200: sound wave watermark parsing module receives the audio-video document play, and parses sound wave watermark ID.

2. according to claim 1 based on the analytic method of sound wave watermark, it is characterised in that described step S200 includes following sub-step:

Step S210: receive submodule and listen to audio-video document；

Step S220: the sound wave watermark ID in audio-video document is separated by segregant module, and is uploaded to sound wave watermark analyzing platform；

Step S230: sound wave watermark ID is decrypted by sound wave watermark analyzing platform.

3. according to claim 2 based on the analytic method of sound wave watermark, it is characterised in that described step S200 farther includes following sub-step:

Step S240: the ID instruction that the storage of preset table submodule is corresponding with sound wave watermark ID, and this ID instruction is analyzed, distributes, reach implementation sub-module；

Step S250: implementation sub-module, according to this ID instruction, performs corresponding order.

4. based on the analytic method of sound wave watermark according to claim 1,2 or 3, it is characterised in that described sound wave watermark ID is the audio file that high-frequency sound wave changes into hexadecimal sequence form.

5. the resolver based on sound wave watermark, it is characterised in that the described resolver based on sound wave watermark includes:

Sound wave watermark processing module, for inserting a sound wave watermark ID in audio-video document；

Sound wave watermark parsing module, for receiving the audio-video document of broadcasting, parses sound wave watermark ID.

6. the resolver based on sound wave watermark according to claim 5, it is characterised in that described sound wave watermark parsing module farther includes:

Receive submodule, be used for listening to audio-video document；

Segregant module, for being separated by the sound wave watermark ID of the hexadecimal sequence number in audio-video document, and is uploaded to sound wave watermark analyzing platform；

Sound wave watermark analyzing platform, for being decrypted the sound wave watermark ID of hexadecimal sequence number.

7. the resolver based on sound wave watermark according to claim 6, it is characterised in that described sound wave watermark parsing module farther includes:

Preset table submodule, for storing the ID instruction corresponding with sound wave watermark ID, and is analyzed this ID instruction, distributes, reach implementation sub-module；

Implementation sub-module, for according to this ID instruction, performing corresponding order.