CN105791973A - Resolving method and resolving device based on sound wave watermark - Google Patents

Resolving method and resolving device based on sound wave watermark Download PDF

Info

Publication number
CN105791973A
CN105791973A CN201610130157.5A CN201610130157A CN105791973A CN 105791973 A CN105791973 A CN 105791973A CN 201610130157 A CN201610130157 A CN 201610130157A CN 105791973 A CN105791973 A CN 105791973A
Authority
CN
China
Prior art keywords
sound wave
watermark
audio
module
video document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610130157.5A
Other languages
Chinese (zh)
Inventor
李万欣
王智鹏
遆宁
林荣越
曹晨
王娜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian Leyun Information Technology Co Ltd
Original Assignee
Dalian Leyun Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian Leyun Information Technology Co Ltd filed Critical Dalian Leyun Information Technology Co Ltd
Priority to CN201610130157.5A priority Critical patent/CN105791973A/en
Publication of CN105791973A publication Critical patent/CN105791973A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/41422Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance located in transportation means, e.g. personal vehicle
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/475End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data
    • H04N21/4758End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data for providing answers, e.g. voting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/47815Electronic shopping
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/835Generation of protective data, e.g. certificates
    • H04N21/8358Generation of protective data, e.g. certificates involving watermark

Abstract

The invention discloses a resolving method based on a sound wave watermark. The resolving method comprises the following steps of a step S100, interpolating a sound wave watermark ID into an audio/video file by a sound wave watermark processing module; and a step S200, receiving the played audio/video file by the sound wave water mark processing module, and resolving the sound wave watermark ID. The resolving method has advantages of high identification rate, high interference resistance, real-time detecting performance. One or more program segment can be identified in one second. Furthermore simple system structure and wide application range.

Description

A kind of analytic method based on sound wave watermark and device
Technical field
The present invention relates to voice recognition technology field, particularly relate to a kind of analytic method based on sound wave watermark and device.
Background technology
At present, the voice recognition of scene application has two ways:
First: by Waveform Matching identification.This method is equipped with substantial amounts of wave file on data server, and in time opening reception, resolver can record a section audio, go into wave file and upload to data server, compare with a large amount of wave files therein, carry out voice recognition, and then carry out next step command operating.Such as: wechat shakes function.
There is the problem that in a noisy environment, resolve bad, Consumer's Experience is bad;Wave file to be put in data server in advance.
Second: identify by playing sound near field.Its detailed process is, playing device plays one section of specific sound, after receiving device reception, specific sound is resolved, identifies, and then carry out next step command operating.
There is the problem that needing large-scale data backstage to intercept analyzes.
Summary of the invention
In order to overcome the problem of above-mentioned existing two kinds of voice recognition technologies, the invention provides a kind of analytic method based on sound wave watermark and device.
A kind of analytic method based on sound wave watermark provided by the invention comprises the following steps: step S100: sound wave watermark processing module inserts a sound wave watermark ID in audio-video document;Step S200: sound wave watermark parsing module receives the audio-video document play, and parses sound wave watermark ID.
Wherein, described step S200 includes following sub-step: step S210: receives submodule and listens to audio-video document;Step S220: the sound wave watermark ID in audio-video document is separated by segregant module, and is uploaded to sound wave watermark analyzing platform;Step S230: sound wave watermark ID is decrypted by sound wave watermark analyzing platform.
Wherein, described step S200 farther includes following sub-step: step S240: the ID instruction that the storage of preset table submodule is corresponding with sound wave watermark ID, and this ID instruction is analyzed, is distributed, and reaches implementation sub-module;Step S250: implementation sub-module, according to this ID instruction, performs corresponding order.
Wherein, described sound wave watermark ID is the audio file that high-frequency sound wave changes into hexadecimal sequence form.
The present invention additionally provides a kind of resolver based on sound wave watermark, including: sound wave watermark processing module, for inserting a sound wave watermark ID in audio-video document;Sound wave watermark parsing module, for receiving the audio-video document of broadcasting, parses sound wave watermark ID.
Wherein, described sound wave watermark parsing module farther includes: receives submodule, is used for listening to audio-video document;Segregant module, for being separated by the sound wave watermark ID of the hexadecimal sequence number in audio-video document, and is uploaded to sound wave watermark analyzing platform;Sound wave watermark analyzing platform, for being decrypted the sound wave watermark ID of hexadecimal sequence number.
Wherein, described sound wave watermark parsing module farther includes: preset table submodule, for storing the ID instruction corresponding with sound wave watermark ID, and this ID instruction is analyzed, is distributed, reaches implementation sub-module;Implementation sub-module, for according to this ID instruction, performing corresponding order.
The invention has the beneficial effects as follows:
(1) discrimination is high: applications of sound waves is mainly the coding of sound wave in a core of new media and intelligent terminal and identifies parsing, it is necessary to ensure that discrimination 100%, it is clean that sound wave frequency range does not interfere with comparison, it is the basic guarantee improving discrimination, also can be greatly improved discrimination additionally by Optimized Coding Based rule.
(2) precision marketing: sound wave is also equipped with the function of precision marketing, it is possible to select region or fixed point propelling movement scope user, improves accurate information and pushes.
(3) anti-interference: sound wave interaction technique Non-Destructive Testing, information gathering is more accurate, has safe encrypting and decrypting characteristic, has anti-interference.This is also shake very big difference with wechat, as long as wechat shakes centre brouhaha, shake search less than.Sound wave has then evaded this problem.
(4) detection property in real time: sound wave has the characteristic of detection in real time, as long as the video that plays back of playback equipment or sound contain sound wave watermark, terminal will receive, and then resolves.
(5) per second or within many seconds, may identify which one or more than one program segment.
(6) live, program request, carousel application are gone for;Can be also used for the application such as the loudspeaker of the occasion such as square, station, large-size screen monitors, broadcast.
(7) system structure is simple: front end system has only to increase a ripples watermark and inserts link;Terminal system has only to increase an APP.Other be all use deposit system and equipment.
Accompanying drawing explanation
Fig. 1 is based on the flow chart of the analytic method of sound wave watermark.
Detailed description of the invention
For the technical scheme making to present invention solves the technical problem that, adopting and the technique effect reached clearly, below in conjunction with drawings and Examples, the present invention is described in further detail.It is understood that specific embodiment described herein is used only for explaining the present invention, but not limitation of the invention.It also should be noted that, for the ease of describing, accompanying drawing illustrate only part related to the present invention but not full content.
Refer to Fig. 1, the analytic method based on sound wave watermark comprises the following steps: step S100: sound wave watermark processing module (encoding software) audio-video document (various forms (and MP3 MP4 TS FLV etc.) audio-video document) in insert sound wave watermark ID (hexadecimal sequence number);Step S200: sound wave watermark parsing module receives the audio-video document play, and parses sound wave watermark ID.
The source code of resolving is as follows:
voidAQRecorder::MyInputBufferHandler(void*inUserData,
AudioQueueRefinAQ,
AudioQueueBufferRefinBuffer,
constAudioTimeStamp*inStartTime,
UInt32inNumPackets,
constAudioStreamPacketDescription*inPacketDesc)
{
AQRecorder*aqr=(AQRecorder*) inUserData;
try{
if(inNumPackets>0){
//writepacketstofile
XThrowIfError(AudioFileWritePackets(aqr->mRecordFile,FALSE,inBuffer->mAudioDataByteSize,
inPacketDesc,aqr->mRecordPacket,&inNumPackets,inBuffer->mAudioData),
"AudioFileWritePacketsfailed");
Aqr-> mRecordPacket+=inNumPackets;
}
Sscanf (" abc ", " %16 ", inBuffer-> mAudioData);
if(aqr->IsRunning())
XThrowIfError(AudioQueueEnqueueBuffer(inAQ,inBuffer,0,NULL),"AudioQueueEnqueueBufferfailed");
}catch(CAXExceptione){
charbuf[256];
Fprintf (stderr, " Error:%s (%s) n ", e.mOperation, e.FormatError (buf));
}
}
Wherein, described step S200 includes following sub-step: step S210: receives submodule (mike of mobile phone) and listens to audio-video document;Step S220: segregant module (is called underlay approach, referring to code) and separated by the sound wave watermark ID in audio-video document, and is uploaded to sound wave watermark analyzing platform (server);Step S230: sound wave watermark ID is decrypted and (converts the hexadecimal sequence number received to decimal sequence number by sound wave watermark analyzing platform, adopt des encryption standard, in the middle of this sequence numbers match to preset table, find the execution instruction of its correspondence);Step S240: the ID instruction that the storage of preset table submodule is corresponding with sound wave watermark ID, such as: jump to Baidu, calculate data " with " " percentage rate ", judge be what type instruction (advertisement, news, TV play etc.), and this ID instruction is analyzed, distributes, reach implementation sub-module;Step S250: implementation sub-module, according to this ID instruction, performs corresponding order (mobile phone terminal shows).
Wherein, described sound wave watermark ID is the audio file that high-frequency sound wave changes into hexadecimal sequence form.
Embodiment 1 public transport mobile media
Time by bus, the contents such as news, TV play, various types of programs, advertisement in mobile TV, can be play;But being as the universal of smart mobile phone, the sight line of people has transferred to mobile terminal from mobile TV, and the income of mobile TV advertisement will reduce then, the eyeball of passenger can be attracted back mobile TV by the sound analytic method of the present invention.
Such as mobile TV has various audio frequency and video when playing various types of programs, sound wave watermark ID is added in these programs, when user open mobile phone terminal be simultaneously received in mobile TV play audio frequency, the sound wave watermark ID in program will be resolved to, sound wave watermark ID can be sent to sound wave watermark analyzing platform by network, decipher according to sound wave watermark analyzing platform, then this instruction is performed, return in mobile phone terminal, perform instruction, such as, when playing news, open mobile phone terminal will eject about dependent event and the up-to-date information of news, user may browse through viewing;When playing program, the clothing that occur in program, car, house, cuisines, the content such as cosmetics, by mobile phone terminal, just can jump to link and even can directly buy.
Embodiment 2 home shopping channel is interactive
For the home shopping channel of cable television, current home shopping channel is to make a phone call to place an order or scan Quick Response Code by user to place an order.Sound wave watermark ID can be added in shopping program later, when user opens the audio frequency being simultaneously received home shopping channel broadcasting of mobile phone A PP, the sound wave watermark ID in program will be resolved to, sound wave watermark ID can be sent to sound wave watermark analyzing platform by network, decipher according to sound wave watermark analyzing platform, then perform this instruction, return in mobile phone terminal, performing instruction, such as mobile phone terminal jumps directly to pay the page and places an order;Can as the interactive tool of all television channels, it is possible to understand the TV story of a play or opera for the degree of depth, it is also possible to directly push into the payment page to user mobile phone end for TV play being implanted commodity.
Embodiment 3 TV media carries out the ballot competition of contest
It is in and sees large-scale interaction ballot program simultaneously, open mobile phone terminal, the audio frequency and video sound wave watermark ID of television transmission can be resolved to, it is decrypted by sound wave watermark analyzing platform, corresponding instruction can be performed, mobile phone terminal can redirect the respective program ballot page, the inside has the introduction of program, ballot button etc., this have also been enlarged the scope of ballot, decrease the tedious steps (editing short message sends information ballot etc.) of a lot of programs of voting now, and a cell-phone number can be controlled and can only throw a ticket, be also prevented from brush ticket behavior.
Embodiment 4 TV is interactive with lottery
A lot of satellite TVs now, when playing TV play, advertisement etc., can be illustrated below " shaking " at screen interactive with lottery, the sound identification method of the present invention can be used instead, it is not necessary to shake, as long as receiving sound wave watermark instruction, just can perform, redirect the corresponding interactive page with lottery.
Other application
Also have a lot of places to apply, if commercial circle, night shop, tourism etc. have the place of sound to use.
Last it is noted that various embodiments above is only in order to illustrate technical scheme, it is not intended to limit;Although the present invention being described in detail with reference to foregoing embodiments, it will be understood by those within the art that: the technical scheme described in foregoing embodiments is modified by it, or wherein some or all of technical characteristic is carried out equivalent replacement, does not make the essence of appropriate technical solution depart from the scope of various embodiments of the present invention technical scheme.

Claims (7)

1. the analytic method based on sound wave watermark, it is characterised in that comprise the following steps:
Step S100: sound wave watermark processing module inserts a sound wave watermark ID in audio-video document;
Step S200: sound wave watermark parsing module receives the audio-video document play, and parses sound wave watermark ID.
2. according to claim 1 based on the analytic method of sound wave watermark, it is characterised in that described step S200 includes following sub-step:
Step S210: receive submodule and listen to audio-video document;
Step S220: the sound wave watermark ID in audio-video document is separated by segregant module, and is uploaded to sound wave watermark analyzing platform;
Step S230: sound wave watermark ID is decrypted by sound wave watermark analyzing platform.
3. according to claim 2 based on the analytic method of sound wave watermark, it is characterised in that described step S200 farther includes following sub-step:
Step S240: the ID instruction that the storage of preset table submodule is corresponding with sound wave watermark ID, and this ID instruction is analyzed, distributes, reach implementation sub-module;
Step S250: implementation sub-module, according to this ID instruction, performs corresponding order.
4. based on the analytic method of sound wave watermark according to claim 1,2 or 3, it is characterised in that described sound wave watermark ID is the audio file that high-frequency sound wave changes into hexadecimal sequence form.
5. the resolver based on sound wave watermark, it is characterised in that the described resolver based on sound wave watermark includes:
Sound wave watermark processing module, for inserting a sound wave watermark ID in audio-video document;
Sound wave watermark parsing module, for receiving the audio-video document of broadcasting, parses sound wave watermark ID.
6. the resolver based on sound wave watermark according to claim 5, it is characterised in that described sound wave watermark parsing module farther includes:
Receive submodule, be used for listening to audio-video document;
Segregant module, for being separated by the sound wave watermark ID of the hexadecimal sequence number in audio-video document, and is uploaded to sound wave watermark analyzing platform;
Sound wave watermark analyzing platform, for being decrypted the sound wave watermark ID of hexadecimal sequence number.
7. the resolver based on sound wave watermark according to claim 6, it is characterised in that described sound wave watermark parsing module farther includes:
Preset table submodule, for storing the ID instruction corresponding with sound wave watermark ID, and is analyzed this ID instruction, distributes, reach implementation sub-module;
Implementation sub-module, for according to this ID instruction, performing corresponding order.
CN201610130157.5A 2016-03-07 2016-03-07 Resolving method and resolving device based on sound wave watermark Pending CN105791973A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610130157.5A CN105791973A (en) 2016-03-07 2016-03-07 Resolving method and resolving device based on sound wave watermark

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610130157.5A CN105791973A (en) 2016-03-07 2016-03-07 Resolving method and resolving device based on sound wave watermark

Publications (1)

Publication Number Publication Date
CN105791973A true CN105791973A (en) 2016-07-20

Family

ID=56388235

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610130157.5A Pending CN105791973A (en) 2016-03-07 2016-03-07 Resolving method and resolving device based on sound wave watermark

Country Status (1)

Country Link
CN (1) CN105791973A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107610036A (en) * 2017-09-26 2018-01-19 武汉斗鱼网络科技有限公司 A kind of method, apparatus and computer equipment for exporting live mark picture
CN113420242A (en) * 2021-08-24 2021-09-21 阿里巴巴(中国)有限公司 Shopping guide method, resource distribution method, content display method and equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5893067A (en) * 1996-05-31 1999-04-06 Massachusetts Institute Of Technology Method and apparatus for echo data hiding in audio signals
CN101115124A (en) * 2006-07-26 2008-01-30 日电(中国)有限公司 Method and apparatus for identifying media program based on audio watermark
CN103428538A (en) * 2013-08-12 2013-12-04 广州信为信息科技有限公司 Method, device and system for interaction of interactive broadcast televisions
CN103957220A (en) * 2014-05-19 2014-07-30 刘飞 Data transmission method and system based on frequency conversion sound waves
CN104320719A (en) * 2014-11-14 2015-01-28 武汉大学 Television program interaction participating method and system based on audio watermarking

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5893067A (en) * 1996-05-31 1999-04-06 Massachusetts Institute Of Technology Method and apparatus for echo data hiding in audio signals
CN101115124A (en) * 2006-07-26 2008-01-30 日电(中国)有限公司 Method and apparatus for identifying media program based on audio watermark
CN103428538A (en) * 2013-08-12 2013-12-04 广州信为信息科技有限公司 Method, device and system for interaction of interactive broadcast televisions
CN103957220A (en) * 2014-05-19 2014-07-30 刘飞 Data transmission method and system based on frequency conversion sound waves
CN104320719A (en) * 2014-11-14 2015-01-28 武汉大学 Television program interaction participating method and system based on audio watermarking

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107610036A (en) * 2017-09-26 2018-01-19 武汉斗鱼网络科技有限公司 A kind of method, apparatus and computer equipment for exporting live mark picture
CN107610036B (en) * 2017-09-26 2021-09-07 武汉斗鱼网络科技有限公司 Method and device for outputting live broadcast identification picture and computer equipment
CN113420242A (en) * 2021-08-24 2021-09-21 阿里巴巴(中国)有限公司 Shopping guide method, resource distribution method, content display method and equipment

Similar Documents

Publication Publication Date Title
US9563699B1 (en) System and method for matching a query against a broadcast stream
EP2901706B1 (en) Methods and apparatus for identifying media
RU2601446C2 (en) Terminal apparatus, server apparatus, information processing method, program and interlocked application feed system
US11227620B2 (en) Information processing apparatus and information processing method
CN102833582B (en) Method for searching audio and video resources via voice
EP2773108B1 (en) Reception device, reception method, program, and information processing system
CN112423081B (en) Video data processing method, device and equipment and readable storage medium
CN104023250A (en) Real-time interaction method and system based on streaming media
US20100154012A1 (en) Television bookmarking with multiplatform distribution
CN103607635A (en) Method, device and terminal for caption identification
US20120272263A1 (en) Method and apparatus for providing an interactive application within a media stream
MX2014005650A (en) Information processing device, information processing method, information provision device, and information provision system.
US11395050B2 (en) Receiving apparatus, transmitting apparatus, and data processing method
CN103747277A (en) Multimedia program identification method and device
KR102244195B1 (en) Providing Method for virtual advertisement and service device supporting the same
CN105791973A (en) Resolving method and resolving device based on sound wave watermark
US20130177288A1 (en) Electronic device and audio output method
RU2630432C2 (en) Receiving apparatus, data processing technique, programme, transmission apparatus and transferring programmes interaction system
KR101377849B1 (en) System and method for providing additional information of multiple real-time broadcasting channels
JP2007134820A (en) Digital broadcast system, collation apparatus, digital broadcast receiver, and method
KR20090073944A (en) System and method for providing keyword(or question) rank information about broadcast contents, broadcast content display device and recording medium
KR101856852B1 (en) Method and Apparatus for playing YouTube Channel in Channel-based Content Providing System
CN103501457A (en) Method and device for playing programs
CN103002322A (en) Method and device for providing relevant information of non-authorized content for subscribers
KR101930488B1 (en) Metadata Creating Method and Apparatus for Linkage Type Service

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160720