CN104754364A - Video advertisement voice interaction system and method - Google Patents

Video advertisement voice interaction system and method Download PDF

Info

Publication number
CN104754364A
CN104754364A CN201510145559.8A CN201510145559A CN104754364A CN 104754364 A CN104754364 A CN 104754364A CN 201510145559 A CN201510145559 A CN 201510145559A CN 104754364 A CN104754364 A CN 104754364A
Authority
CN
China
Prior art keywords
video
voice
video playback
playback client
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510145559.8A
Other languages
Chinese (zh)
Inventor
张云锋
蒋子俊
周盛
姚键
张大伟
曹磊
唐端荣
潘柏宇
卢述奇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unification Infotech (beijing) Co Ltd
Original Assignee
Unification Infotech (beijing) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unification Infotech (beijing) Co Ltd filed Critical Unification Infotech (beijing) Co Ltd
Priority to CN201510145559.8A priority Critical patent/CN104754364A/en
Publication of CN104754364A publication Critical patent/CN104754364A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2668Creating a channel for a dedicated end-user group, e.g. insertion of targeted commercials based on end-user profiles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses a video advertisement voice interaction system and method, belongs to the technical field of internet video advertisements and aims to solve the problem of the prior art that the user is required to register and pay in case of skipping the video advertisements and suffered with losses once just simply skips the video advertisements. The video advertisement voice interaction system comprises a video play client, an advertisement play server and a voice recognition server. The video advertisement voice interaction method includes that as the video play client plays the video advertisements, the user opens a voice monitoring switch to input voice, the voice monitoring module collects voice information and sends voice data extracted to the voice recognition server, the voice recognition server returns a voice data recognition result back the video play client, and the video play client calls a related connector of a player to trigger a related event.

Description

Video ads voice interactive system and method
Technical field
The present invention is specifically related to a kind of video ads voice interactive system and method, belongs to internet video technical field of advertisement.
Background technology
Video ads has become advertisement form main in the Internet at present, increasing video ads brings very large worry to user, for this reason, number of site starts to provide for some premium customers the video ads can selecting to play, and user can select to skip some advertisement, but this needs user to register and pays, most of user can not select to register and the form of paying, and simply skip video ads, advertiser must be made to incur loss, lose the chance of publicity product.
Summary of the invention
Therefore, to the present invention is directed in prior art user to skip video ads and select to need register and pay, most of user can not select to register and the form of paying, and simply skip video ads, advertiser must be made to incur loss, lose the problem of the chance of publicity product, a kind of video ads voice interactive system is provided, comprise video playback client, advertisement releasing server, speech recognition server, advertisement releasing server is used for providing video ads code to video playback client according to the video ads request of video playback client, it is characterized in that, described video playback client comprises audio monitoring switch, audio monitoring module, audio monitoring module is for collecting voice messaging, extract speech data and send to speech recognition server, speech recognition server is for identifying speech data and resulting text being returned to video playback client.
Described speech recognition server comprises sound identification module, described sound identification module comprises acoustic model, dictionary file, language model, acoustic model obtains after carrying out feature extraction and acoustic training model to sound bank, language model obtains after carrying out language model training according to the text provided in text library, deposits the mapping relations table of word and phoneme in dictionary file.
Described video playback client is mobile phone, panel computer, notebook computer or desktop computer.
The video ads voice interactive method realized by said system, it is characterized in that, described method is: video playback client sends ad-request to advertisement releasing server, advertisement releasing server provides ad code to video playback client, video playback client terminal playing video ads, when audio monitoring on off state is opening, if user carries out phonetic entry, audio monitoring module can collect voice messaging, and speech data is sent to speech recognition server, the resulting text of speech data identification is returned to video playback client by speech recognition server, whether specified command is comprised in video playback client judged result text, if had, the relevant interface triggering dependent event of player is then called with these orders.
Specified command comprises built-in command and in non-built order.
After each trigger event occurs, video playback client carries out log recording by the log recording interface calling advertisement releasing server and provide.
Beneficial effect of the present invention is: adopt video ads voice interactive system of the present invention and method, by interactive voice technology, achieve the interactive voice of user and system, both met client not need to register the demand that paying gets final product skip advertisements, again can by the restriction of voice interactive system, as client needs to say the modes such as advertised product title, the product of advertiser is made to obtain the effect of publicity surpassed the expectation.User can also realize other functions such as replay, time-out by interactive voice.
Accompanying drawing explanation
Fig. 1 is the structural representation of video ads voice interactive system of the present invention;
Fig. 2 is the Play Control flow chart of video playback client;
Fig. 3 is speech-recognition services realization flow figure.
Reference numeral is as follows:
1, video playback client;
2, advertisement releasing server;
3, speech recognition server.
Embodiment
Below in conjunction with accompanying drawing, the specific embodiment of the present invention is described:
As shown in Figure 1, video ads voice interactive system, comprise video playback client 1, advertisement releasing server 2, speech recognition server 3, advertisement releasing server 2 is for providing video ads code to video playback client 1 according to the video ads request of video playback client 1, video playback client 1 comprises audio monitoring switch, audio monitoring module, audio monitoring switch is for opening and closing audio monitoring module, audio monitoring module is for collecting voice messaging, extract speech data and send to speech recognition server, speech recognition server 3 is for identifying speech data and resulting text being returned to video playback client 1.The Play Control flow process of video playback client 1 as shown in Figure 2.
Speech recognition server 3 comprises sound identification module, sound identification module comprises acoustic model, dictionary file, language model, acoustic model obtains after carrying out feature extraction and acoustic training model to sound bank, language model obtains after carrying out language model training according to the text provided in text library, deposits the mapping relations table of word and phoneme in dictionary file.Speech-recognition services realization flow as shown in Figure 3.
Video playback client 1 is mobile phone, panel computer, notebook computer or desktop computer.Be applicable to various platform.
The video ads voice interactive method realized by said system, video playback client 1 sends ad-request to advertisement releasing server 2, advertisement releasing server 2 provides ad code to video playback client 1, ad code is the character string of XML or the JSON form generated according to the good advertisement interaction protocol of predefined, the inside contains variously plays relevant information to advertisement, as: the URL of ad material, the exposure of advertisement and click-through count and the URL finished playing, the exposure and click monitoring URL etc. of advertisement, client meeting analyzing XML or JSON string, then the triggering of advertisement broadcasting and dependent event is carried out.Each attribute having the advertisement of interactive voice effect demand can have " skip advertisements keyword " by name, generally can get the brand name of this advertisement as keyword, the record of newly-increased interactive voice effect daily record, for counting user some interactive information to the advertisement of playing, advertiser's reference can be supplied to.Concrete grammar is a newly-increased node " skipword " under the ad node that each advertisement is corresponding, its value is the keyword of skip advertisements, in addition after skipword node, a node " recurl " is increased newly again, its value is the log interface URL of recording user interbehavior, the parameter comprised in this URL can be recorded in daily record, wherein there is an actid parameter, value be one grand: " ##ACTIONID## ", corresponding value can be replaced to again request corresponding for this URL is sent when actual sending request according to the request of the actual triggering of user.Video playback client 1 playing video advertisement, when audio monitoring on off state is opening, if user carries out phonetic entry, audio monitoring module can collect voice messaging, and speech data is sent to speech recognition server 3, the resulting text of speech data identification is returned to video playback client 1 by speech recognition server 3, whether specified command is comprised in video playback client 1 judged result text, if had, then call the relevant interface triggering dependent event of player with these orders.
Specified command comprises built-in command and in non-built order.Such as
" replay ": built-in command, replays Current ad;
" time-out ": built-in command, suspends and plays Current ad;
" Great Wall ": in non-built order, for this order, keyword (Skipword) is skipped in the advertisement of having said Current ad as user, i.e. the brand name of Current ad, so skip Current ad.
After each trigger event occurs, video playback client 1 carries out log recording by the log recording interface calling advertisement releasing server 2 and provide.
JSON fragment as follows, returning results of the ad placement services end obtained when being certain ad-request of a client transmission is two marketing advertisements of two brands in Great Wall and the Changjiang river respectively.Wherein " ads " is an array, the inside houses multiple " ad " child node, the corresponding advertisement of each " ad " child node, " skipword " child node is had again in each " ad " child node, as user's opening voice listening key and when sending the sound on " Great Wall ", id be 123 advertisement will stop play, leap to next id be 124 advertisement play.
Client is after collecting voice messaging, recurl node below can be checked, if this node exists, then take out its URL, then " ##ACTIONID## " in URL is replaced with the actual event triggered by speech recognition character string out numbering (number format as: 1: to replay, 2: to suspend, 3: skip), then this URL is accessed, this URL corresponds to a log collection service of ad placement services end, can relevant parameter be resolved after this service reception request, and complete the record of daily record.Main JSON code is as follows:
The above is the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the prerequisite not departing from principle of the present invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (6)

1. a video ads voice interactive system, comprise video playback client, advertisement releasing server, speech recognition server, advertisement releasing server is used for providing video ads code to video playback client according to the video ads request of video playback client, it is characterized in that, described video playback client comprises audio monitoring switch, audio monitoring module, audio monitoring module is for collecting voice messaging, extract speech data and send to speech recognition server, speech recognition server is for identifying voice and recognition result text being returned to video playback client.
2. video ads voice interactive system as claimed in claim 1, it is characterized in that, described speech recognition server comprises sound identification module, described sound identification module comprises acoustic model, dictionary file, language model, acoustic model obtains after carrying out feature extraction and acoustic training model to sound bank, language model obtains after carrying out language model training according to the text provided in text library, deposits the mapping relations table of word and phoneme in dictionary file.
3. video ads voice interactive system as claimed in claim 1, it is characterized in that, described video playback client is mobile phone, panel computer, notebook computer or desktop computer.
4. the video ads voice interactive method that the system according to any one of claims 1 to 3 realizes, it is characterized in that, described method is: video playback client sends ad-request to advertisement releasing server, advertisement releasing server provides ad code to video playback client, video playback client terminal playing video ads, when audio monitoring on off state is opening, if user carries out phonetic entry, audio monitoring module can collect voice messaging, and speech data is sent to speech recognition server, the resulting text of speech data identification is returned to video playback client by speech recognition server, whether specified command is comprised in video playback client judged result text, if had, the relevant interface triggering dependent event of player is then called with these orders.
5. video ads voice interactive method as claimed in claim 4, it is characterized in that, described specified command comprises built-in command and in non-built order.
6. video ads voice interactive method as claimed in claim 4, is characterized in that, after each trigger event occurs, video playback client carries out log recording by the log recording interface calling advertisement releasing server and provide.
CN201510145559.8A 2015-03-30 2015-03-30 Video advertisement voice interaction system and method Pending CN104754364A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510145559.8A CN104754364A (en) 2015-03-30 2015-03-30 Video advertisement voice interaction system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510145559.8A CN104754364A (en) 2015-03-30 2015-03-30 Video advertisement voice interaction system and method

Publications (1)

Publication Number Publication Date
CN104754364A true CN104754364A (en) 2015-07-01

Family

ID=53593370

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510145559.8A Pending CN104754364A (en) 2015-03-30 2015-03-30 Video advertisement voice interaction system and method

Country Status (1)

Country Link
CN (1) CN104754364A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105824877A (en) * 2016-03-08 2016-08-03 乐视网信息技术(北京)股份有限公司 Voice-based information searching method and device
CN106792050A (en) * 2016-12-21 2017-05-31 深圳Tcl数字技术有限公司 The method for closing and system of video ads
CN107492380A (en) * 2017-08-15 2017-12-19 上海精数信息科技有限公司 Man-machine recognition methods based on audio and apply its advertisement placement method and system
CN107657471A (en) * 2016-09-22 2018-02-02 腾讯科技(北京)有限公司 A kind of methods of exhibiting of virtual resource, client and plug-in unit
CN108039175A (en) * 2018-01-29 2018-05-15 北京百度网讯科技有限公司 Audio recognition method, device and server
CN109240640A (en) * 2018-08-30 2019-01-18 百度在线网络技术(北京)有限公司 Advertising pronunciation exchange method, device and storage medium
CN109583430A (en) * 2018-12-28 2019-04-05 广州励丰文化科技股份有限公司 A kind of control method and device showing device
CN111339361A (en) * 2020-02-27 2020-06-26 深圳市元征科技股份有限公司 Vehicle diagnostic record processing method, device and equipment and readable storage medium
CN113283947A (en) * 2021-06-18 2021-08-20 北京奇艺世纪科技有限公司 Multimedia file playing and control method, related device and readable storage medium
CN113763046A (en) * 2021-09-07 2021-12-07 四川易海天科技有限公司 Mobile internet vehicle-mounted intelligent delivery system based on big data analysis
CN115412768A (en) * 2022-11-02 2022-11-29 深圳市人马互动科技有限公司 Information recommendation method based on voice interaction system and related device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101034455A (en) * 2006-03-06 2007-09-12 腾讯科技(深圳)有限公司 Method and system for implementing online advertisement
CN103810998A (en) * 2013-12-05 2014-05-21 中国农业大学 Method for off-line speech recognition based on mobile terminal device and achieving method
CN104216990A (en) * 2014-09-09 2014-12-17 科大讯飞股份有限公司 Method and system for playing video advertisement

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101034455A (en) * 2006-03-06 2007-09-12 腾讯科技(深圳)有限公司 Method and system for implementing online advertisement
CN103810998A (en) * 2013-12-05 2014-05-21 中国农业大学 Method for off-line speech recognition based on mobile terminal device and achieving method
CN104216990A (en) * 2014-09-09 2014-12-17 科大讯飞股份有限公司 Method and system for playing video advertisement

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105824877A (en) * 2016-03-08 2016-08-03 乐视网信息技术(北京)股份有限公司 Voice-based information searching method and device
US10950224B2 (en) 2016-09-22 2021-03-16 Tencent Technology (Shenzhen) Company Limited Method for presenting virtual resource, client, and plug-in
CN107657471A (en) * 2016-09-22 2018-02-02 腾讯科技(北京)有限公司 A kind of methods of exhibiting of virtual resource, client and plug-in unit
CN106792050A (en) * 2016-12-21 2017-05-31 深圳Tcl数字技术有限公司 The method for closing and system of video ads
CN107492380A (en) * 2017-08-15 2017-12-19 上海精数信息科技有限公司 Man-machine recognition methods based on audio and apply its advertisement placement method and system
CN108039175A (en) * 2018-01-29 2018-05-15 北京百度网讯科技有限公司 Audio recognition method, device and server
US11398228B2 (en) 2018-01-29 2022-07-26 Beijing Baidu Netcom Science And Technology Co., Ltd. Voice recognition method, device and server
CN109240640A (en) * 2018-08-30 2019-01-18 百度在线网络技术(北京)有限公司 Advertising pronunciation exchange method, device and storage medium
CN109583430A (en) * 2018-12-28 2019-04-05 广州励丰文化科技股份有限公司 A kind of control method and device showing device
CN111339361A (en) * 2020-02-27 2020-06-26 深圳市元征科技股份有限公司 Vehicle diagnostic record processing method, device and equipment and readable storage medium
CN113283947A (en) * 2021-06-18 2021-08-20 北京奇艺世纪科技有限公司 Multimedia file playing and control method, related device and readable storage medium
CN113283947B (en) * 2021-06-18 2024-05-03 北京奇艺世纪科技有限公司 Multimedia file playing and controlling method, related device and readable storage medium
CN113763046A (en) * 2021-09-07 2021-12-07 四川易海天科技有限公司 Mobile internet vehicle-mounted intelligent delivery system based on big data analysis
CN115412768A (en) * 2022-11-02 2022-11-29 深圳市人马互动科技有限公司 Information recommendation method based on voice interaction system and related device
CN115412768B (en) * 2022-11-02 2023-03-14 深圳市人马互动科技有限公司 Information recommendation method based on voice interaction system and related device

Similar Documents

Publication Publication Date Title
CN104754364A (en) Video advertisement voice interaction system and method
US11023931B2 (en) System and method for targeted advertising
CN101996234B (en) Word cloud audio navigation
CN105427121B (en) The system and method for natural language processing selection presentation of advertisements based on phonetic entry
CN107659847B (en) Voice interface method and apparatus
CN110275982B (en) Query response using media consumption history
US8306810B2 (en) Systems and methods to enable interactivity among a plurality of devices
TWI711967B (en) Method, device and equipment for determining broadcast voice
CN109215643B (en) Interaction method, electronic equipment and server
CN100389588C (en) System and method for using voice over a telephone to access, process, and carry out transactions over the internet
US8687776B1 (en) System and method to analyze human voice conversations
CN104038473B (en) For intercutting the method, apparatus of audio advertisement, equipment and system
CN109979450B (en) Information processing method and device and electronic equipment
CN109474562B (en) Display method and device of identifier, and response method and device of request
JP5404728B2 (en) System and method for providing advertisement information by sound recognition
US20120109759A1 (en) Speech recognition system platform
CN108012173A (en) A kind of content identification method, device, equipment and computer-readable storage medium
CN102216945A (en) Networking with media fingerprints
CN107659831A (en) Media data processing method, client and storage medium
JP2014513828A (en) Automatic conversation support
JP2012037797A (en) Dialogue learning device, summarization device, dialogue learning method, summarization method, program
CN104581396A (en) Processing method and device for promotion information
US20080109305A1 (en) Using internet advertising as a test bed for radio advertisements
Siemund et al. SPEECON-Speech Data for Consumer Devices.
CN112839237A (en) Video and audio processing method, computer equipment and medium in network live broadcast

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150701