CN104754364A - Video advertisement voice interaction system and method - Google Patents
Video advertisement voice interaction system and method Download PDFInfo
- Publication number
- CN104754364A CN104754364A CN201510145559.8A CN201510145559A CN104754364A CN 104754364 A CN104754364 A CN 104754364A CN 201510145559 A CN201510145559 A CN 201510145559A CN 104754364 A CN104754364 A CN 104754364A
- Authority
- CN
- China
- Prior art keywords
- video
- voice
- video playback
- playback client
- server
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/25—Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
- H04N21/266—Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
- H04N21/2668—Creating a channel for a dedicated end-user group, e.g. insertion of targeted commercials based on end-user profiles
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
The invention discloses a video advertisement voice interaction system and method, belongs to the technical field of internet video advertisements and aims to solve the problem of the prior art that the user is required to register and pay in case of skipping the video advertisements and suffered with losses once just simply skips the video advertisements. The video advertisement voice interaction system comprises a video play client, an advertisement play server and a voice recognition server. The video advertisement voice interaction method includes that as the video play client plays the video advertisements, the user opens a voice monitoring switch to input voice, the voice monitoring module collects voice information and sends voice data extracted to the voice recognition server, the voice recognition server returns a voice data recognition result back the video play client, and the video play client calls a related connector of a player to trigger a related event.
Description
Technical field
The present invention is specifically related to a kind of video ads voice interactive system and method, belongs to internet video technical field of advertisement.
Background technology
Video ads has become advertisement form main in the Internet at present, increasing video ads brings very large worry to user, for this reason, number of site starts to provide for some premium customers the video ads can selecting to play, and user can select to skip some advertisement, but this needs user to register and pays, most of user can not select to register and the form of paying, and simply skip video ads, advertiser must be made to incur loss, lose the chance of publicity product.
Summary of the invention
Therefore, to the present invention is directed in prior art user to skip video ads and select to need register and pay, most of user can not select to register and the form of paying, and simply skip video ads, advertiser must be made to incur loss, lose the problem of the chance of publicity product, a kind of video ads voice interactive system is provided, comprise video playback client, advertisement releasing server, speech recognition server, advertisement releasing server is used for providing video ads code to video playback client according to the video ads request of video playback client, it is characterized in that, described video playback client comprises audio monitoring switch, audio monitoring module, audio monitoring module is for collecting voice messaging, extract speech data and send to speech recognition server, speech recognition server is for identifying speech data and resulting text being returned to video playback client.
Described speech recognition server comprises sound identification module, described sound identification module comprises acoustic model, dictionary file, language model, acoustic model obtains after carrying out feature extraction and acoustic training model to sound bank, language model obtains after carrying out language model training according to the text provided in text library, deposits the mapping relations table of word and phoneme in dictionary file.
Described video playback client is mobile phone, panel computer, notebook computer or desktop computer.
The video ads voice interactive method realized by said system, it is characterized in that, described method is: video playback client sends ad-request to advertisement releasing server, advertisement releasing server provides ad code to video playback client, video playback client terminal playing video ads, when audio monitoring on off state is opening, if user carries out phonetic entry, audio monitoring module can collect voice messaging, and speech data is sent to speech recognition server, the resulting text of speech data identification is returned to video playback client by speech recognition server, whether specified command is comprised in video playback client judged result text, if had, the relevant interface triggering dependent event of player is then called with these orders.
Specified command comprises built-in command and in non-built order.
After each trigger event occurs, video playback client carries out log recording by the log recording interface calling advertisement releasing server and provide.
Beneficial effect of the present invention is: adopt video ads voice interactive system of the present invention and method, by interactive voice technology, achieve the interactive voice of user and system, both met client not need to register the demand that paying gets final product skip advertisements, again can by the restriction of voice interactive system, as client needs to say the modes such as advertised product title, the product of advertiser is made to obtain the effect of publicity surpassed the expectation.User can also realize other functions such as replay, time-out by interactive voice.
Accompanying drawing explanation
Fig. 1 is the structural representation of video ads voice interactive system of the present invention;
Fig. 2 is the Play Control flow chart of video playback client;
Fig. 3 is speech-recognition services realization flow figure.
Reference numeral is as follows:
1, video playback client;
2, advertisement releasing server;
3, speech recognition server.
Embodiment
Below in conjunction with accompanying drawing, the specific embodiment of the present invention is described:
As shown in Figure 1, video ads voice interactive system, comprise video playback client 1, advertisement releasing server 2, speech recognition server 3, advertisement releasing server 2 is for providing video ads code to video playback client 1 according to the video ads request of video playback client 1, video playback client 1 comprises audio monitoring switch, audio monitoring module, audio monitoring switch is for opening and closing audio monitoring module, audio monitoring module is for collecting voice messaging, extract speech data and send to speech recognition server, speech recognition server 3 is for identifying speech data and resulting text being returned to video playback client 1.The Play Control flow process of video playback client 1 as shown in Figure 2.
Speech recognition server 3 comprises sound identification module, sound identification module comprises acoustic model, dictionary file, language model, acoustic model obtains after carrying out feature extraction and acoustic training model to sound bank, language model obtains after carrying out language model training according to the text provided in text library, deposits the mapping relations table of word and phoneme in dictionary file.Speech-recognition services realization flow as shown in Figure 3.
Video playback client 1 is mobile phone, panel computer, notebook computer or desktop computer.Be applicable to various platform.
The video ads voice interactive method realized by said system, video playback client 1 sends ad-request to advertisement releasing server 2, advertisement releasing server 2 provides ad code to video playback client 1, ad code is the character string of XML or the JSON form generated according to the good advertisement interaction protocol of predefined, the inside contains variously plays relevant information to advertisement, as: the URL of ad material, the exposure of advertisement and click-through count and the URL finished playing, the exposure and click monitoring URL etc. of advertisement, client meeting analyzing XML or JSON string, then the triggering of advertisement broadcasting and dependent event is carried out.Each attribute having the advertisement of interactive voice effect demand can have " skip advertisements keyword " by name, generally can get the brand name of this advertisement as keyword, the record of newly-increased interactive voice effect daily record, for counting user some interactive information to the advertisement of playing, advertiser's reference can be supplied to.Concrete grammar is a newly-increased node " skipword " under the ad node that each advertisement is corresponding, its value is the keyword of skip advertisements, in addition after skipword node, a node " recurl " is increased newly again, its value is the log interface URL of recording user interbehavior, the parameter comprised in this URL can be recorded in daily record, wherein there is an actid parameter, value be one grand: " ##ACTIONID## ", corresponding value can be replaced to again request corresponding for this URL is sent when actual sending request according to the request of the actual triggering of user.Video playback client 1 playing video advertisement, when audio monitoring on off state is opening, if user carries out phonetic entry, audio monitoring module can collect voice messaging, and speech data is sent to speech recognition server 3, the resulting text of speech data identification is returned to video playback client 1 by speech recognition server 3, whether specified command is comprised in video playback client 1 judged result text, if had, then call the relevant interface triggering dependent event of player with these orders.
Specified command comprises built-in command and in non-built order.Such as
" replay ": built-in command, replays Current ad;
" time-out ": built-in command, suspends and plays Current ad;
" Great Wall ": in non-built order, for this order, keyword (Skipword) is skipped in the advertisement of having said Current ad as user, i.e. the brand name of Current ad, so skip Current ad.
After each trigger event occurs, video playback client 1 carries out log recording by the log recording interface calling advertisement releasing server 2 and provide.
JSON fragment as follows, returning results of the ad placement services end obtained when being certain ad-request of a client transmission is two marketing advertisements of two brands in Great Wall and the Changjiang river respectively.Wherein " ads " is an array, the inside houses multiple " ad " child node, the corresponding advertisement of each " ad " child node, " skipword " child node is had again in each " ad " child node, as user's opening voice listening key and when sending the sound on " Great Wall ", id be 123 advertisement will stop play, leap to next id be 124 advertisement play.
Client is after collecting voice messaging, recurl node below can be checked, if this node exists, then take out its URL, then " ##ACTIONID## " in URL is replaced with the actual event triggered by speech recognition character string out numbering (number format as: 1: to replay, 2: to suspend, 3: skip), then this URL is accessed, this URL corresponds to a log collection service of ad placement services end, can relevant parameter be resolved after this service reception request, and complete the record of daily record.Main JSON code is as follows:
The above is the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the prerequisite not departing from principle of the present invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.
Claims (6)
1. a video ads voice interactive system, comprise video playback client, advertisement releasing server, speech recognition server, advertisement releasing server is used for providing video ads code to video playback client according to the video ads request of video playback client, it is characterized in that, described video playback client comprises audio monitoring switch, audio monitoring module, audio monitoring module is for collecting voice messaging, extract speech data and send to speech recognition server, speech recognition server is for identifying voice and recognition result text being returned to video playback client.
2. video ads voice interactive system as claimed in claim 1, it is characterized in that, described speech recognition server comprises sound identification module, described sound identification module comprises acoustic model, dictionary file, language model, acoustic model obtains after carrying out feature extraction and acoustic training model to sound bank, language model obtains after carrying out language model training according to the text provided in text library, deposits the mapping relations table of word and phoneme in dictionary file.
3. video ads voice interactive system as claimed in claim 1, it is characterized in that, described video playback client is mobile phone, panel computer, notebook computer or desktop computer.
4. the video ads voice interactive method that the system according to any one of claims 1 to 3 realizes, it is characterized in that, described method is: video playback client sends ad-request to advertisement releasing server, advertisement releasing server provides ad code to video playback client, video playback client terminal playing video ads, when audio monitoring on off state is opening, if user carries out phonetic entry, audio monitoring module can collect voice messaging, and speech data is sent to speech recognition server, the resulting text of speech data identification is returned to video playback client by speech recognition server, whether specified command is comprised in video playback client judged result text, if had, the relevant interface triggering dependent event of player is then called with these orders.
5. video ads voice interactive method as claimed in claim 4, it is characterized in that, described specified command comprises built-in command and in non-built order.
6. video ads voice interactive method as claimed in claim 4, is characterized in that, after each trigger event occurs, video playback client carries out log recording by the log recording interface calling advertisement releasing server and provide.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510145559.8A CN104754364A (en) | 2015-03-30 | 2015-03-30 | Video advertisement voice interaction system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510145559.8A CN104754364A (en) | 2015-03-30 | 2015-03-30 | Video advertisement voice interaction system and method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104754364A true CN104754364A (en) | 2015-07-01 |
Family
ID=53593370
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510145559.8A Pending CN104754364A (en) | 2015-03-30 | 2015-03-30 | Video advertisement voice interaction system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104754364A (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105824877A (en) * | 2016-03-08 | 2016-08-03 | 乐视网信息技术(北京)股份有限公司 | Voice-based information searching method and device |
CN106792050A (en) * | 2016-12-21 | 2017-05-31 | 深圳Tcl数字技术有限公司 | The method for closing and system of video ads |
CN107492380A (en) * | 2017-08-15 | 2017-12-19 | 上海精数信息科技有限公司 | Man-machine recognition methods based on audio and apply its advertisement placement method and system |
CN107657471A (en) * | 2016-09-22 | 2018-02-02 | 腾讯科技(北京)有限公司 | A kind of methods of exhibiting of virtual resource, client and plug-in unit |
CN108039175A (en) * | 2018-01-29 | 2018-05-15 | 北京百度网讯科技有限公司 | Audio recognition method, device and server |
CN109240640A (en) * | 2018-08-30 | 2019-01-18 | 百度在线网络技术(北京)有限公司 | Advertising pronunciation exchange method, device and storage medium |
CN109583430A (en) * | 2018-12-28 | 2019-04-05 | 广州励丰文化科技股份有限公司 | A kind of control method and device showing device |
CN111339361A (en) * | 2020-02-27 | 2020-06-26 | 深圳市元征科技股份有限公司 | Vehicle diagnostic record processing method, device and equipment and readable storage medium |
CN113283947A (en) * | 2021-06-18 | 2021-08-20 | 北京奇艺世纪科技有限公司 | Multimedia file playing and control method, related device and readable storage medium |
CN113763046A (en) * | 2021-09-07 | 2021-12-07 | 四川易海天科技有限公司 | Mobile internet vehicle-mounted intelligent delivery system based on big data analysis |
CN115412768A (en) * | 2022-11-02 | 2022-11-29 | 深圳市人马互动科技有限公司 | Information recommendation method based on voice interaction system and related device |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101034455A (en) * | 2006-03-06 | 2007-09-12 | 腾讯科技(深圳)有限公司 | Method and system for implementing online advertisement |
CN103810998A (en) * | 2013-12-05 | 2014-05-21 | 中国农业大学 | Method for off-line speech recognition based on mobile terminal device and achieving method |
CN104216990A (en) * | 2014-09-09 | 2014-12-17 | 科大讯飞股份有限公司 | Method and system for playing video advertisement |
-
2015
- 2015-03-30 CN CN201510145559.8A patent/CN104754364A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101034455A (en) * | 2006-03-06 | 2007-09-12 | 腾讯科技(深圳)有限公司 | Method and system for implementing online advertisement |
CN103810998A (en) * | 2013-12-05 | 2014-05-21 | 中国农业大学 | Method for off-line speech recognition based on mobile terminal device and achieving method |
CN104216990A (en) * | 2014-09-09 | 2014-12-17 | 科大讯飞股份有限公司 | Method and system for playing video advertisement |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105824877A (en) * | 2016-03-08 | 2016-08-03 | 乐视网信息技术(北京)股份有限公司 | Voice-based information searching method and device |
US10950224B2 (en) | 2016-09-22 | 2021-03-16 | Tencent Technology (Shenzhen) Company Limited | Method for presenting virtual resource, client, and plug-in |
CN107657471A (en) * | 2016-09-22 | 2018-02-02 | 腾讯科技(北京)有限公司 | A kind of methods of exhibiting of virtual resource, client and plug-in unit |
CN106792050A (en) * | 2016-12-21 | 2017-05-31 | 深圳Tcl数字技术有限公司 | The method for closing and system of video ads |
CN107492380A (en) * | 2017-08-15 | 2017-12-19 | 上海精数信息科技有限公司 | Man-machine recognition methods based on audio and apply its advertisement placement method and system |
CN108039175A (en) * | 2018-01-29 | 2018-05-15 | 北京百度网讯科技有限公司 | Audio recognition method, device and server |
US11398228B2 (en) | 2018-01-29 | 2022-07-26 | Beijing Baidu Netcom Science And Technology Co., Ltd. | Voice recognition method, device and server |
CN109240640A (en) * | 2018-08-30 | 2019-01-18 | 百度在线网络技术(北京)有限公司 | Advertising pronunciation exchange method, device and storage medium |
CN109583430A (en) * | 2018-12-28 | 2019-04-05 | 广州励丰文化科技股份有限公司 | A kind of control method and device showing device |
CN111339361A (en) * | 2020-02-27 | 2020-06-26 | 深圳市元征科技股份有限公司 | Vehicle diagnostic record processing method, device and equipment and readable storage medium |
CN113283947A (en) * | 2021-06-18 | 2021-08-20 | 北京奇艺世纪科技有限公司 | Multimedia file playing and control method, related device and readable storage medium |
CN113283947B (en) * | 2021-06-18 | 2024-05-03 | 北京奇艺世纪科技有限公司 | Multimedia file playing and controlling method, related device and readable storage medium |
CN113763046A (en) * | 2021-09-07 | 2021-12-07 | 四川易海天科技有限公司 | Mobile internet vehicle-mounted intelligent delivery system based on big data analysis |
CN115412768A (en) * | 2022-11-02 | 2022-11-29 | 深圳市人马互动科技有限公司 | Information recommendation method based on voice interaction system and related device |
CN115412768B (en) * | 2022-11-02 | 2023-03-14 | 深圳市人马互动科技有限公司 | Information recommendation method based on voice interaction system and related device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104754364A (en) | Video advertisement voice interaction system and method | |
US11023931B2 (en) | System and method for targeted advertising | |
CN101996234B (en) | Word cloud audio navigation | |
CN105427121B (en) | The system and method for natural language processing selection presentation of advertisements based on phonetic entry | |
CN107659847B (en) | Voice interface method and apparatus | |
CN110275982B (en) | Query response using media consumption history | |
US8306810B2 (en) | Systems and methods to enable interactivity among a plurality of devices | |
TWI711967B (en) | Method, device and equipment for determining broadcast voice | |
CN109215643B (en) | Interaction method, electronic equipment and server | |
CN100389588C (en) | System and method for using voice over a telephone to access, process, and carry out transactions over the internet | |
US8687776B1 (en) | System and method to analyze human voice conversations | |
CN104038473B (en) | For intercutting the method, apparatus of audio advertisement, equipment and system | |
CN109979450B (en) | Information processing method and device and electronic equipment | |
CN109474562B (en) | Display method and device of identifier, and response method and device of request | |
JP5404728B2 (en) | System and method for providing advertisement information by sound recognition | |
US20120109759A1 (en) | Speech recognition system platform | |
CN108012173A (en) | A kind of content identification method, device, equipment and computer-readable storage medium | |
CN102216945A (en) | Networking with media fingerprints | |
CN107659831A (en) | Media data processing method, client and storage medium | |
JP2014513828A (en) | Automatic conversation support | |
JP2012037797A (en) | Dialogue learning device, summarization device, dialogue learning method, summarization method, program | |
CN104581396A (en) | Processing method and device for promotion information | |
US20080109305A1 (en) | Using internet advertising as a test bed for radio advertisements | |
Siemund et al. | SPEECON-Speech Data for Consumer Devices. | |
CN112839237A (en) | Video and audio processing method, computer equipment and medium in network live broadcast |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20150701 |