CN103533391A - Two-way interaction digital television box system with acoustic control type interaction and implementation method - Google Patents

Two-way interaction digital television box system with acoustic control type interaction and implementation method Download PDF

Info

Publication number
CN103533391A
CN103533391A CN201310477049.1A CN201310477049A CN103533391A CN 103533391 A CN103533391 A CN 103533391A CN 201310477049 A CN201310477049 A CN 201310477049A CN 103533391 A CN103533391 A CN 103533391A
Authority
CN
China
Prior art keywords
user
module
instruction
digital television
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310477049.1A
Other languages
Chinese (zh)
Other versions
CN103533391B (en
Inventor
郗登振
王淑荣
纪燕杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
QINGDAO YINGTIANXIA INTELLIGENT TECHNOLOGY Co Ltd
Original Assignee
QINGDAO YINGTIANXIA INTELLIGENT TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by QINGDAO YINGTIANXIA INTELLIGENT TECHNOLOGY Co Ltd filed Critical QINGDAO YINGTIANXIA INTELLIGENT TECHNOLOGY Co Ltd
Priority to CN201310477049.1A priority Critical patent/CN103533391B/en
Publication of CN103533391A publication Critical patent/CN103533391A/en
Application granted granted Critical
Publication of CN103533391B publication Critical patent/CN103533391B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses a two-way interaction digital television box system with acoustic control type interaction and an implementation method. The two-way interaction digital television box system comprises a video server system, a VOD (Video-On-Demand) management system, a digital television network, an EPG (Electronic Program Guide) system and a digital television terminal, wherein the EPG system provides a mode with convenient operation, program friendliness and fast program access; by the EPG system, the browsing and query of information of programs played recently by one or more channels are realized; and simultaneously the EPG provides a classifying function, and can help a user browse and select various programs. The EPG system comprises a receiving module, a control conversion module, a storage module and a human-computer interaction operation module. The two-way interaction digital television box system disclosed by the invention has the advantages that the usability of a product is stronger, the layout and the content are normalized, and the operation process of the user in the selection process is simplified, so that the two-way interaction digital television box has the advantage of good user experience.

Description

Bidirectional interdynamic digital television box system and implementation method that a kind of voice-control is mutual
Technical field
The present invention relates to areas of information technology, relate in particular to a kind of voice-control mutual Bidirectional interdynamic digital television box system and implementation method.
Background technology
Digital TV set-top box is a kind of conversion equipment that digital television signal is converted to analog signal, it is to the reduction of decoding through the image of digitalization compressed and voice signal, the video and the voice signal that produce simulation, provide high-quality TV programme by television indicator and stereo set to spectators.Current digital TV set-top box has become a kind of embedded computing equipment, there is perfect real time operating system, powerful CPU computing capability is provided, be used for coordinating controller top box each several part hardware facility, and provide colourful graphic user interface, as the electronic program guides of enhanced-quality television, to user, provide program introduction and the background information that both pictures and texts are excellent.Meanwhile, Set Top Box has " fool's computer " ability, by house software function, network is carried out to bidirectional rebuilding, is convenient to realize as internet browsing, video request program, household electric business, telephone communication many services.
Bidirectional interdynamic digital television Set Top Box still rests in controlling of traditional remote controller formula mostly, lacks the interface layout of unified assembly type and the mode of operation of flexible intelligence.Although current existing related interfaces present with the patent of intelligent interaction mode also can realize chunk interface and touch-control, acoustic control, gesture is intelligentized controls, but it is mutual one to one to realize all interface controls and operational order, thereby intellectualized operation is applied to each chunk instruction, for live telecast, the video-on-demand applications function of bi-directional digital television box, be difficult to realize the intelligence experience of the mutual and hommization of What You See Is What You Get.
Generally speaking, need at present the urgent technical problem solving of those skilled in the art to be:
The first, how at the display end of Digital Television, realize a kind of interface layout of novel component, making interface present can standardize and unify, and the otherness of the various display terminals of automatic shield.
The second, how a kind of interaction characteristic and method of Bidirectional interdynamic digital television box system are provided, support the two-way interactive operation of novel chunk, realize the mutual effect of seeing and obtaining.
Summary of the invention
The interface that the present invention is directed to traditional Bidirectional interdynamic digital television box presents and man-machine interaction relates to the problem of existence, a kind of voice-control mutual Bidirectional interdynamic digital television box system and implementation method have been proposed, this system emphasis improves the EPG system in Bidirectional interdynamic digital television box system, define a kind of UI based on chunk and present interface, form by application resource content with assembly encapsulates, the terminal interface of Bidirectional interdynamic digital television box is realized interactive operation instruction and interface assembly event binding one to one, complete the application choice function of What You See Is What You Get, reciprocal process can reduce unnecessary repeated interaction, realize controlled mutual effect flexibly, the Bidirectional interdynamic digital television box system that the method realizes has stronger Product's Ease of Use, normalized layout and content, simplified the operating process in user's selection course, make Bidirectional interdynamic digital television box there is good user and experience advantage, and realized a kind of novel application and present layout and support touch-control, acoustic control, the intelligentized man-machine interaction mode such as gesture, facilitate user to browse and inquire about programme information, personalized service is provided.
To achieve these goals, the present invention adopts following technical scheme:
The Bidirectional interdynamic digital television box system that a kind of voice-control is mutual, comprise video server system, VOD management system, digital TV network, EPG system and digital TV terminal, described video server system transfers data to VOD management system, described VOD management system is transferred to EPG system by data by digital TV network, between described EPG system and digital TV terminal, it is two-way communication, the digital television case of described digital TV terminal obtains the information of EPG system by interface, and presents in the display unit of digital TV terminal;
EPG system comprises receiver module, control modular converter, instruction memory module and the voice-control man-machine interactive operation module of communication successively;
Described voice-control man-machine interactive operation module, for realizing the man-machine interactive operation of chunk, detects and identifies user's sound instruction, responds and carries out this operational order, and operating result is fed back to digital TV set-top box; Described sound interactive operation can be by user the information gathering of sound carry out free definition, allow user to define different sound and carry out the operating function in expression system.
Described receiver module comes from the various data messages of digital TV network for receiving, and obtains electric program menu information by demodulation, demultiplexing, decoding and audio/video coding, and transfers data to receiver module;
Described control modular converter, for by generate programme content index and the extend information being associated be converted to chunk version, and chunk talked about to version be transferred to instruction memory module;
Described instruction memory module, for storing the program guide information of the chunk structure after conversion, and definitions section block instruction collection, mapping relations storehouse between the raw information of storage user input and the operation information of sign command function, mapping relations exist with the form of the corresponding key value of keyword, between described instruction memory module and man-machine interactive operation module, are two-way communications;
In described mapping relations storehouse, input instruction set and exist as keyword, for the information of match user input, and free definition is carried out in the information gathering of inputting by user; The operation information that characterizes command function exists as key value, being mapped as one to one or many-to-one relation of keyword and key value;
Described voice-control man-machine interactive operation module comprises pretreatment module, characteristic extracting module, matching module, Executive Module.
Described video server system comprises: VOD broadcasting server, VOD page directory server, VOD Broadcast Control server;
Described VOD Broadcast Control server is the core of VOD business, is mainly used to carry out the video on-demand request of processing user, and response data is provided, and the data query of Coordination Treatment VOD program request, broadcast file and prepare, broadcast issuing of control command;
Described VOD broadcasting server is mainly used to carry out the program request order of Broadcast Control server, comprises the distribution of IP Information On Demand, the scheduling of program request file, the control of broadcasting file;
Described VOD page directory server is for the treatment of user's page directory browse request, and page data passed to the user of request.
Described VOD management system is responsible for the mandate of this broadcasting user, the detailed inquiry of the charging of user's program request and expense; Major function comprises that subscriber information management, customer data base index, video frequency program source control, user authenticate, monitoring server;
Described digital TV network is used for realizing transmission distribution, Internet Transmission, is written into network function.
Simple operation, program are friendly, a kind of mode of fast access program for user provides for described EPG system, by this system, realize and browse and inquire about the programme information that one or more channels are play in the recent period, meanwhile, EPG provides classification feature, helps user to browse and select various types of programs.
The operation of described pretreatment module for the voice of collecting being carried out to pre-filtering, quantize removing redundant information and noise reduction process, and by the communication after processing to characteristic extracting module;
Characteristic extracting module, carries out feature extraction to carrying out the voice of typing, obtains characteristic vector, and describes according to characteristic vector the keyword dictionary of setting up sound bank, stores instruction memory module into;
Matching module extracts the characteristic vector obtaining and whether belongs to some keywords for judging user input instruction, and the coupling operational order corresponding with this keyword, by Executive Module, identify and respond and carry out this operational order, finally operating result being fed back to internet television terminal;
In addition, man-machine interactive operation module also comprises self adaptation identification module, user's voice are carried out to self study, allow user to define the operational order that different sound instructions is used as function in system, thereby the sound model that early stage, sampling obtained is carried out to necessary correction, further to improve the accuracy rate of identification.
Described digital TV terminal comprises display unit and digital television case, and display unit is for resolving the EPG information receiving and showing with the interface of chunk form; Digital television case is for the input message of obtaining and identify user of film data, and described digital television case comprises the microphone of realizing sound input function.
The method of work that described system adopts, step is as follows:
Step (1): start, video server system provides the program source of video request program, and by the mandate billing function of VOD management system management point broadcasting user, programme information source, through the transmission of digital TV network, is transmitted to EPG system;
Step (2): receive and come from the various data messages of digital TV network by EPG system receiving module, and obtain electric program menu information by demodulation, demultiplexing, decoding and audio/video coding technology;
Step (3): controlling modular converter is chunk version by the electric program menu content information receiving and index translation, and store the group block structured program guide information after conversion in memory module; Then the display unit to digital TV terminal by program guide communication, carries out presenting of interface;
Step (4): user's reciprocal process is carried out the typing of primitive operation instruction by the microphone of the digital television case of digital TV terminal, and support user to set self-defining operational order;
Step (5): by man-machine interactive operation module, input identification and the detection of instruction, judge user input instruction whether can with mapping relations storehouse in keyword match, if just enter step (6); Just enter if not step (7);
Step (6): system is carried out the function event of the operational order of corresponding keyword, and by terminal display device, present the result interface of operational correspondence; Finish;
Step (7): show miscue information, finish.
The result interface of the operational correspondence of described step (6) generates automatically by setting up mathematical logic model and applying algorithm, described mathematical logic model refers to the structure that presents that represents chunk interface with tree structure, chunk interface is as the root node of dendrogram, the node that has two kinds, be respectively node He Fu district, primary area node, wherein primary area node is the node that must exist, the node permission of auxiliary district exists as the district of object container as required, and the degree of depth of every one deck of tree structure represents the type of the node that it is represented.
The specific works method of the voice-control man-machine interactive operation module of described step (5) is as follows:
Step1 carries out the collection of voice messaging, because the voice operating instruction at chunk interface is corresponding one by one with the instruction of distance type operation, therefore the collection of voice messaging only need to gather the phonetic order of specific distance type operational correspondence, three class instructions have been defined: macro-instruction, chunk instruction and function command;
Phonetic order after Step2 gathers forms sound bank, and for each the voice signal oscillogram in sound bank, the value of extracting its every spacer segment frame obtains the characteristic vector f of a n dimension, thereby obtains characteristic vector set F;
Step3 generates search key dictionary set D to the method for characteristic set F application K-means cluster, capacity is d, keyword g corresponding to each class averaged and obtained by all characteristic vector f in such, the execution instruction of the corresponding chunk of each keyword g, the mapping relations of itself and chunk operational order key value, store in memory module;
Step4, for the sound instruction to be identified of input, obtains the characteristic vector m of a n dimension equally according to the method for step1 and step2;
Step5 is in keyword dictionary set D, between searching and characteristic vector m, Manhattan is apart from minimum keyword g, if this distance is less than the threshold value of appointment, f is the vector of coupling, the instruction of its corresponding instruction for carrying out, and m is belonged in the class that this keyword is corresponding, such feature is described and is updated to g=(D*g+m)/(d+1).
Beneficial effect of the present invention:
1 provides a kind of intellectuality, broadband multimedia services platform open, that support multiple services, to have standard layout format EPG system, solution business index and navigation lack the problem of consolidation form, and transmit static or resource the entertainment service of digital television bidirectional interaction is provided dynamically by Ethernet.
A kind of UI based on chunk of 2 definition presents interface, and the form by application resource content with assembly encapsulates, and by setting up mathematical logic model, has realized a kind of novel application and has presented layout and interactive mode.At the terminal interface of Digital Television, realize interactive operation instruction and interface assembly event binding one to one, complete the application choice function of What You See Is What You Get, reciprocal process can reduce unnecessary repeated interaction, improved the efficiency of response, reach controlled mutual effect flexibly, realize a kind of novel digital TV program menu and present layout and two-way interaction pattern.
3 provide a kind of touch Bidirectional interdynamic digital television box based on chunk interaction technique, and the mutual Bidirectional interdynamic digital television cartridge device of touch that uses chunk interaction technique principle to realize, can support the touch control operation of single-point and multiple spot, and carry out high-precision action recognition, respond fast all kinds of touch control operations, make Bidirectional interdynamic digital television box there is stronger Product's Ease of Use, simplified operating process when user selects, make internet television there is good user and experience.
4 by novel chunk UI interface alternation method, is different from the distance type interactive mode of operation of traditional selections such as only having upper and lower, left and right, confirm and exit.The method is without the complicated alternative events of definition, have easily know, easy to learn, easy-to-use interaction characteristic.Two-way interaction mode has met the demand of user to different business level, the more selection channel of free multicomponent is provided, and chunk exchange method is supported multiple modes of operation, can autonomous configuration, expand to multiple Intelligent control mode, method of operation is flexible and changeable, is applicable to the mutual of miscellaneous service information and application resource.
The advantage of 5 layouts due to chunk interface, acoustic control instruction does not need loaded down with trivial details and huge instruction database, only by the very few instruction corresponding with interface group block, can realize interactive operation, therefore when feature extraction, also can obtain characteristic vector by simpler and more direct mode, shorten match time, guaranteed matching efficiency.
The chunk at 6 chunk interfaces is arranged and is adopted the combining form that is not more than at most 9, therefore acoustic control instruction at most only need to be mated 9 voice of 1~9, the sound instruction storage capacity that order extracts greatly reduces, by definition keyword dictionary, sound instruction for user's input, make keyword that feature extraction obtains more close to matching result, and the operating efficiency of coupling sound instruction also obviously improve.
In a word, this exchange method makes Bidirectional interdynamic digital television box have stronger Product's Ease of Use, normalized layout and content, simplified the operating process in user's selection course, by this UI layout and interaction design, can make the mode of operation hommization more of Bidirectional interdynamic digital television box, thereby significantly the user of improving product experiences.
The present invention has built a kind of interface layout form and interactive mode of Bidirectional interdynamic digital television box system of novel chunk formula, by interface assembly and response events one to one, realizes the mutual effect of What You See Is What You Get.Be different from traditional only have on, under, left, right, confirm and exit the single remote control interactive operator scheme of selection, the Bidirectional interdynamic digital television box system that adopting said method is realized can provide high-quality user to experience service to user, method of operation is flexible and changeable, and can expand to touch-control, acoustic control, the mutual field of the intelligent operations such as gesture, be applicable to the mutual of miscellaneous service information and application resource, realize each generic operation of response fast, system is easily known, easy to learn, easy-to-use convenient interactive mode can be applicable to people's group operation widely to be used, allow user experience intellectuality, the amusement of hommization is enjoyed.
Accompanying drawing explanation
Fig. 1 is the Mathematical Modeling schematic diagram of Bidirectional interdynamic digital television box;
Fig. 2 is Bidirectional interdynamic digital television box system construction drawing;
Fig. 3 is the Sound Match of Bidirectional interdynamic digital television box system and the method step of identification;
Fig. 4 is the chunk exchange method flow chart of Bidirectional interdynamic digital television box system.
Embodiment
As shown in Figure 1, the internet television system that the present invention realizes presents and relates to alternately the problem of existence for conventional internet TV, a kind of internet television service implementation method based on chunk interaction technique has been proposed, first this implementation method is improved the interface that presents of internet television terminal, define a kind of UI based on chunk and present interface, form by application resource content with assembly encapsulates, and has realized a kind of novel application and has presented layout.
The interface layout content of described chunk form comprises: main demonstration block, in order to show the first carrying chunk; Auxiliary demonstration block, in order to show the second carrying chunk; Described auxiliary demonstration block is positioned at upside, downside, left side, the right side of described main demonstration block or is suspended in top.While having the block of a plurality of suspended states, adopt the form that level goes forward one by one to show, i.e. the suspended state block of up-to-date ejection is always positioned at highlighting foremost of interface, and the interface block of other levels shows by level transparency is set.
Described system comprises initial interface and a plurality of processes interface, and initial interface is identical with the appearance form at process interface, and main demonstration block has nine first carrying chunks, arranges and is palace lattice shape; There are nine second carrying chunks auxiliary viewing area, laterally or is longitudinally arranged in order, and shows nine carrying chunks in each block, if when in block, chunk surpasses nine, and need be by the tenth and above carrying chunk Pagination Display.
With the chunk interface phase ratio relating in existing publication, in the present invention, for interface has defined Mathematical Modeling, and can generate automatically initial interface and process interface by algorithm, method for expressing is as follows:
The interface that represents chunk with tree structure presents structure, chunk interface is as the root node of dendrogram, five child nodes that have two kinds, be respectively primary area node (E district node) He Fu district node (1 ,Fu district 2, auxiliary district ... auxiliary district M), wherein primary area node is the node that must exist, auxiliary district node can be as required as district's existence of object container, and the degree of depth of every one deck of tree structure represents the type of the node that it is represented.As shown in Figure 1, as root node, its level degree of depth is 1 at each interface (comprising initial interface and process interface), and the level degree of depth of district's node is 2, and in district, the level degree of depth of chunk node is 3.
Chunk model, to gather Q={q|q=(primary area (chunk E 1, chunk E 2chunk E n) ,Fu district 1 (chunk A 1, chunk A 2chunk A n) ,Fu district 2 (chunk B 1, chunk B 2chunk B n) ... auxiliary district M (chunk M 1, chunk M 2chunk M n)), primary area ≠ ∮ wherein, n≤9} represents, the primary area at chunk interface can not be sky, and the chunk number also having in each district can not surpass 9.In addition, the tree structure that initial interface and process interface obtain, can generate automatically according to rendering content, obtains the child node of allocation tree structure.
As shown in Figure 2, a kind of Bidirectional interdynamic digital television box system, comprise video server system, VOD management system, digital TV network, EPG system and digital TV terminal, described video server system transfers data to VOD management system, described VOD management system is transferred to EPG system by data by digital TV network, between described EPG system and digital TV terminal, it is two-way communication, the digital television case of described digital TV terminal obtains the list of all issue films above EPG system by interface, the information such as program category and film title, and present in the display unit of digital TV terminal.
Described video server system comprises: VOD broadcasting server, VOD page directory server, VOD Broadcast Control server.
Described VOD Broadcast Control server is the core of VOD business, is mainly used to carry out the video on-demand request of processing user, and response data is provided, and the data query of Coordination Treatment VOD program request, broadcast file and prepare, broadcast issuing of control command.
Described VOD broadcasting server is mainly used to carry out the program request order of Broadcast Control server, comprises the distribution (VPID, APID) of IP Information On Demand, the scheduling of program request file, the control of broadcasting file.
Described VOD page directory server is for the treatment of user's page directory browse request, and page data passed to the user of request.
Described VOD management system is responsible for the mandate of this broadcasting user, the detailed inquiry of the charging of user's program request and expense.Major function comprises that subscriber information management, customer data base index, video frequency program source control, user authenticate, monitoring server etc.
Described digital TV network is used for realizing transmission distribution, Internet Transmission, is written into the functions such as network.
Simple operation, program are friendly, a kind of mode that can fast access program for user provides for described EPG system, by this system, realize and browse and inquire about the programme information that one or more channels are play in the recent period, simultaneously, EPG can provide classification feature, can help user to browse and select various types of programs.EPG system comprises receiver module, controls modular converter, memory module and man-machine interactive operation module.
Described receiver module comes from the various data messages of digital TV network for receiving, and obtains electric program menu information by technology such as demodulation, demultiplexing, decoding and audio/video codings, and transfers data to receiver module;
Described control modular converter, for by generate programme content index and the extend information being associated be converted to chunk version, and chunk talked about to version be transferred to instruction memory module;
Described instruction memory module, for storing the program guide information of the chunk structure after conversion, and definitions section block instruction collection, mapping relations storehouse between the raw information of storage user input and the operation information of sign command function, mapping relations exist with the form of the corresponding key value of keyword, between described instruction memory module and man-machine interactive operation module, are two-way communications.In described mapping relations storehouse, input instruction set and exist as keyword, for the information of match user input, and free definition is carried out in the information gathering that can input by user; The operation information that characterizes command function exists as key value, being mapped as one to one or many-to-one relation of keyword and key value.
Described voice-control man-machine interactive operation module is for realizing the man-machine interactive operation of chunk, comprise pretreatment module, characteristic extracting module, matching module, Executive Module, wherein pretreatment module is for carrying out the operation that redundant information and noise reduction process are removed in pre-filtering, quantification etc. to the voice of collecting, characteristic extracting module, carries out feature extraction to carrying out the voice of typing, obtains characteristic vector, and the keyword dictionary of setting up sound bank is described according to characteristic vector, store instruction memory module into.Matching module extracts the characteristic vector obtaining and whether belongs to some keywords for judging user input instruction, and the coupling operational order corresponding with this keyword, by Executive Module, identify and respond and carry out this operational order, finally operating result being fed back to internet television terminal.In addition, man-machine interactive operation module also comprises self adaptation identification module, can carry out self study to user's voice, allow user to define the operational order that different sound instructions is used as function in system, thereby the sound model that early stage, sampling obtained is carried out to necessary correction, further to improve the accuracy rate of identification.
Described digital TV terminal comprises display unit and digital television case, and display unit is for resolving the EPG information receiving and showing with the interface of chunk form; Digital television case is for the input message of obtaining and identify user of film data, and described digital television case comprises the microphone of realizing sound input function.
As shown in Figure 4, the method for work step that said system adopts is as follows:
Step (1): start, video server system provides the program source of video request program, and by the functions such as mandate charging of VOD management system management point broadcasting user, programme information source, through the transmission of digital TV network, is transmitted to EPG system.
Step (2): receive and come from the various data messages of digital TV network by EPG system receiving module, and obtain electric program menu information by technology such as demodulation, demultiplexing, decoding and audio/video codings;
Step (3): controlling modular converter is chunk version by the electric program menu content information receiving and index translation, and store the group block structured program guide information after conversion in memory module; Then the display unit to digital TV terminal by program guide communication, carries out presenting of interface.
Step (4): user's reciprocal process is carried out the typing of primitive operation instruction by the microphone of the digital television case of digital TV terminal, and support user to set self-defining operational order.
Step (5): by man-machine interactive operation module, input identification and the detection of instruction, judge user input instruction whether can with mapping relations storehouse in keyword match, if just enter step (6); Just enter if not step (7);
Step (6): system is carried out the function event of the operational order of corresponding keyword, and by terminal display device, present the result interface of operational correspondence; Finish;
Step (7): show miscue information, finish.
The result interface of the operational correspondence of described step (6) generates automatically by setting up mathematical logic model and applying algorithm, described mathematical logic model refers to the structure that presents that represents chunk interface with tree structure, chunk interface is as the root node of dendrogram, the node that has two kinds, be respectively node He Fu district, primary area node, wherein primary area node is the node that must exist, the node permission of auxiliary district exists as the district of object container as required, and the degree of depth of every one deck of tree structure represents the type of the node that it is represented.
As shown in Figure 3, the specific works method of the voice-control man-machine interactive operation module of described step (5) is as follows:
Step1 carries out the collection of voice messaging, because the voice operating instruction at chunk interface is corresponding one by one with the instruction of distance type operation, therefore the collection of voice messaging only need to gather the phonetic order of specific distance type operational correspondence, three class instructions have been defined: macro-instruction, chunk instruction and function command.The mapping relations of the division of chunk instruction set and configuration-direct and chunk operational order refer to patent " adopting the human-computer interaction device of voice-control " (application number 201310119989.3).
Phonetic order after Step2 gathers forms sound bank, and for each the voice signal oscillogram in sound bank, the value of extracting its every spacer segment frame obtains the characteristic vector f of a n dimension, thereby obtains characteristic vector set F;
Step3 generates search key dictionary set D to the method for characteristic set F application K-means cluster, capacity is d, keyword g corresponding to each class averaged and obtained by all characteristic vector f in such, the execution instruction of the corresponding chunk of each keyword g, the mapping relations of itself and chunk operational order key value, store in memory module;
Step4, for the sound instruction to be identified of input, obtains the characteristic vector m of a n dimension equally according to the method for step1 and step2;
Step5 is in keyword dictionary set D, between searching and characteristic vector m, Manhattan is apart from minimum keyword g, if this distance is less than the threshold value of appointment, f is the vector of coupling, the instruction of its corresponding instruction for carrying out, and m is belonged in the class that this keyword is corresponding, such feature is described and is updated to g=(D*g+m)/(d+1).
Although above-mentioned, by reference to the accompanying drawings the specific embodiment of the present invention is described; but be not limiting the scope of the invention; one of ordinary skill in the art should be understood that; on the basis of technical scheme of the present invention, those skilled in the art do not need to pay various modifications that creative work can make or distortion still in protection scope of the present invention.

Claims (10)

1. the Bidirectional interdynamic digital television box system that voice-control is mutual, it is characterized in that, comprise video server system, VOD management system, digital TV network, EPG system and digital TV terminal, described video server system transfers data to VOD management system, described VOD management system is transferred to EPG system by data by digital TV network, between described EPG system and digital TV terminal, it is two-way communication, the digital television case of described digital TV terminal obtains EPG system information by interface, and presents in the display unit of digital TV terminal;
EPG system comprises receiver module, control modular converter, instruction memory module and the voice-control man-machine interactive operation module of communication successively;
Described voice-control man-machine interactive operation module, for realizing the man-machine interactive operation of chunk, detects and identifies user's sound instruction, responds and carries out this operational order, and operating result is fed back to digital television case; Described sound interactive operation can be by user the information gathering of sound carry out free definition, allow user to define different sound and carry out the operating function in expression system.
2. the mutual Bidirectional interdynamic digital television box system of a kind of voice-control as claimed in claim 1, is characterized in that,
Described receiver module comes from the various data messages of digital TV network for receiving, and obtains electric program menu information by demodulation, demultiplexing, decoding and audio/video coding, and transfers data to receiver module;
Described control modular converter, for by generate programme content index and the extend information being associated be converted to chunk version, and chunk talked about to version be transferred to instruction memory module;
Described instruction memory module, for storing the program guide information of the chunk structure after conversion, and definitions section block instruction collection, mapping relations storehouse between the raw information of storage user input and the operation information of sign command function, mapping relations exist with the form of the corresponding key value of keyword, between described instruction memory module and man-machine interactive operation module, are two-way communications;
In described mapping relations storehouse, input instruction set and exist as keyword, for the information of match user input, and free definition is carried out in the information gathering of inputting by user; The operation information that characterizes command function exists as key value, being mapped as one to one or many-to-one relation of keyword and key value;
Described voice-control man-machine interactive operation module comprises pretreatment module, characteristic extracting module, matching module, Executive Module.
3. the mutual Bidirectional interdynamic digital television box system of a kind of voice-control as claimed in claim 1, is characterized in that,
Described video server system comprises: VOD broadcasting server, VOD page directory server, VOD Broadcast Control server;
Described VOD Broadcast Control server is the core of VOD business, is mainly used to carry out the video on-demand request of processing user, and response data is provided, and the data query of Coordination Treatment VOD program request, broadcast file and prepare, broadcast issuing of control command;
Described VOD broadcasting server is mainly used to carry out the program request order of Broadcast Control server, comprises the distribution of IP Information On Demand, the scheduling of program request file, the control of broadcasting file;
Described VOD page directory server is for the treatment of user's page directory browse request, and page data passed to the user of request.
4. the mutual Bidirectional interdynamic digital television box system of a kind of voice-control as claimed in claim 1, is characterized in that,
Described VOD management system is responsible for the mandate of this broadcasting user, the detailed inquiry of the charging of user's program request and expense; Major function comprises that subscriber information management, customer data base index, video frequency program source control, user authenticate, monitoring server;
Described digital TV network is used for realizing transmission distribution, Internet Transmission, is written into network function.
5. the mutual Bidirectional interdynamic digital television box system of a kind of voice-control as claimed in claim 1, is characterized in that,
Simple operation, program are friendly, a kind of mode of fast access program for user provides for described EPG system, by this system, realize and browse and inquire about the programme information that one or more channels are play in the recent period, meanwhile, EPG provides classification feature, helps user to browse and select various types of programs.
6. the mutual Bidirectional interdynamic digital television box system of a kind of voice-control as claimed in claim 2, is characterized in that,
The operation of described pretreatment module for the voice of collecting being carried out to pre-filtering, quantize removing redundant information and noise reduction process, and by the communication after processing to characteristic extracting module;
Characteristic extracting module, carries out feature extraction to carrying out the voice of typing, obtains characteristic vector, and describes according to characteristic vector the keyword dictionary of setting up sound bank, stores instruction memory module into;
Matching module extracts the characteristic vector obtaining and whether belongs to some keywords for judging user input instruction, and the coupling operational order corresponding with this keyword, by Executive Module, identify and respond and carry out this operational order, finally operating result being fed back to internet television terminal;
In addition, man-machine interactive operation module also comprises self adaptation identification module, user's voice are carried out to self study, allow user to define the operational order that different sound instructions is used as function in system, thereby the sound model that early stage, sampling obtained is carried out to necessary correction, further to improve the accuracy rate of identification.
7. the mutual Bidirectional interdynamic digital television box system of a kind of voice-control as claimed in claim 2, is characterized in that,
Described digital TV terminal comprises display unit and digital television case, and display unit is for resolving the EPG information receiving and showing with the interface of chunk form; Digital television case is for the input message of obtaining and identify user of film data, and described digital television case comprises the microphone of realizing sound input function.
8. the method for work that the system described in above-mentioned arbitrary claim adopts, is characterized in that, step is as follows:
Step (1): start, video server system provides the program source of video request program, and by the mandate billing function of VOD management system management point broadcasting user, programme information source, through the transmission of digital TV network, is transmitted to EPG system;
Step (2): receive and come from the various data messages of digital TV network by EPG system receiving module, and obtain electric program menu information by demodulation, demultiplexing, decoding and audio/video coding technology;
Step (3): controlling modular converter is chunk version by the electric program menu content information receiving and index translation, and store the group block structured program guide information after conversion in memory module; Then the display unit to digital TV terminal by program guide communication, carries out presenting of interface;
Step (4): user's reciprocal process is carried out the typing of primitive operation instruction by the microphone of the digital television case of digital TV terminal, and support user to set self-defining operational order;
Step (5): by man-machine interactive operation module, input identification and the detection of instruction, judge user input instruction whether can with mapping relations storehouse in keyword match, if just enter step (6); Just enter if not step (7);
Step (6): system is carried out the function event of the operational order of corresponding keyword, and by terminal display device, present the result interface of operational correspondence; Finish;
Step (7): show miscue information, finish.
9. method as claimed in claim 8, it is characterized in that, the result interface of the operational correspondence of described step (6) generates automatically by setting up mathematical logic model and applying algorithm, described mathematical logic model refers to the structure that presents that represents chunk interface with tree structure, chunk interface is as the root node of dendrogram, the node that has two kinds, be respectively node He Fu district, primary area node, wherein primary area node is the node that must exist, the node permission of auxiliary district exists as the district of object container as required, the degree of depth of every one deck of tree structure represents the type of the node that it is represented.
10. method as claimed in claim 8, is characterized in that, the specific works method of the voice-control man-machine interactive operation module of described step (5) is as follows:
Step1 carries out the collection of voice messaging, because the voice operating instruction at chunk interface is corresponding one by one with the instruction of distance type operation, therefore the collection of voice messaging only need to gather the phonetic order of specific distance type operational correspondence, three class instructions have been defined: macro-instruction, chunk instruction and function command;
Phonetic order after Step2 gathers forms sound bank, and for each the voice signal oscillogram in sound bank, the value of extracting its every spacer segment frame obtains the characteristic vector f of a n dimension, thereby obtains characteristic vector set F;
Step3 generates search key dictionary set D to the method for characteristic set F application K-means cluster, capacity is d, keyword g corresponding to each class averaged and obtained by all characteristic vector f in such, the execution instruction of the corresponding chunk of each keyword g, the mapping relations of itself and chunk operational order key value, store in memory module;
Step4, for the sound instruction to be identified of input, obtains the characteristic vector m of a n dimension equally according to the method for step1 and step2;
Step5 is in keyword dictionary set D, between searching and characteristic vector m, Manhattan is apart from minimum keyword g, if this distance is less than the threshold value of appointment, f is the vector of coupling, the instruction of its corresponding instruction for carrying out, and m is belonged in the class that this keyword is corresponding, such feature is described and is updated to g=(D*g+m)/(d+1).
CN201310477049.1A 2013-10-12 2013-10-12 The method of work of the Bidirectional interdynamic digital television box system that a kind of voice-control is mutual Expired - Fee Related CN103533391B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310477049.1A CN103533391B (en) 2013-10-12 2013-10-12 The method of work of the Bidirectional interdynamic digital television box system that a kind of voice-control is mutual

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310477049.1A CN103533391B (en) 2013-10-12 2013-10-12 The method of work of the Bidirectional interdynamic digital television box system that a kind of voice-control is mutual

Publications (2)

Publication Number Publication Date
CN103533391A true CN103533391A (en) 2014-01-22
CN103533391B CN103533391B (en) 2016-09-14

Family

ID=49935000

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310477049.1A Expired - Fee Related CN103533391B (en) 2013-10-12 2013-10-12 The method of work of the Bidirectional interdynamic digital television box system that a kind of voice-control is mutual

Country Status (1)

Country Link
CN (1) CN103533391B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105245964A (en) * 2015-09-30 2016-01-13 天脉聚源(北京)科技有限公司 Method and device for setting interactive information of interactive television system
CN105355196A (en) * 2015-09-28 2016-02-24 大连楼兰科技股份有限公司 Speech instruction recognition method for intelligent glasses applied to field of car maintenance
CN106331781A (en) * 2016-09-09 2017-01-11 深圳市九洲电器有限公司 Analysis push method and analysis push system based on household voice
CN107240400A (en) * 2017-07-03 2017-10-10 重庆小雨点小额贷款有限公司 Terminal operation method and device
CN107277745A (en) * 2016-04-02 2017-10-20 英特尔Ip公司 Blue tooth voice contrast means and method
CN108053674A (en) * 2018-01-16 2018-05-18 湖州华科信息咨询有限公司 A kind of method and apparatus for being used for traffic lights fault cues and repair
US10448762B2 (en) 2017-09-15 2019-10-22 Kohler Co. Mirror
US10663938B2 (en) 2017-09-15 2020-05-26 Kohler Co. Power operation of intelligent devices
US10887125B2 (en) 2017-09-15 2021-01-05 Kohler Co. Bathroom speaker
US11099540B2 (en) 2017-09-15 2021-08-24 Kohler Co. User identity in household appliances
US11921794B2 (en) 2017-09-15 2024-03-05 Kohler Co. Feedback for water consuming appliance

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020035477A1 (en) * 2000-09-19 2002-03-21 Schroder Ernst F. Method and apparatus for the voice control of a device appertaining to consumer electronics
CN101257619A (en) * 2008-03-21 2008-09-03 华为技术有限公司 Method, system and equipment for controlling interactive video service
CN102740014A (en) * 2011-04-07 2012-10-17 青岛海信电器股份有限公司 Voice controlled television, television system and method for controlling television through voice
CN103248919A (en) * 2013-05-22 2013-08-14 青岛旲天下智能科技有限公司 IPTV (Internet Protocol Television) system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020035477A1 (en) * 2000-09-19 2002-03-21 Schroder Ernst F. Method and apparatus for the voice control of a device appertaining to consumer electronics
CN101257619A (en) * 2008-03-21 2008-09-03 华为技术有限公司 Method, system and equipment for controlling interactive video service
CN102740014A (en) * 2011-04-07 2012-10-17 青岛海信电器股份有限公司 Voice controlled television, television system and method for controlling television through voice
CN103248919A (en) * 2013-05-22 2013-08-14 青岛旲天下智能科技有限公司 IPTV (Internet Protocol Television) system

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105355196A (en) * 2015-09-28 2016-02-24 大连楼兰科技股份有限公司 Speech instruction recognition method for intelligent glasses applied to field of car maintenance
CN105245964A (en) * 2015-09-30 2016-01-13 天脉聚源(北京)科技有限公司 Method and device for setting interactive information of interactive television system
CN107277745A (en) * 2016-04-02 2017-10-20 英特尔Ip公司 Blue tooth voice contrast means and method
CN106331781A (en) * 2016-09-09 2017-01-11 深圳市九洲电器有限公司 Analysis push method and analysis push system based on household voice
CN107240400A (en) * 2017-07-03 2017-10-10 重庆小雨点小额贷款有限公司 Terminal operation method and device
US10448762B2 (en) 2017-09-15 2019-10-22 Kohler Co. Mirror
US10663938B2 (en) 2017-09-15 2020-05-26 Kohler Co. Power operation of intelligent devices
US10887125B2 (en) 2017-09-15 2021-01-05 Kohler Co. Bathroom speaker
US11099540B2 (en) 2017-09-15 2021-08-24 Kohler Co. User identity in household appliances
US11314215B2 (en) 2017-09-15 2022-04-26 Kohler Co. Apparatus controlling bathroom appliance lighting based on user identity
US11921794B2 (en) 2017-09-15 2024-03-05 Kohler Co. Feedback for water consuming appliance
US11949533B2 (en) 2017-09-15 2024-04-02 Kohler Co. Sink device
CN108053674A (en) * 2018-01-16 2018-05-18 湖州华科信息咨询有限公司 A kind of method and apparatus for being used for traffic lights fault cues and repair

Also Published As

Publication number Publication date
CN103533391B (en) 2016-09-14

Similar Documents

Publication Publication Date Title
CN103533391B (en) The method of work of the Bidirectional interdynamic digital television box system that a kind of voice-control is mutual
CN103501445A (en) Gesture-based interaction two-way interactive digital TV box system and implementation method
CN103533415B (en) Internet television system based on sound control man-machine interaction technology and its implementation
CN101833854B (en) Interacting method of remote control code of universal remote control with USB interface
CN203151689U (en) Image processing apparatus and image processing system
CN111372109B (en) Intelligent television and information interaction method
CN103501446A (en) Internet television system based on gesture man-machine interaction technology and realization method of Internet television system
US20170251259A1 (en) Methods and systems of recommending media assets to users based on content of other media assets
CN110737840A (en) Voice control method and display device
CN102497521B (en) Equipment and method for selecting video and audio signal input channels by means of previewing
CN103281580A (en) Television set remote control method for separating user interface and system thereof
CN103248919B (en) A kind of IPTV system
CN114155855A (en) Voice recognition method, server and electronic equipment
CN111866568B (en) Display device, server and video collection acquisition method based on voice
CN111625716A (en) Media asset recommendation method, server and display device
CN104038825A (en) Virtual channel management method and network multimedia reproduction system
CN202334803U (en) Digital television set-top box
CN102508543A (en) Man-machine interactive system for digital terminal
CN104717536A (en) Voice control method and system
CN109564758A (en) Electronic equipment and its audio recognition method
CN106454463A (en) Control method and device based on television
CN102843598A (en) Browser interaction method for smart television
CN104254016A (en) Method and device for realizing interaction between set top box and smart mobile terminal
CN112929717B (en) Focus management method and display device
CN112883144A (en) Information interaction method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160914

Termination date: 20171012

CF01 Termination of patent right due to non-payment of annual fee