CN103517147A

CN103517147A - Display apparatus, interactive server, and method for providing response information

Info

Publication number: CN103517147A
Application number: CN201310175179.XA
Authority: CN
Inventors: 许惠贤; 孙譓琳; 申俊亨
Original assignee: Samsung Electronics Co Ltd
Current assignee: Samsung Electronics Co Ltd
Priority date: 2012-06-14
Filing date: 2013-05-13
Publication date: 2014-01-15
Also published as: US20130339020A1; JP2014003610A; US9219949B2; KR20130140423A; EP2675153A1

Abstract

A display apparatus, an interactive server, and a method for providing response information are provided. The display apparatus includes: a voice collector which collects a user's uttered voice, a communication unit which communicates with an interactive server; and, a controller which, if response information corresponding to the uttered voice which is transmitted to the interactive server is received from the interactive server, controls to perform an operation corresponding to the user's uttered voice based on the response information, wherein the response information is generated in a different form according to a function of the display apparatus which is classified based on an utterance element extracted from the uttered voice. Accordingly the display apparatus can execute the function corresponding to each of the uttered voices and can output the response message corresponding to each of the uttered voices, even if a variety of uttered voices are input from the user.

Description

Display device, interactive server and the method for response message are provided

The cross reference of related application

The application requires the priority of in June, the 2012 korean patent application No.10-2012-0063811 that 14Xiang Korea S Department of Intellectual Property submits, and its whole disclosures are incorporated herein by reference.

Technical field

The display device of response message, interactive server and the method for providing is provided the method and apparatus that meets example embodiment, more specifically, relate to display device and interactive server that the corresponding response message of the voice that send with user is provided, and the method that response message is provided.

Background technology

Can identify the display device of voice and collect the voice that user sends, and to external server, send by network the voice of collecting.Thereafter, display device receives the information relevant with the voice that send from external server, and these voice that send have been converted into the discernible form of display device, analyzes the information of the voice that send, and the implication of the voice that send of grasp user.Thereafter, the result of display device based on grasping carried out the corresponding function of voice of sending with user, and if the relevant guide message of voice that needs output to send with user.

Yet correlation technique display device can limit and carry out the corresponding function of voice of sending with user like this, and only carries out or do not carry out the corresponding function of voice of sending with user.

Particularly, if display device enters speech recognition mode according to the request from user, display device shows the order of the operation of the voice control display device sending by user on screen.Therefore, user sends with him/her and expects the voice that the function carried out is corresponding with reference to the operation control command of the display device showing on screen.

If the voice that input user sends, display device receives the relevant information of voice of sending with user from external server, analyze the information relevant with the voice that send, and carry out the function of user's request or show and again ask relevant text message with voice on screen.

That is, the display device of correlation technique is based on default order or again ask user to send voice, only carry out the corresponding operation of voice of sending with user, and the voice that can not send in response to various users provides different response messages.Therefore, need to develop the interactive system that a kind of voice that send in response to various users provide different response messages.

Summary of the invention

One or more example embodiment can overcome above shortcoming and above other shortcomings of not describing.Yet, should be appreciated that one or more example embodiment does not need to overcome above-mentioned shortcoming, and can not overcome above-mentioned any problem.

One or more example embodiment provides a kind of voice that send in response to various users that the display device of different response messages is provided.

Yi Ge aspect according to example embodiment, provides a kind of display device, comprising: voice gatherer, collect the voice that user sends; Communication unit, communicates by letter with interactive server; And controller, if from interactive server, receive with send to interactive server send the corresponding response message of voice, based on response message, control and carry out the corresponding operation of voice of sending with user, wherein, response message is language (utterance) element that extracts according to voice based on from the sending function of classifying with multi-form generation.

This function can comprise that the operation of electronic program guides (EPG) correlation function and display device controls at least one in function.

Display device can also comprise output unit, and if the voice that send comprise the relevant language element of operation control of the relevant language element of EPG or display device, controller can the response message based on receiving from interactive server be carried out at least one operation of exporting the operation of response message and carrying out function.

If the voice that send comprise the EPG relevant with a plurality of requests language element of being correlated with, the response message output voice that controller can be based on receiving from interactive server are request message again.

Display device can also comprise output unit, if the voice that send comprise limited language element, the response message that controller can be based on receiving from interactive server control the operation of output and voice corresponding to sending relevant do not allow message.

Display device can also comprise: storage device, user's face image and user profile are matched each other, and storage match information; And shooting unit, the face of taking user.Controller can send the user profile matching with the face image of taking unit generation and the voice that send to interactive server, and if the language element of forbidding was associated with user's age, controller can according to the response message producing based on user profile control the operation of output and voice corresponding to sending relevant do not allow message.

Interactive server can comprise: first server, becomes text message by the speech conversion of collection; And second server, produce be converted into text message send the corresponding response message of voice.Controller can become digital signal by the speech conversion of collection, and send the voice through conversion to first server, and if receive the text message relevant with the voice that send from first server, to second server, send text message, and receive the response message corresponding with the voice that send.

Yi Ge aspect according to another example embodiment, provides a kind of interactive server, comprising: communication unit, communicate by letter with display device; Extraction unit extracts language element from be received from the voice that send of display device; And controller, the function based on classifying according to the language element extracting is with the multi-form generation response message corresponding with the voice that send, and to display device transmission response message.

This function can comprise that the operation of EPG correlation function and display device controls at least one in function.

Interactive server can also comprise the storage device of storage EPG information, and if the language element the extracting relevant language element that is EPG, controller can the EPG information based on storing in storage device determine whether the EPG information that can provide corresponding with the voice that send.If EPG information can be provided, controller can produce the response message corresponding with the voice that send based on EPG information, and if EPG information can not be provided, controller can produce the alternative response message relevant with the voice that send with at least one in internet hunt based on EPG information.

If the voice that send comprise the EPG relevant with a plurality of requests language element of being correlated with, controller can produce voice request message again in display device, again to ask user to send voice.

If relevant language element is controlled in the operation that the language element extracting is display device, controller can determine whether to control based on this language element the operation of the display device corresponding with the voice that send.If can control the operation of display device, controller can produce the operation that response message is controlled display device, and if can not control the operation of display device, controller can produce and the method for controlling operation thereof of display device and at least one the relevant response message in current state notice.

Interactive server can also comprise: storage device, store the table relevant with the language element of forbidding.If the language element extracting comprises the language element of forbidding, controller can produce operation with voice corresponding to sending relevant do not allow message.

Communication unit can also receive user profile from display device, and if the language element extracting was associated with user's age, controller can based on user profile determine whether the operation of generation and voice corresponding to sending relevant do not allow message.

According to the Yi Ge aspect of another example embodiment, a kind of method is provided, in the interactive server with display device interlocking, provide the voice that send with user corresponding response message, the method comprises: from display device, receive the voice that user sends, from the voice that send, extract language element, function based on classifying according to the language element extracting is with the multi-form generation response message corresponding with the voice that send, and to display device transmission response message.

This function can comprise that the operation of EPG correlation function and display device controls at least one in function, and described generation can comprise: determine whether the language element extracting is the relevant language element of EPG, if the language element as definite result extraction is the relevant language element of EPG, can the EPG information based on pre-stored determine whether the EPG information that can provide corresponding with the voice that send, and if can offer EPG information as definite result, based on EPG information, produce the response message corresponding with language element, and if EPG information can not be provided, based on EPG information, produce the alternative response message relevant with the voice that send with at least one in internet hunt.

Described generation can also comprise: if be the relevant language element of EPG as the language element of check result extraction, check whether the voice send comprise the EPG relevant with a plurality of requests language element of being correlated with, and if the voice that send as the result checking comprise the EPG relevant with a plurality of requests language element of being correlated with, in display device, produce voice request message again, again to ask user to send voice.

Described generation can also comprise: if relevant language element is controlled in the operation that the language element extracting as check result is display device, based on the inspection of language element, whether can control the display device operation corresponding with the voice that send, if can control the operation of display device as the result checking, produce response message and control the operation of display device, and if can not control the operation of display device, produce with for controlling the method for display device operation and at least one relevant response message that current state is notified.

The method can also comprise: with reference to the pre-stored table relevant with the language element of forbidding, check whether the language element extracting comprises the language element of forbidding, and if the language element extracting as the result checking comprises the language element of forbidding, produce operation with language corresponding to sending relevant do not allow message.

Described reception can also comprise: from display device, receive user profile, and can comprise: if the language element extracting as check result does not comprise the language element of forbidding, check whether the language element extracting is associated with user's age, and if the language element extracting as check result was associated with user's age, the operation based on user profile generation and voice corresponding to sending is relevant does not allow message.

According to above-mentioned example embodiment, display device can be carried out the corresponding function of voice of sending with each, and the corresponding response message of voice that even can input various send voice in the situation that output from user and send with each.

Accompanying drawing explanation

By describing example embodiment in detail, above and/or other aspects are more apparent with reference to accompanying drawing, in the accompanying drawings:

Fig. 1 shows the view of the first example of interactive system that the response message of the voice that user sends is provided according to providing of example embodiment;

Fig. 2 shows the view of the second example of interactive system that the response message of the voice that user sends is provided according to providing of another example embodiment;

Fig. 3 shows the first pass figure of method that the response message of the voice that user sends is provided according to providing in interactive system of example embodiment;

Fig. 4 shows the second flow chart of method that the response message of the voice that user sends is provided according to providing in interactive system of another example embodiment;

Fig. 5 shows according to the block diagram of the display device of example embodiment;

Fig. 6 is the block diagram showing in detail according to the display device of example embodiment;

Fig. 7 shows according to the block diagram of the interactive server of example embodiment;

Fig. 8 shows the flow chart of method of operation of carrying out in display device according to example embodiment based on being suitable for the response message of the voice that user sends;

Fig. 9 shows the flow chart of the method that the response message that is suitable for the voice that user sends is provided in interactive server according to example embodiment; And

Figure 10 show according in example embodiment interactive server in the situation that the voice that user sends comprise that the relevant language element of EPG produces the flow chart of the method for the corresponding response message of the voice that send with user.

Embodiment

Hereinafter, with reference to accompanying drawing, example embodiment is described in more detail.

In the following description, when identical element time shown in different accompanying drawings identical Reference numeral for identical element.The things that defines in book of furnishing an explanation (for example in detail structure and element) is to contribute to complete understanding example embodiment.Therefore, apparently, do not have the things of those specific definitions can carry out example embodiment yet.Equally, due to the known function of association area or element can be with unnecessary details fuzzy example embodiment, be not therefore described.

Fig. 1 shows the view of the first example of interactive system that the response message of the voice that user sends is provided according to providing of example embodiment.

As shown in Figure 1, according to the interactive system of example embodiment, comprise display device 100 and interactive server 200.Display device 100 can be realized by the various types of electronic equipments that can access the Internet, for example, and intelligent television (TV), the mobile phone that comprises smart phone, desktop PC (PC), PC on knee or navigator.

Display device 100 is collected the voice that user sends, and carries out the operation corresponding with the voice that send.For example, if input changes the user of channel, send voice, display device 100 is selected corresponding channel and shows this channel.In this case, display device 100 can also provide the response message corresponding with this function.In above example, display device 100 can be output as the relevant information of the channel with changing the image of voice or text formatting.Equally, if the user of the broadcast time of input inquiry specific program sends voice, display device 100 can be output as the broadcast time of specific program the image of voice or text formatting.

Above-mentioned in order to realize, display device 100 sends to interactive server 200 voice of collecting.The interactive server 200 that has received the voice that user sends is analyzed the implication that the user who receives from display device 100 sends voice, produces the operation that response message is controlled display device 100, and sends response messages to display device 100.That is, if receive from display device 100 voice that user sends, interactive server 200 extracts language element from the voice that send, and the language element based on extracting produces the relevant response message of voice of sending with user, and sends this response message.Language element can be for carrying out the keyword of the operation of user's request in the voice that send of user.For example, if the voice that user sends are " this Saturday, when ooo program broadcasted? ", language element can be " this Saturday ", " ooo (program title) ", " when " and " broadcast ".

The interactive server 200 that has extracted as mentioned above language element from the voice that send (for example comprises language element limited or that forbid at the language element extracting, the keyword relevant with violated drugs or profanity speech) in situation, produce and do not allow the response message corresponding with the voice that send, and send asked operation is not yet allowed to the response message of being indicated to display device 100.The voice that the user profile that interactive server 200 can receive user from display device 100 is sent together with user.Correspondingly, for example, if the language element that the result of the voice that send as analysis user is extracted (comprises the language element relevant with user's age, the keyword relevant to salaciousness or violence), interactive server 200 can produce the response message that does not allow the operation corresponding with the voice that send based on user profile, and sends asked operation is not yet allowed to the response message of being indicated to display device 100.If the language element extracting does not comprise the above-mentioned language element of forbidding, the language element extracting the voice of interactive server 200 based on sending from user produces response message, to carry out the corresponding operation of voice of sending with user, and send response message to display device 100.

For example, display device 100 can from user collect be associated with electronic program guides (EPG) send voice (for example, this Saturday, when did program ooo broadcast?).If collect such voice that send, display device 100 sends to interactive server 200 voice of collecting.The voice that send can be analog signals.Correspondingly, display device 100 can become digital signal by the speech conversion of collection, and then to interactive server 200, sends the voice that send as the digital signal through conversion.If receive the voice that send that are converted into digital signal, the send voice of interactive server 200 based on being converted into digital signal produce text message, analyze text information, and the corresponding response message of voice sent with user of generation.Yet this should not be considered as restriction.Display device 100 can send the voice of collecting to interactive server 200, and processes without separated signal.In this case, the speech conversion that interactive server 200 can send the user who receives from display device 200 becomes digital signal, and then can the digital signal based on through conversion produce the relevant text message of voice sending with user.The method that the voice that send by user produce text message is known in association area, and therefore omits its detailed description.

As mentioned above, if produce with represent " this Saturday, when program ooo broadcasted? " send the relevant text message of voice, interactive server 200 analysis represents " this Saturday, when program ooo broadcasted? " send voice, and extract language element.The language element extracting can be " this Saturday ", " ooo (program title) ", " when " and " broadcast ".If extract like this language element, the language element of interactive server 200 based on extracting produces the response message relevant with the broadcast time of program, and sends response message to display device 100.Correspondingly, display device 100 can the response message based on receiving will represent that the response message of " program by 7 broadcasts " is output as the image of voice or text formatting.

For another example, display device 100 can be collected from user the voice that send of expression " please arrange to be recorded in this Saturday broadcast program ooo ".If collect such voice that send, display device 100 sends to interactive server 200 voice of collecting.In this case, interactive server 200 extracts language element from representing sending of " please arranging to be recorded in the program ooo broadcasting this Saturday " voice, language element based on extracting produces to be included in and when program ooo broadcasts, arranges to record the control command of this program and response message that expression " has arranged recorded program " in interior response message, and sends response messages to display device 100.Correspondingly, the record that display device 100 programs based on response message, and the response message that represents " having arranged recorded program ooo " is output as to the image of voice or text formatting.

Above-mentioned interactive server 200 can comprise: first server 10, produces the relevant text message of voice sending with the user who is converted into digital signal; And second server 20, based on text message, produce the response message corresponding with the voice that send.Hereinafter, describe the interactive system that the response message that is suitable for the voice that user sends is provided by display device 100 and first server 10 and second server 20 in detail.

Fig. 2 shows the view of the second example of interactive system that the response message of the voice that user sends is provided according to providing of another example embodiment.

As shown in Figure 2, if display device 100 is collected the voice that user sends, display device 100 becomes digital signal by the speech conversion of collection, and sends voice to first server 10.If receive the voice that send that are converted into digital signal, first server 10 is according to sending the relevant text message of voice that the relevant specific pre-stored mode producing of voice and user send with various, and to display device 100 transmission text messages.

The display device that has received the relevant text message of the voice that send with user from first server sends the relevant text message of voice sending with user to second server 20.The second server 20 that has received the relevant text message of the voice that send with user is analyzed text messages, extract language element, language element based on extracting produces response message, to carry out the corresponding operation of voice of sending with user, and sends response message to display device 100.

Up to the present, schematically described the operation that the corresponding response message of the voice that send with user is provided in interactive system, this interactive system comprises display device 100 and interactive server 200.Hereinafter, be schematically described in the method that the corresponding response message of the voice that send with user is provided in interactive system, interactive system comprises display device 100 and interactive server 200.

Fig. 3 shows the first pass figure of method that the response message of the voice that user sends is provided according to providing in interactive system of example embodiment.

As above illustrated in fig. 1, interactive system can comprise display device 100 and interactive server 200.In this case, display device 100 is collected the voice (operation S310) that user sends, as shown in Figure 3.The voice of collecting are analog signals.Correspondingly, display device 100 becomes digital signal (operation S320) by the speech conversion of collection.Thereafter, display device 100 sends and has been converted into the voice (operation S230) that the user of digital signal sends to interactive server 200.Particularly, if initiate to collect the pattern of user speech, display device 100 is collected and is positioned at the voice that the user of display device preset distance sends, and the speech conversion of collection is become to digital signal, and the voice that send through conversion to interactive server 200.

Above-mentioned in order to realize, display device 100 can comprise the voice that microphone sends to receive user.In this case, microphone can be embedded in display device 100, or can be arranged on the remote controller of controlling display device 100.Yet this should not be considered as restriction, and microphone can have with remote controller and separate the form holding by user, or can have the form that can be placed on desk.

If receive the voice that send from display device 100, interactive server 200 produces the text message relevant with the voice that send, and analyzes text message, and from the voice that send, extracts language element (operation S330 and S340).Language element can be the keyword of carrying out the operation of user's request in the voice that send of user.For example, if the voice that user sends are " this Saturday, when program ooo broadcasted? ", language element can be " this Saturday ", " ooo (program title) ", " when " and " broadcast ".

If extract like this language element, the language element of interactive server 200 based on extracting produces the response message of the corresponding operation of the voice that send with user, and sends response message (operating S350 and S360) to display device.Correspondingly, display device 100 receives response message from interactive server 200, and carries out the corresponding operation (operation S360 and S370) of voice of sending with user based on response message.Response message can comprise the control command of the function of controlling display device 100 and in response at least one in the information of sending voice output response message of being collected by display device 100 (hereinafter, being known as response message).

For example, if receive the voice that the user of expression " please minute book Saturday broadcast program ooo (program title) " sends, the language element of interactive server 200 based on sending voice produces response message, this response message arranges to record the control command of this program and the response message of expression " record of the ooo that programmed " while being included in program ooo broadcast, and sends these response messages to display device 100.

Correspondingly, display device 100 arranges the record of corresponding program based on response message, and the response message that represents " having arranged recorded program ooo " is output as to the image of voice or text formatting.

As described above with reference to FIG. 2, interactive server 200 can comprise first server 10 and second server 20.Hereinafter, schematically illustrate the method that the corresponding response message of the voice that send with user is provided in book interactive system, interactive system comprises display device 100 and comprises first server 10 and the interactive server 200 of second server 20.

Fig. 4 shows the second flow chart of method that the response message of the voice that user sends is provided according to providing in interactive system of another example embodiment.

As described above with reference to FIG. 2, interactive system can and comprise first server 10 and the interactive server 200 of second server 20 provides the response message that is suitable for the voice that user sends by display device 100.

As shown in Figure 4, display device 100 is collected the voice that user sends, and the speech conversion of then user being sent becomes digital signal (operation S410).The voice of collecting are analog signals.Correspondingly, if collect the voice that user sends, display device 100 becomes digital signal by the speech conversion of sending of analog signal.Particularly, if initiate to collect the pattern of user speech, display device 100 is collected and is positioned at the voice that the user of preset distance sends, and the speech conversion of collection is become to digital signal, and the voice that send through conversion to first server 10.Above-mentioned in order to realize, display device 100 can comprise the voice that microphone sends to receive user.In this case, microphone can be embedded in display device 100, or can be arranged on the remote controller of controlling display device 100.Yet this should not be considered as restriction, and microphone can have with remote controller and separate the form holding by user, or can have the form that can be placed on desk.

If the speech conversion that user is sent becomes digital signal, display device 100 sends and has been converted into the voice (operation S420) that the user of digital signal sends to first server 10.Received the first server 10 that is converted into the voice that the user of digital signal sends according to the various relevant text messages (operating S430) of voice that the relevant specific pre-stored mode producing of voice and user send that send.Thereafter, first server 10 sends the relevant text message (operation S440) of voice sending with user, and display device 100 sends the relevant text message (operation S450) of voice sending with the user who receives from first server 10 to second server 20.Received the relevant second server 20 analysis text messages of text message and the language element (operation S460) of the voice that extraction user sends of voice sending with user.

Language element can be the keyword of carrying out the operation of user's request in the voice that send of user.For example, if the voice that user sends are " this Saturday, when program ooo broadcasted? ", language element can be " this Saturday ", " ooo (program title) ", " when " and " broadcast ".

If extract like this language element, the language element of second server 20 based on extracting produces response message, to carry out the corresponding operation of voice of sending with user, and sends response message (operation S470 and S480) to display device 100.Correspondingly, display device 100 receives response message from interactive server 200, and carries out the corresponding operation of voice (operation S490) of sending with user based on response message.Response message can comprise the control command of the function of controlling display device 100 and in response at least one in the information (hereinafter, being known as response message) of the voice output response message of collecting in display device 100.

For example, if receive the voice that the user of expression " please minute book Saturday broadcast program ooo (program title) " sends, the language element of interactive server 200 based on extracting produces response message, this response message arranges to record the control command of this program and the response message that expression " has arranged recorded program ooo " while being included in program ooo broadcast, and sends these response messages to display device 100.Correspondingly, display device 100 arranges recorded program based on response message, and the response message that represents " having arranged recorded program ooo " is output as to the image of voice or text formatting.

Up to the present, the method that the response message that is suitable for the voice that user sends is provided in interactive system has been described.Hereinafter, describe the element of above-mentioned display device 100 and interactive server 200 in detail.

Fig. 5 shows according to the block diagram of the display device of example embodiment.

As shown in Figure 5, display device 100 comprises communication unit 110, voice gatherer 120, controller 130 and output unit 140.

Communication unit 110 is communicated by letter with interactive server 200, and interactive server 200 provides the response message that is suitable for the voice that user sends.Particularly, communication unit 110 is communicated by letter with interactive server 200 according to various communication meanss, and sends to interactive server 200 voice that user sends.Above-mentioned in order to realize, communication unit 110 can comprise various communication modules, for example local area radio communication module (not shown) and wireless communication module (not shown).Local area radio communication module (not shown) is and the communication module that is positioned at the external equipment radio communication of short distance, and can is for example bluetooth or Zigbee.Wireless communication module (not shown) is the module that is connected to external network, for example, to communicate by letter according to wireless communication protocol (, WiFi and IEEE).Except above-mentioned, wireless communication module can also comprise the mobile communication module being for example connected to, according to various mobile communication standards (, the third generation (3G), third generation partner program (3GPP) and Long Term Evolution (LTE)) mobile communications network.

Voice gatherer 120 is processed the voice of collecting, and produces user voice signal.That is, voice gatherer 120 can be removed noise (for example, from noise or the musical sound of air-conditioning or vacuum cleaner) from the voice of collecting, and can produce user voice signal.Particularly, if the voice that the user of input analog format sends, 120 pairs of voice that send of voice controller are sampled, and convert thereof into digital signal.Now, voice gatherer 120 is determined and has been converted into sending voice and whether comprising noise of digital signal, if there is noise, from digital signal, removes noise.As mentioned above, if the speech conversion of user being sent by voice gatherer 120 becomes digital signal, communication unit 110 sends and has been converted into the voice that the user of digital signal sends to interactive server 200.As mentioned above, interactive server 200 can comprise: first server 10, produces the relevant text message of voice sending with user; And second server 20, based on text message, produce the corresponding response message of voice of sending with user.Correspondingly, if the speech conversion of user being sent by voice gatherer 120 becomes digital signal, the digital signal that communication unit 110 sends through conversion to first server 10, and if receive and convert the voice that the user of text message sends to from first server 10, communication unit 110 sends to second server 20 voice that users send.

Yet this should not be considered as restriction.Interactive server 200 can be individual server, and can produce the relevant text message of voice sending with user, and can produce the corresponding response message of voice of sending with user based on text message.In this example embodiment, interactive server 200 comprises server 200, and server 200 comprises first server 10 and second server 20.

If receive the corresponding response message of voice of sending with the user who is converted into text message from second server 20, controller 130 is controlled and is carried out the corresponding operation of voice of sending with user based on response message.Particularly, if the voice that input user sends, the speech conversion that controller 130 sends user by voice gatherer 120 becomes digital signal.Thereafter, controller 130 is sent and has been converted into the voice that the user of digital signal sends to first server 10 by communication unit 110, and receives from first server 10 the relevant text message of voice sending with user.If receive from first server 10 the relevant text message of voice sending with user, controller 130 is sent and has been converted into the voice that the user of text message sends to second server 20 by communication unit 110, and the corresponding response message of voice sent with user of reception.

At least one in output unit 140 output voice and image.Particularly, if receive from second server 20 the corresponding response message of voice of sending with user, output unit 140 can be according to the control command of controller 130, and the response message based on receiving is output as the relevant response message of the voice that send with user the image of voice or text formatting.Above-mentioned in order to realize, output unit 140 can comprise display 141 and audio output unit 143.

Particularly, display 141 can be realized by liquid crystal display (LCD), Organic Light Emitting Diode (OLED) or Plasmia indicating panel (PDP), and the various display screens that provided by display device 100 can be provided.Particularly, display 141 can show the corresponding response message of voice sending with user with the form of text or image.Display 141 can be realized by the touch-screen and the touch pad that form layered configuration, and touch-screen can be configured to the pressure that detects touch input position, region and touch input.Yet the configuration of display is not limited to this.

Audio output unit 143 can be realized by output ports such as loud speaker or jack, and the relevant response message of voice that can send with user with phonetic matrix output.

As mentioned above, the function that can classify according to the language element extracting the voice based on sending from user, the response message receiving from second server 20 with multi-form generation.Language element based on extracting and the function of classifying can comprise that the operation of EPG correlation function and display device 100 controls at least one in correlation function.For example, if the language element extracting the voice that send from user is corresponding with broadcast program, function is EPG correlation function, if language element is associated with electric power on/off or the volume change of display device 100, function is that correlation function is controlled in display device operation.

Correspondingly, if receive response message from second server 20, controller 130 is controlled and is carried out the corresponding operation of voice of sending with user based on response message.

For example, if show the voice that send of " please channel being changed to MBC " from user input, controller 130 will represent that by voice gatherer 120 speech conversion of sending of " please channel being changed to MBC " becomes digital signal, and the voice that send through conversion to first server 10.Thereafter, if from first server 10 receive with represent " please channel being changed to MBC " send the relevant text message of voice, controller 130 to second server 120 send with expression " please channel being changed to MBC " send the relevant text message of voice.

Correspondingly, second server 20 is from extracting language element " MBC ", " channel " and " changes " the relevant text message of voice with representing sending of " please channel being changed to MBC ", and the definite voice that send of the language element based on extracting are the operation control correlation functions about display device 100.Thereafter, second server 20 sends and comprises the response message that changes the control command of channel and the response message of expression " channel has changed to MBC " to display device 100.

Correspondingly, the control command that controller 130 comprises according to response message changes to MBC by current channel.The response message that controller 130 comprises based on response message is controlled output unit 140 and by least one output in image and voice, is represented the response message of " channel has changed to MBC ".Correspondingly, the response message that represents " channel has changed to MBC " can be output as to voice by audio output unit 143, or by display 141, be output as the image of text formatting.

As another example, if show the voice that send of " please record the program ooo broadcasting today " from user input, controller 130 becomes digital signal by voice gatherer 120 by the speech conversion of sending that represents " please record the program ooo (program title) broadcasting today ", and this sends voice to first server 10 transmissions.Thereafter, if from first server 10 receive with represent " please record today broadcast program ooo (program title) " send the relevant text message of voice, controller 130 to second server 20 send with represent " please record the program ooo (program title) of broadcast today " send the relevant text message of voice.

Correspondingly, second server 20 is from extracting language element " today ", " program ooo (program title) " and " record " the relevant text message of voice with representing sending of " please recording the program ooo (program title) broadcasting today ", and the definite voice that send of the language element based on extraction are about EPG correlation function.Thereafter, second server 20 sends response message to display device 100, and this response message comprises the control command of record and the response message of expression " record of the ooo that programmed " of the ooo that programs (program title).

The program record of ooo of the control command that controller 130 comprises according to response message.The response message that controller comprises based on response message is controlled output unit 140 and by least one output in image and voice, is represented the response message of " having arranged recorded program ooo ".Correspondingly, represent that the response message of " having arranged recorded program ooo " can be output as voice by audio output unit 143, or by display 141, be output as the image of text formatting.

As mentioned above, the controller 130 that response message based on receiving from second server 20 is carried out the corresponding operation of the voice that send with user can receive the relevant voice of the voice that send with user request message again from second server 20, and can export voice request message again by output unit 140.

According to example embodiment, if the language element extracting the voice that send from user does not meet predetermined condition, controller 130 receives the relevant voice of the voice that send with user request message again from second server 20, and exports voice request message again by output unit 140.

For example, second server 20 can from display device 100 receive with expression " please arrange to watch 9 o'clock news " send the relevant text message of voice.In this case, second server 20 can be based on from representing " please arrange to watch 9 o'clock news " the language element extracting the relevant text message of voice that sends, to display device 100 send comprise expression " being the 9 o'clock news of KBS or MBC? " voice again request message in interior response message.That is, second server 20 determines whether the language element extracting meets predetermined condition, if do not met, produces and comprises the voice response message of request message again that meets predetermined condition, and send these response messages to display device 100.

The voice that controller 130 comprises based on response message are request message again, control display 141 and audio output unit 143 by image and voice at least one output represents " being the 9 o'clock news of KBS or MBC? " message.Correspondingly, controller 130 can receive and the voice of exporting by display 141 and the audio output unit 143 again corresponding additional voice of request message from user.For example, if additionally show the voice of " MBC9 o'clock news " from user input, controller 130 to second server 20 send with expression " MBC9 o'clock news " send the relevant text message of voice.If the text message relevant with the additional voice that send meets predetermined condition, second server 20 sends response messages to display device 100, and this response message comprises and channel arrangement changed to the control command of " MBC9 o'clock news " and the response message that expression " has arranged to watch MBC9 o'clock news ".

Correspondingly, the control command that controller 130 comprises according to response message changes to MBC9 o'clock news by channel arrangement.Output unit 140 is output as voice by the response message that represents " having arranged to watch MBC9 o'clock news " by audio output unit 143 according to the control command of controller 130 or by display 141, is output as the image of text formatting.

According to another example embodiment, if the language element relevant with a plurality of requests is included in the voice that user sends, controller 130 receives the relevant voice of the voice that send with user request message again from second server 20, and exports voice request message again by output unit 140.

For example, second server 20 can receive and the relevant text message of voice that represents " please arrange to watch the program ooo of broadcast this week, and ask recorded program ooo " from display device 100.In this case, with a plurality of requests (" program ooo (program title) " and " arrangement is watched ", and " program ooo (program title) " and " record ") relevant language element is included in sending in voice of expression " the program ooo that please arrange to watch this week and broadcast, and please recorded program ooo ".

Correspondingly, second server 20 determines that the language element relevant with a plurality of requests is included in the text message relevant with the voice that send, and sends and comprise the voice response message of request message again to display device 100.Display 141 by output unit 140 of controller 130 and audio output unit 143 are via at least one output voice in image and voice again request message.If by least one the output voice request message again in image and voice, user can be by only representing " please arrange watch broadcast this week program ooo (program title) " and one of " please record the program ooo (program title) of broadcast this week " asks again.

Controller 130 can receive with the operation of EPG correlation function or display device 100 control correlation function irrelevant send voice.

For example, if show near the voice that send of " could you tell me restaurant " from user input, controller 130 to second server 20 send from first server 10 that receive with represent " near the restaurant could you tell me " send the relevant text message of voice.The second server 20 that has received the text message relevant with the voice that send extracts language element " near ”He“ restaurant " from the text message relevant with representing " near the restaurant could you tell me ", and it is irrelevant to determine that correlation function is controlled in the language element of extraction and the operation of EPG correlation function or display device 100.Correspondingly, second server 20 sends to display device 100 and represents that " alternate information can obtain by the Internet, wish to receive? " alternative response message.Alternative response message like this can comprise the language element extracting the voice from sending.

If receive alternative response message, controller 130 represents that according to the alternative response message image receiving from second server 20 and at least one output voice " alternate information can obtain by the Internet, wish to receive? " response message.That is the alternate information that, controller 130 control displays 141 and audio output unit 143 comprise by least one the output response message in image and voice.

Thereafter, if input user's expectation from user, by web, receive the voice of alternate information, the language element that controller 130 comprises based on alternative response message is carried out internet hunt, and obtains the alternate information relevant with the restaurant that is positioned at display device 100 position closer distance.

Yet this should not be considered as restriction.Second server 20 can interlock (interlock) with Internet Server (not shown).Correspondingly, as mentioned above, if correlation function is controlled in the language element extracting and the operation of EPG correlation function or display device 100, second server 20 sends to display device 100 and represents that " alternate information can obtain by the Internet, wish to receive? " alternative response message.Thereafter, if receive user's expectation from display device 100, by the Internet, receive the voice of alternate information, the language element of second server 20 based on extracting obtains the relevant alternate information of voice of sending with user by Internet Server (not shown), and sends alternate information to display device 100.

If obtain or receive such alternate information from second server 20, controller 130 control displays 141 and audio output unit 143 are by least one the output alternate information in image and voice.Correspondingly, user can be based on identifying his/her position and near restaurant by the alternate information of display 141 and audio output unit 143 outputs.

If comprise at the voice that send the language element of forbidding, the operation of the response message output that controller 130 can be based on receiving from second server 20 and the voice corresponding to sending is relevant does not allow message.

For example, if comprise and profane speech or violated drugs at the interior voice that send from user input, controller 130 receives from first server 10 the relevant text message of voice sending with user, and to second server 20 transmission text information.The second server 20 that has received the relevant text message of the voice that send with user extracts language element from the relevant text message of the voice with sending, and check the language element that extracts be whether pre-stored forbid language element.As the result checking, if the language element extracting is the language element of forbidding, second server 20 sends and comprises the response message that do not allow message relevant with the operation of voice corresponding to sending to display device 100.

What correspondingly, controller 130 controlled according to response message that output unit 140 represents " request is rejected " by least one output in image and voice does not allow message.Correspondingly, the message that do not allow of expression " request is rejected " can be output as voice or can be output as by display 141 image of text formatting by audio output unit 143.

Yet this should not be considered as restriction.If the voice packet sending is containing profaning speech or violated drugs, controller 130 can determine whether to know the operation corresponding with the voice that send with reference to pre-stored table relevant with language element in storage device 150.In storage device 150, the pre-stored relevant with language element represents that user presets to suppose the table of the operation that the voice that send with user are corresponding.For example, if the voice that user sends comprise language element " drugs ", and this language element is recorded in table relevant with language element in storage device 150, what controller 130 can represent by least one output in display 141 and audio output unit 143 " request is rejected " does not allow message.

If language element " drugs " does not have record to be on the relevant table of language element, controller 130 sends to second server 20 the relevant text message of voice sending with user.Correspondingly, controller 130 receives and comprises the response message that do not allow message relevant with the operation of voice corresponding to sending from second server 20, and can will represent that the message that do not allow of " request is rejected " is output as image and voice by least one in display 141 and audio output unit 143, as mentioned above.

Display device 100 can also comprise the shooting unit of taking user face.Storage device 150 can be by matching user's face image and user profile to come store storage user's face image and user profile.

Correspondingly, if take unit 160, produce face image, controller 130 obtains the user profile of mating with the face image producing from storage device 150, and can send the user profile text message relevant with the voice that send with user to second server 20.According to example embodiment, if the language element being associated with age of user is included in the voice that user sends, controller 130 can according to the response message producing based on user profile control the operation of output unit 140 outputs and voice corresponding to sending relevant do not allow message.

For example, if from user's input voice of sending relevant to changing to adult's broadcasting channel, controller 130 receives from first server 10 the relevant text message of voice sending with user, and sends text information to second server 20.Now, controller 130 extracts the user profile of mating with the face image of taking unit 160 shootings from storage device 150, and sends these user profile to second server 20.Received the relevant text message of the voice that send with user and the second server 20 of user profile and extracted language element from the relevant text message of the voice with sending, and checked whether the language element extracting records and be in the relevant pre-stored table of the language element of age of user restriction.As the result checking, if the language element extracting is the language element being associated with user's age limit, second server 20 checks based on user profile whether user meets age limit.As the result checking, if user does not have authorized adult's broadcasting channel of watching user to ask, second server 20 sends and comprises the response message that do not allow message relevant with the operation of the voice that send corresponding to user to display device 100.

What correspondingly, controller 130 controlled according to response message that output unit 140 represents " request is rejected " by least one output in image and voice does not allow message.Correspondingly, the message that do not allow that represents " request is rejected " can be output as image and the voice of text formatting by least one in display 141 and audio output unit 143.

Yet this should not be considered as restriction.If from user input with change to adult's broadcasting channel relevant send voice, controller 130 based on take the user profile inspection user that the face image taken unit 160 mates and whether meet age limit.As the result checking, if user not have authorized adult's broadcasting channel of watching user to ask, controller 130 is controlled output units 140 and is not allowed message by least one the output expression " request is rejected " in image and voice.Correspondingly, the message that do not allow that represents " request is rejected " can be output as image and the voice of text formatting by least one in display 141 and audio output unit 143.

Hereinafter, describe above-mentioned display device 100 in detail.

Fig. 6 is the block diagram showing in detail according to the display device of example embodiment.

As shown in Figure 6, except the element shown in Fig. 5, display device 100 can also comprise input unit 170, receiver 180, signal processor 190.The element identical with element in Fig. 5 has identical function, and therefore omits its detailed description.

Input unit 170 is that various users control and send to controller 130 input unit that various users control for receiving, and can be realized by input panel.Input panel can be realized by touch pad, the keypad that possesses various function keys, numerical key, special key and character keys or touch-screen.Equally, input unit 170 can be realized by infrared ray (IR) receiver (not shown), to receive the remote signal sending from remote controller, to control display device 100.Yet input panel is not limited to these examples.

Input unit 170 can receive various users according to the type of display device 100 and control, to control the function of display device 100.For example, if display device 100 is intelligent television (TV), input unit 170 can receive the function that user controls to control intelligent TV, and for example, electric power on/off, channel change and volume changes.If controlled by the such users of input unit 170 input, controller 130 can be controlled other elements and carries out with the user who inputs by input unit 170 and control corresponding various functions.For example, if input power shutdown command, controller 130 can cut off to the power supply of the element of display device 100, and if inputting channel changes order, controller 130 can be controlled control receiver 180 according to user and turn to selected channel.

Input unit 170 receives the user command of initiating speech recognition mode, to collect user speech.If initiate the user command of speech recognition mode by input unit 140 inputs, the voice activated gatherer 120 of controller 130 is to collect the user speech sending in display device preset distance.

Above-mentioned storage device 150 is storage mediums of the required various programs of storage operation display device 100, and can be realized by memory or hard disk drive (HDD), but is not limited to this.For example, storage device 150 can comprise that read-only memory (ROM) stores the program of the operation of implementation controller 130, and random access storage device (RAM) data of coming the operation of temporary transient storage control 130 to produce.Storage device 150 can also comprise that electrically erasable ROM (EEPROM) stores various reference datas.

Particularly, storage device 150 can store be suitable for the voice that user sends various response messages as voice or text message.Correspondingly, controller 130 reads voice messaging or the text message relevant with the response message that is suitable for the voice that user sends from storage device 150, and can be by least one output voice messaging and text message in display 141 and audio output unit 143.Particularly, if be suitable for the response message of the voice that user sends with speech form output, controller 130 is carried out such as signals such as decodings and is processed about the voice messaging of reading from storage device 150, speech data to decoding amplifies, and by audio output unit 143 output speech datas.Equally, if output is suitable for the response message of the voice that user sends as the image of text formatting, controller 130 is carried out such as signals such as decodings and is processed about the text message of reading from storage device 150, generation comprises user interface (UI) screen of the text that forms text message, and by display 141 output UI screens.

Yet this should not be considered as restriction.The response message that controller 130 can comprise about the response message receiving from second server 20 is carried out above-mentioned processing operation, and by least one in display 141 and audio output unit 143, response message is output as to text image or voice.

Receiver 180 receives the content of broadcast program by radio network.Particularly, receiver 180 can receive content from the broadcasting station that the content of broadcast program is broadcasted by radio network, or from sending the Internet Server of content file, receives content by the Internet.Equally, receiver 180 can from display device 100, provide or be connected with display device 100 various recording medium reproducing apparatus receive contents.Recording medium reproducing apparatus reproduces the content of the upper record of various recording mediums (for example, CD, DVD, hard disk, Blu-ray disc, memory card and USB storage).

Cong broadcasting station receives in the situation of content, if receiver 180 can comprise that tuner (not shown), demodulator (not shown) and equalizer (not shown) are from receiving content such as source devices such as Internet Servers, receiver 180 can be network interface (not shown).Equally, if receive content from various recording medium reproducing apparatus, receiver 180 can be the interface unit (not shown) that is connected to recording medium reproducing apparatus.As mentioned above, receiver 180 can be according to example embodiment accomplished in various ways.

Signal processor 190 is processed about the content executive signal receiving by receiver 180, can pass through output unit 140 output contents.Particularly, the vision signal that signal processor 190 can comprise about content is carried out signals such as decoding, convergent-divergent and frame rate conversion and is processed, and making can be from display 141 outputting video signal.Equally, the audio signal that signal processor 180 can comprise about content is carried out such as signals such as decodings and is processed, and makes to pass through audio output unit 143 output audio signals.Correspondingly, display 141 and audio output unit 143 can the handled content signal of output signal processor 190 comprise vision signal and audio signal.

Described in detail by interactive server 200 and received and be suitable for the response message of the voice that user sends and the operation of carrying out respective operations in display device 100.Hereinafter, describe in detail to produce and to be adapted to pass through the response message of the voice that user that display device 100 receives sends and from interactive server 200 to display device 100, to send the operation of response messages.

Fig. 7 shows according to the block diagram of the interactive server of example embodiment.

Interactive server illustrated in fig. 7 is above-mentioned second server 20, and receive by first server 10 and convert the voice that the user of text message sends to from display device 100, from the relevant text message of the voice that send with user, extract language element, and send to display device 100 response message that is suitable for the voice that user sends.Such interactive server comprises communication unit 710, extraction unit 720, storage device 730 and controller 740.

Communication unit 710 is communicated by letter with display device 100, and extraction unit 720 extracts language element from sending of receiving from display device 100 by communication unit 170 voice.Storage device 730 records the conversion history information of the voice that each user sends, and storage EPG information.The function that the language element extracting on the voice of controller 740 based on sending user according to extraction unit 720 is classified, the corresponding response message of voice of sending with user with multi-form generation.The function of classifying according to language element can comprise at least one in the function of operation of EPG correlation function and controller display device 100.Correspondingly, controller 740 determines that according to the language element extracting the voice that send from user the voice that user sends belong to EPG, still controls the operation of display device 100, and according to determining that result produces the corresponding response message of voice of sending with user.

According to example embodiment, if the language element extracting the voice that send from user belongs to EPG, the EPG information of controller 740 based on pre-stored in storage device 730 determines that whether the corresponding EPG information of the voice send with user is available.As definite result, if EGP can use, controller 740 produces the corresponding EPG information of voice of sending with user based on EPG information.If EPG information is unavailable, the relevant alternative response message of voice that controller 740 sends with user with at least one generation in internet hunt based on predefined EPG information in storage device 730.

Particularly, if receive the relevant text message of voice sending with user, extraction unit 720 can extract and comprise the language element of talking with action, major heading and key element from text message.Dialogue action is the mark that the relevant meaning between the lines (illocutionary force) of voice to sending with user is indicated.For example, dialogue action can be statement, request or problem.Major heading is the mark that user's true intention of the voice to sending from user is indicated, and can be TV ON/OFF, program searching, Pgmtime search or programme arrangement.Key element can be school, program title, time, channel designation or actor names.

For example, if the voice that user sends are " when program ooo (program title) starts? ", dialogue action can indicate comprise question mark "? " inquiry express, major heading can be because the Pgmtime of word " beginnings " is searched for.Key element can be programm name ooo (program title).

Correspondingly, if the relevant text message of the voice that send with user is " when program ooo (program title) starts? ", extraction unit 720 extracts the language element that comprises dialogue action, major heading, key element.If extract like this language element, controller 740 determines with reference to the EPG information of storage in storage device 730 whether the language element extracting belongs to EPG information.If determine that the language element extracting the voice that send from user belongs to EPG information, controller 740 determines whether language element meets the condition for generation of the corresponding response message of the voice that send with user.

According to example embodiment, if the language element extracting the voice that send from user comprises owning in dialogue action, major heading and key element, controller 740 determines whether to meet the condition for generation of the corresponding response message of the voice that send with user.In above example, from representing " program ooo (program title) when? " user send the language element extracting in voice and comprise all in action, major heading and key element of dialogue.In this case, controller 740 determines whether to meet the condition that user produces the corresponding response message of the voice that send with user.The user who represents " when starting " sends the language element that voice only comprise dialogue action and major heading " beginning ", but does not comprise key element, and dialogue action is indicated comprising that the inquiry of question mark is expressed.In this case, controller 740 is determined the dissatisfied condition for generation of the corresponding response message of the voice that send with user, and the conversion history information based on pre-stored in storage this 730, generation requires the alternative response message of the language element of key element.For example, controller 740 can produce with represent " which broadcast items? " the relevant response message of inquiry.

If met the condition for generation of the corresponding response message of the voice that send with user by above-mentioned sequence of operations, the EPG information of controller 740 based on storage in storage device 730, usually determines that according to extracting language unit the voice that send from user whether EPG information is available.As definite result, if according to language element EPG Information Availability, controller 740 produces the corresponding response message of voice of sending with user based on EPG information, and if EPG information is unavailable, controller 740 can produce the relevant alternative response message of voice of sending with user.

If the voice that user sends are " please record the program ooo (program title) broadcasting this week ", language element can be " this week ", " program ooo (program title) ", " record " and " asking ".If extract such language element, controller 740 can based in storage device 73 storage EPG information, obtain programme information and the time started information relevant with program ooo (program title).Correspondingly, controller 740 can produce response message, this response message comprises that the program ooo with arranging of programme information based on pre-acquisition and time start information records relevant control command, and the conversion history information based on pre-stored in storage device 730 and the response message that produces.

If the voice that user sends are " in program ooo, who is leading role? ", language element can be " program ooo (program title) ", " leading role " and " who ".If extract like this language element, controller 740 checks whether the information relevant with the leading role of program ooo is included in the EPG information of storage in storage device 730.As the result checking, if can not obtain the information relevant with the leading role of program ooo from the EPG information of pre-stored, controller 740 produces alternative response message, and this alternative response message inquires whether user's expectation receives by EPG information or internet hunt the relevant alternate information of voice of sending with user.For example, if input receives from EPG information the user speech that alternate information is indicated to user's expectation, controller 740 obtains the information relevant with the cast of program ooo from the EPG information of pre-stored.If obtain the relevant alternate information of voice of sending with user from EPG information, controller 740 can the conversion history information based on pre-stored in storage device 730 produce the alternative response message that comprises the alternate information obtaining in advance.

If the language element extracting the voice that send from user belongs to EPG information, controller 740 determines whether the language element extracting is the EGP language element relevant with a plurality of requests.As definite result, if language element is the EPG language element relevant with a plurality of requests, controller 740 can produce voice request message again at display device 100 places, again to ask the voice that send from user.

For example, if the voice that user sends are " please record the program ooo (program title) broadcasting this week and please arrange to watch program Δ Δ Δ (program title) ", language element can be " this week ", " program ooo (program title) ", " program Δ Δ Δ (program title) ", " record ", " watching " and " asking ".If extract like this language element, controller 740 determines that the language element extracting comprises the language element (" program ooo (program title) ", " program Δ Δ Δ (program title) ", " record ", " watching ") about a plurality of requests.Correspondingly, the conversion history information that controller 740 can be based on pre-stored in storage device 730, produces the voice request message again that represents " please only asking ".

If the language element extracting the voice that send from user is to control relevant language element with the operation of display device 100, the language element of controller 740 based on extracting determines whether the operation of the display device 100 that the voice that can send with user are corresponding.As definite result, if can control the operation of display device 100, controller 740 can produce for controlling the response message of the operation of display device 100.

According to example embodiment, storage device 730 can be stored for controlling the manual information of the operation of display device 100.Manually information comprises the information of the operation of the voice control display device 100 for sending according to user, and for control the information of the operation of display devices 100 according to other control commands except the voice that user sends.Correspondingly, if extract the language element relevant with the control of display device 100, the manual information of controller 740 based on storing in storage device 730, determines that whether the voice that the operation of display device 100 sends according to user are controlled.As definite result, if the voice that the operation of display device 100 is sent according to user are controlled, controller 740 can produce the response message that comprises control command, to carry out the corresponding operation of voice of sending with user.

For example, if the voice that user sends are " please channel being changed to MBC ", language element is " MBC ", " channel " and " change ".If extract like this language element, controller 740 determines that the language element extracting belongs to the function control of display device 100.Thereafter, the manual information of pre-stored in controller 740 reference storage devices 730, determines whether to change according to the language element extracting the channel of display device 100.As definite result, if the channel of the speech modification display device 100 that can send according to user, controller 740 can produce and comprise the response message that current channel in display device 100 is changed to the control command of MBC.

As definite result, if the operation of display device 100 can not be controlled in the voice that send according to user, controller 740 can produce and the method for controlling operation thereof of display device 100 and at least one the relevant response message in current state notice.

For example, if the voice that user sends are " please highlight screen ", can extract language element " screen ", " highlighting " and " asking ".If extract like this language element, controller 740 determines that language element belongs to the function control of display device 100.Thereafter, controller 740, with reference to the manual information of storage device 730 pre-stored, determines whether to adjust according to the language element extracting the brightness of display device 100.As definite result, if the screen intensity of display device 100 can not be adjusted in the voice that send according to user, controller 740 can be with reference to the manual information of pre-stored in storage device 730, produce with for adjusting the relevant response message of the method for screen intensity of display device 100.

According to another example embodiment, above-mentioned storage device 730 can be stored the table relevant with the language element of forbidding.For example, the table relevant with the language element of forbidding can record such as language elements such as violated drugs or such as the word of profaning speech etc. and forbidding.Correspondingly, if extract language element the voice that send from user, controller 740, with reference to table relevant with language element in storage device 730, determines whether the language element extracting is the language element of forbidding.As definite result, if the language element extracting is the language element of forbidding, storage device 730 can the conversion history information based on storage in storage device 730 produce operation with the voice that send corresponding to user relevant do not allow message.

If the language element that the voice that send from user extract was associated with user's age, the user profile that controller 740 can be based on receiving from display device 100 by communication unit 710, determine whether to produce operation with the voice that send corresponding to user relevant do not allow message.The language element relevant with age of user can be and salaciousness or the relevant language element of violence, but be not limited to this.For example, if the voice that user sends be " you wish by channel change to adult broadcasting channel? ", can extract language element " adult's broadcasting channel ", " channel ", " change " and " you wish ".If extract like this language element, controller 740, with reference to the relevant table of language element pre-stored and that forbid in storage device 730, determines that the language element extracting is associated with age of user.Correspondingly, whether the user profile inspection user of controller 740 based on receiving from display device 100 meets predetermined age limit.

Therefore, if determine the not authorized adult's broadcasting channel of watching of user, controller 740 produce say " serving unavailable " do not allow message.On the other hand, if determine the authorized adult's broadcasting channel of watching of user, the EPG information that controller 740 can be based on storage in storage device 730, produces and comprises the response message that channel is changed to the control command of the channel that adult's broadcast service is provided.

Described the element that the interactive server of the response message that is suitable for the voice that user sends is provided according to example embodiment in detail.Hereinafter, describe the method based on being suitable for the response message executable operations of the voice that user sends in above-mentioned display device in detail.

Fig. 8 shows the flow chart of method of operation of carrying out in display device according to example embodiment based on being suitable for the response message of the voice that user sends.

As shown in Figure 8, if input the user command of initiating speech recognition mode from user, display device enters the speech recognition mode (operation S810) of identification user's voice according to the user command of input.In speech recognition mode, display device receives the voice (operation S820) that user sends.If the voice that input user sends, display device is collected the voice that user sends, and the speech conversion of collection is become to digital signal, and sends voice (operation S830 and S840) to first server.First server can be the server that speech conversion that the user who is converted into digital signal is sent becomes text message.If receive the relevant text message of voice sending with user from first server, display device sends the relevant text message (operation S850) of voice sending with user to second server.Second server can be to produce according to the language element extracting the voice that send from user the response message that is suitable for the voice that user sends.

If receive the response message that is suitable for the voice that user sends from second server, display device is carried out the corresponding operation (operating S860) of voice of sending with user based on response message.The function that can classify according to the language element extracting the voice based on sending from user, with multi-form generation response message.The function of classifying by the language element extracting can comprise EPG correlation function and control at least one in relevant function with the operation of display device.For example, if the language element extracting the voice that send from user belongs to broadcast program, this function can be EPG correlation function, and if language element belongs to, electric power on/off, the channel of display device change or volume changes, and this function can be to control relevant function with the operation of display device.Correspondingly, if receive response message from second server, display device can be carried out the corresponding operation of response message of voice that output send with user and at least one in the operation of execution function based on response message.

For example, if show the voice of " please record the program ooo (program title) broadcasting today " from user input, display device will represent that " speech conversion of sending that please record the program ooo (program title) broadcasting today becomes digital signal, and the voice that send to first server.Then first server converts the digital signal receiving to text message, and sends text message to display device.Thereafter, if display device from first server receive with represent " please record today broadcast program ooo (program title) " send the relevant text message of voice, display device to second server send with represent " please record the program ooo (program title) of broadcast today " send the relevant text message of voice.

Correspondingly, second server extracts language element " today ", " program ooo " and " record " from the relevant text message of the voice that send with user, and the language element based on extracting determines that function is EPG correlation function.Thereafter, second server sends and comprises the control command of record of the ooo that programs and the response message of the response message of expression " record of the ooo that programmed " to display device.

Correspondingly, the program record of ooo (program title) of the control command that display device comprises according to response message.The response message that display device comprises based on response message represents " response message of the record of the ooo that programmed " by least one output in image and voice.Correspondingly, the response message that represents " record of the ooo that programmed " can be output as voice or can be output as the image of text formatting.

With reference to Fig. 5, described in the situation that the language element extracting the voice that send from user belongs to the operation of display device and controlled the example embodiment realizing, and therefore omitted its detailed description.

If the voice that user sends comprise the language element relevant with a plurality of requests, display device receives the relevant voice of the voice send with user request message again from second server, and exports voice request message again.

What for example, second server can receive represent from display device " please arrange the program (program title) of watching broadcast this week, and please recorded program ooo " sends the relevant text message of voice.In this case, the voice that send that represent " please arrange to watch the program (program title) of broadcast this week, and ask recorded program ooo " comprise the language element relevant with a plurality of requests (" program ooo (program title) ", " arrangement is watched ", " program ooo (program title) " and " record ").

Correspondingly, second server determines whether the text message relevant with the voice that send comprises the language element relevant with a plurality of requests, and sends and comprise the voice response message of request message again to display device.Correspondingly, the voice that display device receives from second server by least one output in image and voice are request message again.Correspondingly, user only asks one of " please arrange to watch the program (program title) of broadcast this week " and " please record the program ooo broadcasting this week " again.

If the voice that user sends comprise the language element of forbidding, the operation of the response message output that display device can be based on receiving from second server and the voice corresponding to sending is relevant does not allow message.

For example, display device can to second server send with comprise profanity speech or violated drugs language element send the relevant text message of voice.In this case, second server extracts language element from the relevant text message of the voice with sending, and the language element determine extracting be pre-stored forbid language element.As definite result, if the language element extracting is the language element of forbidding, second server sends and comprises the response message that do not allow message relevant with the operation of voice corresponding to sending to display device.What correspondingly, display device represented " request is rejected " according to response message by least one output in image and voice does not allow message.

According to the example embodiment of describing with reference to Fig. 5 above, the voice that display device can send according to user from second server reception are with the response message of multi-form generation, and the corresponding operation of voice that can send with user based on response message execution.

Up to the present, described in display device the method based on being suitable for the response message executable operations of the voice that user sends in detail.Hereinafter, illustrate according to producing the response message be suitable for the voice that user sends and the method that response message is provided to display device in the interactive server of example embodiment.

Fig. 9 shows the flow chart of the method that the response message that is suitable for the voice that user sends is provided in interactive server according to example embodiment.

As shown in Figure 9, interactive server receives the corresponding text message of the user's voice that send and voice that send with user (operation S901) from display device.Interactive server is above-mentioned second server, and can receive the user who is converted into text message by first server from display device and send voice.If receive such voice that send, interactive server extracts language element (operation S920) from the voice that send.

Language element comprises dialogue action, major heading and key element.Dialogue action can be the mark that the relevant meaning between the lines of voice to sending with user is indicated.For example, dialogue action can be statement, request or problem.Major heading can be the mark that user's true intention of the voice to sending from user is indicated, and can be TV ON/OFF, program searching, Pgmtime search or programme arrangement.Key element can be school, program title, time, channel designation or actor names.

For example, if the voice that user sends are " when program ooo starts? ", dialogue action can indicate comprise question mark "? " inquiry express, major heading can be because the Pgmtime of word " beginnings " is searched for.Key element can be programm name ooo.Correspondingly, if receive with represent " when program ooo starts? " the relevant text message of voice that sends of user, interactive server extracts and comprises the language element of talking with action, major heading and key element from text message.

If extract like this language element, interactive server determines whether the language element extracting is recorded in the pre-stored table relevant with the language element of forbidding, and determines whether the language element extracting is the language element (operation S930) of forbidding.As the result checking, if language element is the language element of forbidding, interactive server produces and does not allow message, and to display device, sends this and do not allow message (operation S940).

For example, the voice that user sends can be to comprise the voice of profaning speech or violated drugs.If extract the language element relevant with profaning speech or violated drugs from the voice that send, interactive server, with reference to the pre-stored table relevant with the language element of forbidding, determines whether the language element extracting is recorded on the table relevant with language element.As the result checking, if the language element extracting is recorded in the pre-stored table relevant with language element, interactive server can produce operation with the voice that send corresponding to user relevant do not allow message.

As another example, the voice that user sends can be the voice that are associated with age of user restriction.For example, if the voice that user sends be " you expectation channel is changed to adult's broadcasting channel? ", can extract language element " adult's broadcast ", " channel ", " change " and " your expectation ".If extract like this language element, interactive server, with reference to pre-stored table relevant with the language element of forbidding in storage device, determines whether the language element extracting is associated with age of user.Correspondingly, the user profile of interactive server based on receiving from display device determines whether user meets age limit.As definite result, if the not authorized adult's broadcasting channel of watching of user, interactive server can produce that expression " serves unavailable " does not allow message.

If determine that at S930 place the language element extracting the voice that send from user is not the language element of forbidding, if or user meets age limit and the authorized service of using operation, interactive server determines whether the language element extracting is the relevant language element of EPG (operation S950).As definite result, if the language element relevant language element that is EPG, the EPG information of interactive server based on pre-stored produces the corresponding response message of voice of sending with user, and sends response message (operating S960) to display device.If the language element that the voice that send from user the extract relevant language element that is EPG, interactive server can produce the corresponding response message of voice of sending with user by carrying out following steps.

As shown in figure 10, if extract language element the voice that send from user, whether the language element that interactive server determine to extract comprises the EPG relevant with a plurality of requests language element (operating S1010) of be correlated with.As definite result, if the language element extracting is the EPG relevant language element relevant with a plurality of requests, interactive server produces voice request message (operation S1020) again.

For example, if the voice that user sends are " please record the program ooo broadcasting this week and please arrange to watch program Δ Δ Δ (program title) ", language element can be " this week ", " program ooo (program title) ", " program Δ Δ Δ (program title) ", " record ", " watching " and " asking ".If extract like this language element, interactive server determines that the language element extracting comprises the language element relevant with a plurality of requests (" program ooo (program title) ", " program Δ Δ Δ (program title) ", " record ", " watching ").Correspondingly, the conversion history information that interactive server 730 can be based on pre-stored, produces the voice request message again that represents " please only asking ".

If the language element extracting the voice that send from user does not comprise the EPG relevant with a plurality of requests language element of being correlated with, the language element extracting the voice of interactive server based on sending from user, determines whether to meet the condition (operation S1030) for generation of the corresponding response message of the voice that send with user.According to example embodiment, if the language element extracting the voice that send from user comprises owning in dialogue action, major heading and key element, interactive server determines whether to meet the condition for generation of the corresponding response message of voice of sending with user.For example,, from representing that the user of " when program ooo starts " sends the language element extracting voice and comprises owning in dialogue action, major heading and key element.In this case, interactive server can be determined the condition meeting for generation of the corresponding corresponding information of the voice that send with user.

If determine and do not meet the condition for generation of the corresponding response message of the voice that send with user, interactive server can produce voice request message again by carrying out aforesaid operations S1020.For example, represent " when starting? " user send the language element that voice only comprise the major heading of dialogue action and word " beginnings ", but do not comprise key element, dialogue is moved to comprising that the inquiry expression of question mark is indicated.In this case, interactive server is determined the dissatisfied condition for generation of the corresponding response message of the voice that send with user.Correspondingly, the conversion history information of interactive server based on pre-stored in storage device, produces the response message that requires the language element relevant with key element.

If S1030 place determines the condition meeting for generation of the corresponding response message of the voice that send with user in operation, interactive server determines that the EPG information based on pre-stored determines whether to provide the voice that send with user corresponding response message (operation S1040).As definite result, if can provide the voice that send with user corresponding response message based on EPG information, interactive server produces the corresponding response message (operation S1050) of voice of sending with user based on EPG information.Yet, if determine and can not provide the voice that send with user corresponding response message based on EPG information, the EPG information of interactive server based on pre-stored produces the alternative response message relevant with the voice that send (operating S1060) with at least one in internet hunt

For example, if the voice that user sends are " please record the program ooo broadcasting this week ", language element is " this week ", " program ooo (program title) ", " record " and " asking ".If extract such language element, interactive server can, from the EPG information of pre-stored, obtain programme information and the time started information relevant with program ooo (program title).Correspondingly, interactive server can produce response message, this response message comprises that the program ooo with arranging of programme information based on pre-acquisition and time start information records relevant control command, and the conversion history information based on pre-stored and the response message that produces.

If the voice that user sends are " in program ooo (program title), who is leading role? ", language element can be " program ooo (program title) ", " leading role " and " who ".If extract like this language element, whether the interactive server inspection information relevant with the leading role of program ooo is included in the EPG information of pre-stored.As the result checking, if can not obtain the information relevant with the leading role of program ooo from the EPG information of pre-stored, interactive server produces alternative response message, and this alternative response message inquires whether user's expectation receives by EPG information or internet hunt the relevant alternate information of voice of sending with user.For example, if input receives from EPG information the user speech that alternate information is indicated to user's expectation, interactive server obtains the information relevant with the cast of program ooo from the EPG information of pre-stored.If obtain the relevant alternate information of voice of sending with user from EPG information, interactive server can the conversion history information based on pre-stored produce the alternative response message that comprises the alternate information obtaining in advance.

If the language element extracting the voice that send from user at operation S950 place is associated with the control of display device, the language element of interactive server based on extracting determines whether to control the corresponding display device operation (operation S970) of voice of sending with user.As definite result, if can not control the operation of display device, interactive server produces for controlling the response message of the operation of display device, and sends response message (operation S980) to display device.

According to exemplary embodiment, interactive server can be stored for controlling the manual information of the operation of display device 100.Manually information comprises the information of the operation of the voice control display device 100 for sending according to user, and for control the information of the operation of display devices 100 according to other control commands except the voice that user sends.Correspondingly, if extract the language element relevant with the control of display device 100, the manual information of interactive server based on storing, determines that whether the voice that the operation of display device 100 sends according to user are controlled.As the result checking, if the voice that the operation of display device 100 is sent according to user are controlled, interactive server can produce the response message of the control command that comprises the operation of carrying out the voice relativity of sending with user.

For example, if the voice that user sends are " please channel being changed to MBC ", language element is " MBC ", " channel " and " change ".If extract like this language element, interactive server determines that the language element extracting relates to the function control of display device 100.Thereafter, interactive server, with reference to the manual information of pre-stored, determines whether to change according to the language element extracting the channel of display device 100.As definite result, if the channel of the speech modification display device 100 that can send according to user, interactive server can produce and comprise the response message that current channel in display device 100 is changed to the control command of MBC.

If determine the operation that can not control display device 100, interactive server can produce the relevant alternative response message of voice of sending with user, and sends this alternative response message (operation S990) to display device.Alternative response message can with at least one that control the method for operation of display device and the current state notice of the current state of notice display device, be associated.

For example, if the voice that user sends are " please highlight screen ", can extract language element " screen ", " highlighting " and " asking ".If extract like this language element, interactive server determines that language element relates to the function control of display device 100.Thereafter, interactive server, with reference to the manual information of pre-stored, determines whether to adjust according to the language element extracting the screen of display device 100.As definite result, if the screen of display device 100 can not be adjusted in the voice that send according to user, interactive server can be with reference to the manual information of pre-stored, produce with for adjusting the relevant response message of the method for screen of display device.

Example embodiment of the present disclosure has been described.

Above-mentioned example embodiment is only that example should not be considered as limiting the disclosure.Example embodiment can easily be applied to the equipment of other types.Equally, the description of example embodiment is intended to signal, do not limit the scope of the claims, and many alternatives, modification and modification is apparent for those skilled in the art.

Claims

1. a display device, comprising:

Voice gatherer, collects the voice that user sends;

Communication unit, communicates by letter with interactive server; And

Controller, if from interactive server receive with send to interactive server send the corresponding response message of voice, based on response message, control and carry out the corresponding operation of voice of sending with user,

Wherein, response message is function based on classifying according to the language element extracting from the voice that send with multi-form generation.

2. display device according to claim 1, wherein, described function comprises that the operation of electronic program guide (EPG) correlation function and display device controls at least one in function.

3. display device according to claim 2, also comprises output unit,

Wherein, if the voice that send comprise the operation of the relevant language element of EPG or display device and control relevant language element, the response message of controller based on receiving from interactive server, carries out at least one operation in the operation of output response message and the operation of execution function.

4. display device according to claim 3, wherein, if the voice that send comprise the EPG relevant with a plurality of requests language element of being correlated with, the response message of controller based on receiving from interactive server, output voice are request message again.

5. display device according to claim 1, also comprises output unit,

Wherein, if the voice that send comprise limited language element, the response message of controller based on receiving from interactive server, control the operation of output and voice corresponding to sending relevant do not allow message.

6. display device according to claim 5, also comprises:

Memory cell, matches each other user's face image and user profile, and storage match information; And

Take unit, take user's face,

Wherein, controller sends the user profile matching with the face image of taking unit generation and the voice that send to interactive server, and if limited language element was associated with user's age, controller is according to the response message producing based on user profile, control the operation of output and voice corresponding to sending relevant do not allow message.

7. display device according to claim 1, wherein, interactive server comprises: first server, becomes text message by the speech conversion of collection; And second server, produce be converted into text message send the corresponding response message of voice,

Wherein, controller becomes digital signal by the speech conversion of collection, and send the voice through conversion to first server, and if receive the text message relevant with the voice that send from first server, to second server, send text message, and receive the response message corresponding with the voice that send.

8. an interactive server, comprising:

Communication unit, communicates by letter with display device;

Extraction unit extracts language element from be received from the voice that send of display device; And

Controller, the function based on classifying according to the language element extracting is with the multi-form generation response message corresponding with the voice that send, and to display device transmission response message.

9. interactive server according to claim 8, wherein, described function comprises that the operation of electronic program guide (EPG) correlation function and display device controls at least one in function.

10. interactive server according to claim 9, also comprises: the memory cell of storage EPG information,

Wherein, if the language element the extracting relevant language element that is EPG, the EPG information of controller based on storing in memory cell determines whether the EPG information that can provide corresponding with the voice that send,

Wherein, if EPG information can be provided, controller produces the response message corresponding with the voice that send based on EPG information, and if EPG information can not be provided, controller is based on EPG information and at least one generation alternative response message relevant with the voice that send in web search.

11. interactive servers according to claim 10, wherein, if the voice that send comprise the EPG relevant with a plurality of requests language element of being correlated with, controller produces voice request message again in display device, again to ask user to send voice.

12. interactive servers according to claim 9, wherein, if relevant language element is controlled in the operation that the language element extracting is display device, controller determines whether to control the operation of the display device corresponding with the voice that send based on described language element

Wherein, if can control the operation of display device, controller produces the operation that response message is controlled display device, and if can not control the operation of display device, controller produces and the method for controlling operation thereof of display device and at least one the relevant response message in current state notice.

13. interactive servers according to claim 8, also comprise: memory cell, store the table relevant with limited language element,

Wherein, if the language element extracting comprises limited language element, controller produce operation with voice corresponding to sending relevant do not allow message.

14. interactive servers according to claim 13, wherein, communication unit also receives user profile from display device,

Wherein, if the language element extracting was associated with user's age, controller based on user profile determine whether the operation of generation and voice corresponding to sending relevant do not allow message.

15. 1 kinds of methods that the corresponding response message of the voice that send with user is provided in the interactive server with display device interlocking, described method comprises:

From display device, receive the voice that user sends;

From the voice that send, extract language element;

Function based on classifying according to the language element extracting is with the multi-form generation response message corresponding with the voice that send, and

To display device, send response message.