CN114283801A - Voice interaction display system and intelligent display terminal - Google Patents

Voice interaction display system and intelligent display terminal Download PDF

Info

Publication number
CN114283801A
CN114283801A CN202111532775.XA CN202111532775A CN114283801A CN 114283801 A CN114283801 A CN 114283801A CN 202111532775 A CN202111532775 A CN 202111532775A CN 114283801 A CN114283801 A CN 114283801A
Authority
CN
China
Prior art keywords
unit
signal
voice
information
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111532775.XA
Other languages
Chinese (zh)
Inventor
徐遥令
徐小清
沈思宽
吴伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Skyworth RGB Electronics Co Ltd
Original Assignee
Shenzhen Skyworth RGB Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Skyworth RGB Electronics Co Ltd filed Critical Shenzhen Skyworth RGB Electronics Co Ltd
Priority to CN202111532775.XA priority Critical patent/CN114283801A/en
Publication of CN114283801A publication Critical patent/CN114283801A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses a voice interaction display system and an intelligent display terminal, wherein the system comprises a host, an active optical cable module, a server, a first display screen and a second display screen; the host is respectively connected with the active optical cable module and the server and used for acquiring a program signal and an external sound signal, outputting a program video image signal to the active optical cable module according to the program signal and outputting a voice signal to the server according to the external sound signal; the server is used for returning response information to the host according to the voice signal, and the host outputs text information and interactive information to the active optical cable module according to the response information; the active light module is respectively connected with the first display screen and the second display screen and is used for processing the program video image signals and driving the first display screen to display video images, and processing the text information and the interactive information and driving the second display screen to display text contents and interactive contents. The invention realizes that the user can watch the video image content, the voice information and the interactive information at the same time.

Description

Voice interaction display system and intelligent display terminal
Technical Field
The invention relates to the technical field of intelligent interaction, in particular to a voice interaction display system and an intelligent display terminal.
Background
With the continuous innovation and development of ultra-high-definition televisions, the television interaction mode is changed, the mode of button control by a traditional infrared remote controller is gradually changed into the interaction mode combining wireless communication, artificial intelligent voice and the like, and the television is also added with a display function and a voice control function for providing information such as time, weather and the like for consumers on the basis of single audio and video playing basic functions. When the existing high-definition intelligent television with the artificial intelligent voice interaction function is interacted, the generated interface can shield the video or image on the screen, and the normal watching of a user is influenced; in addition, the display cannot display voice information, interactive information and the like while playing videos or pictures, and the convenience of obtaining information by a user cannot be met.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: the utility model provides a voice interaction display system and intelligent display terminal to solve the interface that produces when artificial intelligence voice interaction and shelter from the video or the image of broadcasting at present, and can't show the technical problem of speech information and mutual information etc. when video and image broadcast.
In order to solve the technical problems, the invention adopts the technical scheme that:
a voice interactive display system comprising: the system comprises a host, an active optical cable module, a server, a first display screen and a second display screen;
the host is respectively connected with the active optical cable module and the server and used for acquiring a program signal and an external sound signal, outputting a program video image signal to the active optical cable module according to the program signal and outputting a voice signal to the server according to the external sound signal;
the server is used for returning response information to the host according to the voice signal, and the host outputs text information and interactive information to the active optical cable module according to the response information;
the active light module is respectively connected with the first display screen and the second display screen and is used for processing the program video image signals and driving the first display screen to display video images, and processing the text information and the interactive information and driving the second display screen to display text contents and interactive contents.
Further, the host includes: the device comprises an interface processing module, a sound processing module, an audio and video processing module and a processor module;
the interface processing module is respectively connected with an external signal source and the audio and video processing module and is used for receiving the program signals output by the signal source, and outputting the program signals to the audio and video processing module after signal enhancement, analysis and descrambling processing;
the audio and video processing module is connected with the active optical cable module and is used for decoding the program signals to obtain the program video image signals and the program audio signals, outputting the program video image signals to the active optical cable module and outputting the program audio signals to the processor module;
the processor module is connected with the sound processing module and is used for outputting the program audio signal to the sound processing module;
and the sound processing module is connected with a loudspeaker and is used for driving the loudspeaker to output audio according to the program audio signal.
Further, the host further includes: the system comprises an interaction processing module and a network processing module;
the interactive processing module is connected with the processor module and used for acquiring the external sound signal, identifying the sound signal and outputting an initial audio signal and an identification result to the processor module;
the processor module is respectively connected with the network processing module and the active optical cable module, and is used for extracting the voice signal according to the initial audio signal and the recognition result, outputting the voice signal to the network processing module, and outputting the text information and the interaction information to the active optical cable module according to the response information, wherein the processor module acquires the characteristic parameters of the second display screen through the active optical cable module;
the network processing module is connected with the server and used for packaging the voice signals to obtain voice packets, sending the voice packets to the server, receiving and unpacking message packets sent by the server to obtain the response information and sending the response information to the processor module.
Further, the interaction processing module comprises: the system comprises a microphone unit, a sound acquisition unit and an interaction identification unit;
the processor module includes: a voice extraction unit, an audio encoding unit and a control unit;
the sound processing module comprises a loudspeaker driving unit;
the microphone unit is connected with the sound acquisition unit and used for acquiring the external sound signal and outputting the external sound signal to the sound acquisition unit;
the loudspeaker driving unit is connected with the sound acquisition unit and is used for outputting a driving audio signal for driving the loudspeaker to work to the sound acquisition unit;
the sound acquisition unit is respectively connected with the interactive recognition unit and the voice extraction unit and is used for carrying out cancellation processing on the external sound signal and the driving audio signal to obtain an initial audio signal and outputting the initial audio signal to the interactive recognition unit and the voice extraction unit;
the interaction identification unit is connected with the control unit, a reference audio is arranged in the interaction identification unit and is used for detecting the initial audio signal and outputting an identification signal to the control unit when the initial audio signal is detected to have the reference audio;
the control unit is respectively connected with the voice extraction unit and the audio and video processing module and is used for driving the voice extraction unit to work according to the identification signal and controlling the program audio to be output to the sound processing module;
the voice extracting unit is connected with the audio coding unit and used for extracting the voice signal according to the initial audio signal and outputting the voice signal to the audio coding unit;
the audio coding unit is connected with the network processing module and used for coding the voice signal and outputting the coded voice signal to the network processing module.
Further, the processor module further comprises: a message processing unit, an execution unit, an information processing unit, and a voice synthesis unit;
the message processing unit is connected with the network processing module and used for receiving the response information, acquiring an interactive instruction from the response information and outputting the interactive instruction to the execution unit and acquiring text information and outputting the text information to the information processing unit;
the execution unit is connected with the information processing unit and used for executing interactive operation according to the interactive instruction and outputting interactive information of the interactive operation to the information processing unit;
the information processing unit is connected with the active optical cable module and is used for processing the text information and the interactive information, generating feature structure data which accords with feature parameters of the second display screen and outputting the feature structure data to the active optical cable module;
the voice synthesis unit is respectively connected with the message processing unit and the sound processing module and is used for receiving the text information and the interactive information, converting the text information and the interactive information into interactive audio signals and outputting the interactive audio signals to the sound processing module;
the control unit is used for controlling the voice synthesis unit to output the interactive audio signal to the sound processing module.
Further, the audio/video processing module includes: the device comprises an audio and video decoding unit, an image processing unit, a format processing unit and an audio processing unit;
the audio and video decoding unit is respectively connected with the interface processing module, the audio processing unit and the image processing unit and is used for receiving the program signal, decoding the program signal to obtain a program video and a program audio, outputting the program video to the image processing unit and outputting the program audio to the audio processing unit;
the image processing unit is connected with the format processing unit and is used for carrying out image processing on the program video and outputting the program video to the format processing unit;
the format processing unit is connected with the active optical cable module and is used for converting the program video into the program video image signals in the parallel format and outputting the program video image signals to the active optical cable module;
the audio processing unit is connected with the processor module and used for processing the program audio and outputting the program audio signal to the processor module so that the processor module outputs the program audio signal to the sound processing module.
Further, the server includes: the voice recognition device comprises a first receiving and unpacking unit, a voice recognition unit, an instruction conversion unit, a message generation unit and a first packet sending unit;
the first receiving and unpacking unit is respectively connected with the host and the voice recognition unit and is used for receiving the voice packet, unpacking the voice packet to obtain the voice signal and outputting the voice signal to the voice recognition unit;
the voice recognition unit is respectively connected with the instruction conversion unit and the message generation unit and is used for performing voice recognition on the voice signal, converting the voice signal into corresponding text information and outputting the text information to the information generation unit and the instruction conversion unit;
the instruction conversion unit is connected with the message generation unit and is used for judging and understanding according to the character information, converting the character information into instruction information and outputting the instruction information to the message generation unit;
the message generating unit is connected with the first packet sending unit and used for combining the instruction information and the text information into the response information and outputting the response information to the first packet sending unit;
the first packet sending unit is connected with the host, and is used for packaging the response information to obtain the message packet and sending the message packet to the network processing module.
Further, the active optical cable module includes: the device comprises a pixel coding unit, a pixel decoding unit and an optical fiber transmission unit;
the pixel coding unit is respectively connected with the audio and video processing module and the optical fiber transmission unit and is used for receiving the program video image signals and converting the program video image signals into serial signals to be output to the optical fiber transmission unit;
the optical fiber transmission units are respectively connected with the pixel decoding units and are used for converting the serial signals into optical signals for transmission and converting the optical signals into the serial signals to be output to the pixel decoding units;
the pixel decoding unit is connected with the first display screen and used for carrying out pixel decoding on the serial signals and driving the first display screen to display.
Further, the active optical cable module further includes: the device comprises a data weaving unit, a data analysis unit and a copper cable;
the data weaving unit is connected with the processor module, is connected with the data analysis unit through the copper cable, and is used for receiving the text information and the interactive information output by the processor module, converting the text information and the interactive information into screen display information and outputting the screen display information to the data analysis unit;
the data analysis unit is connected with the second display screen and used for analyzing the screen display information and then driving the second display screen to display.
An intelligent display terminal comprises the voice interactive display system.
The invention has the beneficial effects that: the host acquires a program signal and an external sound signal input by an external signal source, the program signal is processed and then transmitted through the active optical cable module and drives the first display screen to display video image content, the external sound signal is processed to obtain a voice signal and then sent to the server, the server returns response information, the host acquires text information and interaction information according to the response information, the text information and the interaction information are transmitted through the active optical cable module and drives the second display screen to display the text content and the interaction content after the processing, and therefore a user can watch the video image content through the first display screen and look over the voice information and the interaction information through the second display screen at the same time, the situation that a window generated during artificial intelligent voice interaction shields the video image content is avoided, and convenience of the artificial intelligent voice interaction is improved.
Drawings
FIG. 1 is a schematic block diagram of a voice interactive display system according to an embodiment of the present invention;
FIG. 2 is another functional block diagram of a voice interactive display system according to an embodiment of the present invention;
FIG. 3 is a further functional block diagram of a voice interactive display system in accordance with an embodiment of the present invention;
fig. 4 is a functional block diagram of an active optical cable module according to an embodiment of the present invention.
Description of reference numerals:
100. a host; 110. an interface processing module; 111. a signal enhancement unit; 112. analyzing a descrambling unit; 120. a sound processing module; 121. a speaker driving unit; 130. an audio and video processing module; 131. an audio/video decoding unit; 132. an image processing unit; 133. a format processing unit; 134. an audio processing unit; 140. a processor module; 141. a voice extraction unit; 142. an audio encoding unit; 143. a control unit; 144. a message processing unit; 145. an execution unit; 146. an information processing unit; 147. a speech synthesis unit; 150. an interactive processing module; 151. a microphone unit; 152. a sound acquisition unit; 153. an interaction identification unit; 160. a network processing module; 161. a second packet transmission unit; 162. a second receiving and unpacking unit; 200. an active optical cable module; 210. a pixel encoding unit; 220. a pixel decoding unit; 230. an optical fiber transmission unit; 231. an input interface; 232. a fiber optic cable; 233. an output interface; 240. a data weaving unit; 250. a data analysis unit; 260. a copper cable; 300. a first display screen; 400. a second display screen; 500. a server; 510. a first receiving and unpacking unit; 520. a voice recognition unit; 530. an instruction conversion unit; 540. a message generation unit; 550. a first packet transmission unit; 600. a loudspeaker.
Detailed Description
In order to explain technical contents, achieved objects, and effects of the present invention in detail, the following description is made with reference to the accompanying drawings in combination with the embodiments.
Examples
Referring to fig. 1 to 4, an embodiment of the present invention is:
a voice interaction display system is applied to intelligent display terminals such as intelligent televisions, intelligent screens and projection systems, and the intelligent display terminals can be applied to scenes such as families, meetings and teaching.
Referring to fig. 1, the voice interactive display system includes: a host 100, an active optical cable module 200, a server 500, a first display screen 300, and a second display screen 400; the host 100 is respectively connected to the active optical cable module 200 and the server 500, and configured to acquire a program signal and an external sound signal, output a program video image signal to the active optical cable module 200 according to the program signal, and output a voice signal to the server 500 according to the external sound signal; the server 500 is configured to return response information to the host 100 according to the voice signal, and the host 100 outputs text information and interaction information to the active optical cable module 200 according to the response information; the active light module is respectively connected to the first display screen 300 and the second display screen 400, and is configured to process the program video image signal and drive the first display screen 300 to display a video image, and process the text information and the interactive information and drive the second color display screen 400 to display text content and interactive content.
The working principle of the voice interactive display system in the embodiment is as follows: on one hand, the host 100 obtains a program signal input by an external signal source, processes the program signal to obtain a program video image signal, transmits the program video image signal through the active optical cable module 200, and drives the first display screen 300 to display video image content; on the other hand, the host 100 further obtains an external sound signal, processes the external sound signal, extracts a voice signal, outputs the voice signal to the server 500, the server 500 returns response information to the host 100 according to the voice signal, the host 100 processes the response information, obtains text information and interactive information, and transmits the text information and the interactive information through the active optical cable module 200 and drives the second display 400 to display text content and interactive content.
It can be understood that through this kind of setting realization double screen display, the window that produces when can effectively avoiding artificial intelligence voice interaction shelters from the video or the image of present broadcast, and the user also can watch video image content, interactive content and voice content simultaneously, and interactive content is more directly perceived, has effectively improved artificial intelligence voice interaction's convenience. Moreover, when the host 100 transmits high-quality video images to the first display screen 300, the transmitted data volume is large, the distance is long, the problems of signal radiation, signal attenuation and the like exist by adopting the traditional connecting cable, the traditional connecting cable is thick and hard, the wiring layout is inconvenient, and the overall impression can be influenced.
In this embodiment, first display screen 300 is the large screen, second display screen 400 is the small screen, second display screen 400 can through multiple mode with first display screen 300 makes up, and is located same horizontal plane, not only has the originality, still is favorable to reducing the thickness of the display screen after the combination. The interactive information comprises information such as time and weather and interactive operation information, and the text information is corresponding text content obtained by converting the voice signal.
Specifically, the host 100 includes: an interface processing module 110, a sound processing module 120, an audio and video processing module 130 and a processor module 140; the interface processing module 110 is respectively connected to an external signal source and the audio/video processing module 130, and is configured to receive the program signal output by the signal source, perform signal enhancement, analysis and descrambling on the program signal, and output the program signal to the audio/video processing module 130; the audio/video processing module 130 is connected to the active optical cable module 200, and is configured to decode the program signal to obtain the program video image signal and the program audio signal, output the program video image signal to the active optical cable module 200, and output the program audio signal to the processor module 140; the processor module 140 is connected to the sound processing module 120, and is configured to output the program audio signal to the sound processing module 120; the sound processing module 120 is connected to the speaker 600, and is configured to drive the speaker 600 to output audio according to the program audio signal.
In this embodiment, the interface processing module 110 is configured to receive the program signal output by the external signal source, process the program signal and output the processed program signal to the audio processing module, the audio processing module decodes the program signal and obtains a program video image signal and a program audio signal, the active optical cable module 200 transmits the program video image signal and drives the first display screen 300 to display, and the sound processing module 120 is configured to receive the program audio signal and drive the speaker to output an audio signal. It can be understood that, by the host 100 completing the processing of the television audio and video and the processing of the voice interaction, the first display screen 300 and the second display screen 400 only need to display the corresponding content, and therefore, the first display screen 300 and the second display screen 400 occupy less resources of the system, and the system is more stable as a whole.
In this embodiment, the interface processing module 110 includes a signal enhancement unit 111 and an analysis descrambling unit 112, where the signal enhancement unit 111 is connected to the analysis descrambling unit 112, and is configured to receive the program signal, perform signal enhancement processing, and output the signal to the analysis descrambling unit 112; the analysis descrambling unit 112 is connected to the audio/video processing module 130, and is configured to analyze and descramble the program signal and output the program signal to the audio/video processing module 130.
Further, the host 100 further includes: an interaction processing module 150 and a network processing module 160; the interaction processing module 150 is connected to the processor module 140, and is configured to acquire the external sound signal, identify the sound signal, and output an initial audio signal and the identification result to the processor module 140; the processor module 140 is respectively connected to the network processing module 160 and the active optical cable module 200, and is configured to extract the voice signal according to the initial audio signal and the recognition result, output the voice signal to the network processing module 160, and output the text information and the interaction information to the active optical cable module 200 according to the response information, where the processor module 140 obtains the characteristic parameters of the second display screen 400 through the active optical cable module 200; the network processing module 160 is connected to the server 500, and configured to packetize the voice signal to obtain a voice packet, send the voice packet to the server 500, receive a message packet sent by the server 500, depacketize the message packet to obtain the response information, and send the response information to the processor module 140.
It can be understood that, when the user outputs the interactive voice, the interactive processing module 150 obtains an external voice signal and performs recognition, and when it is recognized that the external voice signal includes the interactive voice of the user, outputs the initial audio signal to the processor module 140 to extract the voice signal of the interactive voice of the corresponding user, and sends the voice packet to the server 500 after being packetized by the network processing module 160, the server 500 outputs the message packet with the response information to the network processing module 160 according to the voice signal in the voice packet, and the network processing module 160 unpacks the response message and outputs the response message to the processor module 140. In this embodiment, the server 500 processes the voice signal and returns the response information, which is beneficial for the host 100 to provide more resources for other functions such as program audio and video playing.
Referring to fig. 3, specifically, the interaction processing module 150 includes: a microphone unit 151, a sound acquisition unit 152, and an interaction recognition unit 153; the processor module 140 includes: a speech extraction unit 141, an audio encoding unit 142, and a control unit 143; the sound processing module 120 includes a speaker driving unit 121. In this embodiment, the network processing module 160 includes a second packet sending unit 161 and a second receiving and unpacking unit 162, and the microphone unit 151 may be a microphone array to obtain a better sound pickup effect.
The microphone unit 151 is connected to the sound acquiring unit 152, and is configured to acquire the external sound signal and output the external sound signal to the sound acquiring unit 152; the speaker driving unit 121 is connected to the sound acquiring unit 152, and is configured to output a driving audio signal for driving a speaker to operate to the sound acquiring unit 152; the sound acquiring unit 152 is respectively connected to the interaction identifying unit 153 and the voice extracting unit 141, and configured to perform cancellation processing on the external sound signal and the driving audio signal to obtain an initial audio signal, and output the initial audio signal to the interaction identifying unit 153 and the voice extracting unit 141; the interaction identification unit 153 is connected to the control unit 143, and a reference audio is disposed in the interaction identification unit 153, and is configured to detect the initial audio signal and output an identification signal to the control unit 143 when the initial audio signal is detected to have the reference audio.
With this arrangement, the microphone unit 151 picks up an external sound signal and outputs the sound signal to the sound acquiring unit 152, and the sound acquiring unit 152 performs signal spectrum analysis and intensity analysis on the external sound signal and the driving audio signal output by the speaker driving unit 121, performs cancellation processing and the like on the external sound signal and the driving audio signal, and forms the initial audio signal, thereby preventing the audio output by the speaker 600 from affecting the acquisition of the voice signal, and improving the accuracy of the voice signal extraction. The initial audio signal is respectively sent to the voice extraction unit 141 and the interaction identification unit 153, the reference audio is provided in the interaction identification unit 153, the interaction identification unit 153 detects the initial audio signal, and when the reference audio is identified to exist in the initial audio signal, the identification signal is immediately output to the control unit 143.
The control unit 143 is respectively connected to the voice extracting unit 141 and the audio/video processing module 130, and is configured to drive the voice extracting unit 141 to operate according to the identification signal, and control the program audio to be output to the sound processing module 120; the voice extracting unit 141 is connected to the audio encoding unit 142, and is configured to extract the voice signal according to the initial audio signal and output the voice signal to the audio encoding unit 142; the audio encoding unit 142 is connected to the network processing module 160, and configured to encode the voice signal and output the encoded voice signal to the network processing module 160, in this embodiment, the audio encoding unit 142 is connected to the second packet sending unit 161, and the second packet sending unit 161 is configured to package the encoded voice signal into the voice packet and send the voice packet to the server 500.
With such a configuration, when the control unit 143 receives the identification signal, the voice extracting unit 141 is controlled to enter the working state, the voice extracting unit 141 starts to extract the voice signal with the reference audio in the initial audio signal as seven points and outputs the voice signal to the audio encoding unit 142, and the audio encoding unit 142 encodes the voice signal and then sends the encoded voice signal to the second packet sending unit 161 to be packaged and sent to the server 500. Wherein, the control unit 143 is further configured to control the program audio to be output to the speaker driving unit 121.
Specifically, the server 500 includes: a first receiving and unpacking unit 510, a voice recognition unit 520, an instruction conversion unit 530, a message generation unit 540 and a first packet transmission unit 550; the first receiving and unpacking unit 510 is respectively connected to the host 100 and the voice recognition unit 520, and is configured to receive and unpack the voice packet to obtain the voice signal, and output the voice signal to the voice recognition unit 520; the voice recognition unit 520 is respectively connected to the instruction conversion unit 530 and the message generation unit 540, and is configured to perform voice recognition on the voice signal, convert the voice signal into corresponding text information, and output the text information to the information generation unit and the instruction conversion unit 530; the instruction converting unit 530 is connected to the message generating unit 540, and is configured to perform judgment and understanding according to the text information, convert the text information into instruction information, and output the instruction information to the message generating unit 540; the message generating unit 540 is connected to the first packet sending unit 550, and configured to combine the instruction information and the text information into the response information, and output the response information to the first packet sending unit 550; the first packet sending unit 550 is connected to the host 100, and configured to package the response information to obtain the message packet, and send the message packet to the network processing module 160.
It can be understood that, the first receiving and unpacking unit 510 receives the voice packet, unpacks the voice packet to obtain the voice signal, and the voice recognition unit 520 recognizes the voice signal, converts the voice signal into corresponding text information, and outputs the text information to the instruction conversion unit 530 and the message generation unit 540 respectively. The instruction converting unit 530 determines and understands the text information, converts the text information into the instruction information, and outputs the instruction information to the message generating unit 540. The message generating unit 540 combines the text information and the instruction information into the response information, and sends the response information to the second receiving and unpacking unit 162 of the network processing module 160 by using the packet sending unit as the message packet.
Further, the processor module 140 further includes: a message processing unit 144, an execution unit 145, an information processing unit 146, and a speech synthesis unit 147; the message processing unit 144 is connected to the network processing module 160, and is configured to receive the response message, obtain an interactive instruction from the response message, output the interactive instruction to the execution unit 145, obtain text information, and output the text information to the information processing unit 146; the execution unit 145 is connected to the information processing unit 146, and configured to execute an interactive operation according to the interactive instruction, and output interactive information of the interactive operation to the information processing unit 146; the information processing unit 146 is connected to the active optical cable module 200, and configured to process the text information and the interaction information, generate feature structure data according with feature parameters of the second display screen 400, and output the feature structure data to the active optical cable module 200; the voice synthesis unit 147 is respectively connected to the message processing unit and the sound processing module 120, and is configured to receive the text information and the interactive information, convert the text information and the interactive information into an interactive audio signal, and output the interactive audio signal to the sound processing module 120; wherein, the control unit 143 is configured to control the speech synthesis unit 147 to output the interactive audio signal to the sound processing module 120.
It can be understood that, the second receiving and unpacking unit 162 unpacks the message packet to obtain the response information and sends the response information to the message processing unit 144, the message processing unit 144 processes the response information to obtain the text information and the instruction information, and the execution unit 145 executes corresponding interactive operation according to the instruction information and outputs interactive information of the interactive operation to the information processing unit 146. In this embodiment, the processor module 140 obtains the characteristic parameters of the second display screen 400 through the active optical cable module 200, where the characteristic parameters include the size and shape of the display screen. The information processing unit 146 performs arrangement processing on the text information and the interaction information according to the characteristic parameters of the second display screen 400, generates characteristic structure data conforming to the characteristic parameters of the second display, and outputs the characteristic structure data to the active optical cable module 200. Further, the voice synthesis unit 147 converts the text information or the interactive information into an interactive audio signal and outputs the interactive audio signal to the speaker driving unit 121, so as to output audio by driving the speaker 600. In this embodiment, the control unit 143 includes a register, which is beneficial for a user to determine whether to play the program audio or the interactive audio when the program audio signal and the interactive audio signal are both provided by setting a parameter of the register.
Referring to fig. 2, in this embodiment, the audio/video processing module 130 includes: an audio/video decoding unit 131, an image processing unit 132, a format processing unit 133, and an audio processing unit 134. The audio/video decoding unit 131 is respectively connected to the interface processing module 110, the audio processing unit 134, and the image processing unit 132, and is configured to receive the program signal, decode the program signal to obtain a program video and the program audio, output the program video to the image processing unit 132, and output the program audio to the audio processing unit 134.
The image processing unit 132 is connected to the format processing unit 133, and is configured to perform image processing on the program video and output the program video to the format processing unit 133; the format processing unit 133 is connected to the active optical cable module 200, and is configured to convert the program video into the program video image signal in the parallel format and output the program video image signal to the active optical cable module 200; the audio processing unit 134 is connected to the processor module 140, and is configured to process the program audio and output the program audio signal to the processor module 140, so that the processor module 140 outputs the program audio signal to the sound processing module 120.
It can be understood that the audio/video decoding unit 131 receives the program signal output by the parsing and descrambling unit 112 and decodes the program signal to obtain the program video and the program audio, the image processing unit 132 performs image processing such as color conversion and contrast enhancement on the program video, and the format processing unit 133 performs format conversion processing on the program video after image processing and outputs the program video image signal in a parallel format to the active optical cable module 200. The program audio is subjected to frequency response enhancement, gain adjustment, and the like through the audio processing unit 134, and outputs the program audio signal, under the control of the control unit 143, the program audio signal is output to the speaker driving unit 121, the speaker driving unit 121 outputs a driving audio signal according to the program audio signal, and outputs the driving audio signal to the sound acquiring unit 152 and drives the speaker 600 to output the program audio. By this arrangement, high-quality video images and audio can be obtained, thereby improving the playing effect of the program.
Referring to fig. 1 and 4, in particular, the active optical cable module 200 includes: a pixel encoding unit 210, a pixel decoding unit 220, and an optical fiber transmission unit 230; the pixel encoding unit 210 is respectively connected to the audio/video processing module 130 and the optical fiber transmission unit 230, and is configured to receive the program video image signal and convert the program video image signal into a serial signal, and output the serial signal to the optical fiber transmission unit 230; the optical fiber transmission units 230 are respectively connected to the pixel decoding units 220, and are configured to convert the serial signals into optical signals for transmission, and convert the optical signals into the serial signals to be output to the pixel decoding units 220; the pixel decoding unit 220 is connected to the first display 300, and configured to perform pixel decoding on the serial signal and drive the first display 300 to perform display.
In this embodiment, the optical fiber transmission unit 230 includes an input interface 231, an optical fiber cable 232, and an output interface 233, the optical fibers are respectively connected to the input interface 231 and the output interface 233, the input interface 231 is connected to the pixel encoding unit 210, and the output interface 233 is connected to the output decoding unit. The pixel decoding unit 220 receives the program image processing signal, performs pixel coding processing, converts the program image processing signal into a high-speed serial signal, and outputs the serial signal to the input interface 231, the input interface 231 performs electro-optical conversion on the serial signal into an optical signal, and outputs the optical signal to the output interface 233 through the optical fiber cable 232, the output interface 233 restores the optical signal into the serial signal through electro-optical conversion and outputs only the pixel decoding unit 220, and the pixel decoding unit 220 performs pixel decoding on the serial signal and outputs a first display screen 300 driving signal to drive the first display screen 300 to output video image content. Compared with the traditional copper cable 260, the optical fiber cable 232 is used for transmitting program video information, has better transmission performance, is lighter and has stronger anti-interference capability.
Further, the active optical cable module 200 further includes: a data weaving unit 240, a data analysis unit 250, and a copper cable 260; the data weaving unit 240 is connected to the processor module 140 and the data analysis unit 250 through the copper cable 260, and is configured to receive text information and interactive information output by the processor module 140, convert the text information and interactive information into screen display information, and output the screen display information to the data analysis unit 250; the data analysis unit 250 is connected to the second display screen 400, and is configured to analyze the screen display information and then drive the second display screen 400 to display the screen display information.
In this embodiment, the data weaving unit 240 receives the text information and the interactive information output by the information processing unit 146, performs data weaving processing on the text information and the interactive information, converts the text information and the interactive information into the screen display information, and transmits the screen display information to the data analyzing unit 250 through the copper cable 260, and the data analyzing unit 250 analyzes the screen display information and outputs a second display screen 400 driving signal to drive the second display screen 400 to output the text content and the interactive content. Wherein the processor module 140 obtains the characteristic parameters of the second display screen 400 through a link path composed of the data weaving unit 240, the copper cable 260 and the data analysis unit 250.
The active optical cable module 200 in this embodiment includes an optical fiber cable 232 and a copper cable 260, the optical fiber cable 232 is used to transmit high-rate program video image signals, and the copper cable 260 is used to transmit text information and interactive information of a dropping rate, which is beneficial to improving the transmission performance and the transmission stability between the host 100 and the two display screens.
The embodiment also provides an intelligent display terminal, which comprises the voice interactive display system.
In summary, the voice interaction display system and the intelligent display terminal provided by the invention can effectively prevent the window generated during artificial intelligent voice interaction from blocking the currently played video or image, and the user can also watch the video image content, the interaction content and the voice content at the same time, so that the interaction content is more intuitive, and the convenience of artificial intelligent voice interaction is effectively improved. And, through using active optical cable module, improved transmission performance, it is lighter than traditional ordinary signal cable, the interference killing feature is stronger. In addition, the first display screen and the second display screen occupy less resources of the system, and the whole system is more stable.
The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent modifications made by the contents of the present specification and the drawings, or applied to the related technical fields directly or indirectly, are included in the scope of the present invention.

Claims (10)

1. A voice interactive display system, comprising: the system comprises a host, an active optical cable module, a server, a first display screen and a second display screen;
the host is respectively connected with the active optical cable module and the server and used for acquiring a program signal and an external sound signal, outputting a program video image signal to the active optical cable module according to the program signal and outputting a voice signal to the server according to the external sound signal;
the server is used for returning response information to the host according to the voice signal, and the host outputs text information and interactive information to the active optical cable module according to the response information;
the active light module is respectively connected with the first display screen and the second display screen and is used for processing the program video image signals and driving the first display screen to display video image contents, and processing the text information and the interactive information and driving the second display screen to display the text contents and the interactive contents.
2. The voice interactive display system of claim 1, wherein the host computer comprises: the device comprises an interface processing module, a sound processing module, an audio and video processing module and a processor module;
the interface processing module is respectively connected with an external signal source and the audio and video processing module and is used for receiving the program signals output by the signal source, and outputting the program signals to the audio and video processing module after signal enhancement, analysis and descrambling processing;
the audio and video processing module is connected with the active optical cable module and is used for decoding the program signals to obtain the program video image signals and the program audio signals, outputting the program video image signals to the active optical cable module and outputting the program audio signals to the processor module;
the processor module is connected with the sound processing module and is used for outputting the program audio signal to the sound processing module;
and the sound processing module is connected with a loudspeaker and is used for driving the loudspeaker to output audio according to the program audio signal.
3. The voice interactive display system of claim 2, wherein the host further comprises: the system comprises an interaction processing module and a network processing module;
the interactive processing module is connected with the processor module and used for acquiring the external sound signal, identifying the sound signal and outputting an initial audio signal and an identification result to the processor module;
the processor module is respectively connected with the network processing module and the active optical cable module, and is used for extracting the voice signal according to the initial audio signal and the recognition result, outputting the voice signal to the network processing module, and outputting the text information and the interaction information to the active optical cable module according to the response information, wherein the processor module acquires the characteristic parameters of the second display screen through the active optical cable module;
the network processing module is connected with the server and used for packaging the voice signals to obtain voice packets, sending the voice packets to the server, receiving and unpacking message packets sent by the server to obtain the response information and sending the response information to the processor module.
4. The voice interactive display system of claim 3, wherein the interactive processing module comprises: the system comprises a microphone unit, a sound acquisition unit and an interaction identification unit;
the processor module includes: a voice extraction unit, an audio encoding unit and a control unit;
the sound processing module comprises a loudspeaker driving unit;
the microphone unit is connected with the sound acquisition unit and used for acquiring the external sound signal and outputting the external sound signal to the sound acquisition unit;
the loudspeaker driving unit is connected with the sound acquisition unit and is used for outputting a driving audio signal for driving the loudspeaker to work to the sound acquisition unit;
the sound acquisition unit is respectively connected with the interactive recognition unit and the voice extraction unit and is used for carrying out cancellation processing on the external sound signal and the driving audio signal to obtain an initial audio signal and outputting the initial audio signal to the interactive recognition unit and the voice extraction unit;
the interaction identification unit is connected with the control unit, a reference audio is arranged in the interaction identification unit and is used for detecting the initial audio signal and outputting an identification signal to the control unit when the initial audio signal is detected to have the reference audio;
the control unit is respectively connected with the voice extraction unit and the audio and video processing module and is used for driving the voice extraction unit to work according to the identification signal and controlling the program audio to be output to the sound processing module;
the voice extracting unit is connected with the audio coding unit and used for extracting the voice signal according to the initial audio signal and outputting the voice signal to the audio coding unit;
the audio coding unit is connected with the network processing module and used for coding the voice signal and outputting the coded voice signal to the network processing module.
5. The voice interactive display system of claim 4, wherein the processor module further comprises: a message processing unit, an execution unit, an information processing unit, and a voice synthesis unit;
the message processing unit is connected with the network processing module and used for receiving the response information, acquiring an interactive instruction from the response information and outputting the interactive instruction to the execution unit and acquiring text information and outputting the text information to the information processing unit;
the execution unit is connected with the information processing unit and used for executing interactive operation according to the interactive instruction and outputting interactive information of the interactive operation to the information processing unit;
the information processing unit is connected with the active optical cable module and is used for processing the text information and the interactive information, generating feature structure data which accords with feature parameters of the second display screen and outputting the feature structure data to the active optical cable module;
the voice synthesis unit is respectively connected with the message processing unit and the sound processing module and is used for receiving the text information and the interactive information, converting the text information and the interactive information into interactive audio signals and outputting the interactive audio signals to the sound processing module;
the control unit is used for controlling the voice synthesis unit to output the interactive audio signal to the sound processing module.
6. The voice interactive display system of claim 4, wherein the audio-video processing module comprises: the device comprises an audio and video decoding unit, an image processing unit, a format processing unit and an audio processing unit;
the audio and video decoding unit is respectively connected with the interface processing module, the audio processing unit and the image processing unit and is used for receiving the program signal, decoding the program signal to obtain a program video and a program audio, outputting the program video to the image processing unit and outputting the program audio to the audio processing unit;
the image processing unit is connected with the format processing unit and is used for carrying out image processing on the program video and outputting the program video to the format processing unit;
the format processing unit is connected with the active optical cable module and is used for converting the program video into the program video image signals in the parallel format and outputting the program video image signals to the active optical cable module;
the audio processing unit is connected with the processor module and used for processing the program audio and outputting the program audio signal to the processor module so that the processor module outputs the program audio signal to the sound processing module.
7. The voice interactive display system of claim 3, wherein the server comprises: the voice recognition device comprises a first receiving and unpacking unit, a voice recognition unit, an instruction conversion unit, a message generation unit and a first packet sending unit;
the first receiving and unpacking unit is respectively connected with the host and the voice recognition unit and is used for receiving the voice packet, unpacking the voice packet to obtain the voice signal and outputting the voice signal to the voice recognition unit;
the voice recognition unit is respectively connected with the instruction conversion unit and the message generation unit and is used for performing voice recognition on the voice signal, converting the voice signal into corresponding text information and outputting the text information to the information generation unit and the instruction conversion unit;
the instruction conversion unit is connected with the message generation unit and is used for judging and understanding according to the character information, converting the character information into instruction information and outputting the instruction information to the message generation unit;
the message generating unit is connected with the first packet sending unit and used for combining the instruction information and the text information into the response information and outputting the response information to the first packet sending unit;
the first packet sending unit is connected with the host, and is used for packaging the response information to obtain the message packet and sending the message packet to the network processing module.
8. The voice interactive display system of claim 2, wherein the active optical cable module comprises: the device comprises a pixel coding unit, a pixel decoding unit and an optical fiber transmission unit;
the pixel coding unit is respectively connected with the audio and video processing module and the optical fiber transmission unit and is used for receiving the program video image signals and converting the program video image signals into serial signals to be output to the optical fiber transmission unit;
the optical fiber transmission units are respectively connected with the pixel decoding units and are used for converting the serial signals into optical signals for transmission and converting the optical signals into the serial signals to be output to the pixel decoding units;
the pixel decoding unit is connected with the first display screen and used for carrying out pixel decoding on the serial signals and driving the first display screen to display.
9. The voice interactive display system of claim 8, wherein the active optical cable module further comprises: the device comprises a data weaving unit, a data analysis unit and a copper cable;
the data weaving unit is connected with the processor module, is connected with the data analysis unit through the copper cable, and is used for receiving the text information and the interactive information output by the processor module, converting the text information and the interactive information into screen display information and outputting the screen display information to the data analysis unit;
the data analysis unit is connected with the second display screen and used for analyzing the screen display information and then driving the second display screen to display.
10. An intelligent display terminal, characterized in that it comprises a voice interactive display system according to any one of claims 1 to 9.
CN202111532775.XA 2021-12-15 2021-12-15 Voice interaction display system and intelligent display terminal Pending CN114283801A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111532775.XA CN114283801A (en) 2021-12-15 2021-12-15 Voice interaction display system and intelligent display terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111532775.XA CN114283801A (en) 2021-12-15 2021-12-15 Voice interaction display system and intelligent display terminal

Publications (1)

Publication Number Publication Date
CN114283801A true CN114283801A (en) 2022-04-05

Family

ID=80872311

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111532775.XA Pending CN114283801A (en) 2021-12-15 2021-12-15 Voice interaction display system and intelligent display terminal

Country Status (1)

Country Link
CN (1) CN114283801A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114945103A (en) * 2022-05-13 2022-08-26 深圳创维-Rgb电子有限公司 Voice interaction system and voice interaction method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114945103A (en) * 2022-05-13 2022-08-26 深圳创维-Rgb电子有限公司 Voice interaction system and voice interaction method
WO2023216414A1 (en) * 2022-05-13 2023-11-16 深圳创维-Rgb电子有限公司 Speech interaction system and speech interaction method

Similar Documents

Publication Publication Date Title
WO2019001347A1 (en) Screen projection method for mobile device, storage medium, terminal and screen projection system
CN201319640Y (en) Digital television receiving terminal capable of synchronously translating in real time
CN109640056B (en) USB camera monitoring system and method based on Android platform
US11500804B2 (en) Method for transmitting control instruction, transmitting device, and receiving device
CN102209232A (en) Remote audio and video monitor system and method thereof
CN103442280B (en) Realize the method, apparatus and its television and mobile terminal of distributed remote control
CN107102694A (en) A kind of display with kinescope recording function
CN106227492A (en) Combination and mobile intelligent terminal interconnected method and device
CN110351533A (en) The network transmission monitoring system and its implementation of video image processing based on Linux
CN114283801A (en) Voice interaction display system and intelligent display terminal
WO2018072337A1 (en) Wireless transmission method, device, terminal and head-mounted display device for virtual reality
CN107707866A (en) A kind of remote video communication method based on Internet of Things
CN111385621B (en) Display device and Bluetooth audio transmission method
CN210807465U (en) Multifunctional video converter
CN206773587U (en) A kind of display with kinescope recording function
CN201805504U (en) Remote audio-video monitoring system
US9992547B2 (en) Method for adjusting backlight brightness of television and box for adjusting backlight brightness of television
KR20130066432A (en) Method and apparatus for providing iptv service based on cloud
US9288433B2 (en) Digital imaging apparatus and control method thereof
CN104754261A (en) Projection equipment and projection method
US9237304B2 (en) Multi-channel sharing apparatus and multi-channel sharing method
WO2013174337A2 (en) Subtitle extraction method and apparatus
CN115278323A (en) Display device, intelligent device and data processing method
CN211531219U (en) Multimedia data sending device, receiving device and transmission system
CN210986124U (en) Transmitting and receiving device and system for audio and video data in long-distance transmission scene

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination