WO2018001088A1 - Method and apparatus for presenting communication information, device and set-top box - Google Patents

Method and apparatus for presenting communication information, device and set-top box Download PDF

Info

Publication number
WO2018001088A1
WO2018001088A1 PCT/CN2017/088109 CN2017088109W WO2018001088A1 WO 2018001088 A1 WO2018001088 A1 WO 2018001088A1 CN 2017088109 W CN2017088109 W CN 2017088109W WO 2018001088 A1 WO2018001088 A1 WO 2018001088A1
Authority
WO
WIPO (PCT)
Prior art keywords
display
exchange information
module
information
voice
Prior art date
Application number
PCT/CN2017/088109
Other languages
French (fr)
Chinese (zh)
Inventor
李晓君
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2018001088A1 publication Critical patent/WO2018001088A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M11/00Telephonic communication systems specially adapted for combination with other electrical systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working

Definitions

  • the embodiment of the invention provides a method, a device and a device for displaying exchange information, and a set top box, so as to facilitate daily communication between a normal user and a language disabled user.
  • a set top box including: a sign language database, an interconnected voice module, a sign language conversion module, and a display module, wherein
  • the voice module is configured to acquire audio data, and the audio data is identified and processed and corrected to be semantically;
  • the method for displaying the exchange information in the foregoing embodiment further includes: if the plurality of first exchange information are separately collected through the two paths and the above, respectively, respectively, Second exchange of information.
  • the sign language conversion module is configured to match the corresponding semantics to be output in the sign language database according to the processed gesture posture;
  • the display module in the above embodiment is further configured to display a standard gesture gesture corresponding to the collected user gesture gesture for the user to learn.
  • the speech recognition module 302 analyzes the audio data, corrects the semantics and then converts the text into a subtitle 303 module, and converts it into a sign language 309, and outputs 303 and 309 to the display module 310;
  • the sign language 305 is then converted into the subtitle 303 by the image recognition module 304 and transmitted to the display module 310.
  • the central processing module 311 controls the voice and image recognition module, and the display module, so that the converted display area is different, so that the user is very Good to achieve interactive communication.
  • the last 709, 710, 711 display 712 according to the priority, the positions of the three information display are different, each position indicates the meaning of which party is expressed, and the respective transparency, font size, and sign language size are It can be adjusted. For example, when the user communicates frequently, the corresponding fonts of 710 and 711 will be relatively enlarged, allowing the user to concentrate on chatting. When there is little communication, the 709 font will be slightly enlarged to make the hearing language disorder. More focused on watching TV shows.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Provided in the embodiments of the present invention are a method and an apparatus for presenting communication information, a device and a set-top box. The method comprises: collecting first communication information presented in a first presentation manner; parsing the first communication information, acquiring a data content corresponding to the first communication information, and acquiring second communication information corresponding to the data content; and presenting the second communication information in a second presentation manner. The embodiments of the present invention can realize the transformation of data between any different presentation manners to help people having different needs communicate, for example, an unimpaired user can select a voice presentation manner and a language-impaired user can select a sign language presentation manner, such that different users only need to present the contents they want to communicate in the manner they commonly use, and after the transformation of the data content, both parties communicating can understand each other and communicate conveniently, with improved user experience.

Description

一种交流信息展示方法、装置及设备、机顶盒Method, device and device for displaying exchange information, set top box 技术领域Technical field
本发明涉及用户交流领域,尤其涉及一种交流信息展示方法、装置及设备、机顶盒。The present invention relates to the field of user communication, and in particular, to a method, device and device for displaying exchange information, and a set top box.
背景技术Background technique
为了便于正常用户与语言障碍用户的交流,出现了手语,但是这种方式要求正常用户与语言障碍用户了解较多的知识,降低了用户体验。In order to facilitate the communication between normal users and language-disabled users, sign language has appeared, but this method requires normal users and language-disabled users to learn more knowledge and reduce the user experience.
因此,现有的手语翻译,大部分是通过第三方翻译员翻译的,就算在看电视的时候也是第三方翻译好编码成视频传送给终端用户,在实际应用中,除非有大的突发新闻或重大直播,才会有手语翻译员进行翻译,而普通的电视节目是没有翻译的,这就造成了听力语言障碍者不能随心所欲的观看想要看的节目。Therefore, most of the existing sign language translations are translated by third-party translators. Even when watching TV, third-party translations are encoded into videos for transmission to end users. In practical applications, unless there is a big breaking news. Or a major live broadcast, there will be a sign language translator for translation, and the ordinary TV program is not translated, which causes the hearing-disabled person to watch the program they want to watch.
发明内容Summary of the invention
本发明实施例提供了一种交流信息展示方法、装置及设备、机顶盒,以方便正常用户与语言障碍用户的日常交流。The embodiment of the invention provides a method, a device and a device for displaying exchange information, and a set top box, so as to facilitate daily communication between a normal user and a language disabled user.
一方面,提供了一种交流信息展示方法,包括:On the one hand, it provides a method for displaying information exchange, including:
采集通过第一展示方式展示的第一交流信息;Collecting first exchange information displayed by the first display manner;
解析第一交流信息,获取第一交流信息对应的数据内容,获取与数据内容对应的第二交流信息;Parsing the first exchange information, acquiring the data content corresponding to the first exchange information, and acquiring the second exchange information corresponding to the data content;
通过第二展示方式展示第二交流信息。The second exchange information is displayed by the second display.
一方面,提供了一种交流信息展示装置,包括: In one aspect, an exchange information display device is provided, including:
采集模块,设置为采集通过第一展示方式展示的第一交流信息;The acquiring module is configured to collect the first exchange information displayed by the first display manner;
处理模块,设置为解析第一交流信息,获取第一交流信息对应的数据内容,获取与数据内容对应的第二交流信息;The processing module is configured to parse the first exchange information, obtain the data content corresponding to the first exchange information, and acquire second exchange information corresponding to the data content;
展示模块,设置为通过第二展示方式展示第二交流信息。The display module is configured to display the second exchange information by using the second display manner.
另一方面,提供了一种交流信息展示设备,包括:交互模块及处理器,其中,In another aspect, an exchange information display device is provided, including: an interaction module and a processor, wherein
交互模块设置为采集通过第一展示方式展示的第一交流信息,并输出至处理器,还设置为通过第二展示方式展示处理器返回的第二交流信息;The interaction module is configured to collect the first exchange information displayed by the first display manner, and output the information to the processor, and further configured to display the second communication information returned by the processor by using the second display manner;
处理器设置为解析第一交流信息,获取第一交流信息对应的数据内容,获取与数据内容对应的第二交流信息,并传输至交互模块。The processor is configured to parse the first exchange information, obtain the data content corresponding to the first exchange information, acquire the second exchange information corresponding to the data content, and transmit the information to the interaction module.
另一方面,提供了一种机顶盒,包括:手语数据库、相互连接的语音模块、手语转换模块及显示模块,其中,In another aspect, a set top box is provided, including: a sign language database, an interconnected voice module, a sign language conversion module, and a display module, wherein
语音模块设置为获取音频数据,对音频数据进行识别处理修正后识别为语义;The voice module is configured to acquire audio data, and the audio data is identified and processed and corrected to be semantically;
手语转换模块设置为根据语义,在手语数据库中匹配音频数据对应的待输出的手语;The sign language conversion module is configured to match the sign language to be output corresponding to the audio data in the sign language database according to the semantics;
显示模块设置为显示待输出的手语。The display module is set to display the sign language to be output.
另一方面,提供了一种计算机存储介质,计算机存储介质中存储有计算机可执行指令,计算机可执行指令设置为执行前述的交流信息展示方法。In another aspect, a computer storage medium is provided, the computer storage medium storing computer executable instructions, and the computer executable instructions being configured to perform the aforementioned communication information presentation method.
本发明实施例的有益效果:Advantageous effects of embodiments of the present invention:
本发明实施例提供了一种交流信息展示方法,采集通过第一展示方式展示的第一交流信息,解析第一交流信息,获取第一交流信息对应的数据内容,获取与数据内容对应的第二交流信息,通过第二展示方式展示第二交流信息;可以实现将数据在任意不同的展示方式之间进行转换,便于不同需求人群进行交流,如可以为正常用户选择语音展示方式,为语言障碍用户选择手语展示方式,这样,不同用户仅需要将需要交流的内容以其常 用的方式展示出来,通过基于数据内容的转换,就可以让交流双方了解对方意图,进行便捷交流,增强了用户的使用体验。An embodiment of the present invention provides a method for displaying an exchange information, collecting first exchange information displayed by the first display manner, parsing the first exchange information, acquiring data content corresponding to the first exchange information, and acquiring a second corresponding to the data content. The information is exchanged, and the second exchange information is displayed through the second display mode; the data can be converted between any different display modes, so that different people can communicate with each other, for example, the voice display mode can be selected for the normal user, and the language barrier user can be Choose the sign language display method, so that different users only need to exchange the content they need to communicate. It is displayed in a way that, through the conversion of data content, the exchange parties can understand each other's intentions, conduct convenient communication, and enhance the user experience.
附图说明DRAWINGS
图1为本发明第一实施例提供的交流信息展示方法的流程图;1 is a flowchart of a method for displaying an exchange information according to a first embodiment of the present invention;
图2为本发明第三实施例提供的交流信息展示设备的结构示意图;2 is a schematic structural diagram of an exchange information display device according to a third embodiment of the present invention;
图3是本发明第五实施例涉及的机顶盒的简单结构示意图;3 is a schematic diagram showing the simple structure of a set top box according to a fifth embodiment of the present invention;
图4是本发明第五实施例涉及的手语到语音的转换流程图;4 is a flowchart of a sign language to speech conversion according to a fifth embodiment of the present invention;
图5是本发明第五实施例涉及的用户语音到手语的转换流程图;FIG. 5 is a flowchart of conversion of a user's voice to a sign language according to a fifth embodiment of the present invention; FIG.
图6是本发明第五实施例涉及的电视节目语音到手语的转换流程图;6 is a flow chart showing a conversion of a speech of a television program to a sign language according to a fifth embodiment of the present invention;
图7是本发明第五实施例涉及的机顶盒的具体结构示意图。FIG. 7 is a schematic diagram showing the specific structure of a set top box according to a fifth embodiment of the present invention.
具体实施方式detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例只是本发明中一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
现通过具体实施方式结合附图的方式对本发明做出进一步的诠释说明。The invention will now be further illustrated by way of specific embodiments in conjunction with the accompanying drawings.
第一实施例:First embodiment:
图1为本发明第一实施例提供的交流信息展示方法的流程图,由图1可知,本实施例提供的交流信息展示方法包括:1 is a flowchart of a method for displaying an exchange information according to a first embodiment of the present invention. As shown in FIG. 1, the method for displaying an exchange information provided in this embodiment includes:
S101:采集通过第一展示方式展示的第一交流信息;S101: Collect first exchange information displayed by the first display manner;
S102:解析第一交流信息,获取第一交流信息对应的数据内容,获取 与数据内容对应的第二交流信息;S102: Analyze the first exchange information, obtain the data content corresponding to the first exchange information, and obtain Second exchange information corresponding to the data content;
S103:通过第二展示方式展示第二交流信息。S103: Display the second exchange information by using the second display manner.
在一些实施例中,上述实施例中的第一展示方式包括语音方式,第二展示方式包括画面方式;In some embodiments, the first display manner in the foregoing embodiment includes a voice mode, and the second display mode includes a picture mode.
采集通过第一展示方式展示的第一交流信息包括:通过语音识别设备采集外界语音,和/或,通过对音频信道进行采集,获取第一交流信息;Collecting the first exchange information displayed by the first display manner includes: collecting the external voice through the voice recognition device, and/or acquiring the first exchange information by collecting the audio channel;
通过第二展示方式展示第二交流信息包括:在画面上以字幕形式和/或手势形式,展示第二交流信息。Displaying the second communication information by the second display manner includes: displaying the second communication information in the form of subtitles and/or gestures on the screen.
在一些实施例中,上述实施例中的交流信息展示方法还包括:若通过两种及以上路径分别采集到多种第一交流信息,则通过多个画面分别展示各第一交流信息分别对应的第二交流信息。In some embodiments, the method for displaying the exchange information in the foregoing embodiment further includes: if the plurality of first exchange information are separately collected through the two paths and the above, respectively, respectively, Second exchange of information.
在一些实施例中,上述实施例中的交流信息展示方法还包括:根据各第一交流信息的重要程度,确定展示各第一交流信息分别对应的第二交流信息的画面位置。In some embodiments, the method for displaying the exchange information in the above embodiment further includes determining, according to the degree of importance of each of the first exchange information, a screen position of the second exchange information corresponding to each of the first exchange information.
在一些实施例中,上述实施例中的第一展示方式包括画面方式,第二展示方式包括语音方式;In some embodiments, the first display manner in the foregoing embodiment includes a picture mode, and the second display mode includes a voice mode;
采集通过第一展示方式展示的第一交流信息包括:通过图像识别发送采集外界手势和/或文字,和/或,通过对图像信道进行采集,获取第一交流信息;Collecting the first exchange information displayed by the first display manner includes: collecting and collecting external gestures and/or characters through image recognition, and/or acquiring first exchange information by collecting the image channels;
通过第二展示方式展示第二交流信息包括:使用扬声器,通过模拟语音方式,展示第二交流信息。Displaying the second exchange information by the second display means includes: using the speaker, displaying the second exchange information by analog voice mode.
第二实施例:Second embodiment:
本实施例提供的交流信息展示装置包括:The communication information display device provided in this embodiment includes:
采集模块,设置为采集通过第一展示方式展示的第一交流信息;The acquiring module is configured to collect the first exchange information displayed by the first display manner;
处理模块,设置为解析第一交流信息,获取第一交流信息对应的数据内容,获取与数据内容对应的第二交流信息; The processing module is configured to parse the first exchange information, obtain the data content corresponding to the first exchange information, and acquire second exchange information corresponding to the data content;
展示模块,设置为通过第二展示方式展示第二交流信息。The display module is configured to display the second exchange information by using the second display manner.
在一些实施例中,第一展示方式包括语音方式,第二展示方式包括画面方式;上述实施例中的采集模块设置为通过语音识别设备采集外界语音,和/或,通过对音频信道进行采集,获取第一交流信息;上述实施例中的展示模块设置为在画面上以字幕形式和/或手势形式,展示第二交流信息。In some embodiments, the first display mode includes a voice mode, and the second display mode includes a picture mode; the collection module in the foregoing embodiment is configured to collect external voice through the voice recognition device, and/or, by collecting the audio channel, Acquiring the first exchange information; the display module in the above embodiment is configured to display the second exchange information in the form of subtitles and/or gestures on the screen.
在一些实施例中,上述实施例中的展示模块还设置为若通过两种及以上路径分别采集到多种第一交流信息,则通过多个画面分别展示各第一交流信息分别对应的第二交流信息。In some embodiments, the display module in the foregoing embodiment is further configured to: if the plurality of first exchange information are separately collected through the two paths and the multiple paths, respectively display the second corresponding to each of the first exchange information by using multiple screens. exchange information.
在一些实施例中,上述实施例中的展示模块还设置为根据各第一交流信息的重要程度,确定展示各第一交流信息分别对应的第二交流信息的画面位置。In some embodiments, the display module in the above embodiment is further configured to determine, according to the importance degree of each of the first exchange information, a screen position of the second exchange information corresponding to each of the first exchange information.
在一些实施例中,第一展示方式包括画面方式,第二展示方式包括语音方式;上述实施例中的采集模块设置为通过图像识别发送采集外界手势和/或文字,和/或,通过对图像信道进行采集,获取第一交流信息;展示模块设置为使用扬声器,通过模拟语音方式,展示第二交流信息。In some embodiments, the first display mode includes a picture mode, and the second display mode includes a voice mode; the acquisition module in the foregoing embodiment is configured to send an external gesture and/or text by image recognition, and/or The channel is collected to obtain the first exchange information; the display module is set to use the speaker to display the second exchange information through the analog voice mode.
第三实施例:Third embodiment:
图2为本发明第三实施例提供的交流信息展示设备的结构示意图,由图2可知,本实施例提供的交流信息展示设备包括:交互模块21及处理器22,其中,2 is a schematic structural diagram of an AC information display device according to a third embodiment of the present invention. As shown in FIG. 2, the AC information display device provided in this embodiment includes: an interaction module 21 and a processor 22, where
交互模块21设置为采集通过第一展示方式展示的第一交流信息,并输出至处理器,还设置为通过第二展示方式展示处理器返回的第二交流信息;The interaction module 21 is configured to collect the first exchange information displayed by the first display manner, and output the information to the processor, and further configured to display the second exchange information returned by the processor by using the second display manner;
处理器22设置为解析第一交流信息,获取第一交流信息对应的数据内容,获取与数据内容对应的第二交流信息,并传输至交互模块。The processor 22 is configured to parse the first communication information, acquire the data content corresponding to the first communication information, acquire the second communication information corresponding to the data content, and transmit the information to the interaction module.
在一些实施例中,第一展示方式包括语音方式,第二展示方式包括画面方式;上述实施例中的交互模块21设置为通过语音识别设备采集外界语音,和/或,通过对音频信道进行采集,获取第一交流信息还设置为在画 面上以字幕形式和/或手势形式,展示第二交流信息。In some embodiments, the first display mode includes a voice mode, and the second display mode includes a picture mode. The interaction module 21 in the foregoing embodiment is configured to collect external voice through the voice recognition device, and/or collect the audio channel. , getting the first exchange information is also set to draw The second exchange information is displayed in the form of subtitles and/or gestures.
在一些实施例中,上述实施例中的交互模块21还设置为若通过两种及以上路径分别采集到多种第一交流信息,则通过多个画面分别展示各第一交流信息分别对应的第二交流信息。In some embodiments, the interaction module 21 in the foregoing embodiment is further configured to: if the plurality of first exchange information are collected through the two paths and the multiple paths, respectively, the first communication information is respectively displayed through the plurality of screens. Second, exchange information.
在一些实施例中,上述实施例中的交互模块21还设置为根据各第一交流信息的重要程度,确定展示各第一交流信息分别对应的第二交流信息的画面位置。In some embodiments, the interaction module 21 in the foregoing embodiment is further configured to determine, according to the importance degree of each of the first exchange information, a screen position of the second exchange information corresponding to each of the first exchange information.
在一些实施例中,第一展示方式包括画面方式,第二展示方式包括语音方式;上述实施例中的交互模块21设置为通过图像识别发送采集外界手势和/或文字,和/或,通过对图像信道进行采集,获取第一交流信息;还设置为使用扬声器,通过模拟语音方式,展示第二交流信息。In some embodiments, the first display mode includes a picture mode, and the second display mode includes a voice mode; the interaction module 21 in the foregoing embodiment is configured to send an external gesture and/or text by image recognition, and/or The image channel is collected to obtain the first exchange information; and is also set to use the speaker to display the second exchange information through the analog voice mode.
第四实施例:Fourth embodiment:
本实施例提供了一种机顶盒,包括:手语数据库、相互连接的语音模块、手语转换模块及显示模块,其中,The embodiment provides a set top box, including: a sign language database, an interconnected voice module, a sign language conversion module, and a display module, where
语音模块设置为获取音频数据,对音频数据进行识别处理修正后识别为语义;The voice module is configured to acquire audio data, and the audio data is identified and processed and corrected to be semantically;
手语转换模块设置为根据语义,在手语数据库中匹配音频数据对应的待输出的手语;The sign language conversion module is configured to match the sign language to be output corresponding to the audio data in the sign language database according to the semantics;
显示模块设置为显示待输出的手语。The display module is set to display the sign language to be output.
在一些实施例中,上述实施例中的显示模块还设置为显示音频数据的语义,供用户确认是否是正常用户想表达的内容。In some embodiments, the display module in the above embodiment is further configured to display the semantics of the audio data for the user to confirm whether it is content that the normal user wants to express.
在一些实施例中,上述实施例中的语音模块设置为分别获取直播电视节目的音频数据和正常人通过麦克风发出的音频数据。In some embodiments, the voice module in the above embodiment is configured to respectively obtain audio data of a live television program and audio data sent by a normal person through a microphone.
在一些实施例中,上述实施例中的机顶盒还包括图像模块;In some embodiments, the set top box in the above embodiment further includes an image module;
图像模块设置为采用用户的手势姿势,对手势姿势进行校对修正处理后,传输至手语转换模块; The image module is configured to adopt a gesture gesture of the user, and after correcting the gesture gesture, the image module is transmitted to the sign language conversion module;
手语转换模块设置为根据处理后的手势姿势,在手语数据库中匹配对应的待输出的语义;The sign language conversion module is configured to match the corresponding semantics to be output in the sign language database according to the processed gesture posture;
显示模块设置为显示待输出的语义。The display module is set to display the semantics to be output.
在一些实施例中,上述实施例中的显示模块还设置为显示与采集到的用户手势姿势对应的标准手势姿势,供用户学习。In some embodiments, the display module in the above embodiment is further configured to display a standard gesture gesture corresponding to the collected user gesture gesture for the user to learn.
在实际应用中,上述实施例涉及的所有功能模块都可以由烧入有特定软件程序的可编辑逻辑器件实现,可以有处理器与存储器相互配合实现。In practical applications, all the functional modules involved in the foregoing embodiments may be implemented by an editable logic device that is burned into a specific software program, and may be implemented by a processor and a memory.
第五实施例:Fifth embodiment:
现结合具体应用场景对本发明做进一步的诠释说明。The present invention will be further explained in conjunction with specific application scenarios.
本实施例为了使听力语言障碍者更方便的看电视、为了解决正常人和听力语言障碍者的沟通问题、为了增加特殊群体的幸福感和满意度、为了给客户提供更满意的体验,提供一种在机顶盒上进行手语和字幕相互转换的方案。In order to make the hearing-disabled person more convenient to watch TV, to solve the communication problem of the normal person and the hearing-disabled person, to increase the happiness and satisfaction of the special group, and to provide a more satisfactory experience for the customer, the present embodiment provides a A scheme for converting sign language and subtitles on a set top box.
本实施例提供的在机顶盒上进行手语和字幕相互转换的实现方法包括:The implementation method for performing sign language and subtitle conversion on the set top box provided by this embodiment includes:
步骤A:在播放电视节目的时候去获取直播节目音频通道数据,将数据传给语音识别模块。Step A: When the television program is being played, the audio channel data of the live program is obtained, and the data is transmitted to the voice recognition module.
步骤B:语音识别模块进行分析转换成字幕后再去匹配手语库,输出字幕或手语给用户。Step B: The speech recognition module analyzes and converts into subtitles, then matches the sign language library, and outputs subtitles or sign language to the user.
步骤C:正常人讲话的时候通过机顶盒的语音接收模块把内容传输给语音识别模块,走第二路音频通道,语音识别模块进行数据分析处理后把语音转换成字幕,同时匹配手语库图片或动画。Step C: When the normal person speaks, the content is transmitted to the voice recognition module through the voice receiving module of the set top box, and the second audio channel is taken, and the voice recognition module performs data analysis processing to convert the voice into subtitles, and simultaneously matches the sign language picture or animation. .
步骤D:同时展示语音和字幕给听力语言障碍者,当听力语言障碍者看到字幕或手语的时候,如果做出回应,则通过机顶盒的图像接收模块把内容传输给图像识别模块。Step D: Simultaneously display voice and subtitles to the hearing-disabled person. When the hearing-language person sees the subtitle or sign language, if the response is made, the content is transmitted to the image recognition module through the image receiving module of the set-top box.
步骤E:图像识别模块进行数据分析处理后跟手语文字库对比,然后 转换成字幕,展示给正常人。Step E: The image recognition module performs data analysis processing and then compares with the sign language font library, and then Convert to subtitles and show them to normal people.
步骤F:用户交流通道和视频播放通道是两个独立展示的通道,展示在不同的位置,哪个通道展示处于相对主动,完全是根据不同场景确定的。如果用户交流频繁的时候,手语和字幕的展示要放大,否则电视节目所在的字幕要放大。Step F: The user communication channel and the video playback channel are two independently displayed channels, which are displayed at different positions, and which channel display is relatively active, and is determined according to different scenarios. If the user communicates frequently, the display of sign language and subtitles should be enlarged, otherwise the subtitles of the TV program should be enlarged.
在本实施例中,机顶盒包括:语音获取模块、语音识别模块、语音转换模块、手语匹配模块、显示模块、图像识别模块、图像转换模块、中央控制模块。其中,In this embodiment, the set top box includes: a voice acquisition module, a voice recognition module, a voice conversion module, a sign language matching module, a display module, an image recognition module, an image conversion module, and a central control module. among them,
语音获取模块:机顶盒音频是分多路,语音获取模块能分别获取到直播电视节目的音频数据和正常人通过麦克风发出的音频数据。Voice acquisition module: The set-top box audio is divided into multiple channels, and the voice acquisition module can respectively obtain the audio data of the live TV program and the audio data sent by the normal person through the microphone.
语音识别模块:对音频数据进行识别处理修正,识别成中文。Speech recognition module: The audio data is identified and processed and corrected, and recognized as Chinese.
语音转换模块:结合语音识别模块,把中文数据转换成对应的字幕数据,同时结合手语匹配模块,输出对应的手语信息。The voice conversion module: combines the voice recognition module to convert the Chinese data into corresponding subtitle data, and combines the sign language matching module to output the corresponding sign language information.
显示模块:在屏幕上显示字幕信息和手语信息。Display module: Display subtitle information and sign language information on the screen.
图像识别模块:获取听力语言障碍者的手势姿势,分析手势姿势。The image recognition module: acquires gesture gestures of a hearing-disabled person and analyzes gesture gestures.
图像转换模块:结合图像识别模块,跟手语文字库进行对比,对手势姿势进行校对修正处理,然后输出文字字幕信息。Image conversion module: combined with the image recognition module, compares with the sign language font library, corrects the gesture posture, and then outputs the text subtitle information.
手语匹配模块:该模块由手语图片动画、手语文字库组合,有本地和网络两种。Sign language matching module: This module is composed of sign language picture animation and sign language font library, both local and network.
中央控制模块:该模块对各个流程统一逻辑处理,负责字幕和手语主次显示的算法。Central Control Module: This module handles the logic of each process and is responsible for the algorithm of subtitle and sign language display.
与现有方案相比,本实施例提供的机顶盒多了互动这一特征,同时这个展示跟正常播放电视节目是不冲突的,我们设计的时候是分成两路进行的,一路专门输出这个交互过程,一路传输电视节目,电视节目的声音同样是语音识别以后转换成字幕传输给用户,两路传输可以实现无缝主次切换,极大了提高了听力语言障碍者的方便度。 Compared with the existing solution, the set-top box provided by the embodiment has the feature of interaction, and the display does not conflict with the normal broadcast of the TV program. When we design, it is divided into two paths, and one channel exclusively outputs the interaction process. The TV program is transmitted all the way, and the sound of the TV program is also converted into subtitles and transmitted to the user after the speech recognition. The two-way transmission can realize seamless primary and secondary switching, which greatly improves the convenience of the hearing-disabled person.
下面结合图3-图7,对本发明字幕手语相互转换的实现方法进一步说明。The implementation method of the mutual conversion of the caption sign language of the present invention will be further described below with reference to FIGS.
如图3所示:As shown in Figure 3:
本实施例提供的机顶盒主要包括:语音识别模块302、图像识别模块304、显示模块310以及中央处理模块311。当正常人聊天说话的时候,声音从301传到语音识别模块302,同时RF306ts流传输到调谐器(TUNER)307再传输到解复用器308,同样解复用获取到音频数据以后把数据传送给302,语音识别模块302对音频数据作分析处理后校对修正语义然后转化成文字传给字幕303模块,同时转成手语309,把303和309都输出到显示模块310;同样听力语言障碍者发出手语305,然后经图像识别模块304转成字幕303,传送到显示模块310,整个过程中,中央处理模块311控制语音和图像识别模块,以及显示模块,使得转换后的显示区域不同,这样用户很好的就实现了互动沟通。The set top box provided in this embodiment mainly includes: a voice recognition module 302, an image recognition module 304, a display module 310, and a central processing module 311. When the normal person chats, the voice is transmitted from 301 to the speech recognition module 302, and the RF 306ts stream is transmitted to the tuner (TUNER) 307 and then transmitted to the demultiplexer 308, and the data is transmitted after demultiplexing the acquired audio data. 302, the speech recognition module 302 analyzes the audio data, corrects the semantics and then converts the text into a subtitle 303 module, and converts it into a sign language 309, and outputs 303 and 309 to the display module 310; The sign language 305 is then converted into the subtitle 303 by the image recognition module 304 and transmitted to the display module 310. During the whole process, the central processing module 311 controls the voice and image recognition module, and the display module, so that the converted display area is different, so that the user is very Good to achieve interactive communication.
如图4所示:As shown in Figure 4:
本实施例提供的转换方法包括:The conversion method provided in this embodiment includes:
听力语言障碍者发出手语姿势S401,经摄像头采集手语姿势图像S402,将图像传给机顶盒S403,机顶盒识别图像S404,识别以后跟本地手语库进行比较S405,匹配对应手语姿势对应的词条S406,如果没有匹配到,则去网络手语库中匹配S408,如果匹配到了则输出字幕到字幕缓冲区S407,然后在显存上显示出来,正常人就可以观看到了S409。The hearing-language disorder person emits a sign language posture S401, collects a sign language posture image S402 via the camera, transmits the image to the set-top box S403, the set-top box identification image S404, and recognizes a comparison with the local sign language library S405, and matches the entry S406 corresponding to the corresponding sign language posture, if If there is no match, it will go to the network sign language to match S408. If it matches, the subtitle will be output to the subtitle buffer S407, and then displayed on the display memory, and the normal person can view S409.
如图5所示:As shown in Figure 5:
本实施例提供的转换方法包括:The conversion method provided in this embodiment includes:
正常人发出声音S501,通过麦克风或其他录音设备采集到声音S502,将声音传给机顶盒S503,机顶盒进行语音识别S504,这个时候要判断这路声音所在的通道是ts流通道的还是录音设备传过来的S505,如果是录音设备传过来的,则跟本地文字库比较S506,匹配对应人声词条S507,如果没有匹配到,则去网络手语库中匹配S509,如果匹配到了则输出字 幕到字幕缓冲区S508,同时需要匹配手语库S510,输出手语图像和字幕信息到显存S511,这样听力语言障碍者就可以观看到了S512。The normal person emits a sound S501, collects the sound S502 through a microphone or other recording device, transmits the sound to the set top box S503, and the set top box performs the voice recognition S504. At this time, it is judged whether the channel where the sound is located is the ts stream channel or the recording device is transmitted. S505, if it is transmitted by the recording device, compare S506 with the local text library, match the corresponding vocal entry S507, if there is no match, then go to the network sign language to match S509, if it matches, output the word The screen reaches the subtitle buffer S508, and at the same time, it needs to match the sign language library S510, and output the sign language image and the subtitle information to the memory S511, so that the hearing language disabled person can view the S512.
如图6所示:As shown in Figure 6:
本实施例提供的转换方法包括:The conversion method provided in this embodiment includes:
是否是ts流声音S601,如果语音识别器处理的是ts流音频通道的数据,获取音频数据S602,输入语音识别S603,,进行语义校对修正S604,是否匹配到对应人声词条S605,如果没有匹配到,则去网络手语库中匹配S607,如果匹配到了则输出字幕到字幕缓冲区S606,同时需要匹配手语库S608,输出手语图像和字幕信息到显存S609,这样听力语言障碍者就可以观看到了S610。Whether it is the ts stream sound S601, if the voice recognizer processes the data of the ts stream audio channel, acquires the audio data S602, inputs the voice recognition S603, and performs the semantic proofreading correction S604, whether it matches the corresponding vocal entry S605, if not If it matches, it will match S607 in the network sign language. If it matches, the subtitle will be output to the subtitle buffer S606. At the same time, it needs to match the sign language library S608, and output the sign language image and subtitle information to the memory S609, so that the hearing language disabled person can watch it. S610.
如图7所示:As shown in Figure 7:
本实施例实现了两路声音和字幕处理的兼容,具体为:ts流声音701是经过音频通道1(704)传输到语音识别器的,正常人的声音702,是通过音频通道2(705)传送到语音识别器的,然后语音识别器分别识别707,然后分两个图层显示出来,图层通道2显示的是ts流声音对应的文字和手语信息,图层通道1对应的是正常人的声音转化的结果信息,手语图像703经过专用编解码通道706后进行图像识别708,然后转化给图层通道3This embodiment implements compatibility between two-way sound and subtitle processing, specifically: ts stream sound 701 is transmitted to the speech recognizer via audio channel 1 (704), and normal person's sound 702 is through audio channel 2 (705) Transmitted to the speech recognizer, then the speech recognizer respectively recognizes 707, and then displays it in two layers. The layer channel 2 displays the text and sign language information corresponding to the ts stream sound, and the layer channel 1 corresponds to the normal person. The result information of the sound conversion, the sign language image 703 is subjected to image recognition 708 after passing through the dedicated codec channel 706, and then converted to the layer channel 3
(711),最后709、710、711按照优先级显示712,这三个信息显示的位置是不同的,每个位置会说明是哪方表达的意思,并且各自的透明度、字体大小、手语大小都是可以调节的,比如用户交流频繁的时候,对应的710和711的字体会相对放大点,让用户更把精力集中在聊天,当交流很少的时候,709字体会稍微放大,让听力语言障碍者更专注的欣赏电视节目。(711), the last 709, 710, 711 display 712 according to the priority, the positions of the three information display are different, each position indicates the meaning of which party is expressed, and the respective transparency, font size, and sign language size are It can be adjusted. For example, when the user communicates frequently, the corresponding fonts of 710 and 711 will be relatively enlarged, allowing the user to concentrate on chatting. When there is little communication, the 709 font will be slightly enlarged to make the hearing language disorder. More focused on watching TV shows.
综上可知,通过本发明实施例的实施,至少存在以下有益效果:In summary, through the implementation of the embodiments of the present invention, at least the following beneficial effects exist:
本发明实施例提供了一种交流信息展示方法,采集通过第一展示方式展示的第一交流信息,解析第一交流信息,获取第一交流信息对应的数据内容,获取与数据内容对应的第二交流信息,通过第二展示方式展示第二 交流信息;可以实现将数据在任意不同的展示方式之间进行转换,便于不同需求人群进行交流,如可以为正常用户选择语音展示方式,为语言障碍用户选择手语展示方式,这样,不同用户仅需要将需要交流的内容以其常用的方式展示出来,通过基于数据内容的转换,就可以让交流双方了解对方意图,进行便捷交流,增强了用户的使用体验。An embodiment of the present invention provides a method for displaying an exchange information, collecting first exchange information displayed by the first display manner, parsing the first exchange information, acquiring data content corresponding to the first exchange information, and acquiring a second corresponding to the data content. Exchange information and show the second through the second display Exchange information; it can realize the conversion of data between any different display modes, so that people with different needs can communicate. For example, voice display mode can be selected for normal users, and sign language display mode can be selected for language barrier users, so that different users only need The content that needs to be exchanged is displayed in the usual way. Through the conversion based on the data content, the exchange parties can understand the intention of the other party, conduct convenient communication, and enhance the user experience.
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用硬件实施例、软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention can take the form of a hardware embodiment, a software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage and optical storage, etc.) including computer usable program code.
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention has been described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (system), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flowchart illustrations and/or FIG. These computer program instructions can be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing device to produce a machine for the execution of instructions for execution by a processor of a computer or other programmable data processing device. Means for implementing the functions specified in one or more of the flow or in a block or blocks of the flow chart.
第六实施例:Sixth embodiment:
本发明的实施例还提供了一种存储介质,该存储介质包括存储的程序,其中,上述程序运行时执行上述任一项所述的方法。Embodiments of the present invention also provide a storage medium including a stored program, wherein the program described above executes the method of any of the above.
可选地,在本实施例中,上述存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。Optionally, in the embodiment, the foregoing storage medium may include, but is not limited to, a USB flash drive, a Read-Only Memory (ROM), and a Random Access Memory (RAM). A variety of media that can store program code, such as a hard disk, a disk, or an optical disk.
本发明的实施例还提供了一种处理器,该处理器用于运行程序,其中,该程序运行时执行上述任一项方法中的步骤。 Embodiments of the present invention also provide a processor for running a program, wherein the program is executed to perform the steps of any of the above methods.
可选地,本实施例中的具体示例可以参考上述实施例及可选实施方式中所描述的示例,本实施例在此不再赘述。For example, the specific examples in this embodiment may refer to the examples described in the foregoing embodiments and the optional embodiments, and details are not described herein again.
显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。It will be apparent to those skilled in the art that the various modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein. The steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software.
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above description is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the scope of the present invention are intended to be included within the scope of the present invention.
工业实用性Industrial applicability
基于本发明实施例提供的上述交流信息展示方法,采集通过第一展示方式展示的第一交流信息,解析第一交流信息,获取第一交流信息对应的数据内容,获取与数据内容对应的第二交流信息,通过第二展示方式展示第二交流信息;可以实现将数据在任意不同的展示方式之间进行转换,便于不同需求人群进行交流,如可以为正常用户选择语音展示方式,为语言障碍用户选择手语展示方式,这样,不同用户仅需要将需要交流的内容以其常用的方式展示出来,通过基于数据内容的转换,就可以让交流双方了解对方意图,进行便捷交流,增强了用户的使用体验。 The method for displaying the exchange information provided by the embodiment of the present invention collects the first exchange information displayed by the first display manner, parses the first exchange information, acquires the data content corresponding to the first exchange information, and acquires the second corresponding to the data content. The information is exchanged, and the second exchange information is displayed through the second display mode; the data can be converted between any different display modes, so that different people can communicate with each other, for example, the voice display mode can be selected for the normal user, and the language barrier user can be Select the sign language display method, so that different users only need to display the content that needs to be exchanged in their usual way. Through the conversion based on the data content, the exchange parties can understand each other's intentions, conduct convenient communication, and enhance the user experience. .

Claims (18)

  1. 一种交流信息展示方法,包括:A method of displaying information exchange, including:
    采集通过第一展示方式展示的第一交流信息;Collecting first exchange information displayed by the first display manner;
    解析所述第一交流信息,获取所述第一交流信息对应的数据内容,获取与所述数据内容对应的第二交流信息;Parsing the first exchange information, acquiring data content corresponding to the first exchange information, and acquiring second exchange information corresponding to the data content;
    通过第二展示方式展示所述第二交流信息。The second exchange information is displayed by the second display manner.
  2. 如权利要求1所述的交流信息展示方法,其中,所述第一展示方式包括语音方式,所述第二展示方式包括画面方式;The method of displaying an exchange information according to claim 1, wherein the first display mode comprises a voice mode, and the second display mode comprises a picture mode;
    所述采集通过第一展示方式展示的第一交流信息包括:通过语音识别设备采集外界语音,和/或,通过对音频信道进行采集,获取所述第一交流信息;The collecting the first exchange information displayed by the first display manner includes: collecting the external voice through the voice recognition device, and/or acquiring the first exchange information by collecting the audio channel;
    所述通过第二展示方式展示所述第二交流信息包括:在画面上以字幕形式和/或手势形式,展示所述第二交流信息。The displaying the second exchange information by using the second display manner includes: displaying the second exchange information in a subtitle form and/or a gesture form on a screen.
  3. 如权利要求2所述的交流信息展示方法,其中,还包括:若通过两种及以上路径分别采集到多种第一交流信息,则通过多个画面分别展示各第一交流信息分别对应的第二交流信息。The method for displaying an exchange information according to claim 2, further comprising: if the plurality of first exchange information are respectively collected through the two or more paths, respectively displaying the first communication information corresponding to each of the plurality of screens Second, exchange information.
  4. 如权利要求3所述的交流信息展示方法,其中,还包括:根据各第一交流信息的重要程度,确定展示各第一交流信息分别对应的第二交流信息的画面位置。The method for displaying an exchange information according to claim 3, further comprising: determining, based on the degree of importance of each of the first exchange information, a screen position of the second exchange information corresponding to each of the first exchange information.
  5. 如权利要求1至4任一项所述的交流信息展示方法,其中,所述第一展示方式包括画面方式,所述第二展示方式包括语音方式;The method for displaying an exchange information according to any one of claims 1 to 4, wherein the first display mode comprises a picture mode, and the second display mode comprises a voice mode;
    所述采集通过第一展示方式展示的第一交流信息包括:通过图像识别发送采集外界手势和/或文字,和/或,通过对图像信道进行采集,获取所述第一交流信息;The collecting, by the first display, the first exchange information includes: collecting and collecting external gestures and/or characters through image recognition, and/or acquiring the first communication information by collecting the image channels;
    所述通过第二展示方式展示所述第二交流信息包括:使用扬声器,通过模拟语音方式,展示所述第二交流信息。The displaying the second communication information by using the second display manner includes: using a speaker, displaying the second communication information by using an analog voice manner.
  6. 一种交流信息展示装置,包括: An exchange information display device, comprising:
    采集模块,设置为采集通过第一展示方式展示的第一交流信息;The acquiring module is configured to collect the first exchange information displayed by the first display manner;
    处理模块,设置为解析所述第一交流信息,获取所述第一交流信息对应的数据内容,获取与所述数据内容对应的第二交流信息;a processing module, configured to parse the first exchange information, acquire data content corresponding to the first exchange information, and acquire second exchange information corresponding to the data content;
    展示模块,设置为通过第二展示方式展示所述第二交流信息。And a display module, configured to display the second exchange information by using a second display manner.
  7. 如权利要求6所述的交流信息展示装置,其中,所述第一展示方式包括语音方式,所述第二展示方式包括画面方式;所述采集模块设置为通过语音识别设备采集外界语音,和/或,通过对音频信道进行采集,获取所述第一交流信息;所述展示模块设置为在画面上以字幕形式和/或手势形式,展示所述第二交流信息。The communication information display device of claim 6, wherein the first display mode comprises a voice mode, and the second display mode comprises a picture mode; the acquisition module is configured to collect an external voice through a voice recognition device, and/ Or acquiring the first exchange information by collecting the audio channel; the display module is configured to display the second exchange information in a subtitle form and/or a gesture form on the screen.
  8. 如权利要求7所述的交流信息展示装置,其中,所述展示模块还设置为若通过两种及以上路径分别采集到多种第一交流信息,则通过多个画面分别展示各第一交流信息分别对应的第二交流信息。The communication information display device of claim 7, wherein the display module is further configured to display each of the first exchange information through the plurality of screens if the plurality of first exchange information are respectively collected through the two paths and the above paths. Corresponding second exchange information.
  9. 如权利要求8所述的交流信息展示装置,其中,所述展示模块还设置为根据各第一交流信息的重要程度,确定展示各第一交流信息分别对应的第二交流信息的画面位置。The communication information display device according to claim 8, wherein the display module is further configured to determine, according to the importance degree of each of the first exchange information, a screen position of the second exchange information corresponding to each of the first exchange information.
  10. 如权利要求6至9任一项所述的交流信息展示装置,其中,所述第一展示方式包括画面方式,所述第二展示方式包括语音方式;所述采集模块设置为通过图像识别发送采集外界手势和/或文字,和/或,通过对图像信道进行采集,获取所述第一交流信息;所述展示模块设置为使用扬声器,通过模拟语音方式,展示所述第二交流信息。The communication information display device according to any one of claims 6 to 9, wherein the first display mode comprises a picture mode, the second display mode comprises a voice mode; and the acquisition module is configured to send and collect by image recognition. The external communication gesture and/or text, and/or, by acquiring the image channel, acquiring the first communication information; the display module is configured to display the second communication information by using an analog voice mode.
  11. 一种交流信息展示设备,包括:交互模块及处理器,其中,An exchange information display device includes: an interaction module and a processor, wherein
    所述交互模块设置为采集通过第一展示方式展示的第一交流信息,并输出至所述处理器,还设置为通过第二展示方式展示所述处理器返回的第二交流信息;The interaction module is configured to collect the first exchange information displayed by the first display mode, and output the information to the processor, and further configured to display the second exchange information returned by the processor by using the second display manner;
    所述处理器设置为解析所述第一交流信息,获取所述第一交流信息对应的数据内容,获取与所述数据内容对应的第二交流信息,并传输至所述交互模块。 The processor is configured to parse the first communication information, acquire data content corresponding to the first communication information, acquire second communication information corresponding to the data content, and transmit the information to the interaction module.
  12. 一种机顶盒,包括:手语数据库、相互连接的语音模块、手语转换模块及显示模块,其中,A set top box includes: a sign language database, an interconnected voice module, a sign language conversion module, and a display module, wherein
    所述语音模块设置为获取音频数据,对所述音频数据进行识别处理修正后识别为语义;The voice module is configured to acquire audio data, and identify and process the audio data to be recognized as semantics;
    所述手语转换模块设置为根据所述语义,在所述手语数据库中匹配所述音频数据对应的待输出的手语;The sign language conversion module is configured to match, in the sign language database, a sign language to be output corresponding to the audio data according to the semantics;
    所述显示模块设置为显示所述待输出的手语。The display module is configured to display the sign language to be output.
  13. 如权利要求12所述的机顶盒,其中,所述显示模块还设置为显示所述音频数据的语义。The set top box of claim 12 wherein said display module is further configured to display semantics of said audio data.
  14. 如权利要求12所述的机顶盒,其中,所述语音模块设置为分别获取直播电视节目的音频数据和正常人通过麦克风发出的音频数据。The set top box of claim 12, wherein the voice module is configured to separately obtain audio data of a live television program and audio data sent by a normal person through a microphone.
  15. 如权利要求12至14任一项所述的机顶盒,其中,还包括图像模块;A set top box according to any one of claims 12 to 14, further comprising an image module;
    所述图像模块设置为采用用户的手势姿势,对所述手势姿势进行校对修正处理后,传输至所述手语转换模块;The image module is configured to adopt a gesture gesture of the user, and after performing the proofreading correction process on the gesture gesture, the image module is transmitted to the sign language conversion module;
    所述手语转换模块设置为根据处理后的手势姿势,在所述手语数据库中匹配对应的待输出的语义;The sign language conversion module is configured to match corresponding semantics to be output in the sign language database according to the processed gesture gesture;
    所述显示模块设置为显示所述待输出的语义。The display module is configured to display the semantics to be output.
  16. 如权利要求15所述的机顶盒,其中,所述显示模块还设置为显示与采集到的用户手势姿势对应的标准手势姿势。The set top box of claim 15 wherein said display module is further configured to display a standard gesture gesture corresponding to the captured user gesture gesture.
  17. 一种存储介质,所述存储介质包括存储的程序,其中,所述程序运行时执行权利要求1至5中任一项所述的方法。A storage medium, the storage medium comprising a stored program, wherein the program is executed to perform the method of any one of claims 1 to 5.
  18. 一种处理器,所述处理器用于运行程序,其中,所述程序运行时执行权利要求1至5中任一项所述的方法。 A processor for running a program, wherein the program is executed to perform the method of any one of claims 1 to 5.
PCT/CN2017/088109 2016-06-30 2017-06-13 Method and apparatus for presenting communication information, device and set-top box WO2018001088A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610512638.2A CN107566863A (en) 2016-06-30 2016-06-30 A kind of exchange of information methods of exhibiting, device and equipment, set top box
CN201610512638.2 2016-06-30

Publications (1)

Publication Number Publication Date
WO2018001088A1 true WO2018001088A1 (en) 2018-01-04

Family

ID=60785795

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/088109 WO2018001088A1 (en) 2016-06-30 2017-06-13 Method and apparatus for presenting communication information, device and set-top box

Country Status (2)

Country Link
CN (1) CN107566863A (en)
WO (1) WO2018001088A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110121097A (en) * 2019-05-13 2019-08-13 深圳市亿联智能有限公司 Multimedia playing apparatus and method with accessible function
CN111327961A (en) * 2020-03-30 2020-06-23 上海句石智能科技有限公司 Video subtitle switching method and system
CN113076967B (en) * 2020-12-08 2022-09-23 无锡乐骐科技股份有限公司 Image and audio-based music score dual-recognition system

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5982853A (en) * 1995-03-01 1999-11-09 Liebermann; Raanan Telephone for the deaf and method of using same
US6477239B1 (en) * 1995-08-30 2002-11-05 Hitachi, Ltd. Sign language telephone device
CN101502094A (en) * 2006-06-15 2009-08-05 威瑞森数据服务公司 Methods and systems for a sign language graphical interpreter
CN101539994A (en) * 2009-04-16 2009-09-23 西安交通大学 Mutually translating system and method of sign language and speech
CN101594434A (en) * 2009-06-16 2009-12-02 中兴通讯股份有限公司 The sign language processing method and the sign language processing mobile terminal of portable terminal
CN202652435U (en) * 2012-06-29 2013-01-02 广西工学院 Digital television set top box capable of automatically generating subtitles
CN102984496A (en) * 2012-12-21 2013-03-20 华为技术有限公司 Processing method, device and system of video and audio information in video conference
CN106254960A (en) * 2016-08-30 2016-12-21 福州瑞芯微电子股份有限公司 A kind of video call method for communication disorders and system
CN106713974A (en) * 2015-11-12 2017-05-24 中兴通讯股份有限公司 Data conversion method and device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006139138A (en) * 2004-11-12 2006-06-01 Matsushita Electric Ind Co Ltd Information terminal and base station
CN101794528B (en) * 2010-04-02 2012-03-14 北京大学软件与微电子学院无锡产学研合作教育基地 Gesture language-voice bidirectional translation system
CN102236986A (en) * 2010-05-06 2011-11-09 鸿富锦精密工业(深圳)有限公司 Sign language translation system, device and method
CN103188548A (en) * 2011-12-30 2013-07-03 乐金电子(中国)研究开发中心有限公司 Digital television sign language dubbing method and digital television sign language dubbing device
CN102708866A (en) * 2012-06-01 2012-10-03 武汉大学 Semantic-computing-based interaction system and method for person with hearing or language disorder
US9697630B2 (en) * 2014-10-01 2017-07-04 Sony Corporation Sign language window using picture-in-picture

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5982853A (en) * 1995-03-01 1999-11-09 Liebermann; Raanan Telephone for the deaf and method of using same
US6477239B1 (en) * 1995-08-30 2002-11-05 Hitachi, Ltd. Sign language telephone device
CN101502094A (en) * 2006-06-15 2009-08-05 威瑞森数据服务公司 Methods and systems for a sign language graphical interpreter
CN101539994A (en) * 2009-04-16 2009-09-23 西安交通大学 Mutually translating system and method of sign language and speech
CN101594434A (en) * 2009-06-16 2009-12-02 中兴通讯股份有限公司 The sign language processing method and the sign language processing mobile terminal of portable terminal
CN202652435U (en) * 2012-06-29 2013-01-02 广西工学院 Digital television set top box capable of automatically generating subtitles
CN102984496A (en) * 2012-12-21 2013-03-20 华为技术有限公司 Processing method, device and system of video and audio information in video conference
CN106713974A (en) * 2015-11-12 2017-05-24 中兴通讯股份有限公司 Data conversion method and device
CN106254960A (en) * 2016-08-30 2016-12-21 福州瑞芯微电子股份有限公司 A kind of video call method for communication disorders and system

Also Published As

Publication number Publication date
CN107566863A (en) 2018-01-09

Similar Documents

Publication Publication Date Title
US11863806B2 (en) Systems and methods for correcting errors in caption text
WO2021068558A1 (en) Simultaneous subtitle translation method, smart television, and storage medium
JP5564459B2 (en) Method and system for adding translation to a video conference
US8515728B2 (en) Language translation of visual and audio input
US9558756B2 (en) Method and system for adjusting user speech in a communication session
US9282377B2 (en) Apparatuses, methods and systems to provide translations of information into sign language or other formats
US20160066055A1 (en) Method and system for automatically adding subtitles to streaming media content
JP6227459B2 (en) Remote operation method and system, and user terminal and viewing terminal thereof
JP2013521523A (en) A system for translating spoken language into sign language for the hearing impaired
JP2006215553A (en) System and method for providing sign language video data in broadcasting-communication convergence system
WO2018001088A1 (en) Method and apparatus for presenting communication information, device and set-top box
JP2011065467A (en) Conference relay device and computer program
JP2015115892A (en) Comment generating apparatus and control method of the same
US20200342180A1 (en) Systems and methods for training a model to determine whether a query with multiple segments comprises multiple distinct commands or a combined command
US8913869B2 (en) Video playback apparatus and video playback method
KR20200121603A (en) Electronic apparatus for providing text and controlling method thereof
JP6266330B2 (en) Remote operation system and user terminal and viewing device thereof
KR20130097513A (en) Multimedia device for accessing database according to result of voice recognition and method for controlling the same
JP5213572B2 (en) Sign language video generation system, server, terminal device, information processing method, and program
CN115359796A (en) Digital human voice broadcasting method, device, equipment and storage medium
US8130318B2 (en) Method and audio/video device for generating response data related to selected caption data
KR101877430B1 (en) Image processing apparatus and control method thereof, image processing system
Ellis et al. Automatic closed captions and immersive learning in higher education
US11736773B2 (en) Interactive pronunciation learning system
WO2022237381A1 (en) Method for saving conference record, terminal, and server

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17819085

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17819085

Country of ref document: EP

Kind code of ref document: A1