CN101500127A - Method for synchronously displaying subtitle in video telephone call - Google Patents
Method for synchronously displaying subtitle in video telephone call Download PDFInfo
- Publication number
- CN101500127A CN101500127A CNA2008100569943A CN200810056994A CN101500127A CN 101500127 A CN101500127 A CN 101500127A CN A2008100569943 A CNA2008100569943 A CN A2008100569943A CN 200810056994 A CN200810056994 A CN 200810056994A CN 101500127 A CN101500127 A CN 101500127A
- Authority
- CN
- China
- Prior art keywords
- captions
- memory block
- synthesis unit
- image synthesis
- viewing area
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
The invention relates to a method for displaying captions synchronously in a video call process. The method is mainly realized by a voice recognition module, a caption processing module and an image synthesis module. In the video call process, the captions are generated by applying the voice recognition technology, and then the captions are superposed on a local terminal image according to caption display rules preset by a user and transmitted to a remote terminal user after processing. The increase of the caption display in the video call process is important compensation to the video call, and can improve the video call quality and enhance the communication effect. In addition, the captions generated by utilizing the method can be displayed on mobile phones supporting the video telephone function without increasing software procedures or hardware devices, thus being convenient and practical.
Description
Technical field
The present invention relates to moving communicating field, concrete, the present invention relates to the method for synchronously displaying subtitle in the visual telephone.Use user of the present invention under the situation that does not influence normal talking, acoustic information to be changed into caption data, and send remote subscriber to after the local terminal image overlay.
Background technology
Along with the development of mobile communication technology, visual telephone business relies on its vividly user experience intuitively, has obtained promoting fast.Present visual telephone function mainly is video and the voice data by the collection both call sides, and abides by the agreement of arranging and transmit, thereby reaches the purpose of information interaction.But at present visual telephone is in communication process, and exchange way mainly still relies on verbal exposition, certainly will influence speech quality and communicative effect when sound transmits when unintelligible, even and have this moment video perception intuitively still can not satisfy the demand of communication.
Summary of the invention
The present invention aims to provide a kind of in video call process, can improve speech quality, and the effectively auxiliary method that exchanges can realize user voice information is changed into captions, and sends remote subscriber to after the local terminal image overlay.
To achieve these goals, basic thought of the present invention is in the video telephony call process, use speech recognition technology to generate subtitle file, captions configuration informations such as the viewing area of selecting according to the user, font, color and size again, the local terminal image that is added to sends remote subscriber to after merging with view data.This end subscriber and remote subscriber can see that the video of band captions shows simultaneously on display screen.Said process mainly uses sound identification module, captions processing module and image synthesis unit to finish.
Described visual telephone comprises both sides' conversation, MPTY and video conference.The present invention is that example describes with both sides' conversation only.
Described sound identification module will give an oral account language word for word be converted to corresponding literal, produce captions, and store the captions processing module into.Speech recognition technology can adopt software or hardware to discern according to the specific requirement of mobile phone.
Described captions processing module, according to the display mode that presets, the word that the identification of some is good is delivered to image synthesis unit.
The image synthesis unit of telling according to the requirement of captions configuration information, superposes captions and the background video of receiving, generates the video data stream of band captions.
Described captions configuration information comprises character script, line number, every capable number of words, text color, residence time, update time, captions viewing area and size etc.
In visual telephone, show the captions that user language carried out the character property explanation synchronously, be important supplement to video calling, speech quality can be improved, and the auxiliary effect that exchanges, improves communicative effect can be played.Because the variation of captions display format has also increased the flexibility and the interest of visual telephone.In addition, the present invention does not have specific (special) requirements for the equipment that receives captions, therefore supports the mobile phone of visual telephone function not need to increase extra software program or hardware device can be seen captions, and is convenient and practical.
Description of drawings
Fig. 1 is a structural representation of the present invention.
Fig. 2 is the functional flow diagram of the scheme two of captions display mode of the present invention.
Embodiment
Below in conjunction with accompanying drawing the specific embodiment of the present invention is described.
Fig. 1 is a structural representation of the present invention.As shown in the figure, method of the present invention is mainly by sound identification module 1, and captions processing module 2 and image synthesis unit 3 are finished.
Preferably, the captions display mode can adopt following two kinds of methods:
Method one: the memory block on the captions processing module promptly notifies image synthesis unit to carry out image overlay if renewal is arranged, and then the view data after the stack is carried out encoding and decoding handle, transmit and show.After the number of words in the captions viewing area satisfies captions requirement is set, promptly empty the viewing area, wait for the renewal of memory block.But do not upgrade if arrive the back storage area update time, then the captions viewing area also will all empty, and wait for next time and handle.
Method two: after the number of words in the captions memory block satisfies captions desired one group of captions are set, promptly notify image synthesis unit to superpose, and then to the view data after the stack carry out that encoding and decoding are handled, transmission and showing.But do not upgrade if arrive the back storage area update time, then the data with one group of captions of less than in the memory block superpose by image synthesis unit, and handle accordingly, transmit and show; Otherwise continue to wait for the memory block renewal.If the demonstration time of one group of captions has reached residence time, then empty the captions viewing area, continue to monitor the renewal of memory block.
In conjunction with Fig. 2, be example with the method two of captions display mode, specify the functional sequence of synchronously displaying subtitle of the present invention.
At first the user sets font, shows caption informations such as line number, every capable number of words, text color, residence time, update time, captions viewing area and size.
After visual telephone was communicated with, sound identification module 1 was monitored voice messaging in video calling, and word for word converts it to corresponding literal, stores the captions memory block of captions processing module 2 into.In the memory block, have captions to upgrade, and after the storage number of words satisfied the captions that preset desired one group of captions are set, captions processing module 2 just notified image synthesis unit 3 to begin stack work.Image synthesis unit 3 superposes captions and the local terminal video image of receiving according to the requirement of captions configuration information, generates the video data stream of band captions.And then to the view data after the stack encode wait to handle after, by the visual telephone host-host protocol Voice ﹠ Video data after encoding are carried out the multiplexing operation of Denging, be delivered to distal displayed.But do not upgrade if arrive the back storage area update time, then the captions with one group of less than in the memory block superpose by image synthesis unit 3, and handle accordingly, transmit and show; Otherwise continue to wait for the memory block renewal.If the demonstration time of one group of captions has reached residence time, then cancel the demonstration of these group captions, continue to monitor the renewal of memory block to carry out demonstration next time.
Below only be one embodiment of the present of invention, in order to restriction the present invention, within the spirit and principles in the present invention all, any modification of being done all is not included within the claim scope of the present invention.
Claims (8)
1. the method for synchronously displaying subtitle in the visual telephone, it is characterized in that: use sound identification module (1), captions processing module (2) and image synthesis unit (3), in the video telephony call process, at first use speech recognition technology and generate captions, captions according to user preset show rule again, in the local terminal image, send subtitle superposition to remote subscriber after treatment.
2. sound identification module as claimed in claim 1 (1) is characterized in that the oral account language word for word is converted to corresponding literal, produces captions, and stores captions processing module (2) into.
3. sound identification module as claimed in claim 1 (1) is characterized in that and can according to circumstances select to use software or hardware identification technology.
4. captions processing module as claimed in claim 1 (2) is characterized in that according to the display mode that presets, and the captions of some are delivered to image synthesis unit (3).
5. image synthesis unit as claimed in claim 1 (3) is characterized in that the requirement according to the captions configuration information, and captions and the background video of receiving superposeed, and generates the video data stream of band captions.
6. captions configuration information as claimed in claim 5 is characterized in that comprising character script, line number, every capable number of words, text color, residence time, update time, captions viewing area and size etc.
7. captions display mode as claimed in claim 4 can adopt following method:
Memory block on the captions processing module promptly notifies image synthesis unit (3) to carry out image overlay if renewal is arranged, and then the view data after the stack is carried out encoding and decoding handle, transmit and show.After the number of words in the captions viewing area satisfies captions requirement is set, promptly empty the viewing area, wait for the renewal of memory block.But do not upgrade if arrive the back storage area update time, then the captions viewing area also will all empty, and wait for next time and handle.
8. captions display mode as claimed in claim 4 can adopt following method:
After the number of words in the captions memory block satisfies captions desired one group of captions is set, promptly notify image synthesis unit (3) to superpose, and then to the view data after the stack carry out that encoding and decoding are handled, transmission and showing.But do not upgrade if arrive the back storage area update time, then the data with one group of captions of less than in the memory block superpose by image synthesis unit (3), and handle accordingly, transmit and show; Otherwise continue to wait for the memory block renewal.If the demonstration time of one group of captions has reached residence time, then empty the captions viewing area, continue to monitor the renewal of memory block.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2008100569943A CN101500127A (en) | 2008-01-28 | 2008-01-28 | Method for synchronously displaying subtitle in video telephone call |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2008100569943A CN101500127A (en) | 2008-01-28 | 2008-01-28 | Method for synchronously displaying subtitle in video telephone call |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101500127A true CN101500127A (en) | 2009-08-05 |
Family
ID=40946976
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2008100569943A Pending CN101500127A (en) | 2008-01-28 | 2008-01-28 | Method for synchronously displaying subtitle in video telephone call |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101500127A (en) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010148890A1 (en) * | 2009-06-23 | 2010-12-29 | 中兴通讯股份有限公司 | Videophone and communication method thereof |
CN102685413A (en) * | 2012-05-11 | 2012-09-19 | 青岛海信电器股份有限公司 | Method and system for simultaneously displaying caption and menu |
CN105450970A (en) * | 2014-06-16 | 2016-03-30 | 联想(北京)有限公司 | Information processing method and electronic equipment |
US9374536B1 (en) | 2015-11-12 | 2016-06-21 | Captioncall, Llc | Video captioning communication system, devices and related methods for captioning during a real-time video communication session |
CN105791713A (en) * | 2016-03-21 | 2016-07-20 | 安徽声讯信息技术有限公司 | Intelligent device for playing voices and captions synchronously |
US9525830B1 (en) | 2015-11-12 | 2016-12-20 | Captioncall Llc | Captioning communication systems |
CN106899875A (en) * | 2017-02-06 | 2017-06-27 | 合网络技术(北京)有限公司 | The display control method and device of plug-in captions |
CN107172377A (en) * | 2017-06-30 | 2017-09-15 | 福州瑞芯微电子股份有限公司 | A kind of data processing method and device of video calling |
WO2017162010A1 (en) * | 2016-03-21 | 2017-09-28 | 中兴通讯股份有限公司 | Method and apparatus for realizing caption processing |
CN108366182A (en) * | 2018-02-13 | 2018-08-03 | 京东方科技集团股份有限公司 | Text-to-speech synchronizes the calibration method reported and device, computer storage media |
WO2019184650A1 (en) * | 2018-03-29 | 2019-10-03 | 华为技术有限公司 | Subtitle generation method and terminal |
CN110415706A (en) * | 2019-08-08 | 2019-11-05 | 常州市小先信息技术有限公司 | A kind of technology and its application of superimposed subtitle real-time in video calling |
CN111045624A (en) * | 2019-11-27 | 2020-04-21 | 深圳创维-Rgb电子有限公司 | Multi-screen simultaneous display method, display terminal and computer readable storage medium |
CN111447325A (en) * | 2020-04-03 | 2020-07-24 | 上海闻泰电子科技有限公司 | Call auxiliary method, device, terminal and storage medium |
-
2008
- 2008-01-28 CN CNA2008100569943A patent/CN101500127A/en active Pending
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010148890A1 (en) * | 2009-06-23 | 2010-12-29 | 中兴通讯股份有限公司 | Videophone and communication method thereof |
CN102685413A (en) * | 2012-05-11 | 2012-09-19 | 青岛海信电器股份有限公司 | Method and system for simultaneously displaying caption and menu |
CN105450970A (en) * | 2014-06-16 | 2016-03-30 | 联想(北京)有限公司 | Information processing method and electronic equipment |
US9998686B2 (en) | 2015-11-12 | 2018-06-12 | Sorenson Ip Holdings, Llc | Transcribing video communication sessions |
US9525830B1 (en) | 2015-11-12 | 2016-12-20 | Captioncall Llc | Captioning communication systems |
US10972683B2 (en) | 2015-11-12 | 2021-04-06 | Sorenson Ip Holdings, Llc | Captioning communication systems |
US9374536B1 (en) | 2015-11-12 | 2016-06-21 | Captioncall, Llc | Video captioning communication system, devices and related methods for captioning during a real-time video communication session |
US10051207B1 (en) | 2015-11-12 | 2018-08-14 | Sorenson Ip Holdings, Llc | Captioning communication systems |
US11509838B2 (en) | 2015-11-12 | 2022-11-22 | Sorenson Ip Holdings, Llc | Captioning communication systems |
CN105791713A (en) * | 2016-03-21 | 2016-07-20 | 安徽声讯信息技术有限公司 | Intelligent device for playing voices and captions synchronously |
WO2017162010A1 (en) * | 2016-03-21 | 2017-09-28 | 中兴通讯股份有限公司 | Method and apparatus for realizing caption processing |
CN106899875A (en) * | 2017-02-06 | 2017-06-27 | 合网络技术(北京)有限公司 | The display control method and device of plug-in captions |
CN107172377A (en) * | 2017-06-30 | 2017-09-15 | 福州瑞芯微电子股份有限公司 | A kind of data processing method and device of video calling |
CN108366182A (en) * | 2018-02-13 | 2018-08-03 | 京东方科技集团股份有限公司 | Text-to-speech synchronizes the calibration method reported and device, computer storage media |
CN110324723A (en) * | 2018-03-29 | 2019-10-11 | 华为技术有限公司 | Method for generating captions and terminal |
CN110324723B (en) * | 2018-03-29 | 2022-03-08 | 华为技术有限公司 | Subtitle generating method and terminal |
WO2019184650A1 (en) * | 2018-03-29 | 2019-10-03 | 华为技术有限公司 | Subtitle generation method and terminal |
CN110415706A (en) * | 2019-08-08 | 2019-11-05 | 常州市小先信息技术有限公司 | A kind of technology and its application of superimposed subtitle real-time in video calling |
CN111045624A (en) * | 2019-11-27 | 2020-04-21 | 深圳创维-Rgb电子有限公司 | Multi-screen simultaneous display method, display terminal and computer readable storage medium |
CN111447325A (en) * | 2020-04-03 | 2020-07-24 | 上海闻泰电子科技有限公司 | Call auxiliary method, device, terminal and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101500127A (en) | Method for synchronously displaying subtitle in video telephone call | |
CN101309390B (en) | Visual communication system, apparatus and subtitle displaying method | |
JP6179834B1 (en) | Video conferencing equipment | |
US20110131498A1 (en) | Presentation method and presentation system using identification label | |
CN101330339A (en) | Method and apparatus for implementing call business based on IPTV, application server | |
JP2002535932A (en) | Method and apparatus for selecting and displaying multimedia messages | |
CN101291379A (en) | Mobile terminal and picture-phone implementing method thereof | |
CN101931779A (en) | Video telephone and communication method thereof | |
CN101123630A (en) | Communication method and system for voice and text conversion | |
EP1465423A1 (en) | Videophone device and data transmitting/receiving method applied thereto | |
CN102420897B (en) | Mobile phone communication information transmitting method and device | |
CN101764990A (en) | Identifying label presentation method and system thereof as well as video providing device and video receiving device | |
CN102306430B (en) | Self-help system and equipment for realizing AV (audio/video) integration | |
CN101815097A (en) | Method and device for realizing call holding in CTD calling business | |
JP2004356896A (en) | Automatic answering machine and automatic answering system using same, and telephone banking system | |
CN102447874B (en) | Video scheduling system and method | |
CN101610380A (en) | Dialing and answering method and device of video telephone, video telephone | |
CN101742215A (en) | Realization method, mobile terminal and system of video telephone | |
CN101420585A (en) | System and method for transmitting non-video information by visible telephone terminal | |
CN101895717A (en) | Method for displaying pure voice terminal image in video session | |
CN101635820B (en) | Set-top box system with multimedia communication function | |
CN100531360C (en) | Set-top box system with multimedia communication function | |
JP6064209B2 (en) | Call system and call relay method | |
JP5136823B2 (en) | PoC system with fixed message function, communication method, communication program, terminal, PoC server | |
KR20040039603A (en) | System and method for providing ringback tone |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20090805 |