CN101500127A - Method for synchronously displaying subtitle in video telephone call - Google Patents

Method for synchronously displaying subtitle in video telephone call Download PDF

Info

Publication number
CN101500127A
CN101500127A CNA2008100569943A CN200810056994A CN101500127A CN 101500127 A CN101500127 A CN 101500127A CN A2008100569943 A CNA2008100569943 A CN A2008100569943A CN 200810056994 A CN200810056994 A CN 200810056994A CN 101500127 A CN101500127 A CN 101500127A
Authority
CN
China
Prior art keywords
captions
memory block
synthesis unit
image synthesis
viewing area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008100569943A
Other languages
Chinese (zh)
Inventor
郭晓丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TECHFAITH INTELLIGENT HANDSET TECHNOLOGY (BEIJING) Co Ltd
Original Assignee
TECHFAITH INTELLIGENT HANDSET TECHNOLOGY (BEIJING) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TECHFAITH INTELLIGENT HANDSET TECHNOLOGY (BEIJING) Co Ltd filed Critical TECHFAITH INTELLIGENT HANDSET TECHNOLOGY (BEIJING) Co Ltd
Priority to CNA2008100569943A priority Critical patent/CN101500127A/en
Publication of CN101500127A publication Critical patent/CN101500127A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to a method for displaying captions synchronously in a video call process. The method is mainly realized by a voice recognition module, a caption processing module and an image synthesis module. In the video call process, the captions are generated by applying the voice recognition technology, and then the captions are superposed on a local terminal image according to caption display rules preset by a user and transmitted to a remote terminal user after processing. The increase of the caption display in the video call process is important compensation to the video call, and can improve the video call quality and enhance the communication effect. In addition, the captions generated by utilizing the method can be displayed on mobile phones supporting the video telephone function without increasing software procedures or hardware devices, thus being convenient and practical.

Description

The method of synchronously displaying subtitle in a kind of visual telephone
Technical field
The present invention relates to moving communicating field, concrete, the present invention relates to the method for synchronously displaying subtitle in the visual telephone.Use user of the present invention under the situation that does not influence normal talking, acoustic information to be changed into caption data, and send remote subscriber to after the local terminal image overlay.
Background technology
Along with the development of mobile communication technology, visual telephone business relies on its vividly user experience intuitively, has obtained promoting fast.Present visual telephone function mainly is video and the voice data by the collection both call sides, and abides by the agreement of arranging and transmit, thereby reaches the purpose of information interaction.But at present visual telephone is in communication process, and exchange way mainly still relies on verbal exposition, certainly will influence speech quality and communicative effect when sound transmits when unintelligible, even and have this moment video perception intuitively still can not satisfy the demand of communication.
Summary of the invention
The present invention aims to provide a kind of in video call process, can improve speech quality, and the effectively auxiliary method that exchanges can realize user voice information is changed into captions, and sends remote subscriber to after the local terminal image overlay.
To achieve these goals, basic thought of the present invention is in the video telephony call process, use speech recognition technology to generate subtitle file, captions configuration informations such as the viewing area of selecting according to the user, font, color and size again, the local terminal image that is added to sends remote subscriber to after merging with view data.This end subscriber and remote subscriber can see that the video of band captions shows simultaneously on display screen.Said process mainly uses sound identification module, captions processing module and image synthesis unit to finish.
Described visual telephone comprises both sides' conversation, MPTY and video conference.The present invention is that example describes with both sides' conversation only.
Described sound identification module will give an oral account language word for word be converted to corresponding literal, produce captions, and store the captions processing module into.Speech recognition technology can adopt software or hardware to discern according to the specific requirement of mobile phone.
Described captions processing module, according to the display mode that presets, the word that the identification of some is good is delivered to image synthesis unit.
The image synthesis unit of telling according to the requirement of captions configuration information, superposes captions and the background video of receiving, generates the video data stream of band captions.
Described captions configuration information comprises character script, line number, every capable number of words, text color, residence time, update time, captions viewing area and size etc.
In visual telephone, show the captions that user language carried out the character property explanation synchronously, be important supplement to video calling, speech quality can be improved, and the auxiliary effect that exchanges, improves communicative effect can be played.Because the variation of captions display format has also increased the flexibility and the interest of visual telephone.In addition, the present invention does not have specific (special) requirements for the equipment that receives captions, therefore supports the mobile phone of visual telephone function not need to increase extra software program or hardware device can be seen captions, and is convenient and practical.
Description of drawings
Fig. 1 is a structural representation of the present invention.
Fig. 2 is the functional flow diagram of the scheme two of captions display mode of the present invention.
Embodiment
Below in conjunction with accompanying drawing the specific embodiment of the present invention is described.
Fig. 1 is a structural representation of the present invention.As shown in the figure, method of the present invention is mainly by sound identification module 1, and captions processing module 2 and image synthesis unit 3 are finished.
Preferably, the captions display mode can adopt following two kinds of methods:
Method one: the memory block on the captions processing module promptly notifies image synthesis unit to carry out image overlay if renewal is arranged, and then the view data after the stack is carried out encoding and decoding handle, transmit and show.After the number of words in the captions viewing area satisfies captions requirement is set, promptly empty the viewing area, wait for the renewal of memory block.But do not upgrade if arrive the back storage area update time, then the captions viewing area also will all empty, and wait for next time and handle.
Method two: after the number of words in the captions memory block satisfies captions desired one group of captions are set, promptly notify image synthesis unit to superpose, and then to the view data after the stack carry out that encoding and decoding are handled, transmission and showing.But do not upgrade if arrive the back storage area update time, then the data with one group of captions of less than in the memory block superpose by image synthesis unit, and handle accordingly, transmit and show; Otherwise continue to wait for the memory block renewal.If the demonstration time of one group of captions has reached residence time, then empty the captions viewing area, continue to monitor the renewal of memory block.
In conjunction with Fig. 2, be example with the method two of captions display mode, specify the functional sequence of synchronously displaying subtitle of the present invention.
At first the user sets font, shows caption informations such as line number, every capable number of words, text color, residence time, update time, captions viewing area and size.
After visual telephone was communicated with, sound identification module 1 was monitored voice messaging in video calling, and word for word converts it to corresponding literal, stores the captions memory block of captions processing module 2 into.In the memory block, have captions to upgrade, and after the storage number of words satisfied the captions that preset desired one group of captions are set, captions processing module 2 just notified image synthesis unit 3 to begin stack work.Image synthesis unit 3 superposes captions and the local terminal video image of receiving according to the requirement of captions configuration information, generates the video data stream of band captions.And then to the view data after the stack encode wait to handle after, by the visual telephone host-host protocol Voice ﹠ Video data after encoding are carried out the multiplexing operation of Denging, be delivered to distal displayed.But do not upgrade if arrive the back storage area update time, then the captions with one group of less than in the memory block superpose by image synthesis unit 3, and handle accordingly, transmit and show; Otherwise continue to wait for the memory block renewal.If the demonstration time of one group of captions has reached residence time, then cancel the demonstration of these group captions, continue to monitor the renewal of memory block to carry out demonstration next time.
Below only be one embodiment of the present of invention, in order to restriction the present invention, within the spirit and principles in the present invention all, any modification of being done all is not included within the claim scope of the present invention.

Claims (8)

1. the method for synchronously displaying subtitle in the visual telephone, it is characterized in that: use sound identification module (1), captions processing module (2) and image synthesis unit (3), in the video telephony call process, at first use speech recognition technology and generate captions, captions according to user preset show rule again, in the local terminal image, send subtitle superposition to remote subscriber after treatment.
2. sound identification module as claimed in claim 1 (1) is characterized in that the oral account language word for word is converted to corresponding literal, produces captions, and stores captions processing module (2) into.
3. sound identification module as claimed in claim 1 (1) is characterized in that and can according to circumstances select to use software or hardware identification technology.
4. captions processing module as claimed in claim 1 (2) is characterized in that according to the display mode that presets, and the captions of some are delivered to image synthesis unit (3).
5. image synthesis unit as claimed in claim 1 (3) is characterized in that the requirement according to the captions configuration information, and captions and the background video of receiving superposeed, and generates the video data stream of band captions.
6. captions configuration information as claimed in claim 5 is characterized in that comprising character script, line number, every capable number of words, text color, residence time, update time, captions viewing area and size etc.
7. captions display mode as claimed in claim 4 can adopt following method:
Memory block on the captions processing module promptly notifies image synthesis unit (3) to carry out image overlay if renewal is arranged, and then the view data after the stack is carried out encoding and decoding handle, transmit and show.After the number of words in the captions viewing area satisfies captions requirement is set, promptly empty the viewing area, wait for the renewal of memory block.But do not upgrade if arrive the back storage area update time, then the captions viewing area also will all empty, and wait for next time and handle.
8. captions display mode as claimed in claim 4 can adopt following method:
After the number of words in the captions memory block satisfies captions desired one group of captions is set, promptly notify image synthesis unit (3) to superpose, and then to the view data after the stack carry out that encoding and decoding are handled, transmission and showing.But do not upgrade if arrive the back storage area update time, then the data with one group of captions of less than in the memory block superpose by image synthesis unit (3), and handle accordingly, transmit and show; Otherwise continue to wait for the memory block renewal.If the demonstration time of one group of captions has reached residence time, then empty the captions viewing area, continue to monitor the renewal of memory block.
CNA2008100569943A 2008-01-28 2008-01-28 Method for synchronously displaying subtitle in video telephone call Pending CN101500127A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008100569943A CN101500127A (en) 2008-01-28 2008-01-28 Method for synchronously displaying subtitle in video telephone call

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008100569943A CN101500127A (en) 2008-01-28 2008-01-28 Method for synchronously displaying subtitle in video telephone call

Publications (1)

Publication Number Publication Date
CN101500127A true CN101500127A (en) 2009-08-05

Family

ID=40946976

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008100569943A Pending CN101500127A (en) 2008-01-28 2008-01-28 Method for synchronously displaying subtitle in video telephone call

Country Status (1)

Country Link
CN (1) CN101500127A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010148890A1 (en) * 2009-06-23 2010-12-29 中兴通讯股份有限公司 Videophone and communication method thereof
CN102685413A (en) * 2012-05-11 2012-09-19 青岛海信电器股份有限公司 Method and system for simultaneously displaying caption and menu
CN105450970A (en) * 2014-06-16 2016-03-30 联想(北京)有限公司 Information processing method and electronic equipment
US9374536B1 (en) 2015-11-12 2016-06-21 Captioncall, Llc Video captioning communication system, devices and related methods for captioning during a real-time video communication session
CN105791713A (en) * 2016-03-21 2016-07-20 安徽声讯信息技术有限公司 Intelligent device for playing voices and captions synchronously
US9525830B1 (en) 2015-11-12 2016-12-20 Captioncall Llc Captioning communication systems
CN106899875A (en) * 2017-02-06 2017-06-27 合网络技术(北京)有限公司 The display control method and device of plug-in captions
CN107172377A (en) * 2017-06-30 2017-09-15 福州瑞芯微电子股份有限公司 A kind of data processing method and device of video calling
WO2017162010A1 (en) * 2016-03-21 2017-09-28 中兴通讯股份有限公司 Method and apparatus for realizing caption processing
CN108366182A (en) * 2018-02-13 2018-08-03 京东方科技集团股份有限公司 Text-to-speech synchronizes the calibration method reported and device, computer storage media
WO2019184650A1 (en) * 2018-03-29 2019-10-03 华为技术有限公司 Subtitle generation method and terminal
CN110415706A (en) * 2019-08-08 2019-11-05 常州市小先信息技术有限公司 A kind of technology and its application of superimposed subtitle real-time in video calling
CN111045624A (en) * 2019-11-27 2020-04-21 深圳创维-Rgb电子有限公司 Multi-screen simultaneous display method, display terminal and computer readable storage medium
CN111447325A (en) * 2020-04-03 2020-07-24 上海闻泰电子科技有限公司 Call auxiliary method, device, terminal and storage medium

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010148890A1 (en) * 2009-06-23 2010-12-29 中兴通讯股份有限公司 Videophone and communication method thereof
CN102685413A (en) * 2012-05-11 2012-09-19 青岛海信电器股份有限公司 Method and system for simultaneously displaying caption and menu
CN105450970A (en) * 2014-06-16 2016-03-30 联想(北京)有限公司 Information processing method and electronic equipment
US9998686B2 (en) 2015-11-12 2018-06-12 Sorenson Ip Holdings, Llc Transcribing video communication sessions
US9525830B1 (en) 2015-11-12 2016-12-20 Captioncall Llc Captioning communication systems
US10972683B2 (en) 2015-11-12 2021-04-06 Sorenson Ip Holdings, Llc Captioning communication systems
US9374536B1 (en) 2015-11-12 2016-06-21 Captioncall, Llc Video captioning communication system, devices and related methods for captioning during a real-time video communication session
US10051207B1 (en) 2015-11-12 2018-08-14 Sorenson Ip Holdings, Llc Captioning communication systems
US11509838B2 (en) 2015-11-12 2022-11-22 Sorenson Ip Holdings, Llc Captioning communication systems
CN105791713A (en) * 2016-03-21 2016-07-20 安徽声讯信息技术有限公司 Intelligent device for playing voices and captions synchronously
WO2017162010A1 (en) * 2016-03-21 2017-09-28 中兴通讯股份有限公司 Method and apparatus for realizing caption processing
CN106899875A (en) * 2017-02-06 2017-06-27 合网络技术(北京)有限公司 The display control method and device of plug-in captions
CN107172377A (en) * 2017-06-30 2017-09-15 福州瑞芯微电子股份有限公司 A kind of data processing method and device of video calling
CN108366182A (en) * 2018-02-13 2018-08-03 京东方科技集团股份有限公司 Text-to-speech synchronizes the calibration method reported and device, computer storage media
CN110324723A (en) * 2018-03-29 2019-10-11 华为技术有限公司 Method for generating captions and terminal
CN110324723B (en) * 2018-03-29 2022-03-08 华为技术有限公司 Subtitle generating method and terminal
WO2019184650A1 (en) * 2018-03-29 2019-10-03 华为技术有限公司 Subtitle generation method and terminal
CN110415706A (en) * 2019-08-08 2019-11-05 常州市小先信息技术有限公司 A kind of technology and its application of superimposed subtitle real-time in video calling
CN111045624A (en) * 2019-11-27 2020-04-21 深圳创维-Rgb电子有限公司 Multi-screen simultaneous display method, display terminal and computer readable storage medium
CN111447325A (en) * 2020-04-03 2020-07-24 上海闻泰电子科技有限公司 Call auxiliary method, device, terminal and storage medium

Similar Documents

Publication Publication Date Title
CN101500127A (en) Method for synchronously displaying subtitle in video telephone call
CN101309390B (en) Visual communication system, apparatus and subtitle displaying method
JP6179834B1 (en) Video conferencing equipment
US20110131498A1 (en) Presentation method and presentation system using identification label
CN101330339A (en) Method and apparatus for implementing call business based on IPTV, application server
JP2002535932A (en) Method and apparatus for selecting and displaying multimedia messages
CN101291379A (en) Mobile terminal and picture-phone implementing method thereof
CN101931779A (en) Video telephone and communication method thereof
CN101123630A (en) Communication method and system for voice and text conversion
EP1465423A1 (en) Videophone device and data transmitting/receiving method applied thereto
CN102420897B (en) Mobile phone communication information transmitting method and device
CN101764990A (en) Identifying label presentation method and system thereof as well as video providing device and video receiving device
CN102306430B (en) Self-help system and equipment for realizing AV (audio/video) integration
CN101815097A (en) Method and device for realizing call holding in CTD calling business
JP2004356896A (en) Automatic answering machine and automatic answering system using same, and telephone banking system
CN102447874B (en) Video scheduling system and method
CN101610380A (en) Dialing and answering method and device of video telephone, video telephone
CN101742215A (en) Realization method, mobile terminal and system of video telephone
CN101420585A (en) System and method for transmitting non-video information by visible telephone terminal
CN101895717A (en) Method for displaying pure voice terminal image in video session
CN101635820B (en) Set-top box system with multimedia communication function
CN100531360C (en) Set-top box system with multimedia communication function
JP6064209B2 (en) Call system and call relay method
JP5136823B2 (en) PoC system with fixed message function, communication method, communication program, terminal, PoC server
KR20040039603A (en) System and method for providing ringback tone

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20090805