CN107205131A - A kind of methods, devices and systems for realizing video calling - Google Patents

A kind of methods, devices and systems for realizing video calling Download PDF

Info

Publication number
CN107205131A
CN107205131A CN201610161286.0A CN201610161286A CN107205131A CN 107205131 A CN107205131 A CN 107205131A CN 201610161286 A CN201610161286 A CN 201610161286A CN 107205131 A CN107205131 A CN 107205131A
Authority
CN
China
Prior art keywords
bag
video
text
terminal
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610161286.0A
Other languages
Chinese (zh)
Inventor
程岑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201610161286.0A priority Critical patent/CN107205131A/en
Priority to PCT/CN2017/075195 priority patent/WO2017157168A1/en
Publication of CN107205131A publication Critical patent/CN107205131A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/40Network security protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/762Media network packet handling at the source 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/80Responding to QoS
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/278Subtitling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Security & Cryptography (AREA)
  • Telephone Function (AREA)

Abstract

A kind of methods, devices and systems for realizing video calling, including:First terminal gathers digital audio and video signals and digital video signal respectively;Digital audio and video signals are converted to text message by first terminal, and text message is packaged into text bag, digital audio and video signals are packaged into audio pack, digital video signal is packaged into video bag;Text bag, audio pack and video bag are sent to second terminal by first terminal respectively.

Description

A kind of methods, devices and systems for realizing video calling
Technical field
Present document relates to but be not limited to video calling field, espespecially a kind of methods, devices and systems for realizing video calling.
Background technology
With developing rapidly for mobile and the Internet broadband technology, Visual communications value-added service is set rapidly to be promoted in domestic consumer, the service of the value-added service such as aspectant exchange and online video teaching can be obtained by the technology based on this business, audio if Visual communications business increases sychronization captions, it is not only able to provide more preferable service to the user of hearing difference, and a beneficial complement can be made to actual audio frequency effect in the case where network is not good.
In correlation technique, realizing increases voice subtitle method in video calling is generally comprised:
First terminal gathers digital audio and video signals and digital video signal respectively;Voice coding processing is carried out to the digital audio and video signals of collection, the digital audio and video signals after voice coding is handled are packaged into audio pack;And the digital audio and video signals of collection are converted into text message by speech recognition technology, text message is superimposed after synthesis with the digital video signal gathered and carries out Video coding processing, the digital video signal after Video coding is handled is packaged into video bag;Audio pack and video bag are sent to second terminal respectively;
Second terminal receives audio pack and video bag, digital audio and video signals after voice coding processing in audio pack are carried out with tone decoding to obtain digital audio and video signals and play, video decoding is carried out to the digital video signal after video bag intermediate frequency coded treatment and obtains digital video signal and shows.
In the above method, when network condition is not good, because video bag is than larger, so video bag packet loss occurs and the probability of shake can be bigger, so, text message will together be lost with video bag, cause information loss in video call process.
The content of the invention
The embodiment of the present invention proposes a kind of methods, devices and systems for realizing video calling, and the information that can be reduced when network condition is not good in video call process is lost.
The embodiment of the present invention proposes a kind of method for realizing video calling, including:
First terminal gathers digital audio and video signals and digital video signal respectively;
Digital audio and video signals are converted to text message by first terminal, and text message is packaged into text bag, digital audio and video signals are packaged into audio pack, digital video signal is packaged into video bag;
Text bag, audio pack and video bag are sent to second terminal by first terminal respectively.
Optionally, it is described digital audio and video signals are packaged into audio pack before also include:The first terminal carries out voice coding processing to the digital audio and video signals;
It is described digital audio and video signals are packaged into audio pack to include:The first terminal is packaged into the audio pack to the digital audio and video signals after voice coding processing.
Optionally, it is described digital video signal is packaged into video bag before also include:The first terminal carries out Video coding processing to the digital video signal;
It is described digital video signal is packaged into video bag to include:The first terminal is packaged into the video bag to the digital video signal after Video coding processing.
The embodiment of the present invention also proposed a kind of method for realizing video calling, including:
Second terminal receives the text bag from first terminal;
Second terminal judges that the timestamp corresponding time in the text bag received is less than or equal to the timestamp of the audio pack played or the video bag shown the corresponding time, in the text bag for showing the text bag received and caching, the text message that the timestamp field corresponding time is less than or equal in the text bag of the timestamp field of the audio pack played or the video bag shown corresponding time.
Optionally, during the timestamp that is more than the audio pack played or the video bag shown when the timestamp corresponding time that the second terminal is judged in the text bag received corresponding time, this method also includes:
The text bag received described in the second terminal caching.
Optionally, when second terminal does not receive audio pack and video bag in the preset time after receiving the text bag, this method also includes:
Text message in the text bag of the second terminal display caching.
Optionally, the second terminal is received after the text bag from first terminal, is also included before the timestamp corresponding time during the second terminal judges the text bag received is less than or equal to the timestamp of audio pack or the video bag shown the corresponding time played:
The second terminal judges that caption display function has been opened.
The embodiment of the present invention also proposed a kind of first terminal, including:
Acquisition module, for gathering digital audio and video signals and digital video signal respectively;
First processing module, for digital audio and video signals to be converted into text message, is packaged into text bag by text message, digital audio and video signals is packaged into audio pack, digital video signal is packaged into video bag;
Sending module, for text bag, audio pack and video bag to be sent into second terminal respectively.
Optionally, the first processing module specifically for:
Digital audio and video signals are converted into text message, voice coding processing is carried out to the digital audio and video signals, text message is packaged into text bag, the audio pack is packaged into the digital audio and video signals after voice coding processing, Video coding processing is carried out to the digital video signal, the video bag is packaged into the digital video signal after Video coding processing.
The embodiment of the present invention also proposed a kind of second terminal, including:
Receiving module, for receiving the text bag from first terminal;
Second processing module, it is less than or equal to the timestamp of audio pack or the video bag shown the corresponding time played for the timestamp corresponding time in the text bag judging to receive, in the text bag for showing the text bag received and caching, the text message that the timestamp field corresponding time is less than or equal in the text bag of the timestamp field of the audio pack played or the video bag shown corresponding time.
Optionally, the Second processing module is additionally operable to:
During the timestamp that is more than the audio pack played or the video bag shown when the timestamp corresponding time in the text bag received of judging corresponding time, the text bag received described in caching.
Optionally, the Second processing module is additionally operable to:
When not receiving audio pack and video bag in the preset time after receiving the text bag, the text message in the text bag of display caching.
The embodiment of the present invention also proposed a kind of system for realizing video calling, including:
First terminal, for gathering digital audio and video signals and digital video signal respectively;Digital audio and video signals are converted into text message, text message is packaged into text bag, digital audio and video signals are packaged into audio pack, digital video signal is packaged into video bag;Text bag, audio pack and video bag are sent to second terminal respectively;
Second terminal, for receiving the text bag from first terminal;Judge that the timestamp corresponding time in the text bag received is less than or equal to the timestamp of the audio pack played or the video bag shown the corresponding time, in the text bag for showing the text bag received and caching, the text message that the timestamp field corresponding time is less than or equal in the text bag of the timestamp field of the audio pack played or the video bag shown corresponding time.
Optionally, the second terminal is additionally operable to:
During the timestamp that is more than the audio pack played or the video bag shown when the timestamp corresponding time in the text bag received of judging corresponding time, the text bag received described in caching.
Optionally, the second terminal is additionally operable to:
When not receiving audio pack and video bag in the preset time after receiving the text bag, the text message in the text bag of display caching.
Compared with correlation technique, the technical scheme of the embodiment of the present invention includes:First terminal gathers digital audio and video signals and digital video signal respectively;Digital audio and video signals are converted to text message by first terminal, and text message is packaged into text bag, digital audio and video signals are packaged into audio pack, digital video signal is packaged into video bag;Text bag, audio pack and video bag are sent to second terminal by first terminal respectively.Pass through the scheme of the embodiment of the present invention, text bag, audio pack and video bag are sent to second terminal by first terminal respectively, realize when network condition is not good, text will not be caused to lose during video packet loss, so that the information reduced in video call process is lost.
Brief description of the drawings
The accompanying drawing in the embodiment of the present invention is illustrated below, the accompanying drawing in embodiment is to be used for a further understanding of the present invention, be used to explain the present invention together with specification, do not constitute limiting the scope of the invention.
Fig. 1 is the flow chart for the method that transmitting terminal of the embodiment of the present invention realizes video calling;
Fig. 2 is the flow chart for the method that receiving terminal of the embodiment of the present invention realizes video calling;
Fig. 3 is the structure composition schematic diagram of first terminal of the embodiment of the present invention;
Fig. 4 is the structure composition schematic diagram of second terminal of the embodiment of the present invention;
Fig. 5 is the structure composition schematic diagram for the system that the embodiment of the present invention realizes video calling.
Embodiment
For the ease of the understanding of those skilled in the art, the invention will be further described below in conjunction with the accompanying drawings, can not be used for limiting the scope of the invention.It should be noted that in the case where not conflicting, the various modes in embodiment and embodiment in the application can be mutually combined.
Referring to Fig. 1, the embodiment of the present invention proposes a kind of method for realizing video calling, including:
Step 100, first terminal gather digital audio and video signals and digital video signal respectively.
In this step, first terminal can gather digital video signal using acquisition time collection digital audio and video signals specified in G.711 (a kind of audio coding mode formulated as International Telecommunication Union) according to video frame rate set in advance.For example, every 10 milliseconds (ms) gather a digital audio and video signals, a digital video signal is gathered per 40ms.
Digital audio and video signals are converted to text message by step 101, first terminal, and text message is packaged into text bag, digital audio and video signals are packaged into audio pack, digital video signal is packaged into video bag.
In this step, digital audio and video signals can be converted to text message by first terminal using speech recognition technology.
In this step, text bag or audio pack or video bag can be packaged according to the specification of RTP (RTP, Real-time Transport Protocol) packet protocol.
The form in the packet header of RTP bags is as shown in table 1.
Table 1
In table 1, V presentation protocol versions, 2 bits (bit),
P represents filler, 1 bit, when P set, and the packet header afterbody of RTP bags includes additional byte of padding.
X is extension bits, and 1 bit, when X set, represents to extend a packet header behind the packet header of RTP bags.
CC represents the number of contributing source list (Contributing Source Identifiers) mark.
M is marker bit, 1 bit.
PT is loadtype (Payload Type), and 7 bits, for text bag, can be represented, such as 20 using untapped type in correlation technique.
Sequence number, 16 bits often send out a RTP bag, sequence number increase by 1.In the embodiment of the present invention, text bag, audio pack, the sequence number independent numbering of video bag.
The sampling instant of first character section in timestamp, 32 bits, record RTP bags.For audio pack and video bag, timestamp is the time for starting collection, for text bag, and timestamp is the time that corresponding audio pack starts collection.
Synchronous source identifier (SSRC, Synchronization Source Identifier), 32 bits represent the source of RTP bags, can not there is two identical SSRC values in same RTP sessions.
CSRC, 0~15, each 32 bit, necessary to the field is not the packet header of RTP bags.
Timestamp field in text bag is the time of collection digital audio and video signals or digital video signal, and Payload Type are speech text information type (undefined value can be used, such as 20).
Text bag, audio pack and video bag are sent to second terminal by step 102, first terminal respectively.
In this step, different types of bag can respectively be sent according to different strategies.For example, audio pack is sent according to audio coding sample frequency, video bag is sent according to the frame per second interval of agreement, and text bag is sent according to audio coding sample frequency.
Optionally, also include before digital audio and video signals being packaged into audio pack:First terminal carries out voice coding processing to digital audio and video signals;Accordingly,
Digital audio and video signals are packaged into audio pack includes:First terminal is packaged into audio pack to the digital audio and video signals after voice coding processing.
Optionally, also include before digital video signal being packaged into video bag:First terminal carries out Video coding processing to the digital video signal;Accordingly,
It is described digital video signal is packaged into video bag to include:The first terminal is packaged into the video bag to the digital video signal after Video coding processing.
Pass through the scheme of the embodiment of the present invention, text bag, audio pack and video bag are sent to second terminal by first terminal respectively, realize when network condition is not good, text will not be caused to lose during video packet loss, so that the information reduced in video call process is lost.
Referring to Fig. 2, the embodiment of the present invention also proposed a kind of method for realizing video calling, including:
Step 200, second terminal receive the text bag from first terminal.
Step 201, second terminal judge that the timestamp corresponding time in the text bag received is less than or equal to the timestamp of the audio pack played or the video bag shown the corresponding time, in the text bag for showing the text bag received and caching, the text message that the timestamp corresponding time is less than or equal in the text bag of the timestamp of the audio pack played or the video bag shown corresponding time.
In this step, text message can be shown according to the viewing area and/or font size pre-set.Specifically, the last number of words that can be shown of screen can be determined according to viewing area and/or font size, calculate the number of times that the text message of a text bag needs to show, the residence time of display once is determined according to the frequency acquisition of the corresponding audio pack of a text bag, shown according to the residence time.
Gathered once for example, the frequency acquisition of the corresponding audio pack of a text bag is 20ms, text bag has 100 words altogether, the number of words that can once show is 10 words, then need display 10 times, the residence time shown every time is 2ms.
In this step, text message can be shown on the graph layer of screen, that is, be shown on the video layer of display digital video signal that is added to.
Optionally, also include between step 200 and step 201:
Second terminal judges that caption display function has been opened.
When second terminal judges that caption display function is closed, terminate this flow.
This method also includes:
Second terminal judges that the timestamp corresponding time in the text bag received is more than the timestamp of the audio pack played or the video bag shown the corresponding time, caches the text bag received.
This method also includes:
Second terminal does not receive the text message in audio pack and video bag, the text bag of display caching in preset time.
In the above method, second terminal is received after audio pack and/or video bag, can be played out or be shown according to the rule arranged in audio/video decoding consensus standard.
Wherein, second terminal is received after audio pack, it can play out, second terminal is received after video bag, can be shown according to the rule of agreement in video decoding protocol (such as H264) according to the rule of agreement in audio decoder consensus standard (such as G711).
Referring to Fig. 3, the embodiment of the present invention also proposed a kind of first terminal, including:
Acquisition module, for gathering digital audio and video signals and digital video signal respectively;
First processing module, for digital audio and video signals to be converted into text message, is packaged into text bag by text message, digital audio and video signals is packaged into audio pack, digital video signal is packaged into video bag;
Sending module, for text bag, audio pack and video bag to be sent into second terminal respectively.
In the first terminal of the embodiment of the present invention, first processing module specifically for:
Digital audio and video signals are converted into text message, voice coding processing is carried out to digital audio and video signals, text message is packaged into text bag, audio pack is packaged into the digital audio and video signals after voice coding processing, Video coding processing is carried out to digital video signal, video bag is packaged into the digital video signal after Video coding processing.
Referring to Fig. 4, the embodiment of the present invention also proposed a kind of second terminal, including:
Receiving module, for receiving the text bag from first terminal;
Second processing module, it is less than or equal to the timestamp of audio pack or the video bag shown the corresponding time played for the timestamp corresponding time in the text bag judging to receive, in the text bag for showing the text bag received and caching, the text message that the timestamp field corresponding time is less than or equal in the text bag of the timestamp field of the audio pack played or the video bag shown corresponding time.
In the second terminal of the embodiment of the present invention, Second processing module is additionally operable to:
During the timestamp that is more than the audio pack played or the video bag shown when the timestamp corresponding time in the text bag for judging to receive corresponding time, the text bag received is cached.
In the second terminal of the embodiment of the present invention, Second processing module is additionally operable to:
When not receiving audio pack and video bag in the preset time after receiving text bag, the text message in the text bag of display caching.
Referring to Fig. 5, the embodiment of the present invention also proposed a kind of system for realizing video calling, including:
First terminal, for gathering digital audio and video signals and digital video signal respectively;Digital audio and video signals are converted into text message, text message is packaged into text bag, digital audio and video signals are packaged into audio pack, digital video signal is packaged into video bag;Text bag, audio pack and video bag are sent to second terminal respectively;
Second terminal, for receiving text bag from first terminal;Judge that the timestamp corresponding time in the text bag received is less than or equal to the timestamp of the audio pack played or the video bag shown the corresponding time, in the text bag for showing the text bag received and caching, the text message that the timestamp field corresponding time is less than or equal in the text bag of the timestamp field of the audio pack played or the video bag shown corresponding time.
In the system of the embodiment of the present invention, second terminal is additionally operable to:
During the timestamp that is more than the audio pack played or the video bag shown when the timestamp corresponding time in the text bag for judging to receive corresponding time, the text bag received is cached.
In the system of the embodiment of the present invention, second terminal is additionally operable to:
When not receiving audio pack and video bag in the preset time after receiving text bag, the text message in the text bag of display caching.
It should be noted that; embodiment described above be for only for ease of it will be understood by those skilled in the art that; the protection domain being not intended to limit the invention; on the premise of the inventive concept of the present invention is not departed from, any obvious replacement and improvement that those skilled in the art are made to the present invention etc. is within protection scope of the present invention.

Claims (15)

1. a kind of method for realizing video calling, it is characterised in that including:
First terminal gathers digital audio and video signals and digital video signal respectively;
Digital audio and video signals are converted to text message by first terminal, and text message is packaged into text bag, Digital audio and video signals are packaged into audio pack, digital video signal is packaged into video bag;
Text bag, audio pack and video bag are sent to second terminal by first terminal respectively.
2. according to the method described in claim 1, it is characterised in that described to encapsulate digital audio and video signals Also include before into audio pack:The first terminal carries out voice coding processing to the digital audio and video signals;
It is described digital audio and video signals are packaged into audio pack to include:The first terminal is to voice coding processing Digital audio and video signals afterwards are packaged into the audio pack.
3. according to the method described in claim 1, it is characterised in that described to encapsulate digital video signal Also include before into video bag:The first terminal carries out Video coding processing to the digital video signal;
It is described digital video signal is packaged into video bag to include:The first terminal is to Video coding processing Digital video signal afterwards is packaged into the video bag.
4. a kind of method for realizing video calling, it is characterised in that including:
Second terminal receives the text bag from first terminal;
Second terminal judges that the timestamp corresponding time in the text bag received is less than or equal to The timestamp corresponding time of the audio pack of broadcasting or the video bag shown, show the text received In the text bag of bag and caching, the timestamp field corresponding time is less than or equal to the audio pack played Or the text message in the text bag of the timestamp field corresponding time of the video bag shown.
5. method according to claim 4, it is characterised in that when the second terminal judges institute The timestamp corresponding time in the text bag received is stated more than the audio pack played or is shown Video bag timestamp corresponding time when, this method also includes:
The text bag received described in the second terminal caching.
6. method according to claim 5, it is characterised in that when second terminal receive it is described When not receiving audio pack and video bag in the preset time after text bag, this method also includes:
Text message in the text bag of the second terminal display caching.
7. method according to claim 4, it is characterised in that the second terminal, which is received, to be come from After the text bag of first terminal, the timestamp pair in the second terminal judges the text bag that receives The time answered is less than or equal to the audio pack played or the timestamp of the video bag shown is corresponding Also include before time:
The second terminal judges that caption display function has been opened.
8. a kind of first terminal, it is characterised in that including:
Acquisition module, for gathering digital audio and video signals and digital video signal respectively;
First processing module, for digital audio and video signals to be converted into text message, text message is encapsulated Into text bag, digital audio and video signals are packaged into audio pack, digital video signal is packaged into video bag;
Sending module, for text bag, audio pack and video bag to be sent into second terminal respectively.
9. first terminal according to claim 8, it is characterised in that the first processing module tool Body is used for:
Digital audio and video signals are converted into text message, the digital audio and video signals are carried out at voice coding Reason, text bag is packaged into by text message, and institute is packaged into the digital audio and video signals after voice coding processing Audio pack is stated, Video coding processing is carried out to the digital video signal, to the number after Video coding processing Word vision signal is packaged into the video bag.
10. a kind of second terminal, it is characterised in that including:
Receiving module, for receiving the text bag from first terminal;
Second processing module, is less than for the timestamp corresponding time in the text bag judging to receive Or equal to the timestamp corresponding time of the audio pack played or the video bag shown, display connects In the text bag and the text bag of caching that receive, the timestamp field corresponding time, which is less than or equal to, to be broadcast Text in the text bag of the timestamp field corresponding time of the audio pack put or the video bag shown Information.
11. second terminal according to claim 10, it is characterised in that the Second processing module It is additionally operable to:
It is more than the sound played when the timestamp corresponding time in the text bag received of judging Frequency was wrapped or during the timestamp of video bag that shows corresponding time, the text bag received described in caching.
12. second terminal according to claim 11, it is characterised in that the Second processing module It is additionally operable to:
When not receiving audio pack and video bag in the preset time after receiving the text bag, show Show the text message in the text bag of caching.
13. a kind of system for realizing video calling, it is characterised in that including:
First terminal, for gathering digital audio and video signals and digital video signal respectively;DAB is believed Number text message is converted to, text message is packaged into text bag, digital audio and video signals are packaged into audio Bag, video bag is packaged into by digital video signal;Text bag, audio pack and video bag are sent to respectively Second terminal;
Second terminal, for receiving the text bag from first terminal;Judge the text bag received In the timestamp corresponding time be less than or equal to the audio pack played or the video bag that shows In timestamp corresponding time, the text bag for showing the text bag received and caching, timestamp field pair The time answered is less than or equal to the timestamp field pair of the audio pack played or the video bag shown Text message in the text bag for the time answered.
14. system according to claim 13, it is characterised in that the second terminal is additionally operable to:
It is more than the sound played when the timestamp corresponding time in the text bag received of judging Frequency was wrapped or during the timestamp of video bag that shows corresponding time, the text bag received described in caching.
15. system according to claim 14, it is characterised in that the second terminal is additionally operable to:
When not receiving audio pack and video bag in the preset time after receiving the text bag, show Show the text message in the text bag of caching.
CN201610161286.0A 2016-03-18 2016-03-18 A kind of methods, devices and systems for realizing video calling Pending CN107205131A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201610161286.0A CN107205131A (en) 2016-03-18 2016-03-18 A kind of methods, devices and systems for realizing video calling
PCT/CN2017/075195 WO2017157168A1 (en) 2016-03-18 2017-02-28 Method, terminal, system and computer storage medium for video calling

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610161286.0A CN107205131A (en) 2016-03-18 2016-03-18 A kind of methods, devices and systems for realizing video calling

Publications (1)

Publication Number Publication Date
CN107205131A true CN107205131A (en) 2017-09-26

Family

ID=59851446

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610161286.0A Pending CN107205131A (en) 2016-03-18 2016-03-18 A kind of methods, devices and systems for realizing video calling

Country Status (2)

Country Link
CN (1) CN107205131A (en)
WO (1) WO2017157168A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109933574A (en) * 2019-02-27 2019-06-25 常州猛犸电动科技有限公司 A kind of unique key generation method, device and terminal device
CN110290341A (en) * 2019-07-24 2019-09-27 长沙世邦通信技术有限公司 Follow video intercom method, system and the storage medium of face synchronously displaying subtitle
CN110415706A (en) * 2019-08-08 2019-11-05 常州市小先信息技术有限公司 A kind of technology and its application of superimposed subtitle real-time in video calling

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1992861A (en) * 2005-12-26 2007-07-04 财团法人工业技术研究院 Recording medium for storing subtitling data structure method for playing the subtitling data
CN101035262A (en) * 2007-04-19 2007-09-12 深圳市融合视讯科技有限公司 Video information transmission method
US20100194979A1 (en) * 2008-11-02 2010-08-05 Xorbit, Inc. Multi-lingual transmission and delay of closed caption content through a delivery system
CN102957892A (en) * 2011-08-24 2013-03-06 三星电子(中国)研发中心 Method, system and device for realizing audio and video conference
CN103685985A (en) * 2012-09-17 2014-03-26 联想(北京)有限公司 Communication method, transmitting device, receiving device, voice processing equipment and terminal equipment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7561178B2 (en) * 2005-09-13 2009-07-14 International Business Machines Corporation Method, apparatus and computer program product for synchronizing separate compressed video and text streams to provide closed captioning and instant messaging integration with video conferencing
KR20150021258A (en) * 2013-08-20 2015-03-02 삼성전자주식회사 Display apparatus and control method thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1992861A (en) * 2005-12-26 2007-07-04 财团法人工业技术研究院 Recording medium for storing subtitling data structure method for playing the subtitling data
CN101035262A (en) * 2007-04-19 2007-09-12 深圳市融合视讯科技有限公司 Video information transmission method
US20100194979A1 (en) * 2008-11-02 2010-08-05 Xorbit, Inc. Multi-lingual transmission and delay of closed caption content through a delivery system
CN102957892A (en) * 2011-08-24 2013-03-06 三星电子(中国)研发中心 Method, system and device for realizing audio and video conference
CN103685985A (en) * 2012-09-17 2014-03-26 联想(北京)有限公司 Communication method, transmitting device, receiving device, voice processing equipment and terminal equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
林健浩: "基于SIP协议的音视频会话技术研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109933574A (en) * 2019-02-27 2019-06-25 常州猛犸电动科技有限公司 A kind of unique key generation method, device and terminal device
CN109933574B (en) * 2019-02-27 2021-03-19 常州猛犸电动科技有限公司 Unique key generation method and device and terminal equipment
CN110290341A (en) * 2019-07-24 2019-09-27 长沙世邦通信技术有限公司 Follow video intercom method, system and the storage medium of face synchronously displaying subtitle
CN110415706A (en) * 2019-08-08 2019-11-05 常州市小先信息技术有限公司 A kind of technology and its application of superimposed subtitle real-time in video calling

Also Published As

Publication number Publication date
WO2017157168A1 (en) 2017-09-21

Similar Documents

Publication Publication Date Title
US7697559B2 (en) Communication terminal, server, relay apparatus, broadcast communication system, broadcast communication method, and program
KR102229109B1 (en) Transmitting apparatus and receiving apparatus and signal processing method thereof
CN1722801B (en) Emergency alert message data structure, emergency alert message processing method and broadcast receiver
KR102225948B1 (en) Method and device for transmitting and receiving broadcast service in hybrid broadcast system on basis of connection of terrestrial broadcast network and internet protocol network
KR101733501B1 (en) Broadcast signal transmitting method, broadcast signal receiving method, broadcast signal transmitting apparatus, and broadcast signal receiving apparatus
JP6523249B2 (en) Method and apparatus for compressing packet header
KR101721884B1 (en) Method for transmitting broadcast signal, method for receiving broadcast signal, apparatus for transmitting broadcast signal, and apparatus for receiving broadcast signal
CN112491781B (en) Broadcast signal transmitting method and apparatus, and broadcast signal receiving method and apparatus
US9578360B2 (en) Information presentation device and method
US20090177952A1 (en) Transcoder and receiver
KR101764634B1 (en) Method for transmitting broadcast signal, method for receiving broadcast signal, apparatus for transmitting broadcast signal, and apparatus for receiving broadcast signal
KR20080047411A (en) Transmission of multiplex protocol data units in physical layer packets
CN105429984A (en) Media play method, equipment and music teaching system
CN107205131A (en) A kind of methods, devices and systems for realizing video calling
KR101792519B1 (en) Broadcasting signal transmitting device, broadcasting signal receiving device, broadcasting signal transmitting method, and broadcasting signal receiving method
US20170171070A1 (en) Broadcast signal transmission device, broadcast signal reception device, broadcast signal transmission method, and broadcast signal reception method
US20060080715A1 (en) Apparatus and method for processing VOD data in a mobile terminal
CN100534198C (en) Information source adapter based on SAF
CN115103228A (en) Video streaming transmission method, device, electronic equipment, storage medium and product
JP4655870B2 (en) Packet transmission / reception system and elapsed time measurement method
JP3452914B2 (en) Data storage communication method and data generation method
KR20210030336A (en) Transmitting apparatus and receiving apparatus and signal processing method thereof
JP2000216828A (en) Data storing communication equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20170926

WD01 Invention patent application deemed withdrawn after publication