CN103581604A - Video call device and method - Google Patents

Video call device and method Download PDF

Info

Publication number
CN103581604A
CN103581604A CN201210269514.8A CN201210269514A CN103581604A CN 103581604 A CN103581604 A CN 103581604A CN 201210269514 A CN201210269514 A CN 201210269514A CN 103581604 A CN103581604 A CN 103581604A
Authority
CN
China
Prior art keywords
video
module
data
audio
video data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210269514.8A
Other languages
Chinese (zh)
Inventor
王亮亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Simcom Ltd
Original Assignee
Shanghai Simcom Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Simcom Ltd filed Critical Shanghai Simcom Ltd
Priority to CN201210269514.8A priority Critical patent/CN103581604A/en
Publication of CN103581604A publication Critical patent/CN103581604A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention relates to the technical field of communication, and discloses a video call device. The video call device comprises an audio input and output module, a video input and output module, an audio encoding and decoding module, a video encoding and decoding module, an audio and video synchronizing module, a first network module and a second network module. The audio input and output module is connected with the audio encoding and decoding module, collects called audio data and transmits the called audio data to the audio encoding and decoding module to carry out encoding on the called audio data, and outputs the decoded audio data. The video input and output module is connected with the video encoding and decoding module, collects called video data and transmits the called video data to the video encoding and decoding module to carry out encoding on the called video data, and outputs the decoded video data. The audio and video synchronizing module is respectively connected with the audio encoding and decoding module and the video encoding and decoding module, is used for synchronizing the called encoded audio and video data and synchronizing incoming-call audio and video data which are not decoded. The first network module is connected with the audio encoding and decoding module and receives and transmits the audio data through the circuit domain of the first network module. The second network module is connected with the video encoding and decoding module, and receives and transmits the video data through the packet domain of the second network module.

Description

Video conversation apparatus and method
Technical field
The present invention relates to communication technical field, relate in particular to a kind of video conversation apparatus and method that can realize video calling on dual-mode handset.
Background technology
Day by day universal along with mobile terminals such as mobile phones, the scope of application of mobile terminal is more and more wider, and user is also more and more higher to the requirement of mobile terminal.The video calling of mobile phone of take is example, and 3G network TD-SCDMA and WCDMA, in the visual telephone bandwidth of the reserved 64Kbits of circuit domain, make mobile terminal that the video call service of circuit domain video telephone can be provided at present.Referring to accompanying drawing 1, adopt at present the mobile terminal of TD-SCDMA or WCDMA that the flow chart of video call service is provided.
From data plane, during data acquisition caller, comprise the steps: S11: mobile terminal from microphone collect voice data, from camera collection to video data; S12: voice data is adopted to AMR coding, video data is adopted to H.263 coding; S13: the voice data encoding is sent to the H.223 Level2 of adaptation layer (AL), video data is sent to the H.223 Level3 of adaptation layer (AL), and audio, video data is undertaken multiplexing by multiplex layer (MUX) afterwards; S14: multiplex layer adopts multiplexing code stream translation-protocol (V.34) that audio and video data streams is multiplexed into a stream; S15: multiplexing code stream sends to the base band (BaseBand) of TD-SCDMA or WCDMA; S16: multiplexing code stream sends to the circuit domain of TD-SCDMA or WCDMA by BaseBand.When called, be a reverse process, wherein H.223 at multiplex layer, the code stream receiving resolved into a plurality of Media Streams.
From control plane, comprise the steps: that S17:H.245 communicates control; The control information of S18:H.245 is transmitted by the Level1 of adaptation layer H.223, is guaranteed the correct transmission of control information by control channel IP fragmentation and reassembly agreement (CCSRL) and data packet retransmission agreement (NSRP or WNSRP).H.245 communicate control and mainly comprise that being H.245 responsible for capabilities exchange (uses the method H.245 providing, the ability of exchange mobile terminal), principal and subordinate determines (the method H.245 providing is provided, is determined calling party and callee), opens and close logical channel (setting up or disconnect the logic channel of audio frequency, video and market demand).
Below the primary video call flow that adopts the mobile phone of TD-SCDMA or WCDMA:
1) first, two mobile phones connect;
2) then, two mobile phones are set up data communication;
3) starting to set up visual telephone connects; This process mainly comprises the method that a) H.245 use provides, the ability of exchange mobile terminal; B) use the method H.245 providing, determine calling party and callee; C) send H.223 multi-meter; D) start audio/video encoding/decoding and transceiving data;
4) two cellphone subscribers carry out video calling, in the process of call, can change channel attribute, ability etc. by the method for H.245 definition;
5) two cellphone subscribers finish call;
6) two mobile phones disconnect.
In TD-SCDMA or WCDMA, due to the restriction of circuit domain bandwidth, image transmitting adopts high compression ratio and low rate, causes image effect not ideal.
The dual-mode handset that also has in the market a kind of CDMA+GSM, this dual-mode handset is supported dual-network and dual-standby bilateral specifically, and wherein, bilateral will realize the standby simultaneously of two cards, that is to say that at mobile phone the number of any card can be driven into not under talking state.And because CDMA2000 network is not reserved the visual telephone bandwidth of 64Kbits at the beginning of design in circuit domain, make CDMA2000 that the service of video telephone can only be provided by packet domain, because the data volume of video telephone audio & video is very large, use H.263 Profile 0 Level 10, AMR etc. carry out the coding of audio frequency and video, the network bandwidth takies still very large, affect the popularization of other data increment application, and the current network architecture of upgrading will consider Cost Problems, therefore telecom operators do not provide the service of the visual telephone under 3G network at present.GSM is the network of 2G, and up-downgoing transmission rate is inadequate, so can not support visual telephone.Therefore the dual-mode handset of CDMA+GSM can not support to dial visual telephone at present.
Summary of the invention
Technical problem to be solved by this invention is, a kind of video conversation apparatus and method are provided, solution in TD-SCDMA or WCDMA due to the restriction of circuit domain bandwidth, image transmitting adopts high compression ratio and low rate, cause the less-than-ideal problem of image effect, and multi-module mobile terminal of the prior art cannot be in the situation that the current network architecture of not upgrading be supported the problem of visual telephone.
In order to address the above problem, the invention provides a kind of video conversation apparatus, comprise audio frequency input/output module, video input output module, audio coding decoding module, coding and decoding video module, audio-visual synchronization module, first network module and second network module; Described audio frequency input/output module is connected with described audio coding decoding module, for gathering the voice data of exhalation, and the voice data collecting is sent to described audio coding decoding module encodes, and export decoded voice data; Described video input output module is connected with described coding and decoding video module, for gathering the video data of exhalation, and the video data collecting is sent to described coding and decoding video module encodes, and export decoded video data; Described audio-visual synchronization module is connected with described audio coding decoding module, coding and decoding video module respectively, for voice data and video data after synchronous coding of breathing out, and the front voice data of the decoding of synchronous incoming call and video data; Described first network module is connected with described audio coding decoding module, by the circuit domain receiving and sending audio data of first network; Described second network module is connected with described coding and decoding video module, by the packet domain transmitting-receiving video data of second network.
Described audio-visual synchronization module further comprises a delay cell and a timestamp loading unit being connected with described delay cell, described delay cell postpones for the voice data to after coding, and described timestamp loading unit loads upper timestamp for the voice data to after coding and video data.
Described audio-visual synchronization module further comprises a timestamp comparing unit, the timestamp of voice data and the timestamp of video data of described timestamp comparing unit for relatively receiving, described audio-visual synchronization module is synchronous according to comparative result control voice data and video data.
Further, described first network module is for supporting the communication network module of audio data transmission, and described second network module is for supporting the communication network module of video data transmitting.
In order to address the above problem, the present invention also provides a kind of video calling call-out method, adopt video conversation apparatus of the present invention, comprise the steps: that (1) adopts described audio frequency input/output module and video input output module to gather respectively voice data and video data; (2) adopt described audio coding decoding module and coding and decoding video module respectively to the voice data collecting and coding video data; (3) adopt voice data and the video data after described audio-visual synchronization module synchronous coding; (4) voice data after synchronous is breathed out by the circuit domain of first network, and the video data after simultaneously synchronous is breathed out by the packet domain of second network.
Described audio-visual synchronization module further comprises a delay cell and a timestamp loading unit being connected with described delay cell, step (3) further comprises: (31) postpone the voice data after encoding by described delay cell, carry out afterwards step (32); (32) by described timestamp loading unit for coding after voice data and video data load respectively upper timestamp.
In order to address the above problem, the present invention also provides a kind of video calling incoming call method, adopt video conversation apparatus of the present invention, comprise the steps: that (1) adopts the circuit domain of first network to receive the voice data of incoming call, and adopt the video data of the packet domain reception incoming call of second network; (2) voice data and the video data that adopt described audio-visual synchronization module synchronously to receive; (3) adopt described audio coding decoding module respectively the voice data after synchronizeing and video data to be decoded with coding and decoding video module; (4) adopt described video input output module to show decoded video data, adopt described audio frequency input/output module to export decoded voice data simultaneously.
Described audio-visual synchronization module further comprises a timestamp comparing unit, and step (2) further comprises: the timestamp of the voice data relatively receiving and the timestamp of video data, and synchronous according to comparative result control voice data and video data.
The timestamp of the described audio, video data relatively receiving further adopts: set a time of delay, if the video time stamp of the video data of getting is compared with audio time stamp, postpone to surpass the time of delay setting, abandon this frame video data, again get new video data, carry out the timestamp comparison of audio, video data.
The invention has the advantages that, synchronous by audio, video data provided by the invention and be divided into different Internet Transmissions method and apparatus, support voice and the conforming video calling of image on multi-module mobile terminal have been realized, and because voice data passes through respectively different Internet Transmissions from video data, solved in TD-SCDMA or WCDMA the restriction due to circuit domain bandwidth, image transmitting adopts high compression ratio and low rate, causes the less-than-ideal problem of image effect.
Accompanying drawing explanation
Accompanying drawing 1, adopts the mobile terminal of TD-SCDMA or WCDMA that the flow chart of video call service is provided at present;
Accompanying drawing 2, the configuration diagram of video conversation apparatus one embodiment of the present invention;
Accompanying drawing 3, adopts video conversation apparatus of the present invention to carry out the schematic flow sheet of video calling call-out method;
Accompanying drawing 4, adopts video conversation apparatus of the present invention to carry out the schematic flow sheet of video calling incoming call method;
Accompanying drawing 5 to 7 is schematic flow sheets of three embodiment of video call method of the present invention.
Embodiment
Below in conjunction with accompanying drawing, the embodiment of video conversation apparatus provided by the invention and method is elaborated.In the present invention, video call method comprises call-out method and the incoming call method of video calling; Mobile terminal can be for supporting the how logical multi mode terminal of many nets.
Referring to accompanying drawing 2, the configuration diagram of video conversation apparatus one embodiment of the present invention, comprises audio frequency input/output module M21, audio coding decoding module M22, video input output module M23, coding and decoding video module M24, audio-visual synchronization module M25, first network module M26 and second network module M27.
Described audio frequency input/output module M21 is connected with described audio coding decoding module M22, for gathering the voice data of exhalation, and the voice data collecting is sent to described audio coding decoding module M22 encodes, and output is through the decoded voice data of audio coding decoding module M22.Described audio frequency input/output module M21 can comprise microphone and loud speaker.Communication of mobile terminal both sides video calling link channel enters the video calling stage after setting up; The voice data that described audio frequency input/output module M21 breathes out by microphone collection, and the voice data collecting is sent to described audio coding decoding module M22; And export decoded voice data by loud speaker.
Described audio coding decoding module M22 is further connected with described audio-visual synchronization module M25.The voice data that described audio coding decoding module M22 collects described audio frequency input/output module M21 is encoded, and the voice data after coding is sent to described audio-visual synchronization module M25; Described audio coding decoding module M22 is also for the voice data after described audio-visual synchronization module M25 is synchronous is decoded, and exports decoded voice data by described audio frequency input/output module M21.To the encoding and decoding of voice data, can adopt code encoding/decoding mode of the prior art, for example, adopt AMR encoding and decoding.
Described video input output module M23 is connected with described coding and decoding video module M24, for gathering the video data of exhalation, and the video data collecting is sent to described coding and decoding video module M24 encodes, and output is through the decoded video data of coding and decoding video module M24.Described video input output module comprises camera and display screen.Communication of mobile terminal both sides video calling link channel enters the video calling stage after setting up; The video data that described video input output module M23 breathes out by camera collection, and the video data collecting is sent to described coding and decoding video module M24; And show decoded video data by display screen.
Described coding and decoding video module M24 is further connected with described audio-visual synchronization module M25.The coding video data that described coding and decoding video module M24 collects described video input output module M23, and the video data after coding is sent to described audio-visual synchronization module M25; Described coding and decoding video module M24 is also for the video data after described audio-visual synchronization module M25 is synchronous is decoded, and exports decoded video data by described coding and decoding video module M24.To the encoding and decoding of video data, can adopt code encoding/decoding mode of the prior art, for example, adopt H.263 encoding and decoding.
Described audio-visual synchronization module M25 is connected with described audio coding decoding module M22, coding and decoding video module M24 respectively, for voice data and video data after synchronous coding of breathing out, and the front voice data of the decoding of synchronous incoming call and video data.
For voice data and video data after coding, because the coding rate of voice data is very fast, therefore, in order to guarantee voice that transmitting terminal sends and the time consistency of view data, need to voice data and the video data after coding be carried out synchronously.Described audio-visual synchronization module M25 can comprise the timestamp loading unit M252 that a delay cell M251 is connected with described delay cell with in the present embodiment; Described delay cell M251 postpones for the voice data to after coding, by described delay cell M251, voice data is done to an encoding time delay; By the time after video data encoding completes, more respectively the voice data after coding and video data are loaded to upper timestamp by described timestamp loading unit M252; Thereby the voice data after synchronous coding and video data, then by eating dishes without rice or wine to send to together callee.
For voice data and the video data of incoming call, will be according to relatively speed, flow etc. being controlled of timestamp before decoding, guarantee to carry out again output function after image and sound consistent.Described audio-visual synchronization module M25 can comprise a timestamp comparing unit M253 in the present embodiment, the timestamp of voice data and the timestamp of video data of described timestamp comparing unit M253 for relatively receiving, described audio-visual synchronization module is synchronous according to comparative result control voice data and video data.The Voice & Video data that receive for first network module M26 and second network module M27, before decoding not by the timestamp of described timestamp comparing unit M253 comparing audio data and the timestamp of video data; In general the accuracy rate of audio transmission is higher, so can using audio time stamp as with reference to the time, and set time of delay, for example setting time of delay is 1s, if the video time stamp of the video data of getting is compared with audio time stamp, postpones to surpass 1s, abandon this frame video data, again get new video data, carry out the timestamp comparison of audio, video data, so just can guarantee the consistency of voice and image.
Described first network module M26 is connected with described audio coding decoding module M22, by the circuit domain receiving and sending audio data of first network.Described second network module M27 is connected with described coding and decoding video module M24, by the packet domain transmitting-receiving video data of second network.
For audio, video data, send, in order to support video calling on mobile terminal, the voice data after synchronous can be sent to the circuit domain of first network by the base band of first network module M26, by the circuit domain of first network, breathe out; And the video data after synchronous is sent to the packet domain of second network by the base band of second network module M47, by the packet domain of second network, breathe out.For audio, video data, receive, described first network module M26 receives the voice data of incoming call by its circuit domain; Described second network module M27 receives the video data of incoming call by its packet domain.Be that voice data transmits by first network, concurrent video data transmits by second network, thereby guarantees speech quality.Described first network module M26 is for supporting the communication network module of audio data transmission, described second network module M27 is for supporting the communication network module of video data transmitting, and the base band of the base band of first network module M26 and second network module M27 can be concurrent.For example described first network module M26 is GSM mixed-media network modules mixed-media, and described second network module M27 is CDMA2000, WCDMA or TD-SCDMA mixed-media network modules mixed-media.
With reference to accompanying drawing 3, adopt video conversation apparatus of the present invention to carry out the schematic flow sheet of video calling call-out method, comprise the steps.
S31: adopt described audio frequency input/output module and video input output module to gather respectively voice data and video data.
First mobile terminal is initiated video calling connection to callee and is set up request, and obtains after callee's response, and both sides' video calling link channel has just been set up, and enters afterwards the video calling stage.Described audio frequency input/output module M21 gathers the voice data of breathing out, and described video input output module M22 gathers the video data of breathing out; For example can pass through video input output module M22 camera collection video data; Microphone by audio frequency input/output module M21 gathers voice data.
S32: adopt described audio coding decoding module and coding and decoding video module respectively the Voice & Video data that collect to be encoded.
Audio, video data is encoded and can be adopted coded system of the prior art, for example, voice data is adopted to AMR coding, video data is adopted to H.263 coding.
S33: adopt voice data and video data after described audio-visual synchronization module synchronous coding.
Because the coding rate of voice data is very fast, therefore in order to guarantee that transmitting terminal sends the time consistency of voice and view data, voice data will be done an encoding time delay, by the time video data encoding is stamped timestamp again after completing, thereby the voice data after synchronous coding and video data, then by eating dishes without rice or wine to send to together callee.
In the present embodiment, described audio-visual synchronization module M25 can comprise the timestamp loading unit M252 that a delay cell M251 is connected with described delay cell with, step S33 further realizes by following steps: S331: by described delay cell M251, the voice data after encoding is postponed, carry out afterwards step S332; S332: by described timestamp loading unit M252 for coding after voice data and video data load respectively upper timestamp.Thereby the voice data after synchronous coding and video data, then by eating dishes without rice or wine to send to together callee.
S34: the voice data after synchronous is breathed out by the circuit domain of first network, the video data after the while is synchronous is breathed out by the packet domain of second network.
First, video call data comprises audio & video data, in order to support video calling on mobile terminal, voice data after synchronous can be sent to first network circuit domain at the base band by first network, the video data after simultaneously synchronous sends to second network packet domain by the base band of second network; Be that voice data transmits by first network, concurrent video data transmits by second network, thereby guarantees speech quality.Wherein said first network is for supporting the communication network of audio data transmission, and described second network is for supporting the communication network of video data transmitting; And the base band of first network and the base band of second network can be concurrent.For example described first network is GSM network, and described second network is CDMA2000, WCDMA or TD-SCDMA network.
With reference to accompanying drawing 4, adopt video conversation apparatus of the present invention to carry out the schematic flow sheet of video calling incoming call method, comprise the steps.
S41: adopt the circuit domain of first network to receive the voice data of incoming call, and adopt the video data of the packet domain reception incoming call of second network.
First calling party initiates video calling connection to mobile terminal and sets up request, and obtains after the response of mobile terminal, and both sides' video calling link channel has just been set up, and enters afterwards the video calling stage.First network circuit domain receives the voice data of incoming call, and second network packet domain receives the video data of incoming call, and voice data transmits by first network, and concurrent video data transmits by second network.
S42: the voice data and the video data that adopt described audio-visual synchronization module synchronously to receive.
For voice data and the video data of incoming call, will be according to relatively speed, flow etc. being controlled of timestamp before decoding, guarantee to carry out again output function after image and sound consistent.Described audio-visual synchronization module M25 can comprise a timestamp comparing unit M253 in the present embodiment, step S42 can adopt: the timestamp of the voice data relatively receiving and the timestamp of video data, and synchronous according to comparative result control voice data and video data.The timestamp of the described audio, video data relatively receiving further adopts: set a time of delay, if the video time stamp of the video data of getting is compared with audio time stamp, postpone to surpass the time of delay setting, abandon this frame video data, again get new video data, carry out the timestamp comparison of audio, video data.In general the accuracy rate of audio transmission is higher, so can using audio time stamp as with reference to the time, and set time of delay, for example setting time of delay is 1s, if the video time stamp of the video data of getting is compared with audio time stamp, postpones to surpass 1s, abandon this frame video data, again get new video data, carry out the timestamp comparison of audio, video data, so just can guarantee the consistency of voice and image.
S43: adopt described audio coding decoding module with coding and decoding video module respectively to the Voice & Video decoding data after synchronizeing.
Audio, video data is decoded and can be adopted decoding process of the prior art, for example, voice data is adopted to AMR decoding, video data is adopted to H.263 decoding.
S44: adopt described video input output module to show decoded video data, adopt described audio frequency input/output module to export decoded voice data simultaneously.
Due to voice data and the video data of incoming call, before decoding, according to relatively having realized speed, flow etc. is controlled of timestamp, guarantee the consistent of image and sound.Therefore the video data after can showing synchronously by the display screen of video input output module M22; Simultaneously by the voice data after the microphone output synchronously of audio frequency input/output module M21; Realize the conforming video calling of voice and image.
Synchronous by audio, video data provided by the invention and be divided into different Internet Transmissions method, support voice and the conforming video calling of image on multi-module mobile terminal have been realized, and because voice data passes through respectively different Internet Transmissions from video data, solved in TD-SCDMA or WCDMA the restriction due to circuit domain bandwidth, image transmitting adopts high compression ratio and low rate, causes the less-than-ideal problem of image effect.
Next provide a plurality of embodiment of technique scheme, for make the description of following examples simple and clear only take realize video calling on CDMA+GSM dual-mode handset and be described as example, other multi-module mobile terminal is identical with the principle of CDMA+GSM dual-mode handset.
First, visual telephone data comprise voice and video data, in order to realize video calling on CDMA+GSM dual-mode handset, consider to guarantee the voice quality of call, can be speech data by 2G Internet Transmission.Because the EVDO of GSM and CDMA is that they can be concurrent in two base band; So by the lower GSM lattice network territory voice-bearer data of speed, by the higher EVDO network packet territory of speed carrying video data, then mark a public region of memory on CDMA, speech data and video data gathered together and carry out encoding and decoding.When calling, after the encoded speech data that the video data of camera collection and microphone are received, be put in internal memory shared pool, by CDMA and GSM network, transmit respectively, CDMA guarantees to reside on EVDO network simultaneously, makes the CDMA2000 user of 3G and the GSM user of 2G experience significant function-video calling of 3G.
The Chinese implication that the english abbreviation that adopts in following examples is corresponding is as follows:
H.245 control protocol for multimedia communication;
H.225 call out and control and data packing agreement;
H.223 data-reusing agreement;
V.34 multiplexing code stream translation-protocol;
CCSRL control channel IP fragmentation and reassembly agreement;
NSRP data packet retransmission agreement;
WNSRP transmission control protocol;
RTP RTP;
RTCP RTP control protocol;
AL is adaptation layer H.223;
MUX is multiplex layer H.223;
AMR adaptive multi-rate coding;
BaseBand base band;
H.263 video protocols;
MUX-PDU multiplex layer becomes data module by adaptation layer data-reusing.
Embodiment mono-
Fig. 5 is the flow chart of the present embodiment.The present embodiment is on CDMA+GSM dual-mode handset, to realize the flow process of video calling, and wherein this dual-mode handset is two bilaterals for the treatment of.
Data plane is that video data passes through EVDO Internet Transmission, first the data of camera collection are used and are H.263 encoded, data after coding are used and are H.225 packaged into RTP packet, the carrying of RTP is tcp/ip layer, and the voice data that microphone collects adopts the AMR coding of 12.2kbps speed to send to the H.223 Level2 of adaptation layer.Voice data sends to the circuit domain of GSM by base band, video data sends to the packet domain of EVDO by base band.
Control plane be by H.225.0 layer by H.263 coding after Video stream format change into message, be wherein Q.931 responsible for calling out control.H.245 here except capabilities exchange, outside the functions such as principal and subordinate's decision, also to be responsible for setting up RTP passage.
Below a C+G video calling flow process:
1) H.225 to visual telephone recipient, initiate SET UP request, send and connect the signal of setting up.
2) Call Proceeding, calling party waits for the foundation of Call passage, illustrates that signal is processed by destination.
3) visual telephone recipient starts ring, and Alerting tells calling party's destination node just in ring.
4) if callee gives the response of Accept, both sides' link channel is set up, Connected.
5) start H.245 session.Conversation procedure comprises: a) use the method H.245 providing, by No. 0 control logic channel, the ability of exchange termination; B) use the method H.245 providing, determine calling party and callee; C) set up the logic channel of audio frequency and RTP; D) send H.223 multi-meter, multi-meter has defined various types of adaptation layer data putting in order and length in MUX-PDU; E) start audio/video encoding/decoding; F) video data after coding transmits by being H.225 packaged into RTP, is H.225 also responsible for assembling RTCP bag and carries out statistic of attribute; G) voice data after coding is submitted to the H.223 Level2 of adaptation layer, and is multiplexed into a stream by multiplex layer H.223.
6) carry out video calling.In the process of call, can change channel attribute, ability etc. by the method for H.245 definition.If there is the voice incoming call of outside 1x simultaneously, need to send out Reject message, refusing incoming call because voice incoming call can handover network to 1x pattern, due to cdma network support voice and data service concurrent not, can cause like this transmission termination of video data.If user does data service and can allow on EVDO network.
7) H.245 finish call, stop transmitting voice and video information, by H.245 closing voice and mathematical logic passage, and to opposite end transmission EndSession message, indication opposite end stops call, then caller stops transmitting H.245 message, receives after the response of EndSession of opposite end closing control logical channel.
8) be H.225 responsible for disconnecting link passage, discharge interface-free resources.
Embodiment bis-
Fig. 6 is the flow chart of the present embodiment.The present embodiment is that CDMA+GSM dual-mode handset is realized voice and view data synchronizing process as calling party, and calling party is for the speech data of AMR coding and the view data synchronizing process of H.263 encoding.Therefore because the coding rate of voice data is very fast, in order to guarantee that transmitting terminal sends the time consistency of voice and view data, Audio will do an encoding time delay, stamps timestamp after waiting Video to encode again, and sends to together callee by eating dishes without rice or wine.
Embodiment tri-
Fig. 7 is the flow chart of the present embodiment.The present embodiment is that CDMA+GSM dual-mode handset is realized voice and view data synchronizing process as callee, and callee is for the speech data of AMR coding and the view data synchronizing process of H.263 encoding.Callee receives after the voice and view data that GSM and EVDO transmit respectively, before decoding, first do not comparing both timestamps, in general the accuracy rate of audio transmission is higher, so we using audio time stamp as with reference to the time, if video time stamp is compared with audio time stamp, postpone to surpass 1s, abandon frame of video, again get new video data, so just can guarantee the consistency of voice and image.
The above is only the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (9)

1. a video conversation apparatus, is characterized in that, comprises audio frequency input/output module, video input output module, audio coding decoding module, coding and decoding video module, audio-visual synchronization module, first network module and second network module; Described audio frequency input/output module is connected with described audio coding decoding module, for gathering the voice data of exhalation, and the voice data collecting is sent to described audio coding decoding module encodes, and export decoded voice data; Described video input output module is connected with described coding and decoding video module, for gathering the video data of exhalation, and the video data collecting is sent to described coding and decoding video module encodes, and export decoded video data; Described audio-visual synchronization module is connected with described audio coding decoding module, coding and decoding video module respectively, for voice data and video data after synchronous coding of breathing out, and the front voice data of the decoding of synchronous incoming call and video data; Described first network module is connected with described audio coding decoding module, by the circuit domain receiving and sending audio data of first network; Described second network module is connected with described coding and decoding video module, by the packet domain transmitting-receiving video data of second network.
2. video conversation apparatus according to claim 1, it is characterized in that, described audio-visual synchronization module further comprises a delay cell and a timestamp loading unit being connected with described delay cell, described delay cell postpones for the voice data to after coding, and described timestamp loading unit loads upper timestamp for the voice data to after coding and video data.
3. video conversation apparatus according to claim 1, it is characterized in that, described audio-visual synchronization module further comprises a timestamp comparing unit, the timestamp of voice data and the timestamp of video data of described timestamp comparing unit for relatively receiving, described audio-visual synchronization module is synchronous according to comparative result control voice data and video data.
4. video conversation apparatus according to claim 1, is characterized in that, described first network module is for supporting the communication network module of audio data transmission, and described second network module is for supporting the communication network module of video data transmitting.
5. a video calling call-out method, adopts video conversation apparatus described in claim 1, it is characterized in that, comprises the steps: that (1) adopts described audio frequency input/output module and video input output module to gather respectively voice data and video data; (2) adopt described audio coding decoding module and coding and decoding video module respectively to the voice data collecting and coding video data; (3) adopt voice data and the video data after described audio-visual synchronization module synchronous coding; (4) voice data after synchronous is breathed out by the circuit domain of first network, and the video data after simultaneously synchronous is breathed out by the packet domain of second network.
6. video calling call-out method according to claim 5, it is characterized in that, described audio-visual synchronization module further comprises a delay cell and a timestamp loading unit being connected with described delay cell, step (3) further comprises: (31) postpone the voice data after encoding by described delay cell, carry out afterwards step (32); (32) by described timestamp loading unit for coding after voice data and video data load respectively upper timestamp.
7. a video calling incoming call method, video conversation apparatus described in employing claim 1, it is characterized in that, comprise the steps: that (1) adopts the circuit domain of first network to receive the voice data of incoming call, and adopt the video data of the packet domain reception incoming call of second network; (2) voice data and the video data that adopt described audio-visual synchronization module synchronously to receive; (3) adopt described audio coding decoding module respectively the voice data after synchronizeing and video data to be decoded with coding and decoding video module; (4) adopt described video input output module to show decoded video data, adopt described audio frequency input/output module to export decoded voice data simultaneously.
8. video calling incoming call method according to claim 7, it is characterized in that, described audio-visual synchronization module further comprises a timestamp comparing unit, step (2) further comprises: the timestamp of the voice data relatively receiving and the timestamp of video data, and synchronous according to comparative result control voice data and video data.
9. video calling incoming call method according to claim 8, it is characterized in that, the timestamp of the described audio, video data relatively receiving further adopts: set a time of delay, if the video time stamp of the video data of getting is compared with audio time stamp, postpone to surpass the time of delay setting, abandon this frame video data, again get new video data, carry out the timestamp comparison of audio, video data.
CN201210269514.8A 2012-08-01 2012-08-01 Video call device and method Pending CN103581604A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210269514.8A CN103581604A (en) 2012-08-01 2012-08-01 Video call device and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210269514.8A CN103581604A (en) 2012-08-01 2012-08-01 Video call device and method

Publications (1)

Publication Number Publication Date
CN103581604A true CN103581604A (en) 2014-02-12

Family

ID=50052404

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210269514.8A Pending CN103581604A (en) 2012-08-01 2012-08-01 Video call device and method

Country Status (1)

Country Link
CN (1) CN103581604A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103841584A (en) * 2014-02-27 2014-06-04 汉柏科技有限公司 Data transmission method and device
WO2016206165A1 (en) * 2015-06-24 2016-12-29 中兴通讯股份有限公司 Method and device for increasing data transmission rate in video call
CN107872733A (en) * 2016-09-26 2018-04-03 中兴通讯股份有限公司 The video call method and device and server of voice and video shunting transmission
CN109040138A (en) * 2018-10-11 2018-12-18 南方电网科学研究院有限责任公司 A kind of encryption Science Report information sharing system
CN111491127A (en) * 2020-04-21 2020-08-04 新石器慧通(北京)科技有限公司 Video call method and system based on unmanned vehicle remote driving
CN112073789A (en) * 2019-06-10 2020-12-11 海信视像科技股份有限公司 Sound processing method and display device

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1745526A (en) * 2002-04-19 2006-03-08 汤姆森许可公司 Apparatus and method for synchronization of audio and video streams.
CN1791201A (en) * 2004-12-15 2006-06-21 乐金电子(昆山)电脑有限公司 High power and timbre mobile image reproduction device and reproduction method
US20070019739A1 (en) * 2005-07-19 2007-01-25 Nec Viewtechnology, Ltd. Video and audio reproducing apparatus and video and audio reproducing method for reproducing video images and sound based on video and audio streams
CN101047842A (en) * 2006-03-27 2007-10-03 中兴通讯股份有限公司 Method for playing flow media audio-vedio on radio binary operation environment mobile phone
CN101271720A (en) * 2008-04-22 2008-09-24 中兴通讯股份有限公司 Synchronization process for mobile phone stream media audio and video
CN101383940A (en) * 2008-10-10 2009-03-11 中兴通讯股份有限公司 Visible telephone implementing method and device
CN102143349A (en) * 2011-03-22 2011-08-03 宋健 Video session implementation method based on audio/video signal transmission through independent channels respectively
CN102368761A (en) * 2011-03-15 2012-03-07 陈建国 Time division-synchronous code division multiple access (TD-SCDMA)/ IP double network cooperation multimedia telephone, system and communication method thereof

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1745526A (en) * 2002-04-19 2006-03-08 汤姆森许可公司 Apparatus and method for synchronization of audio and video streams.
CN1791201A (en) * 2004-12-15 2006-06-21 乐金电子(昆山)电脑有限公司 High power and timbre mobile image reproduction device and reproduction method
US20070019739A1 (en) * 2005-07-19 2007-01-25 Nec Viewtechnology, Ltd. Video and audio reproducing apparatus and video and audio reproducing method for reproducing video images and sound based on video and audio streams
CN101047842A (en) * 2006-03-27 2007-10-03 中兴通讯股份有限公司 Method for playing flow media audio-vedio on radio binary operation environment mobile phone
CN101271720A (en) * 2008-04-22 2008-09-24 中兴通讯股份有限公司 Synchronization process for mobile phone stream media audio and video
CN101383940A (en) * 2008-10-10 2009-03-11 中兴通讯股份有限公司 Visible telephone implementing method and device
CN102368761A (en) * 2011-03-15 2012-03-07 陈建国 Time division-synchronous code division multiple access (TD-SCDMA)/ IP double network cooperation multimedia telephone, system and communication method thereof
CN102143349A (en) * 2011-03-22 2011-08-03 宋健 Video session implementation method based on audio/video signal transmission through independent channels respectively

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103841584A (en) * 2014-02-27 2014-06-04 汉柏科技有限公司 Data transmission method and device
WO2016206165A1 (en) * 2015-06-24 2016-12-29 中兴通讯股份有限公司 Method and device for increasing data transmission rate in video call
CN106331565A (en) * 2015-06-24 2017-01-11 中兴通讯股份有限公司 Method and device for improving data transmission speed of video call
CN107872733A (en) * 2016-09-26 2018-04-03 中兴通讯股份有限公司 The video call method and device and server of voice and video shunting transmission
CN109040138A (en) * 2018-10-11 2018-12-18 南方电网科学研究院有限责任公司 A kind of encryption Science Report information sharing system
CN112073789A (en) * 2019-06-10 2020-12-11 海信视像科技股份有限公司 Sound processing method and display device
WO2020248699A1 (en) * 2019-06-10 2020-12-17 海信视像科技股份有限公司 Sound processing method and display apparatus
CN111491127A (en) * 2020-04-21 2020-08-04 新石器慧通(北京)科技有限公司 Video call method and system based on unmanned vehicle remote driving
CN111491127B (en) * 2020-04-21 2021-06-29 新石器慧通(北京)科技有限公司 Video call method and system based on unmanned vehicle remote driving

Similar Documents

Publication Publication Date Title
CN103581604A (en) Video call device and method
CN101583009B (en) Video terminal and method thereof for realizing interface content sharing
US20050208962A1 (en) Mobile phone, multimedia chatting system and method thereof
CN101931781B (en) The method of display of calling video and wireless communication terminal
CN100479517C (en) Method for superposing voice in transmitting audio-video file
CN101778485A (en) Method and device for building circuit switch link of WiFi handheld equipment
US20080279177A1 (en) Conjoined Telephony Communication System
WO2012097531A1 (en) Video communication method and terminal
US8451311B2 (en) Method and system for video telephone communications set up, related equipment and computer program product
WO2010139211A1 (en) Video telephone, method for dialing and answering video telephone and device thereof
CN1941819B (en) Method and system for transmitting speech service in Ethernet
EP2317747B1 (en) A realizing method and apparatus for video phone
CN101998102A (en) Method for controlling video quality of mobile video telephone and mobile video telephone
EP1670248A1 (en) Low bit rate video transmission over GSM network
KR100780801B1 (en) Call setup control system of potable device and control method thereof
CN102447874A (en) Video scheduling system and method
CN100362864C (en) Visual telephone system of network based on single-chip
CN101197892B (en) Fax system and method in mobile communications network
US20100002068A1 (en) Communication terminal and method for performing video telephony
KR200265603Y1 (en) The CTI conference system using technology of separate transmission of multi-media
CN100366078C (en) Videophone and method for increasing speech quality using it
CN203225839U (en) Video communication device applied to PSTN
CN101394351B (en) Method and device for controlling audio and video data stream based mobile network visual telephone
CN100421467C (en) Videophone video stream data receiving handling method based on mobile communication network
KR100835772B1 (en) Method for converting vocoder of mobile communication terminal and method for offering concurrent multi service using the same

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140212