CN103152544A - Video call quality optimization method and system thereof - Google Patents

Video call quality optimization method and system thereof Download PDF

Info

Publication number
CN103152544A
CN103152544A CN2013100350566A CN201310035056A CN103152544A CN 103152544 A CN103152544 A CN 103152544A CN 2013100350566 A CN2013100350566 A CN 2013100350566A CN 201310035056 A CN201310035056 A CN 201310035056A CN 103152544 A CN103152544 A CN 103152544A
Authority
CN
China
Prior art keywords
key frame
threshold value
video calling
receiving terminal
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013100350566A
Other languages
Chinese (zh)
Other versions
CN103152544B (en
Inventor
刘灵新
李静
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yulong Computer Telecommunication Scientific Shenzhen Co Ltd
Dongguan Yulong Telecommunication Technology Co Ltd
Original Assignee
Yulong Computer Telecommunication Scientific Shenzhen Co Ltd
Dongguan Yulong Telecommunication Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yulong Computer Telecommunication Scientific Shenzhen Co Ltd, Dongguan Yulong Telecommunication Technology Co Ltd filed Critical Yulong Computer Telecommunication Scientific Shenzhen Co Ltd
Priority to CN201310035056.6A priority Critical patent/CN103152544B/en
Publication of CN103152544A publication Critical patent/CN103152544A/en
Application granted granted Critical
Publication of CN103152544B publication Critical patent/CN103152544B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention belongs to the technical field of communications, and provides a video call quality optimization method and a system of the video call quality optimization method. The video call quality optimization method comprises the following steps: detecting the occurrence of losing of a key frame in a video call; and controlling the transmission of the key frame according to the network condition of the present video call. Therefore, the quality of the video call is optimized.

Description

Method and system thereof that video speech quality is optimized
Technical field
The present invention relates to communication technical field, relate in particular to method and system thereof that a kind of video speech quality is optimized.
Background technology
The visual telephone of the signaling control protocol of an application layer based on SIP (Session Initiation Protocol), video and audio frequency adopt RTP(Real-time Transport Protocol, RTP) agreement is by UDP(User Datagram Protocol, User Datagram Protocol) mode transmits audio, video data, sister's agreement of realtime transmission protocol RTP by RTCP (Real-time Transport Control Protocol, RTCP Real-time Transport Control Protocol).Synchronizeing between the supervision that realizes service quality and feedback, media, and member's sign in the multicast group.Because UDP belongs to non-reliable connection, the situation that may have a packet loss when network condition is poor occurs.If video is surrounded by loss, will produce the phenomenon of the poor image quality such as mosaic, if loss is that key frame also can cause continue for some time of poor quality, until next key frame receives.In the prior art, although have packet loss to adopt remedial measures UDP being detected, but do not consider the impact that the video calling network condition sends key frame.Therefore, caused video speech quality to be affected.
In summary, existing video calling technology obviously exists inconvenience and defective in actual use, so be necessary to be improved.
Summary of the invention
For above-mentioned defective, the method and the system thereof that the object of the present invention is to provide a kind of video speech quality to optimize are to have optimized the quality of video calling.
To achieve these goals, the invention provides a kind of method that video speech quality is optimized, comprise the steps:
Detect the generation of key frame loss situation in video calling;
According to the network condition of current described video calling, control the transmission of described key frame.
According to described method, comprise before the step of the generation of key frame loss situation in described detection video calling:
The threshold value of the receive data bag number of default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception;
Be preset in the threshold value that sends the network transmission speed of described key frame in described video calling;
In described detection video calling, the step of the generation of key frame loss situation comprises:
The video calling both sides judge according to the threshold value of described receive data bag number and/or the threshold value of time of reception whether described packet has loss;
If judge that described data are surrounded by loss, detect the generation of key frame loss situation in described video calling.
According to described method, described network condition according to current described video calling, the step of controlling the transmission of described key frame comprises:
Add up the network transmission speed in current described video calling;
When described network transmission speed reached the threshold value of described network transmission speed, video encoder was worked out key frame, and described key frame was sent to the receiving terminal of described video calling; Perhaps
When described network transmission speed does not reach the threshold value of described network transmission speed, stop sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame;
Described receiving terminal stops sending key frame after receiving described the first control frame.
According to described method, described network condition according to current described video calling, the step of controlling the transmission of described key frame also comprises:
When the network transmission speed in the current described video calling of statistics reached the threshold value of described network transmission speed again, video encoder was worked out key frame, and described key frame was sent to the described receiving terminal of described video calling; Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described the second control frame, recover to send key frame.
According to described method, described video calling both sides judge according to the threshold value of described receive data bag number and/or the threshold value of time of reception whether described packet has the step of loss to comprise:
Whether the number of the packet that the receiving terminal judgement of video calling receives reaches the threshold value of described receive data bag number;
If do not reach the threshold value of described receive data bag number, judge that described packet loses, and to the transmitting terminal transmitting and receiving terminal report of described video calling;
If reach the threshold value of described receive data bag number, whether the time of the described packet of judgement reception reaches the threshold value of described time of reception, do not judge that described packet loses if reach, and to the transmitting terminal transmitting and receiving terminal report of described video calling; And/or
The transmitting terminal of described video calling receives described receiving terminal report, judge according to described receiving terminal report and the last described receiving terminal report that receives whether the loss of described packet surpasses the threshold value of described receive data bag number, if judge described data-bag lost.
In order to realize another goal of the invention of the present invention, the system that the present invention also provides a kind of video speech quality to optimize comprises:
Detection module is for detection of the generation of key frame loss situation in video calling;
Control module is used for the network condition according to current described video calling, controls the transmission of described key frame.
According to described system, described system also comprises:
Presetting module is for the threshold value of the receive data bag number of presetting video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; And the threshold value that sends the network transmission speed of described key frame in described video calling;
Described detection module comprises:
The judgement submodule is used for judging according to the threshold value of described receive data bag number and/or the threshold value of time of reception whether described packet has loss;
If judge that described data are surrounded by loss, described detection module detects the generation of key frame loss situation in described video calling.
According to described system, described control module comprises:
The statistics submodule is for the network transmission speed of adding up current described video calling;
First controls submodule, is used for when described network transmission speed reaches the threshold value of described network transmission speed, controls video encoder establishment key frame, and described key frame is sent to the receiving terminal of described video calling; Perhaps
Second controls submodule, be used for when described network transmission speed does not reach the threshold value of described network transmission speed, the transmitting terminal of controlling described video calling stops sending key frame, and sends the first control frame to described receiving terminal, asks described receiving terminal to stop sending key frame;
Described receiving terminal stops sending key frame after receiving described the first control frame.
According to described system, described control module also comprises:
The 3rd controls submodule, be used for when the network transmission speed of the current described video calling of statistics submodule statistics reaches the threshold value of described network transmission speed again, control video encoder establishment key frame, and described key frame is sent to the described receiving terminal of described video calling; And
Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described the second control frame, recover to send key frame.
According to described system, described judgement submodule comprises:
The first judging unit is arranged at the receiving terminal of described video calling, and whether the number that is used for the packet that judgement receives reaches the threshold value of described receive data bag number; If do not reach the threshold value of described receive data bag number, described the first described packet of judgment unit judges is lost;
The second judging unit is arranged at the receiving terminal of described video calling, is used for the threshold value whether time that judgement receives described packet reaches described time of reception; If do not reach the threshold value of described time of reception, described the second described packet of judgment unit judges is lost;
Transmitting element is used for when described the first judging unit and/or the described packet generation of the second judgment unit judges loss, to the transmitting terminal transmitting and receiving terminal report of described video calling; And/or
Receiving element is arranged at the transmitting terminal of described video calling, is used for receiving described receiving terminal report;
The 3rd judging unit is used for judging according to described receiving terminal report and the last described receiving terminal report that receives whether the loss of described packet surpasses the threshold value of described receive data bag number, if judge described data-bag lost.
The present invention is by judging in described video calling, whether packet has loss; If judge that described data are surrounded by loss, video encoder is worked out key frame, and described key frame is sent to the receiving terminal of described video calling.Simultaneously, due in the process of video data transmitting, the volume of transmitted data of key frame is very large, in the situation that poor video quality sends the load that key frame will increase the weight of the terminal transmission data frequently, when the ability of eating dishes without rice or wine of mobile terminal does not return to can smooth and easy carrying visual telephone business the time, video quality can further worsen, and by detecting the network condition of current video conversation, controls the transmission of described key frame, thereby optimized the quality of video calling, promoted user's experience.
Description of drawings
Fig. 1 is the system construction drawing that the video speech quality that provides of first embodiment of the invention is optimized;
Fig. 2 be the present invention second and third, the system construction drawing optimized of the video speech quality that provides of four embodiment;
Fig. 3 is the system construction drawing that the video speech quality that provides of one embodiment of the invention is optimized;
Fig. 4 is the system construction drawing that the video speech quality that provides of the present invention the 5th, six, seven embodiment is optimized;
Fig. 5 is the method flow diagram that the video speech quality that provides of eighth embodiment of the invention is optimized;
Fig. 6 A is the method flow diagram that the video speech quality that provides of one embodiment of the invention is optimized;
Fig. 6 B is the method flow diagram that the video speech quality that provides of one embodiment of the invention is optimized.
Embodiment
In order to make purpose of the present invention, technical scheme and advantage clearer, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, is not intended to limit the present invention.
Referring to Fig. 1, in the first embodiment of the present invention, the system 100 that provides a kind of video speech quality to optimize comprises:
Detection module 10 is for detection of the generation of key frame loss situation in video calling;
Control module 20 is used for the network condition according to current described video calling, controls the transmission of described key frame.
In this embodiment, at first by whether the transmission of crucial LOF situation is arranged in detection module 10 video callings, because the key frame loss will cause the video speech quality variation, therefore at first detect above-mentioned situations by this detection module 10 and whether occur.Then, if the generation of key frame loss situation by the network condition of control module 20 according to current described video calling, is controlled the transmission of described key frame.Arranging like this is that the volume of transmitted data of key frame is very large due in the process of whole video data transmitting, in the situation that poor video quality, and in the poor situation of network condition, sends frequently the load that key frame will increase the weight of the terminal transmission data.Do not return to can smooth and easy carrying visual telephone business the time as the ability of eating dishes without rice or wine at mobile terminal, continue to send key frame video quality will further be worsened, and can not increase.Therefore carry out the transmission of step control key frame according to concrete network condition, will be conducive to the raising of video speech quality.Described key frame is the I frame.
Referring to Fig. 2, in the second embodiment of the present invention, described system 100 also comprises:
Presetting module 30 is for the threshold value of the receive data bag number of presetting video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; And the threshold value that sends the network transmission speed of described key frame in described video calling;
Detection module 10 comprises:
Judgement submodule 11 is used for judging according to the threshold value of described receive data bag number and/or the threshold value of time of reception whether described packet has loss;
If judge that described data are surrounded by loss, detection module 10 detects the generation of key frame loss situation in described video calling.
In the visual telephone based on SIP, audio frequency and video are sent data and control information by the pair of end oral instructions respectively, and the RTP grouping only comprises the RTP data, is to be provided by another matching used rtcp protocol and control.RTP selects a untapped even number udp port number between 1025 to 65535, using next odd number udp port number with the RTCP in a session.RTCP controls bag and has five types, wherein is used for providing QoS(Quality of Service, service quality) feedback two kinds of SR (Sender Report, sender report) and RR (Receiver Report, receiving terminal is reported) arranged.The former describes the sending and receiving statistics of transmitting terminal; The latter describes the receiving and counting data of receiving terminal.These statisticss comprise send the bag number, send byte number, accumulative total number of dropped packets, received telegraph maximum sequence number, the time of advent space jitter etc. of literary composition.In this embodiment, the receive data bag that sets in advance video reception RTCP port by presetting module 30 is counted threshold value and/or time of reception threshold value; And the threshold value that sends the network transmission speed of described key frame in described video calling.The threshold value of the network transmission speed of described key frame can send smoothly Audio and Video with video calling and be as the criterion, as the transmission rate of WIFI cordless communication network can satisfy the demand of video calling, it is 2M/S that this threshold value can be set.In addition, particularly 3G network, than very fast, will be conducive to normally carrying out of video calling due to its network speed.When judgement submodule 11 judgement receive data Bao Wei reached threshold value or time of reception and reach threshold value, detection module 10 detected the generation of key frame loss situation in described video calling.
Referring to Fig. 2, in the third embodiment of the present invention, control module 20 comprises:
Statistics submodule 21 is for the network transmission speed of adding up current described video calling;
First controls submodule 22, is used for when described network transmission speed reaches the threshold value of described network transmission speed, controls video encoder 26 establishment key frames, and described key frame is sent to the receiving terminal of described video calling; Perhaps
Second controls submodule 23, be used for when described network transmission speed does not reach the threshold value of described network transmission speed, the transmitting terminal of controlling described video calling stops sending key frame, and sends the first control frame to described receiving terminal, asks described receiving terminal to stop sending key frame;
Described receiving terminal stops sending key frame after receiving described the first control frame.
In this embodiment, network transmission speed in the described video callings of statistics submodule 21 statistics, first controls submodule 22 and first controls submodule 22 and will add up respectively the threshold value of network transmission speed that submodule 21 adds up and described default network transmission speed and compare.And first controls submodule 22 control video encoder 26 establishment key frames when described network transmission speed reaches the threshold value of described network transmission speed, and described key frame is sent to the receiving terminal of described video calling.Thus, in video call process, after loss of data occurs, carry out timely the establishment of key frame, and send to receiving terminal, video speech quality is improved.And when described network transmission speed does not reach the threshold value of described network transmission speed, second controls the timely transmitting terminal of controlling described video calling of submodule 23 stops sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame; Reduce thus offered load, the network data transmission of video calling can be recovered as soon as possible.
In the fourth embodiment of the present invention, control module 20 also comprises referring to Fig. 2:
The 3rd controls submodule 24, be used for when the network transmission speed of the statistics submodule 21 current described video callings of statistics reaches the threshold value of described network transmission speed again, control video encoder 26 establishment key frames, and described key frame is sent to the described receiving terminal of described video calling; And
Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described the second control frame, recover to send key frame.
In this embodiment, when the network transmission speed in the statistics submodule 21 current described video callings of statistics reaches the threshold value of described network transmission speed again, the 3rd controls submodule 24 will control the described receiving terminal that video encoder 26 establishment key frames send to described video calling, and video speech quality is improved.Simultaneously, send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame; Make video calling smooth.
Referring to Fig. 3, in one embodiment of the invention, provide the structured flowchart of the system 100 of video speech quality optimization, comprised videophone application 1; Video control unit 2, coding and decoding video unit 3, audio frequency control unit 4, audio coding decoding unit 5, communication module 6.Wherein the Audio and Video encoding and decoding in video calling are responsible for respectively in coding and decoding video unit 3 and audio coding decoding unit 5, and transmission and the encoding and decoding that comprise video data and voice data are controlled respectively in video control unit 2 and audio frequency control unit 4.And communication module 6 will be added up submodule 21 and be eated dishes without rice or wine a Rate Feedback that layer statistics current data sends to video control unit 2 at physics.These data are updated in video calling video control unit 2 at a certain time interval.Video control unit 2 can determine whether to send key frame according to current network conditions and P frame quantity forwarded.If network quality is relatively poor simultaneously, 2 transmission control frames of control unit, require the opposite end to suspend the transmission of key frame frequently, to reduce the offered load of terminal, can recover from congestion status as early as possible.
Referring to Fig. 4, in the fifth embodiment of the present invention, judgement submodule 11 comprises:
The first judging unit 111 is arranged at the receiving terminal of described video calling, and whether the number that is used for the packet that judgement receives reaches the threshold value of described receive data bag number; If do not reach the threshold value of described receive data bag number, described the first judging unit 111 described packets of judgement are lost;
The second judging unit 112 is arranged at the receiving terminal of described video calling, is used for the threshold value whether time that judgement receives described packet reaches described time of reception; If do not reach the threshold value of described time of reception, described the second judging unit 112 described packets of judgement are lost;
Transmitting element 113 is used for when described the first judging unit 111 and/or the second judging unit 112 described packets generation of judgement loss, to the transmitting terminal transmitting and receiving terminal report of described video calling.
In this embodiment, can unilaterally carry out the judgement of data-bag lost by the receiving terminal of video calling.Concrete, receiving terminal judges respectively by the first judging unit 111, the second judging unit 112 whether the number of receive data bag reaches the threshold value that does not arrive described receive data bag number, and whether the time of receive data bag arrives the threshold value of described time of reception.Whether the number of receive data bag reaches the threshold value that does not arrive described receive data bag number, and explanation has data-bag lost; The threshold value of described receive data bag number arranges according to the key frame interval number of described video encoder 26; For example, sent 30 P frames at video encoder 26 and will send a key frame, and when receiving 30 P frames and do not receive a key frame value, judged that data are surrounded by loss.And the threshold value of described time of reception is according to setting interval time of described video encoder 26 establishment key frames.For example, the time that receives 10 key frames is 10 seconds, and the time that receives 10 key frames needs 15 seconds, illustrates that data are surrounded by loss.
Referring to Fig. 4, in the sixth embodiment of the present invention, described judgement submodule 11 also comprises:
Receiving element 114 is arranged at the transmitting terminal of described video calling, is used for receiving described receiving terminal report;
The 3rd judging unit 115 is used for judging according to described receiving terminal report and the last described receiving terminal report that receives whether the loss of described packet surpasses the threshold value of described receive data bag number, if judge described data-bag lost.
In this embodiment, by the both sides of video calling, namely receiving terminal and transmitting terminal judge jointly whether packet sends loss.Concrete, receiving element 114 after receiving described receiving terminal report by the 3rd judging unit 115 according to described receiving terminal report and on the described receiving terminal report that once receives judge described packet loss whether over the threshold value of described receive data bag number, if both comparison values have surpassed the threshold value of described receive data bag number, the judgement data are surrounded by to send loses.
Referring to Fig. 4, in the seventh embodiment of the present invention, described control module 20 comprises:
Notice submodule 25 is arranged at described transmitting terminal, when being used in the judgement submodule 11 described data-bag lost of judgement, notifies described video encoder 26 establishment key frames;
Video encoder 26 after being used for the establishment key frame, sends to described key frame the receiving terminal of described video calling.
In this embodiment, send RR and report to transmit end receive end, after transmitting terminal receives this information, after the judgement number of dropped packets has increase, notify submodule 25 notice video encoder 26 establishment key frames; And will work out the receiving terminal that sends to described video calling after key frame.Thus, make video speech quality obtain to improve.
In above-mentioned a plurality of embodiment, 100 systems of system of video speech quality optimization can be the software units that is built in communication terminal, hardware cell or software and hardware combining unit.Communication terminal can be mobile phone, PDA(Personal Digital Assistant, personal digital assistant), panel computer etc.
Referring to Fig. 5, in the eighth embodiment of the present invention, a kind of method that provides video speech quality to optimize comprises the steps:
In step S501, detect the generation of key frame loss situation in video calling; This step is realized by detection module 10;
In step S502, according to the network condition of current described video calling, control the transmission of described key frame; This step is realized by control module 20.
In this embodiment, at first detect by key frame loss situation in 10 pairs of video callings of detection module, if find that key frame has loss, by the network condition of control module 20 according to current described video calling, control the transmission of described key frame, with according to current network condition, optimize the quality of video calling.
In the ninth embodiment of the present invention, comprise before described step S501:
The threshold value of the receive data bag number of default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; This step is realized by presetting module 30.
Be preset in the threshold value that sends the network transmission speed of described key frame in described video calling; This step is realized by presetting module 30.
Described step S501 comprises:
The video calling both sides judge according to the threshold value of described receive data bag number and/or the threshold value of time of reception whether described packet has loss; This step is realized by judgement submodule 11.
If judge that described data are surrounded by loss, detect the generation of key frame loss situation in described video calling.
In this embodiment, the user can set in advance the relevant parameter that judges whether key frame is lost, and the network transmission speed reference value that can carry out video calling, can control the video calling high-quality and carries out according to being worth.
In this embodiment, the threshold value of the receive data bag number of at first default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception are judged whether sending data-bag lost in video calling as the reference standard by these two threshold values.Be surrounded by loss in the described data of judgement, video encoder 26 is worked out key frames, and described key frame is sent to the receiving terminal of described video calling, thus, has optimized the quality of video calling.
In the tenth embodiment of the present invention, described step S502 comprises:
Add up the network transmission speed in current described video calling; This step is realized by statistics submodule 21;
When described network transmission speed reached the threshold value of described network transmission speed, video encoder 26 was worked out key frames, and described key frame was sent to the receiving terminal of described video calling; This step is controlled submodule 22 by first and is realized; Perhaps
When described network transmission speed does not reach the threshold value of described network transmission speed, stop sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame; This step is controlled submodule 23 by second and is realized.
Described receiving terminal stops sending key frame after receiving described the first control frame.
In the 11st embodiment of the present invention, described step S502 also comprises:
When the network transmission speed in the current described video calling of statistics reached the threshold value of described network transmission speed again, video encoder 26 was worked out key frames, and described key frame was sent to the described receiving terminal of described video calling; Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame; This step is controlled submodule 24 by the 3rd and is realized.
After described receiving terminal receives described the second control frame, recover to send key frame.
In above-mentioned two embodiment, in the situation of losing key frame, control the transmission of key frame by concrete network rate, optimized the quality of video calling.
In the 12nd embodiment of the present invention, described step S502 comprises:
Whether the number of the packet that the receiving terminal judgement of video calling receives reaches the threshold value of described receive data bag number;
If do not reach the threshold value of described receive data bag number, judge that described packet loses, and to the transmitting terminal transmitting and receiving terminal report of described video calling; This step is realized by the first judging unit 111 and transmitting element 113.
If reach the threshold value of described receive data bag number, judge further whether the time that receives described packet reaches the threshold value of described time of reception, do not judge that described packet loses if reach, and to the transmitting terminal transmitting and receiving terminal report of described video calling; This step is realized by the second judging unit 112 and transmitting element 113.
In this embodiment, the concrete receiving terminal at video calling judges whether packet has loss, notifies the video calling transmitting terminal if having, and the video calling transmitting terminal sends to receiving terminal after controlling video encoder 26 establishment key frames, has optimized video speech quality.
In the 13rd embodiment of the present invention, described and also comprise after the step of the transmitting terminal transmitting and receiving terminal of described video calling report:
The transmitting terminal of described video calling receives described receiving terminal report, judge according to described receiving terminal report and the last described receiving terminal report that receives whether the loss of described packet surpasses the threshold value of described receive data bag number, if judge described data-bag lost.This step is realized by receiving element 114 and the 3rd judging unit 115.
In this embodiment, after video calling receiving terminal judgement data are surrounded by the generation of loss situation, judge further by the transmitting terminal of video calling whether packet has to lose and send.Concrete pass through to contrast the report that twice receiving terminal return and judge, if the threshold value that in twice report, the difference of packet surpasses receive data bag number judges described data-bag lost.By after described video encoder 26 establishment key frames, described key frame is sent to the receiving terminal of described video calling.Described step S303 comprises: when the described data-bag lost of described transmitting terminal judgement, described transmitting terminal is notified described video encoder 26 establishment key frames; This step is realized by notice submodule 25; After described video encoder 26 establishment key frames, described key frame is sent to the receiving terminal of described video calling.
Referring to Fig. 6 A and Fig. 6 B, the method that provides in one embodiment of the invention video calling to optimize, in this embodiment, the SR that receives according to video channel RTCP, judged whether packet loss, if packet loss is arranged notifies video encoder 26 to force compile key frame and send to the opposite end, to improve video quality.Wherein Fig. 6 A is the workflow of video calling receiving terminal, is described below:
In step S601, visual telephone starts;
In step S602, number-of-packet thresholding and time thresholding are set;
In step S603, judging whether to reach the number-of-packet threshold value, is execution in step S604, otherwise execution in step S605;
In step S604, judge whether to reach the time gate limit value of receive data bag, be execution in step S606, otherwise return to step S603;
In step S605, send RR.
In step S606, judging whether to receive BYE, namely finish the video calling instruction, is to finish video calling, otherwise returns to step S603.
Fig. 6 B is the workflow of video calling transmitting terminal, is described below:
In step S701, visual telephone starts;
In step S702, prepare to receive SR;
In step S703, judge whether to receive SR, be execution in step S704, otherwise return to step S702;
In step S704, the value that judgement received with last time relatively, number of dropped packets and lose byte number greater than described receive data bag threshold value is execution in step S705, otherwise returns to step S702;
In step S705, notice video encoder 26 establishment key frames also send;
In step S706, whether receiving BYE, namely finish the video calling instruction, is to finish video calling, otherwise returns to step S702.
Preferably, in above-mentioned a plurality of embodiment, described key frame is the I frame.
In sum, the present invention is by detecting the generation of key frame loss situation in video calling; According to the network condition of current described video calling, control the transmission of described key frame.Avoided under generation key frame loss situation, and when the speed ratio of transmitted data on network is low, the video calling both sides but still send key frame, thereby the congested video speech quality worse situation that becomes that causes that causes video calling Internet Transmission passage occurs.Thus, optimize the quality of video calling, promoted user's experience.
Certainly; the present invention also can have other various embodiments; in the situation that do not deviate from spirit of the present invention and essence thereof; those of ordinary skill in the art work as can make according to the present invention various corresponding changes and distortion, but these corresponding changes and distortion all should belong to the protection range of the appended claim of the present invention.

Claims (10)

1. the method that video speech quality is optimized, is characterized in that, comprises the steps:
Detect the generation of key frame loss situation in video calling;
According to the network condition of current described video calling, control the transmission of described key frame.
2. method according to claim 1, is characterized in that, comprises before the step of the generation of key frame loss situation in described detection video calling:
The threshold value of the receive data bag number of default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception;
Be preset in the threshold value that sends the network transmission speed of described key frame in described video calling;
In described detection video calling, the step of the generation of key frame loss situation comprises:
The video calling both sides judge according to the threshold value of described receive data bag number and/or the threshold value of time of reception whether described packet has loss;
If judge that described data are surrounded by loss, detect the generation of key frame loss situation in described video calling.
3. method according to claim 2, is characterized in that, described network condition according to current described video calling, and the step of controlling the transmission of described key frame comprises:
Add up the network transmission speed in current described video calling;
When described network transmission speed reached the threshold value of described network transmission speed, video encoder was worked out key frame, and described key frame was sent to the receiving terminal of described video calling; Perhaps
When described network transmission speed does not reach the threshold value of described network transmission speed, stop sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame;
Described receiving terminal stops sending key frame after receiving described the first control frame.
4. method according to claim 3, is characterized in that, described network condition according to current described video calling, and the step of controlling the transmission of described key frame also comprises:
When the network transmission speed in the current described video calling of statistics reached the threshold value of described network transmission speed again, video encoder was worked out key frame, and described key frame was sent to the described receiving terminal of described video calling; Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described the second control frame, recover to send key frame.
5. method according to claim 2, is characterized in that, described video calling both sides judge according to the threshold value of described receive data bag number and/or the threshold value of time of reception whether described packet has the step of loss to comprise:
Whether the number of the packet that the receiving terminal judgement of video calling receives reaches the threshold value of described receive data bag number;
If do not reach the threshold value of described receive data bag number, judge that described packet loses, and to the transmitting terminal transmitting and receiving terminal report of described video calling;
If reach the threshold value of described receive data bag number, whether the time of the described packet of judgement reception reaches the threshold value of described time of reception, do not judge that described packet loses if reach, and to the transmitting terminal transmitting and receiving terminal report of described video calling; And/or
The transmitting terminal of described video calling receives described receiving terminal report, judge according to described receiving terminal report and the last described receiving terminal report that receives whether the loss of described packet surpasses the threshold value of described receive data bag number, if judge described data-bag lost.
6. the system that video speech quality is optimized, is characterized in that, comprising:
Detection module is for detection of the generation of key frame loss situation in video calling;
Control module is used for the network condition according to current described video calling, controls the transmission of described key frame.
7. system according to claim 6, is characterized in that, described system also comprises:
Presetting module is for the threshold value of the receive data bag number of presetting video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; And the threshold value that sends the network transmission speed of described key frame in described video calling;
Described detection module comprises:
The judgement submodule is used for judging according to the threshold value of described receive data bag number and/or the threshold value of time of reception whether described packet has loss;
If judge that described data are surrounded by loss, described detection module detects the generation of key frame loss situation in described video calling.
8. system according to claim 6, is characterized in that, described control module comprises:
The statistics submodule is for the network transmission speed of adding up current described video calling;
First controls submodule, is used for when described network transmission speed reaches the threshold value of described network transmission speed, controls video encoder establishment key frame, and described key frame is sent to the receiving terminal of described video calling; Perhaps
Second controls submodule, be used for when described network transmission speed does not reach the threshold value of described network transmission speed, the transmitting terminal of controlling described video calling stops sending key frame, and sends the first control frame to described receiving terminal, asks described receiving terminal to stop sending key frame;
Described receiving terminal stops sending key frame after receiving described the first control frame.
9. system according to claim 8, is characterized in that, described control module also comprises:
The 3rd controls submodule, be used for when the network transmission speed of the current described video calling of statistics submodule statistics reaches the threshold value of described network transmission speed again, control video encoder establishment key frame, and described key frame is sent to the described receiving terminal of described video calling; And
Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described the second control frame, recover to send key frame.
10. system according to claim 7, is characterized in that, described judgement submodule comprises:
The first judging unit is arranged at the receiving terminal of described video calling, and whether the number that is used for the packet that judgement receives reaches the threshold value of described receive data bag number; If do not reach the threshold value of described receive data bag number, described the first described packet of judgment unit judges is lost;
The second judging unit is arranged at the receiving terminal of described video calling, is used for the threshold value whether time that judgement receives described packet reaches described time of reception; If do not reach the threshold value of described time of reception, described the second described packet of judgment unit judges is lost;
Transmitting element is used for when described the first judging unit and/or the described packet generation of the second judgment unit judges loss, to the transmitting terminal transmitting and receiving terminal report of described video calling; And/or
Receiving element is arranged at the transmitting terminal of described video calling, is used for receiving described receiving terminal report;
The 3rd judging unit is used for judging according to described receiving terminal report and the last described receiving terminal report that receives whether the loss of described packet surpasses the threshold value of described receive data bag number, if judge described data-bag lost.
CN201310035056.6A 2013-01-29 2013-01-29 The method that video speech quality is optimized and system thereof Active CN103152544B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310035056.6A CN103152544B (en) 2013-01-29 2013-01-29 The method that video speech quality is optimized and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310035056.6A CN103152544B (en) 2013-01-29 2013-01-29 The method that video speech quality is optimized and system thereof

Publications (2)

Publication Number Publication Date
CN103152544A true CN103152544A (en) 2013-06-12
CN103152544B CN103152544B (en) 2016-04-06

Family

ID=48550389

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310035056.6A Active CN103152544B (en) 2013-01-29 2013-01-29 The method that video speech quality is optimized and system thereof

Country Status (1)

Country Link
CN (1) CN103152544B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104427286A (en) * 2013-08-20 2015-03-18 中国移动通信集团公司 Method and system for making video call
CN112584081A (en) * 2020-12-01 2021-03-30 北京融讯科创技术有限公司 Video transmission method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101345962A (en) * 2008-08-15 2009-01-14 宇龙计算机通信科技(深圳)有限公司 Call mode switch control method, system and mobile terminal
CN101651815A (en) * 2009-09-01 2010-02-17 中兴通讯股份有限公司 Visual telephone and method for enhancing video quality by utilizing same
CN101931799A (en) * 2010-09-14 2010-12-29 中兴通讯股份有限公司 Method and device for smoothing video bit stream
CN102752670A (en) * 2012-06-13 2012-10-24 广东威创视讯科技股份有限公司 Method, device and system for reducing phenomena of mosaics in network video transmission

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101345962A (en) * 2008-08-15 2009-01-14 宇龙计算机通信科技(深圳)有限公司 Call mode switch control method, system and mobile terminal
CN101651815A (en) * 2009-09-01 2010-02-17 中兴通讯股份有限公司 Visual telephone and method for enhancing video quality by utilizing same
CN101931799A (en) * 2010-09-14 2010-12-29 中兴通讯股份有限公司 Method and device for smoothing video bit stream
CN102752670A (en) * 2012-06-13 2012-10-24 广东威创视讯科技股份有限公司 Method, device and system for reducing phenomena of mosaics in network video transmission

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104427286A (en) * 2013-08-20 2015-03-18 中国移动通信集团公司 Method and system for making video call
CN104427286B (en) * 2013-08-20 2019-01-01 中国移动通信集团公司 A kind of method and system carrying out video calling
CN112584081A (en) * 2020-12-01 2021-03-30 北京融讯科创技术有限公司 Video transmission method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN103152544B (en) 2016-04-06

Similar Documents

Publication Publication Date Title
US11349900B2 (en) Voice encoding and sending method and apparatus
KR100450236B1 (en) Transmitting/receiving method and device therefor
EP2135407B1 (en) Method of transmitting data in a communication system
CN101778426B (en) Method and equipment for video data stream transmission in mobile wireless network
JPH10247944A (en) Relay controller and its method
KR20180031016A (en) Downside of the transmitter side video phone
WO2012075951A1 (en) Method and device for adjusting bandwidth in conference place, conference terminal and media control server
JP2001160824A (en) Wired and wireless mixed network data distributor and data distribution method
CN104702922A (en) Method and system for transmitting video
KR20150023351A (en) User interaction monitoring for adaptive real time communication
CN103780907A (en) Method and device for video data flow shaping
CN102348095A (en) Method for keeping stable transmission of images in mobile equipment video communication
CN102025963A (en) Method and device for controlling call of video telephone
CN107770473B (en) Audio and video data transmission control method and device
CN103152544B (en) The method that video speech quality is optimized and system thereof
US8446823B2 (en) Method of managing the flow of time-sensitive data over packet networks
KR20090078454A (en) Voip terminal and method for automatically altering codec according to quality of voice
CN101552771B (en) Media gateway, media gateway controller, multimedia telephone intercommunication method and system
CN1885879B (en) Method for preventing VOIP system bandwidth overload
US10200694B2 (en) Method and apparatus for response of feedback information during video call
CN101998105A (en) Media control server cascade system as well as method and device for controlling multimedia code streams
CN103152545B (en) A kind of method, video server and video conferencing system processing error correction request
CN107154913B (en) IP telephone terminal communication method
CN101159746B (en) Self-adaptive method and system
WO2014087764A1 (en) Terminal and communication system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant