CN103152544B

CN103152544B - The method that video speech quality is optimized and system thereof

Info

Publication number: CN103152544B
Application number: CN201310035056.6A
Authority: CN
Inventors: 刘灵新; 李静
Original assignee: Yulong Computer Telecommunication Scientific Shenzhen Co Ltd; Dongguan Yulong Telecommunication Technology Co Ltd
Current assignee: Yulong Computer Telecommunication Scientific Shenzhen Co Ltd; Dongguan Yulong Telecommunication Technology Co Ltd
Priority date: 2013-01-29
Filing date: 2013-01-29
Publication date: 2016-04-06
Anticipated expiration: 2033-01-29
Also published as: CN103152544A

Abstract

The present invention is applicable to communication technical field, provides method and the system thereof of the optimization of a kind of video speech quality, comprises the steps: the generation detecting key frame loss situation in video calling; According to the network condition of current described video calling, control the transmission of described key frame.Whereby, present invention optimizes the quality of video calling.

Description

The method that video speech quality is optimized and system thereof

Technical field

The present invention relates to communication technical field, particularly relate to method and the system thereof of the optimization of a kind of video speech quality.

Background technology

Based on the visual telephone that SIP (SessionInitiationProtocol) is the signaling control protocol of an application layer, video and audio frequency adopt RTP(Real-timeTransportProtocol, RTP) agreement passes through UDP(UserDatagramProtocol, User Datagram Protocol) mode transmits audio, video data, by sister's agreement that RTCP (Real-timeTransportControlProtocol, RTCP Real-time Transport Control Protocol) is realtime transmission protocol RTP.Realize synchronous with between feedback, media of the supervision of service quality, and the mark of member in multicast group.Because UDP belongs to non-reliable connection, when network condition is poor may have packet loss situation occur.If video is surrounded by loss, the phenomenon of the poor image quality such as mosaic will be produced, if what lose is that key frame also can cause continue for some time of poor quality, until next key frame receives.In the prior art, although detecting that UDP has packet loss to adopt remedial measures, the impact that video calling network condition sends key frame is not but being considered.Therefore, cause video speech quality to be affected.

In summary, in actual use, obviously there is inconvenience and defect, so be necessary to be improved in existing video call technology.

Summary of the invention

For above-mentioned defect, a kind of method that the object of the present invention is to provide video speech quality to optimize and system thereof, to optimize the quality of video calling.

To achieve these goals, the invention provides a kind of method that video speech quality is optimized, comprise the steps:

Detect the generation of key frame loss situation in video calling;

According to the network condition of current described video calling, control the transmission of described key frame.

According to described method, comprise before the step of the generation of key frame loss situation in described detection video calling:

Preset the reception number-of-packet object threshold value of video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception;

Be preset in the threshold value of the network transmission speed sending described key frame in described video calling;

In described detection video calling, the step of the generation of key frame loss situation comprises:

According to the threshold value of described reception number-of-packet object threshold value and/or time of reception, video calling both sides judge whether described packet has loss;

If judge, described data are surrounded by loss, then the generation of key frame loss situation in described video calling detected.

According to described method, the described network condition according to current described video calling, the step controlling the transmission of described key frame comprises:

Add up the network transmission speed in current described video calling;

When described network transmission speed reaches the threshold value of described network transmission speed, video encoder establishment key frame, and described key frame is sent to the receiving terminal of described video calling; Or

When described network transmission speed does not reach the threshold value of described network transmission speed, stop sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame;

After described receiving terminal receives described first control frame, stop sending key frame.

According to described method, the described network condition according to current described video calling, the step controlling the transmission of described key frame also comprises:

When adding up the network transmission speed in current described video calling and again reaching the threshold value of described network transmission speed, video encoder establishment key frame, and described key frame is sent to the described receiving terminal of described video calling; Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;

After described receiving terminal receives described second control frame, recover to send key frame.

According to described method, according to the threshold value of described reception number-of-packet object threshold value and/or time of reception, described video calling both sides judge whether described packet has the step of loss to comprise:

The receiving terminal of video calling judges whether the number of the packet received reaches described reception number-of-packet object threshold value;

If do not reach described reception number-of-packet object threshold value, then judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling;

If reach described reception number-of-packet object threshold value, whether the time then judging to receive described packet reaches the threshold value of described time of reception, if do not reach, judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling; And/or

The transmitting terminal of described video calling receives the report of described receiving terminal, judge whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost according to the described receiving terminal report that described receiving terminal report and last time receive.

In order to realize another goal of the invention of the present invention, present invention also offers the system that a kind of video speech quality is optimized, comprising:

Detection module, for detecting the generation of key frame loss situation in video calling;

Control module, for the network condition according to current described video calling, controls the transmission of described key frame.

According to described system, described system also comprises:

Presetting module, for the reception number-of-packet object threshold value of default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; And in described video calling, send the threshold value of network transmission speed of described key frame;

Described detection module comprises:

Judge submodule, for judging whether described packet has loss according to the threshold value of described reception number-of-packet object threshold value and/or time of reception;

If judge, described data are surrounded by loss, then described detection module detects the generation of key frame loss situation in described video calling.

According to described system, described control module comprises:

Statistics submodule, for adding up the network transmission speed in current described video calling;

First controls submodule, for when described network transmission speed reaches the threshold value of described network transmission speed, controls video encoder establishment key frame, and described key frame is sent to the receiving terminal of described video calling; Or

Second controls submodule, for when described network transmission speed does not reach the threshold value of described network transmission speed, the transmitting terminal controlling described video calling stops sending key frame, and sends the first control frame to described receiving terminal, asks described receiving terminal to stop sending key frame;

According to described system, described control module also comprises:

3rd controls submodule, for when adding up the network transmission speed in the current described video calling of submodule statistics and again reaching the threshold value of described network transmission speed, control video encoder establishment key frame, and described key frame is sent to the described receiving terminal of described video calling; And

Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;

According to described system, described judgement submodule comprises:

First judging unit, is arranged at the receiving terminal of described video calling, for judging whether the number of the packet received reaches described reception number-of-packet object threshold value; If do not reach described reception number-of-packet object threshold value, then described first judging unit judges that described packet is lost;

Second judging unit, is arranged at the receiving terminal of described video calling, and whether the time for judging to receive described packet reaches the threshold value of described time of reception; If do not reach the threshold value of described time of reception, then described second judging unit judges that described packet is lost;

Transmitting element, during for judging that at described first judging unit and/or the second judging unit described packet occurs to lose, the transmitting terminal transmitting and receiving terminal to described video calling is reported; And/or

Receiving element, is arranged at the transmitting terminal of described video calling, for receiving the report of described receiving terminal;

For the described receiving terminal report received according to described receiving terminal report and last time, 3rd judging unit, judges whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost.

The present invention is by judging in described video calling, whether packet has loss; If judge, described data are surrounded by loss, then video encoder establishment key frame, and described key frame are sent to the receiving terminal of described video calling.Simultaneously, due in the process of video data transmission, the volume of transmitted data of key frame is very large, and when poor video quality, sending key frame frequently will increase the weight of the load of terminal transmission data, when the ability of eating dishes without rice or wine of mobile terminal do not return to can smooth and easy carrying visual telephone business time, video quality can worsen further, by detecting the network condition of current video call, controls the transmission of described key frame, thus optimize the quality of video calling, improve Consumer's Experience.

Accompanying drawing explanation

Fig. 1 is the system construction drawing of the video speech quality optimization that first embodiment of the invention provides;

Fig. 2 be the present invention second and third, the system construction drawing optimized of the video speech quality that provides of four embodiments;

Fig. 3 is the system construction drawing of the video speech quality optimization that one embodiment of the invention provides;

Fig. 4 be the present invention the 5th, six, the system construction drawing optimized of the video speech quality that provides of seven embodiments;

Fig. 5 is the method flow diagram of the video speech quality optimization that eighth embodiment of the invention provides;

Fig. 6 A is the method flow diagram of the video speech quality optimization that one embodiment of the invention provides;

Fig. 6 B is the method flow diagram of the video speech quality optimization that one embodiment of the invention provides.

Embodiment

In order to make object of the present invention, technical scheme and advantage clearly understand, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.

See Fig. 1, in the first embodiment of the present invention, provide the system 100 that a kind of video speech quality is optimized, comprising:

Detection module 10, for detecting the generation of key frame loss situation in video calling;

Control module 20, for the network condition according to current described video calling, controls the transmission of described key frame.

In this embodiment, first by whether there being the transmission of crucial LOF situation in detection module 10 video calling, losing due to key frame and video speech quality will be caused to be deteriorated, therefore first detecting above-mentioned situation by this detection module 10 and whether occur.Then, if the generation of key frame loss situation, then by the network condition of control module 20 according to current described video calling, control the transmission of described key frame.Such setting is due in the process of whole video data transmission, and the volume of transmitted data of key frame is very large, and when poor video quality, and when network condition is poor, sending key frame frequently will increase the weight of the load of terminal transmission data.As do not return in the ability of eating dishes without rice or wine of mobile terminal can smooth and easy carrying visual telephone business time, continue to send key frame and video quality will be made to worsen further, and can not to increase.Therefore carry out according to concrete network condition the transmission that a step controls key frame, will the raising of video speech quality be conducive to.Described key frame is I frame.

See Fig. 2, in the second embodiment of the present invention, described system 100 also comprises:

Presetting module 30, for the reception number-of-packet object threshold value of default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; And in described video calling, send the threshold value of network transmission speed of described key frame;

Detection module 10 comprises:

Judge submodule 11, for judging whether described packet has loss according to the threshold value of described reception number-of-packet object threshold value and/or time of reception;

If judge, described data are surrounded by loss, then detection module 10 detects the generation of key frame loss situation in described video calling.

Based in the visual telephone of SIP, audio frequency and video send data and control information respectively by pair of end oral instructions, and RTP grouping only comprises RTP data, and controls to be provided by another matching used rtcp protocol.RTP selects a untapped even number UDP port number between 1025 to 65535, is then using next odd number UDP port number with the RTCP in a session.RTCP controls bag and has five types, wherein for providing QoS(QualityofService, service quality) feed back have two kinds of SR (SenderReport, sender report) and RR (ReceiverReport, receiving terminal report).The former describes the transmission of transmitting terminal and receives statistics; The latter describes the reception statistics of receiving terminal.These statisticss comprise send bag number, send byte number, accumulative number of dropped packets, received telegraph literary composition maximum sequence number, the time of advent space jitter etc.In this embodiment, reception number-of-packet threshold value and/or the time of reception threshold value of video reception RTCP port is pre-set by presetting module 30; And in described video calling, send the threshold value of network transmission speed of described key frame.The threshold value of the network transmission speed of described key frame can send Audio and Video smoothly with video calling and be as the criterion, as the transmission rate of WIFI cordless communication network can meet the demand of video calling, can arrange this threshold value is 2M/S.In addition, particularly 3G network, because its network speed is than very fast, will be conducive to normally carrying out of video calling.Judge that submodule 11 judges to receive packet and do not reach threshold value or time of reception when reaching threshold value, then detection module 10 detects the generation of key frame loss situation in described video calling.

See Fig. 2, in the third embodiment of the present invention, control module 20 comprises:

Statistics submodule 21, for adding up the network transmission speed in current described video calling;

First controls submodule 22, for when described network transmission speed reaches the threshold value of described network transmission speed, controls video encoder 26 and works out key frame, and described key frame is sent to the receiving terminal of described video calling; Or

Second controls submodule 23, for when described network transmission speed does not reach the threshold value of described network transmission speed, the transmitting terminal controlling described video calling stops sending key frame, and sends the first control frame to described receiving terminal, asks described receiving terminal to stop sending key frame;

In this embodiment, statistics submodule 21 adds up the network transmission speed in described video calling, and the threshold value of the network transmission speed that statistics submodule 21 is added up by the first control submodule 22 and the first control submodule 22 respectively and described default network transmission speed contrasts.And when described network transmission speed reaches the threshold value of described network transmission speed, first controls submodule 22 and controls video encoder 26 and work out key frame, and described key frame is sent to the receiving terminal of described video calling.Thus, in video call process, after there is loss of data, carry out the establishment of key frame timely, and be sent to receiving terminal, video speech quality is improved.And when described network transmission speed does not reach the threshold value of described network transmission speed, second controls the transmitting terminal stopping transmission key frame that submodule 23 controls described video calling in time, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame; Reduce offered load thus, the network data transmission of video calling can be recovered as soon as possible.

See Fig. 2 in the fourth embodiment of the present invention, control module 20 also comprises:

3rd controls submodule 24, for when adding up submodule 21 network transmission speed added up in current described video calling and again reaching the threshold value of described network transmission speed, control video encoder 26 and work out key frame, and described key frame is sent to the described receiving terminal of described video calling; And

In this embodiment, when adding up submodule 21 network transmission speed added up in current described video calling and again reaching the threshold value of described network transmission speed, 3rd controls submodule 24 works out control video encoder 26 the described receiving terminal that key frame is sent to described video calling, and video speech quality is improved.Meanwhile, send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame; Make video calling smooth.

See Fig. 3, in one embodiment of the invention, provide the structured flowchart of the system 100 that video speech quality is optimized, include videophone application 1; Video control unit 2, coding and decoding video unit 3, Audio control unit 4, audio coding decoding unit 5, communication module 6.Wherein coding and decoding video unit 3 and audio coding decoding unit 5 are responsible for the Audio and Video encoding and decoding in video calling respectively, and video control unit 2 and Audio control unit 4 control the transmission and the encoding and decoding that comprise video data and voice data respectively.And communication module 6 by statistics submodule 21 physics eat dishes without rice or wine layer statistics current data send Rate Feedback to video control unit 2.These data are updated in video calling video control unit 2 at a certain time interval.Video control unit 2 can determine whether to send key frame according to current network conditions and P frame quantity forwarded.If network quality is poor simultaneously, control unit 2 transmission control frames frequently, require that the transmission of key frame is suspended in opposite end, to reduce the offered load of terminal, can recover from congestion status as early as possible.

See Fig. 4, in the fifth embodiment of the present invention, judge that submodule 11 comprises:

First judging unit 111, is arranged at the receiving terminal of described video calling, for judging whether the number of the packet received reaches described reception number-of-packet object threshold value; If do not reach described reception number-of-packet object threshold value, then described first judging unit 111 judges that described packet is lost;

Second judging unit 112, is arranged at the receiving terminal of described video calling, and whether the time for judging to receive described packet reaches the threshold value of described time of reception; If do not reach the threshold value of described time of reception, then described second judging unit 112 judges that described packet is lost;

Transmitting element 113, during for judging that at described first judging unit 111 and/or the second judging unit 112 described packet occurs to lose, the transmitting terminal transmitting and receiving terminal to described video calling is reported.

In this embodiment, the judgement of data-bag lost unilaterally can be carried out by the receiving terminal of video calling.Concrete, by the first judging unit 111, second judging unit 112, receiving terminal judges whether the number of reception packet reaches respectively and does not arrive described reception number-of-packet object threshold value, and whether the time receiving packet arrives the threshold value of described time of reception.Whether the number receiving packet reaches and does not arrive described reception number-of-packet object threshold value, then illustrate there is data-bag lost; Described reception number-of-packet object threshold value is arranged according to the key frame interval number of described video encoder 26; Such as, have sent 30 P frames by transmission key frame at video encoder 26, and when receiving 30 P frames and not receiving a key frame value, then judge that data are surrounded by loss.And the interval time that the threshold value of described time of reception works out key frame according to described video encoder 26 is arranged.Such as, the time receiving 10 key frames is 10 seconds, and the time receiving 10 key frames needs 15 seconds, then illustrate that data are surrounded by loss.

See Fig. 4, in the sixth embodiment of the present invention, described judgement submodule 11 also comprises:

Receiving element 114, is arranged at the transmitting terminal of described video calling, for receiving the report of described receiving terminal;

For the described receiving terminal report received according to described receiving terminal report and last time, 3rd judging unit 115, judges whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost.

In this embodiment, by the both sides of video calling, namely receiving terminal and transmitting terminal judge whether packet sends loss jointly.Concrete, receiving element 114 to be reported according to described receiving terminal by the 3rd judging unit 115 after receiving the report of described receiving terminal and on the described receiving terminal report that once receives judge whether the loss of described packet exceedes described reception number-of-packet object threshold value, if both comparison values have exceeded described reception number-of-packet object threshold value, judge that data are surrounded by send and lost.

See Fig. 4, in the seventh embodiment of the present invention, described control module 20 comprises:

Notice submodule 25, is arranged at described transmitting terminal, is used in when judging that submodule 11 judges described data-bag lost, notifies that described video encoder 26 works out key frame;

Video encoder 26, after working out key frame, is sent to the receiving terminal of described video calling by described key frame.

In this embodiment, send RR and report to transmit end receive end, after transmitting terminal receives this information, after judging that number of dropped packets has increase, then notify that submodule 25 notifies that video encoder 26 works out key frame; And the receiving terminal of described video calling will be sent to after establishment key frame.Thus, make video speech quality obtain to improve.

In above-mentioned multiple embodiment, system 100 system of video speech quality optimization can be the software unit being built in communication terminal, hardware cell or software and hardware combining unit.Communication terminal can be mobile phone, PDA(PersonalDigitalAssistant, personal digital assistant), panel computer etc.

See Fig. 5, in the eighth embodiment of the present invention, provide a kind of method that video speech quality is optimized, comprise the steps:

In step S501, detect the generation of key frame loss situation in video calling; This step is realized by detection module 10;

In step S502, according to the network condition of current described video calling, control the transmission of described key frame; This step is realized by control module 20.

In this embodiment, first detected by key frame loss situation in detection module 10 pairs of video callings, if find, key frame has loss, then by the network condition of control module 20 according to current described video calling, control the transmission of described key frame, with according to current network condition, optimize the quality of video calling.

In the ninth embodiment of the present invention, comprise before described step S501:

Preset the reception number-of-packet object threshold value of video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; This step is realized by presetting module 30.

Be preset in the threshold value of the network transmission speed sending described key frame in described video calling; This step is realized by presetting module 30.

Described step S501 comprises:

According to the threshold value of described reception number-of-packet object threshold value and/or time of reception, video calling both sides judge whether described packet has loss; This step is by judging that submodule 11 realizes.

In this embodiment, user can pre-set the relevant parameter judging whether key frame is lost, and can carry out the network transmission speed reference value of video calling, can control video calling high-quality carry out according to being worth.

In this embodiment, first presetting the reception number-of-packet object threshold value of video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception, being judged whether sending data-bag lost in video calling as with reference to standard by these two threshold values.Judging that described data are surrounded by loss, then video encoder 26 works out key frame, and described key frame is sent to the receiving terminal of described video calling, thus, optimizes the quality of video calling.

In the tenth embodiment of the present invention, described step S502 comprises:

Add up the network transmission speed in current described video calling; This step realizes by adding up submodule 21;

When described network transmission speed reaches the threshold value of described network transmission speed, video encoder 26 works out key frame, and described key frame is sent to the receiving terminal of described video calling; This step controls submodule 22 by first and realizes; Or

When described network transmission speed does not reach the threshold value of described network transmission speed, stop sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame; This step controls submodule 23 by second and realizes.

In the 11st embodiment of the present invention, described step S502 also comprises:

When adding up the network transmission speed in current described video calling and again reaching the threshold value of described network transmission speed, video encoder 26 works out key frame, and described key frame is sent to the described receiving terminal of described video calling; Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame; This step controls submodule 24 by the 3rd and realizes.

In the above two embodiments, when by concrete network rate to loss key frame, control the transmission of key frame, optimize the quality of video calling.

In the 12nd embodiment of the present invention, described step S502 comprises:

If do not reach described reception number-of-packet object threshold value, then judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling; This step is realized by the first judging unit 111 and transmitting element 113.

If reach described reception number-of-packet object threshold value, whether the time then judging to receive described packet further reaches the threshold value of described time of reception, if do not reach, judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling; This step is realized by the second judging unit 112 and transmitting element 113.

In this embodiment, at the receiving terminal of video calling, concrete judges whether packet has loss, if having, notice video calling transmitting terminal, video calling transmitting terminal controls to be sent to receiving terminal after video encoder 26 works out key frame, optimizes video speech quality.

In the 13rd embodiment of the present invention, described and also comprise after the step that the transmitting terminal transmitting and receiving terminal of described video calling is reported:

The transmitting terminal of described video calling receives the report of described receiving terminal, judge whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost according to the described receiving terminal report that described receiving terminal report and last time receive.This step is realized by receiving element 114 and the 3rd judging unit 115.

In this embodiment, after video calling receiving terminal judges that data are surrounded by the generation of loss situation, judge whether packet has to lose further by the transmitting terminal of video calling and send.The concrete report returned by contrast twice receiving terminal is judged, if the difference of packet exceedes and receives number-of-packet object threshold value in twice report, judges described data-bag lost.After then working out key frame by described video encoder 26, described key frame is sent to the receiving terminal of described video calling.Described step S303 comprises: when described transmitting terminal judges described data-bag lost, and described transmitting terminal notifies that described video encoder 26 works out key frame; This step is by notifying that submodule 25 realizes; After described video encoder 26 works out key frame, described key frame is sent to the receiving terminal of described video calling.

See Fig. 6 A and Fig. 6 B, provide the method that video calling is optimized in one embodiment of the invention, in this embodiment, according to the SR that video channel RTCP receives, judge whether packet loss, if there is packet loss, notify that video encoder 26 is forced compile key frame and send to opposite end, to improve video quality.Wherein Fig. 6 A is the workflow of video calling receiving terminal, is described below:

In step S601, visual telephone starts;

In step S602, setting data bag number thresholding and time threshold;

In step S603, judging whether to reach number-of-packet threshold value, is perform step S604, otherwise performs step S605;

In step S604, judge whether to reach the time gate limit value receiving packet, be perform step S606, otherwise return step S603;

In step S605, send RR.

In step S606, judge whether to receive BYE, namely terminate video calling instruction, be, terminate video calling, otherwise return step S603.

Fig. 6 B is the workflow of video calling transmitting terminal, is described below:

In step S701, visual telephone starts;

In step S702, prepare to receive SR;

In step S703, judge whether to receive SR, be, perform step S704, otherwise return step S702;

In step S704, judge to compare with the value that last time receives, number of dropped packets and lose byte number and be greater than described reception packet threshold value, be perform step S705, otherwise return step S702;

In step S705, notice video encoder 26 is worked out key frame and sends;

In step S706, whether receive BYE, namely terminate video calling instruction, be, terminate video calling, otherwise return step S702.

Preferably, in above-mentioned multiple embodiment, described key frame is I frame.

In sum, the present invention is by detecting the generation of key frame loss situation in video calling; According to the network condition of current described video calling, control the transmission of described key frame.Avoid under generation key frame loss situation, and when the speed ratio of transmitted data on network is lower, video calling both sides but still send key frame, cause the congested of video calling network transmission channels thus cause video speech quality to become worse situation to occur.Thus, optimize the quality of video calling, improve Consumer's Experience.

Certainly; the present invention also can have other various embodiments; when not deviating from the present invention's spirit and essence thereof; those of ordinary skill in the art are when making various corresponding change and distortion according to the present invention, but these change accordingly and are out of shape the protection range that all should belong to the claim appended by the present invention.

Claims

1. a method for video speech quality optimization, is characterized in that, comprise the steps:

Detect the generation of key frame loss situation in video calling;

According to the network condition of current described video calling, control the transmission of described key frame;

Comprise before the step of the generation of key frame loss situation in described detection video calling:

If judge, described data are surrounded by loss, then the generation of key frame loss situation in described video calling detected;

The described network condition according to current described video calling, the step controlling the transmission of described key frame comprises:

Add up the network transmission speed in current described video calling;

2. method according to claim 1, is characterized in that, the described network condition according to current described video calling, and the step controlling the transmission of described key frame also comprises:

3. method according to claim 1, is characterized in that, according to the threshold value of described reception number-of-packet object threshold value and/or time of reception, described video calling both sides judge whether described packet has the step of loss to comprise:

4. a system for video speech quality optimization, is characterized in that, comprising:

Control module, for the network condition according to current described video calling, controls the transmission of described key frame;

Described system also comprises:

Described detection module comprises:

If judge, described data are surrounded by loss, then described detection module detects the generation of key frame loss situation in described video calling;

Described control module comprises:

5. system according to claim 4, is characterized in that, described control module also comprises:

6. system according to claim 4, is characterized in that, described judgement submodule comprises: