CN103152544B - The method that video speech quality is optimized and system thereof - Google Patents

The method that video speech quality is optimized and system thereof Download PDF

Info

Publication number
CN103152544B
CN103152544B CN201310035056.6A CN201310035056A CN103152544B CN 103152544 B CN103152544 B CN 103152544B CN 201310035056 A CN201310035056 A CN 201310035056A CN 103152544 B CN103152544 B CN 103152544B
Authority
CN
China
Prior art keywords
key frame
threshold value
video calling
receiving terminal
packet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310035056.6A
Other languages
Chinese (zh)
Other versions
CN103152544A (en
Inventor
刘灵新
李静
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yulong Computer Telecommunication Scientific Shenzhen Co Ltd
Dongguan Yulong Telecommunication Technology Co Ltd
Original Assignee
Yulong Computer Telecommunication Scientific Shenzhen Co Ltd
Dongguan Yulong Telecommunication Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yulong Computer Telecommunication Scientific Shenzhen Co Ltd, Dongguan Yulong Telecommunication Technology Co Ltd filed Critical Yulong Computer Telecommunication Scientific Shenzhen Co Ltd
Priority to CN201310035056.6A priority Critical patent/CN103152544B/en
Publication of CN103152544A publication Critical patent/CN103152544A/en
Application granted granted Critical
Publication of CN103152544B publication Critical patent/CN103152544B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The present invention is applicable to communication technical field, provides method and the system thereof of the optimization of a kind of video speech quality, comprises the steps: the generation detecting key frame loss situation in video calling; According to the network condition of current described video calling, control the transmission of described key frame.Whereby, present invention optimizes the quality of video calling.

Description

The method that video speech quality is optimized and system thereof
Technical field
The present invention relates to communication technical field, particularly relate to method and the system thereof of the optimization of a kind of video speech quality.
Background technology
Based on the visual telephone that SIP (SessionInitiationProtocol) is the signaling control protocol of an application layer, video and audio frequency adopt RTP(Real-timeTransportProtocol, RTP) agreement passes through UDP(UserDatagramProtocol, User Datagram Protocol) mode transmits audio, video data, by sister's agreement that RTCP (Real-timeTransportControlProtocol, RTCP Real-time Transport Control Protocol) is realtime transmission protocol RTP.Realize synchronous with between feedback, media of the supervision of service quality, and the mark of member in multicast group.Because UDP belongs to non-reliable connection, when network condition is poor may have packet loss situation occur.If video is surrounded by loss, the phenomenon of the poor image quality such as mosaic will be produced, if what lose is that key frame also can cause continue for some time of poor quality, until next key frame receives.In the prior art, although detecting that UDP has packet loss to adopt remedial measures, the impact that video calling network condition sends key frame is not but being considered.Therefore, cause video speech quality to be affected.
In summary, in actual use, obviously there is inconvenience and defect, so be necessary to be improved in existing video call technology.
Summary of the invention
For above-mentioned defect, a kind of method that the object of the present invention is to provide video speech quality to optimize and system thereof, to optimize the quality of video calling.
To achieve these goals, the invention provides a kind of method that video speech quality is optimized, comprise the steps:
Detect the generation of key frame loss situation in video calling;
According to the network condition of current described video calling, control the transmission of described key frame.
According to described method, comprise before the step of the generation of key frame loss situation in described detection video calling:
Preset the reception number-of-packet object threshold value of video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception;
Be preset in the threshold value of the network transmission speed sending described key frame in described video calling;
In described detection video calling, the step of the generation of key frame loss situation comprises:
According to the threshold value of described reception number-of-packet object threshold value and/or time of reception, video calling both sides judge whether described packet has loss;
If judge, described data are surrounded by loss, then the generation of key frame loss situation in described video calling detected.
According to described method, the described network condition according to current described video calling, the step controlling the transmission of described key frame comprises:
Add up the network transmission speed in current described video calling;
When described network transmission speed reaches the threshold value of described network transmission speed, video encoder establishment key frame, and described key frame is sent to the receiving terminal of described video calling; Or
When described network transmission speed does not reach the threshold value of described network transmission speed, stop sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame;
After described receiving terminal receives described first control frame, stop sending key frame.
According to described method, the described network condition according to current described video calling, the step controlling the transmission of described key frame also comprises:
When adding up the network transmission speed in current described video calling and again reaching the threshold value of described network transmission speed, video encoder establishment key frame, and described key frame is sent to the described receiving terminal of described video calling; Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described second control frame, recover to send key frame.
According to described method, according to the threshold value of described reception number-of-packet object threshold value and/or time of reception, described video calling both sides judge whether described packet has the step of loss to comprise:
The receiving terminal of video calling judges whether the number of the packet received reaches described reception number-of-packet object threshold value;
If do not reach described reception number-of-packet object threshold value, then judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling;
If reach described reception number-of-packet object threshold value, whether the time then judging to receive described packet reaches the threshold value of described time of reception, if do not reach, judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling; And/or
The transmitting terminal of described video calling receives the report of described receiving terminal, judge whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost according to the described receiving terminal report that described receiving terminal report and last time receive.
In order to realize another goal of the invention of the present invention, present invention also offers the system that a kind of video speech quality is optimized, comprising:
Detection module, for detecting the generation of key frame loss situation in video calling;
Control module, for the network condition according to current described video calling, controls the transmission of described key frame.
According to described system, described system also comprises:
Presetting module, for the reception number-of-packet object threshold value of default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; And in described video calling, send the threshold value of network transmission speed of described key frame;
Described detection module comprises:
Judge submodule, for judging whether described packet has loss according to the threshold value of described reception number-of-packet object threshold value and/or time of reception;
If judge, described data are surrounded by loss, then described detection module detects the generation of key frame loss situation in described video calling.
According to described system, described control module comprises:
Statistics submodule, for adding up the network transmission speed in current described video calling;
First controls submodule, for when described network transmission speed reaches the threshold value of described network transmission speed, controls video encoder establishment key frame, and described key frame is sent to the receiving terminal of described video calling; Or
Second controls submodule, for when described network transmission speed does not reach the threshold value of described network transmission speed, the transmitting terminal controlling described video calling stops sending key frame, and sends the first control frame to described receiving terminal, asks described receiving terminal to stop sending key frame;
After described receiving terminal receives described first control frame, stop sending key frame.
According to described system, described control module also comprises:
3rd controls submodule, for when adding up the network transmission speed in the current described video calling of submodule statistics and again reaching the threshold value of described network transmission speed, control video encoder establishment key frame, and described key frame is sent to the described receiving terminal of described video calling; And
Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described second control frame, recover to send key frame.
According to described system, described judgement submodule comprises:
First judging unit, is arranged at the receiving terminal of described video calling, for judging whether the number of the packet received reaches described reception number-of-packet object threshold value; If do not reach described reception number-of-packet object threshold value, then described first judging unit judges that described packet is lost;
Second judging unit, is arranged at the receiving terminal of described video calling, and whether the time for judging to receive described packet reaches the threshold value of described time of reception; If do not reach the threshold value of described time of reception, then described second judging unit judges that described packet is lost;
Transmitting element, during for judging that at described first judging unit and/or the second judging unit described packet occurs to lose, the transmitting terminal transmitting and receiving terminal to described video calling is reported; And/or
Receiving element, is arranged at the transmitting terminal of described video calling, for receiving the report of described receiving terminal;
For the described receiving terminal report received according to described receiving terminal report and last time, 3rd judging unit, judges whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost.
The present invention is by judging in described video calling, whether packet has loss; If judge, described data are surrounded by loss, then video encoder establishment key frame, and described key frame are sent to the receiving terminal of described video calling.Simultaneously, due in the process of video data transmission, the volume of transmitted data of key frame is very large, and when poor video quality, sending key frame frequently will increase the weight of the load of terminal transmission data, when the ability of eating dishes without rice or wine of mobile terminal do not return to can smooth and easy carrying visual telephone business time, video quality can worsen further, by detecting the network condition of current video call, controls the transmission of described key frame, thus optimize the quality of video calling, improve Consumer's Experience.
Accompanying drawing explanation
Fig. 1 is the system construction drawing of the video speech quality optimization that first embodiment of the invention provides;
Fig. 2 be the present invention second and third, the system construction drawing optimized of the video speech quality that provides of four embodiments;
Fig. 3 is the system construction drawing of the video speech quality optimization that one embodiment of the invention provides;
Fig. 4 be the present invention the 5th, six, the system construction drawing optimized of the video speech quality that provides of seven embodiments;
Fig. 5 is the method flow diagram of the video speech quality optimization that eighth embodiment of the invention provides;
Fig. 6 A is the method flow diagram of the video speech quality optimization that one embodiment of the invention provides;
Fig. 6 B is the method flow diagram of the video speech quality optimization that one embodiment of the invention provides.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearly understand, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.
See Fig. 1, in the first embodiment of the present invention, provide the system 100 that a kind of video speech quality is optimized, comprising:
Detection module 10, for detecting the generation of key frame loss situation in video calling;
Control module 20, for the network condition according to current described video calling, controls the transmission of described key frame.
In this embodiment, first by whether there being the transmission of crucial LOF situation in detection module 10 video calling, losing due to key frame and video speech quality will be caused to be deteriorated, therefore first detecting above-mentioned situation by this detection module 10 and whether occur.Then, if the generation of key frame loss situation, then by the network condition of control module 20 according to current described video calling, control the transmission of described key frame.Such setting is due in the process of whole video data transmission, and the volume of transmitted data of key frame is very large, and when poor video quality, and when network condition is poor, sending key frame frequently will increase the weight of the load of terminal transmission data.As do not return in the ability of eating dishes without rice or wine of mobile terminal can smooth and easy carrying visual telephone business time, continue to send key frame and video quality will be made to worsen further, and can not to increase.Therefore carry out according to concrete network condition the transmission that a step controls key frame, will the raising of video speech quality be conducive to.Described key frame is I frame.
See Fig. 2, in the second embodiment of the present invention, described system 100 also comprises:
Presetting module 30, for the reception number-of-packet object threshold value of default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; And in described video calling, send the threshold value of network transmission speed of described key frame;
Detection module 10 comprises:
Judge submodule 11, for judging whether described packet has loss according to the threshold value of described reception number-of-packet object threshold value and/or time of reception;
If judge, described data are surrounded by loss, then detection module 10 detects the generation of key frame loss situation in described video calling.
Based in the visual telephone of SIP, audio frequency and video send data and control information respectively by pair of end oral instructions, and RTP grouping only comprises RTP data, and controls to be provided by another matching used rtcp protocol.RTP selects a untapped even number UDP port number between 1025 to 65535, is then using next odd number UDP port number with the RTCP in a session.RTCP controls bag and has five types, wherein for providing QoS(QualityofService, service quality) feed back have two kinds of SR (SenderReport, sender report) and RR (ReceiverReport, receiving terminal report).The former describes the transmission of transmitting terminal and receives statistics; The latter describes the reception statistics of receiving terminal.These statisticss comprise send bag number, send byte number, accumulative number of dropped packets, received telegraph literary composition maximum sequence number, the time of advent space jitter etc.In this embodiment, reception number-of-packet threshold value and/or the time of reception threshold value of video reception RTCP port is pre-set by presetting module 30; And in described video calling, send the threshold value of network transmission speed of described key frame.The threshold value of the network transmission speed of described key frame can send Audio and Video smoothly with video calling and be as the criterion, as the transmission rate of WIFI cordless communication network can meet the demand of video calling, can arrange this threshold value is 2M/S.In addition, particularly 3G network, because its network speed is than very fast, will be conducive to normally carrying out of video calling.Judge that submodule 11 judges to receive packet and do not reach threshold value or time of reception when reaching threshold value, then detection module 10 detects the generation of key frame loss situation in described video calling.
See Fig. 2, in the third embodiment of the present invention, control module 20 comprises:
Statistics submodule 21, for adding up the network transmission speed in current described video calling;
First controls submodule 22, for when described network transmission speed reaches the threshold value of described network transmission speed, controls video encoder 26 and works out key frame, and described key frame is sent to the receiving terminal of described video calling; Or
Second controls submodule 23, for when described network transmission speed does not reach the threshold value of described network transmission speed, the transmitting terminal controlling described video calling stops sending key frame, and sends the first control frame to described receiving terminal, asks described receiving terminal to stop sending key frame;
After described receiving terminal receives described first control frame, stop sending key frame.
In this embodiment, statistics submodule 21 adds up the network transmission speed in described video calling, and the threshold value of the network transmission speed that statistics submodule 21 is added up by the first control submodule 22 and the first control submodule 22 respectively and described default network transmission speed contrasts.And when described network transmission speed reaches the threshold value of described network transmission speed, first controls submodule 22 and controls video encoder 26 and work out key frame, and described key frame is sent to the receiving terminal of described video calling.Thus, in video call process, after there is loss of data, carry out the establishment of key frame timely, and be sent to receiving terminal, video speech quality is improved.And when described network transmission speed does not reach the threshold value of described network transmission speed, second controls the transmitting terminal stopping transmission key frame that submodule 23 controls described video calling in time, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame; Reduce offered load thus, the network data transmission of video calling can be recovered as soon as possible.
See Fig. 2 in the fourth embodiment of the present invention, control module 20 also comprises:
3rd controls submodule 24, for when adding up submodule 21 network transmission speed added up in current described video calling and again reaching the threshold value of described network transmission speed, control video encoder 26 and work out key frame, and described key frame is sent to the described receiving terminal of described video calling; And
Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described second control frame, recover to send key frame.
In this embodiment, when adding up submodule 21 network transmission speed added up in current described video calling and again reaching the threshold value of described network transmission speed, 3rd controls submodule 24 works out control video encoder 26 the described receiving terminal that key frame is sent to described video calling, and video speech quality is improved.Meanwhile, send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame; Make video calling smooth.
See Fig. 3, in one embodiment of the invention, provide the structured flowchart of the system 100 that video speech quality is optimized, include videophone application 1; Video control unit 2, coding and decoding video unit 3, Audio control unit 4, audio coding decoding unit 5, communication module 6.Wherein coding and decoding video unit 3 and audio coding decoding unit 5 are responsible for the Audio and Video encoding and decoding in video calling respectively, and video control unit 2 and Audio control unit 4 control the transmission and the encoding and decoding that comprise video data and voice data respectively.And communication module 6 by statistics submodule 21 physics eat dishes without rice or wine layer statistics current data send Rate Feedback to video control unit 2.These data are updated in video calling video control unit 2 at a certain time interval.Video control unit 2 can determine whether to send key frame according to current network conditions and P frame quantity forwarded.If network quality is poor simultaneously, control unit 2 transmission control frames frequently, require that the transmission of key frame is suspended in opposite end, to reduce the offered load of terminal, can recover from congestion status as early as possible.
See Fig. 4, in the fifth embodiment of the present invention, judge that submodule 11 comprises:
First judging unit 111, is arranged at the receiving terminal of described video calling, for judging whether the number of the packet received reaches described reception number-of-packet object threshold value; If do not reach described reception number-of-packet object threshold value, then described first judging unit 111 judges that described packet is lost;
Second judging unit 112, is arranged at the receiving terminal of described video calling, and whether the time for judging to receive described packet reaches the threshold value of described time of reception; If do not reach the threshold value of described time of reception, then described second judging unit 112 judges that described packet is lost;
Transmitting element 113, during for judging that at described first judging unit 111 and/or the second judging unit 112 described packet occurs to lose, the transmitting terminal transmitting and receiving terminal to described video calling is reported.
In this embodiment, the judgement of data-bag lost unilaterally can be carried out by the receiving terminal of video calling.Concrete, by the first judging unit 111, second judging unit 112, receiving terminal judges whether the number of reception packet reaches respectively and does not arrive described reception number-of-packet object threshold value, and whether the time receiving packet arrives the threshold value of described time of reception.Whether the number receiving packet reaches and does not arrive described reception number-of-packet object threshold value, then illustrate there is data-bag lost; Described reception number-of-packet object threshold value is arranged according to the key frame interval number of described video encoder 26; Such as, have sent 30 P frames by transmission key frame at video encoder 26, and when receiving 30 P frames and not receiving a key frame value, then judge that data are surrounded by loss.And the interval time that the threshold value of described time of reception works out key frame according to described video encoder 26 is arranged.Such as, the time receiving 10 key frames is 10 seconds, and the time receiving 10 key frames needs 15 seconds, then illustrate that data are surrounded by loss.
See Fig. 4, in the sixth embodiment of the present invention, described judgement submodule 11 also comprises:
Receiving element 114, is arranged at the transmitting terminal of described video calling, for receiving the report of described receiving terminal;
For the described receiving terminal report received according to described receiving terminal report and last time, 3rd judging unit 115, judges whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost.
In this embodiment, by the both sides of video calling, namely receiving terminal and transmitting terminal judge whether packet sends loss jointly.Concrete, receiving element 114 to be reported according to described receiving terminal by the 3rd judging unit 115 after receiving the report of described receiving terminal and on the described receiving terminal report that once receives judge whether the loss of described packet exceedes described reception number-of-packet object threshold value, if both comparison values have exceeded described reception number-of-packet object threshold value, judge that data are surrounded by send and lost.
See Fig. 4, in the seventh embodiment of the present invention, described control module 20 comprises:
Notice submodule 25, is arranged at described transmitting terminal, is used in when judging that submodule 11 judges described data-bag lost, notifies that described video encoder 26 works out key frame;
Video encoder 26, after working out key frame, is sent to the receiving terminal of described video calling by described key frame.
In this embodiment, send RR and report to transmit end receive end, after transmitting terminal receives this information, after judging that number of dropped packets has increase, then notify that submodule 25 notifies that video encoder 26 works out key frame; And the receiving terminal of described video calling will be sent to after establishment key frame.Thus, make video speech quality obtain to improve.
In above-mentioned multiple embodiment, system 100 system of video speech quality optimization can be the software unit being built in communication terminal, hardware cell or software and hardware combining unit.Communication terminal can be mobile phone, PDA(PersonalDigitalAssistant, personal digital assistant), panel computer etc.
See Fig. 5, in the eighth embodiment of the present invention, provide a kind of method that video speech quality is optimized, comprise the steps:
In step S501, detect the generation of key frame loss situation in video calling; This step is realized by detection module 10;
In step S502, according to the network condition of current described video calling, control the transmission of described key frame; This step is realized by control module 20.
In this embodiment, first detected by key frame loss situation in detection module 10 pairs of video callings, if find, key frame has loss, then by the network condition of control module 20 according to current described video calling, control the transmission of described key frame, with according to current network condition, optimize the quality of video calling.
In the ninth embodiment of the present invention, comprise before described step S501:
Preset the reception number-of-packet object threshold value of video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; This step is realized by presetting module 30.
Be preset in the threshold value of the network transmission speed sending described key frame in described video calling; This step is realized by presetting module 30.
Described step S501 comprises:
According to the threshold value of described reception number-of-packet object threshold value and/or time of reception, video calling both sides judge whether described packet has loss; This step is by judging that submodule 11 realizes.
If judge, described data are surrounded by loss, then the generation of key frame loss situation in described video calling detected.
In this embodiment, user can pre-set the relevant parameter judging whether key frame is lost, and can carry out the network transmission speed reference value of video calling, can control video calling high-quality carry out according to being worth.
In this embodiment, first presetting the reception number-of-packet object threshold value of video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception, being judged whether sending data-bag lost in video calling as with reference to standard by these two threshold values.Judging that described data are surrounded by loss, then video encoder 26 works out key frame, and described key frame is sent to the receiving terminal of described video calling, thus, optimizes the quality of video calling.
In the tenth embodiment of the present invention, described step S502 comprises:
Add up the network transmission speed in current described video calling; This step realizes by adding up submodule 21;
When described network transmission speed reaches the threshold value of described network transmission speed, video encoder 26 works out key frame, and described key frame is sent to the receiving terminal of described video calling; This step controls submodule 22 by first and realizes; Or
When described network transmission speed does not reach the threshold value of described network transmission speed, stop sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame; This step controls submodule 23 by second and realizes.
After described receiving terminal receives described first control frame, stop sending key frame.
In the 11st embodiment of the present invention, described step S502 also comprises:
When adding up the network transmission speed in current described video calling and again reaching the threshold value of described network transmission speed, video encoder 26 works out key frame, and described key frame is sent to the described receiving terminal of described video calling; Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame; This step controls submodule 24 by the 3rd and realizes.
After described receiving terminal receives described second control frame, recover to send key frame.
In the above two embodiments, when by concrete network rate to loss key frame, control the transmission of key frame, optimize the quality of video calling.
In the 12nd embodiment of the present invention, described step S502 comprises:
The receiving terminal of video calling judges whether the number of the packet received reaches described reception number-of-packet object threshold value;
If do not reach described reception number-of-packet object threshold value, then judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling; This step is realized by the first judging unit 111 and transmitting element 113.
If reach described reception number-of-packet object threshold value, whether the time then judging to receive described packet further reaches the threshold value of described time of reception, if do not reach, judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling; This step is realized by the second judging unit 112 and transmitting element 113.
In this embodiment, at the receiving terminal of video calling, concrete judges whether packet has loss, if having, notice video calling transmitting terminal, video calling transmitting terminal controls to be sent to receiving terminal after video encoder 26 works out key frame, optimizes video speech quality.
In the 13rd embodiment of the present invention, described and also comprise after the step that the transmitting terminal transmitting and receiving terminal of described video calling is reported:
The transmitting terminal of described video calling receives the report of described receiving terminal, judge whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost according to the described receiving terminal report that described receiving terminal report and last time receive.This step is realized by receiving element 114 and the 3rd judging unit 115.
In this embodiment, after video calling receiving terminal judges that data are surrounded by the generation of loss situation, judge whether packet has to lose further by the transmitting terminal of video calling and send.The concrete report returned by contrast twice receiving terminal is judged, if the difference of packet exceedes and receives number-of-packet object threshold value in twice report, judges described data-bag lost.After then working out key frame by described video encoder 26, described key frame is sent to the receiving terminal of described video calling.Described step S303 comprises: when described transmitting terminal judges described data-bag lost, and described transmitting terminal notifies that described video encoder 26 works out key frame; This step is by notifying that submodule 25 realizes; After described video encoder 26 works out key frame, described key frame is sent to the receiving terminal of described video calling.
See Fig. 6 A and Fig. 6 B, provide the method that video calling is optimized in one embodiment of the invention, in this embodiment, according to the SR that video channel RTCP receives, judge whether packet loss, if there is packet loss, notify that video encoder 26 is forced compile key frame and send to opposite end, to improve video quality.Wherein Fig. 6 A is the workflow of video calling receiving terminal, is described below:
In step S601, visual telephone starts;
In step S602, setting data bag number thresholding and time threshold;
In step S603, judging whether to reach number-of-packet threshold value, is perform step S604, otherwise performs step S605;
In step S604, judge whether to reach the time gate limit value receiving packet, be perform step S606, otherwise return step S603;
In step S605, send RR.
In step S606, judge whether to receive BYE, namely terminate video calling instruction, be, terminate video calling, otherwise return step S603.
Fig. 6 B is the workflow of video calling transmitting terminal, is described below:
In step S701, visual telephone starts;
In step S702, prepare to receive SR;
In step S703, judge whether to receive SR, be, perform step S704, otherwise return step S702;
In step S704, judge to compare with the value that last time receives, number of dropped packets and lose byte number and be greater than described reception packet threshold value, be perform step S705, otherwise return step S702;
In step S705, notice video encoder 26 is worked out key frame and sends;
In step S706, whether receive BYE, namely terminate video calling instruction, be, terminate video calling, otherwise return step S702.
Preferably, in above-mentioned multiple embodiment, described key frame is I frame.
In sum, the present invention is by detecting the generation of key frame loss situation in video calling; According to the network condition of current described video calling, control the transmission of described key frame.Avoid under generation key frame loss situation, and when the speed ratio of transmitted data on network is lower, video calling both sides but still send key frame, cause the congested of video calling network transmission channels thus cause video speech quality to become worse situation to occur.Thus, optimize the quality of video calling, improve Consumer's Experience.
Certainly; the present invention also can have other various embodiments; when not deviating from the present invention's spirit and essence thereof; those of ordinary skill in the art are when making various corresponding change and distortion according to the present invention, but these change accordingly and are out of shape the protection range that all should belong to the claim appended by the present invention.

Claims (6)

1. a method for video speech quality optimization, is characterized in that, comprise the steps:
Detect the generation of key frame loss situation in video calling;
According to the network condition of current described video calling, control the transmission of described key frame;
Comprise before the step of the generation of key frame loss situation in described detection video calling:
Preset the reception number-of-packet object threshold value of video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception;
Be preset in the threshold value of the network transmission speed sending described key frame in described video calling;
In described detection video calling, the step of the generation of key frame loss situation comprises:
According to the threshold value of described reception number-of-packet object threshold value and/or time of reception, video calling both sides judge whether described packet has loss;
If judge, described data are surrounded by loss, then the generation of key frame loss situation in described video calling detected;
The described network condition according to current described video calling, the step controlling the transmission of described key frame comprises:
Add up the network transmission speed in current described video calling;
When described network transmission speed reaches the threshold value of described network transmission speed, video encoder establishment key frame, and described key frame is sent to the receiving terminal of described video calling; Or
When described network transmission speed does not reach the threshold value of described network transmission speed, stop sending key frame, and send the first control frame to described receiving terminal, ask described receiving terminal to stop sending key frame;
After described receiving terminal receives described first control frame, stop sending key frame.
2. method according to claim 1, is characterized in that, the described network condition according to current described video calling, and the step controlling the transmission of described key frame also comprises:
When adding up the network transmission speed in current described video calling and again reaching the threshold value of described network transmission speed, video encoder establishment key frame, and described key frame is sent to the described receiving terminal of described video calling; Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described second control frame, recover to send key frame.
3. method according to claim 1, is characterized in that, according to the threshold value of described reception number-of-packet object threshold value and/or time of reception, described video calling both sides judge whether described packet has the step of loss to comprise:
The receiving terminal of video calling judges whether the number of the packet received reaches described reception number-of-packet object threshold value;
If do not reach described reception number-of-packet object threshold value, then judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling;
If reach described reception number-of-packet object threshold value, whether the time then judging to receive described packet reaches the threshold value of described time of reception, if do not reach, judge that described packet is lost, and report to the transmitting terminal transmitting and receiving terminal of described video calling; And/or
The transmitting terminal of described video calling receives the report of described receiving terminal, judge whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost according to the described receiving terminal report that described receiving terminal report and last time receive.
4. a system for video speech quality optimization, is characterized in that, comprising:
Detection module, for detecting the generation of key frame loss situation in video calling;
Control module, for the network condition according to current described video calling, controls the transmission of described key frame;
Described system also comprises:
Presetting module, for the reception number-of-packet object threshold value of default video reception RTCP Real-time Transport Control Protocol port and/or the threshold value of time of reception; And in described video calling, send the threshold value of network transmission speed of described key frame;
Described detection module comprises:
Judge submodule, for judging whether described packet has loss according to the threshold value of described reception number-of-packet object threshold value and/or time of reception;
If judge, described data are surrounded by loss, then described detection module detects the generation of key frame loss situation in described video calling;
Described control module comprises:
Statistics submodule, for adding up the network transmission speed in current described video calling;
First controls submodule, for when described network transmission speed reaches the threshold value of described network transmission speed, controls video encoder establishment key frame, and described key frame is sent to the receiving terminal of described video calling; Or
Second controls submodule, for when described network transmission speed does not reach the threshold value of described network transmission speed, the transmitting terminal controlling described video calling stops sending key frame, and sends the first control frame to described receiving terminal, asks described receiving terminal to stop sending key frame;
After described receiving terminal receives described first control frame, stop sending key frame.
5. system according to claim 4, is characterized in that, described control module also comprises:
3rd controls submodule, for when adding up the network transmission speed in the current described video calling of submodule statistics and again reaching the threshold value of described network transmission speed, control video encoder establishment key frame, and described key frame is sent to the described receiving terminal of described video calling; And
Send the second control frame to described receiving terminal, ask described receiving terminal to recover to send key frame;
After described receiving terminal receives described second control frame, recover to send key frame.
6. system according to claim 4, is characterized in that, described judgement submodule comprises:
First judging unit, is arranged at the receiving terminal of described video calling, for judging whether the number of the packet received reaches described reception number-of-packet object threshold value; If do not reach described reception number-of-packet object threshold value, then described first judging unit judges that described packet is lost;
Second judging unit, is arranged at the receiving terminal of described video calling, and whether the time for judging to receive described packet reaches the threshold value of described time of reception; If do not reach the threshold value of described time of reception, then described second judging unit judges that described packet is lost;
Transmitting element, during for judging that at described first judging unit and/or the second judging unit described packet occurs to lose, the transmitting terminal transmitting and receiving terminal to described video calling is reported; And/or
Receiving element, is arranged at the transmitting terminal of described video calling, for receiving the report of described receiving terminal;
For the described receiving terminal report received according to described receiving terminal report and last time, 3rd judging unit, judges whether the loss of described packet exceedes described reception number-of-packet object threshold value, if then judge described data-bag lost.
CN201310035056.6A 2013-01-29 2013-01-29 The method that video speech quality is optimized and system thereof Active CN103152544B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310035056.6A CN103152544B (en) 2013-01-29 2013-01-29 The method that video speech quality is optimized and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310035056.6A CN103152544B (en) 2013-01-29 2013-01-29 The method that video speech quality is optimized and system thereof

Publications (2)

Publication Number Publication Date
CN103152544A CN103152544A (en) 2013-06-12
CN103152544B true CN103152544B (en) 2016-04-06

Family

ID=48550389

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310035056.6A Active CN103152544B (en) 2013-01-29 2013-01-29 The method that video speech quality is optimized and system thereof

Country Status (1)

Country Link
CN (1) CN103152544B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104427286B (en) * 2013-08-20 2019-01-01 中国移动通信集团公司 A kind of method and system carrying out video calling
CN112584081A (en) * 2020-12-01 2021-03-30 北京融讯科创技术有限公司 Video transmission method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101345962A (en) * 2008-08-15 2009-01-14 宇龙计算机通信科技(深圳)有限公司 Call mode switch control method, system and mobile terminal
CN101651815A (en) * 2009-09-01 2010-02-17 中兴通讯股份有限公司 Visual telephone and method for enhancing video quality by utilizing same
CN101931799A (en) * 2010-09-14 2010-12-29 中兴通讯股份有限公司 Method and device for smoothing video bit stream
CN102752670A (en) * 2012-06-13 2012-10-24 广东威创视讯科技股份有限公司 Method, device and system for reducing phenomena of mosaics in network video transmission

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101345962A (en) * 2008-08-15 2009-01-14 宇龙计算机通信科技(深圳)有限公司 Call mode switch control method, system and mobile terminal
CN101651815A (en) * 2009-09-01 2010-02-17 中兴通讯股份有限公司 Visual telephone and method for enhancing video quality by utilizing same
CN101931799A (en) * 2010-09-14 2010-12-29 中兴通讯股份有限公司 Method and device for smoothing video bit stream
CN102752670A (en) * 2012-06-13 2012-10-24 广东威创视讯科技股份有限公司 Method, device and system for reducing phenomena of mosaics in network video transmission

Also Published As

Publication number Publication date
CN103152544A (en) 2013-06-12

Similar Documents

Publication Publication Date Title
US11349900B2 (en) Voice encoding and sending method and apparatus
US7333439B2 (en) Sending/receiving method, and device for the same
EP3297195B1 (en) Adjustment method and device for service rate
JP4101993B2 (en) Wired and wireless mixed network data distribution apparatus and wired and wireless mixed network data distribution method
CN100574412C (en) Adopt the video transmission of the enhancing of dual network pattern
CN101227482B (en) System, apparatus and method of media negotiation in network telephone call
CN101483494B (en) Method and system for encoding and decoding algorithm dynamic negotiation of SIP terminal
US9143450B2 (en) Communication system and method for assisting with the transmission of TCP packets
CN100574467C (en) A kind of band width control method and terminal equipment
CN101778426A (en) Method and equipment for video data stream transmission in mobile wireless network
CN103118238B (en) The control method of video conference and video conferencing system
US9674737B2 (en) Selective rate-adaptation in video telephony
CN106973066A (en) H264 encoded videos data transmission method and system in a kind of real-time communication
CN105812711A (en) Method and system for optimizing image quality during video call
CN103152544B (en) The method that video speech quality is optimized and system thereof
CN107770473B (en) Audio and video data transmission control method and device
CN107645647A (en) A kind of multichannel audio-video frequency transmission method and device
KR20090078454A (en) Voip terminal and method for automatically altering codec according to quality of voice
CN101552771B (en) Media gateway, media gateway controller, multimedia telephone intercommunication method and system
WO2023155747A1 (en) Data coding bitrate adaptive adjustment method, apparatus and device, and storage medium
CN103347252A (en) Multimedia trunking communication system, communication terminal and switching center
CN102984785A (en) Transmitting data over multiple networks
CN1885879B (en) Method for preventing VOIP system bandwidth overload
CN107404363B (en) Method, system, terminal and network side equipment for adjusting voice code rate
CN101159746B (en) Self-adaptive method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant