WO2012058913A1 - Procédé et dispositif de visiophonie - Google Patents

Procédé et dispositif de visiophonie Download PDF

Info

Publication number
WO2012058913A1
WO2012058913A1 PCT/CN2011/073686 CN2011073686W WO2012058913A1 WO 2012058913 A1 WO2012058913 A1 WO 2012058913A1 CN 2011073686 W CN2011073686 W CN 2011073686W WO 2012058913 A1 WO2012058913 A1 WO 2012058913A1
Authority
WO
WIPO (PCT)
Prior art keywords
frame
background noise
flag
module
data frame
Prior art date
Application number
PCT/CN2011/073686
Other languages
English (en)
Chinese (zh)
Inventor
张林会
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012058913A1 publication Critical patent/WO2012058913A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone

Definitions

  • the present invention relates to the field of mobile communication technologies, and in particular, to a method and apparatus for implementing a videophone. Background technique
  • 3G mobile communication technology
  • 2G mobile communication technology
  • the real application of 3G is real-time provided under bandwidth.
  • Business such as: videophone, multimedia real-time gaming, remote wireless monitoring, etc.
  • videophone is one of the most widely used 3G applications.
  • the video coding of videophone can use one of H.261, H.263, H.264, MPEG-4; Coded Adaptive Multi-Rate (AMR), Advanced Audio Coding (AAC), Qualcomm Code Excited Linear Predictive (QCELP), Enhanced Variable Rate Codec (Enhanced) Variable Rate Codec, EVRC), etc. Because of the limited network conditions, the video can usually guarantee the quality of the compressed image and voice while occupying the channel bandwidth as small as possible.
  • AMR Coded Adaptive Multi-Rate
  • AAC Advanced Audio Coding
  • QELP Qualcomm Code Excited Linear Predictive
  • Enhanced Variable Rate Codec Enhanced Variable Rate Codec
  • EVRC Enhanced Variable Rate Codec
  • the main problem of videophone is that the image is not clear and smooth. It is mainly manifested in the fact that after the mobile terminal moves or the object in the scene moves, the content of the videophone will have different degrees of mosaic, especially under high-speed movement. Scenes, images of Masek are more serious.
  • the current solutions are:
  • a method for implementing a videophone comprising: determining, by a transmitting end, whether a data frame to be transmitted is a background noise frame; and when determining a background noise frame, further determining whether a predetermined flag frame has been generated currently, if The flag frame is sent to the opposite end.
  • the method further includes: if the flag frame is not currently generated, sending the background noise frame to the opposite end, and generating a flag frame.
  • the method further includes: the receiving end determines whether the received data frame is a flag frame, and if it is a flag frame, sends an analog signal according to the received background noise frame.
  • the background noise frame at the end is a flag frame, and if it is a flag frame, sends an analog signal according to the received background noise frame.
  • the method further includes: determining, when the received data frame is an image data frame, whether a matching condition is met, and if yes, using the simulated background noise frame as a background noise frame corresponding to the image data frame. .
  • the determining whether the matching condition is satisfied is: when the received data frame is an image In the data frame, set the value of the preset variable to true; determine whether the value of the variable is true, and if so, satisfy the matching condition;
  • the method further comprises: setting the value of the variable to false.
  • the flag frame includes: a frame type bit, a frame quality indicator bit, a mode indicator bit, a mode request bit, a data check bit, and a flag frame content part; wherein, the frame type bit length is 4 bit, and the value Is one of 12, 13 or 14; the length of the frame quality indicator bit is 1 bit, and the value is 1; the length of the mode indication bit and the mode request bit are both 3 bits, and the value is 12; The length of the data check bit is 8 bits; the content portion of the flag frame is defined as "Noise_flag".
  • An apparatus for implementing a videophone comprising: a frame type determining module, a flag frame control module, wherein
  • a frame type determining module configured to determine whether the data frame to be sent is a background noise frame, and when the background noise frame is a background noise frame, send a trigger information to the flag frame control module;
  • the flag frame control module is configured to: after receiving the trigger information sent by the frame type determining module, determine whether a preset flag frame has been generated currently, and if yes, send the flag frame to the peer end.
  • the device further includes a background noise control module, wherein the flag frame control module is further configured to: when the flag frame is not currently generated, send the trigger information to the background noise control module, and simultaneously generate the flag frame; the background noise control module And after receiving the trigger information of the flag frame control module, sending the background noise frame to the peer end.
  • the flag frame control module is further configured to: when the flag frame is not currently generated, send the trigger information to the background noise control module, and simultaneously generate the flag frame; the background noise control module And after receiving the trigger information of the flag frame control module, sending the background noise frame to the peer end.
  • the device further includes: a background noise simulation module; wherein, the frame type determination module is further configured to determine whether the received data frame is a flag frame, and if yes, send a trigger information to the background noise simulation module; And a module, configured to: after receiving the trigger information of the frame type determining module, simulate a background noise frame of the transmitting end according to the received background noise frame.
  • the device further includes a background noise and image matching module; wherein, the frame type determining module is further configured to generate and send a map according to the notification generated when the image data frame is received
  • the image noise triggering module is configured to: when receiving the image data frame triggering information sent by the frame type determining module, determine whether the matching condition is currently met, and if yes, obtain the background noise simulation module
  • the background noise frame is used as the background noise frame corresponding to the image data frame.
  • the background noise and image matching module is specifically configured to: when the image data frame trigger information is received, set a value of a preset variable to true; and then determine whether the value of the variable is true, and if yes, The matching condition is satisfied; and after the simulated background noise frame is used as the background noise frame corresponding to the image data frame, the value of the variable is set to false.
  • the method and device for implementing a videophone provided by the present invention dynamically send a background noise frame resource for transmitting image data when the transmitting end is in the silent mode according to whether the transmitting end is in the silent mode during the videophone service process.
  • the flag frame, the receiving end simulates the background noise frame of the transmitting end, thereby achieving the purpose of expanding the image data transmission bandwidth in the videophone service, effectively solving the mosaic phenomenon of the image in the videophone service, and achieving the simplicity and applicability at the same time. Strong. DRAWINGS
  • FIG. 1 is a schematic flow chart of a method for implementing a videophone according to the present invention
  • FIG. 2 is a schematic structural diagram of a flag frame in a method for implementing a videophone according to the present invention
  • FIG. 3 is a schematic flowchart of a receiving embodiment of a method for implementing a videophone according to the present invention
  • Schematic diagram of the structure of the device
  • FIG. 5 is a schematic diagram showing the structure of the device shown in FIG. 4 built in the mobile terminal for videophone service. detailed description
  • the transmitting end determines whether the data frame to be transmitted is a background noise frame, and if it is a background noise frame, further determines whether a predetermined flag frame has been generated currently; when the flag frame is currently generated, Then sending the flag frame to the opposite end.
  • FIG. 1 shows a flow of a method of implementing a videophone of the present invention. As shown in FIG. 1, the method includes the following steps:
  • Step 101 the sender determines whether the audio data frame to be sent by the audio processing module is a background noise frame, if not, step 102 is performed, and if so, step 103 is performed;
  • the determination of the audio data frame type may be performed by determining the frame type bit in the audio data frame.
  • the frame type bit is 8
  • the audio data frame is a background noise frame
  • the transmitting end is The mode in which the videophone is located is the silent mode.
  • Step 102 When the audio data frame to be sent by the audio processing module is a normal voice data frame, send according to a normal voice data frame sending process, for example, directly transmitting through a wireless channel, and the like;
  • Step 103 further determining whether a predetermined flag frame has been generated at this time, if not, executing step 104, and if yes, performing step 105; in this step, the structure of the flag frame may refer to FIG. 2, the structure of the flag frame For:
  • Frame Type The length of 4 bits is occupied. According to the 3GPP regulations, the audio data frames 12 ⁇ 14 are reserved frames. Therefore, the frame type bits of the flag frame can be defined as one of 12, 13, and 14. In the embodiment of the present invention, the frame type bit of the flag frame is defined as 12.
  • Frame Quality Indicator The length of 1 bit is occupied. It is defined as 1 in the embodiment of the present invention, and is a Good frame according to the provisions of 3GPP.
  • Mode Indication The length of 3 bits is occupied. To distinguish the existing definition, the mode indicator bit is defined as 12.
  • Mode Request The length of 3 bits is occupied. To distinguish the existing definition, the mode request bit is defined as 12.
  • Data check bit Takes 8 bits in length.
  • Mark frame content Define the content of the frame as "Noise_flag” (noise flag), corresponding to the corresponding binary.
  • Step 104 Send a current background noise frame to the opposite end, and generate a flag frame, and the current processing process ends.
  • Step 105 Send the generated flag frame to the peer end, and the current processing flow ends.
  • FIG. 3 is a flowchart of a receiving embodiment of a method for implementing a videophone according to the present invention. As shown in FIG. 3, the receiving embodiment includes:
  • Step 201 the receiving end determines whether the received data frame is a flag frame, and if not, executing step 202, and if so, executing step 203;
  • this step it is determined whether the frame type bit of the predetermined flag frame is 12, and if so, the received data frame is a flag frame, and if not, the received data frame is not a flag frame.
  • Step 202 The received data frame is processed according to a normal data frame processing flow, and the current processing flow ends.
  • Step 203 Simulate a background noise frame of the transmitting end according to the previously received background noise frame; Step 204, whether an image data frame is received, and if yes, generate a notification to perform step 205; otherwise, perform step 202;
  • Step 205 to step 207 determining whether the matching condition is met, and if yes, using the simulated background noise frame as the background noise frame corresponding to the received image data frame;
  • step 205 when the image data frame is received, the value of the variable flag is set to true (true), and the flag is a preset variable; in step 206, it is determined whether the value of the variable flag is true, and if so, the matching is satisfied. If the condition is met, step 207 is performed, otherwise the current processing flow ends. Step 207 is to use the background noise frame simulated in step 203 as the background noise frame corresponding to the image data frame received in step 204.
  • Step 208 Set the value of the variable flag to false (false), and return to step 201.
  • the device includes: a frame type judging module 10 and a flag frame control module 20; wherein the frame type judging module 10 is configured to determine whether the data frame to be transmitted is a background noise frame, and if it is a background noise frame, send a trigger information to the flag.
  • the frame control module 20 is configured to: after receiving the trigger information sent by the frame type determining module 10, determine whether a preset flag frame has been generated, and if yes, send the flag frame to the peer end. .
  • the device further includes a background noise control module 30, wherein the flag frame control module 20 is further configured to: when the flag frame is not currently generated, send the trigger information to the background noise control module 30, and simultaneously generate the flag frame;
  • the control module 30 is configured to send the background noise frame to the opposite end after receiving the trigger information of the flag frame control module 20.
  • the device further includes a background noise simulation module 40.
  • the frame type determination module 10 is further configured to determine whether the received data frame is a flag frame, and if yes, send the trigger information to the background noise simulation module 40; the background noise
  • the analog module 40 is configured to: after receiving the trigger information of the frame type determining module 10, simulate a background noise frame of the transmitting end according to the received background noise frame.
  • the device further includes a background noise and image matching module 50; wherein, the frame type determining module 10 is further configured to generate and send image data frame trigger information according to the notification generated when the image data frame is received;
  • the image matching module 50 is configured to: when the image data frame triggering information sent by the frame type determining module 10 is received, determine whether the matching condition is currently met, and if yes, the background noise frame obtained by the background noise simulation module 40 is used as an image.
  • the background noise frame corresponding to the data frame.
  • the background noise and image matching module 50 is specifically configured to: when the image data frame trigger information is received, set the value of the preset variable to true, and then determine whether the value of the preset variable is true, and if so, To satisfy the matching condition; and after the simulated background noise frame is used as the background noise frame corresponding to the image data frame, the value of the variable is set to false.
  • FIG. 5 is a schematic diagram showing the structure of the videophone service built into the mobile terminal.
  • the mobile terminal is the transmitting end of the videophone service; and the mobile terminal 2 is the receiving end of the videophone service.
  • the audio processing module, the video encoding module, the real-time transmission protocol/real-time transmission control protocol function module (RTP/RTCP function module) are existing functional modules of the mobile terminal.
  • the video encoding module 104 completes the image compression encoding and the motion compensation algorithm processing, and sends the processed image data frame to the RTP/RTCP function module 103 for processing;
  • the audio processing module 101 completes converting the sound signal into an audio data frame, and detects whether the mobile terminal is in the silent mode, and if so, sends the background noise frame to the device 102 implementing the videophone, and if not, transmits the normal audio data frame.
  • the device 102 implementing the videophone determines the received data frame, and when receiving the background noise frame, further determines whether a predetermined flag frame has been generated, if not, Sending a background noise frame to the mobile terminal 2, and simultaneously generating a flag frame; if the flag frame has been generated, transmitting the flag frame to the mobile terminal 2; the device 102 for implementing the videophone at the transmitting end includes a frame type determining module, a flag frame control module, and Background noise control module;
  • the RTP/RTCP function module 103 is responsible for packetizing and processing the received data frame and transmitting it to the mobile terminal 2.
  • the mobile terminal 2 initiates the receiving process: the RTP/RTCP function module 203 is responsible for unpacking the received data packet, and if it is an image data frame, transmitting the image data frame to the video encoding module 204 and notifying the device 202 that implements the videophone; If it is an audio data frame, the audio data frame is sent to the device 202 implementing the videophone;
  • the video encoding module 204 performs decoding processing on the image data frame, and converts the image into a The image format displayed;
  • the device 202 for implementing the videophone determines the received audio data frame, and when it is a normal audio data frame or a background noise frame, sends a normal audio data frame or a background noise frame to the audio processing module 201 for processing; When it is a flag frame, the background noise frame of the mobile terminal 1 is simulated according to the received background noise frame, and when the notification of the RTP/RTCP function module 203 is received, the image data frame trigger information is generated, and the trigger judgment is satisfied. The matching condition of the background noise and the image, if yes, the simulated background noise frame is sent to the audio processing module 201.
  • the variable of the RTP/RTCP function module 203 can be received through a preset variable such as flag.
  • the device 202 implementing the videophone sets the value of the flag to true; then determines whether the value of the variable flag is true, and if so, the matching condition is satisfied; after transmitting the simulated background noise frame to the audio processing module 201, the flag is The value of the variable is set to false; the device 202 implementing the videophone at the receiving end includes frame type judgment Module, the background noise and background noise simulation module and the image matching module;
  • the audio processing module 201 converts the received background noise frame and the normal audio data frame into a sound signal.
  • the mobile terminal 2 can also initiate a call of the videophone service to the mobile terminal, and the specific process is the same as the above, and details are not described herein again.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

L'invention concerne un procédé et un dispositif de visiophonie, ledit procédé comprenant les étapes suivantes: un terminal émetteur détermine si une trame de données à émettre est une trame de bruit de fond, et si la trame de données est une trame de bruit de fond, le terminal émetteur détermine en outre si une trame d'un signe prédéterminé est actuellement générée; si la trame de signe a été générée, elle est transmise au terminal opposé. Ce procédé et dispositif de visiophonie sont applicables dans le cadre d'un service de visiophonie, et selon que le terminal émetteur se trouve en mode silencieux ou non, exploitent de manière dynamique les ressources qui sont utilisées par le terminal émetteur en mode silencieux pour transmettre une trame de bruit de fond, afin de transmettre des donnés image et des trames de signe. Un terminal récepteur obtient ainsi la trame de bruit de fond du terminal émetteur par simulation, de sorte qu'une augmentation de la bande passante de transmission de données image est obtenue, ce qui permet de résoudre efficacement le problème lié aux images mosaïque dans le service de visiophonie. Ce procédé et ce dispositif sont aisément mis en oeuvre et présentent une applicabilité élevée.
PCT/CN2011/073686 2010-11-03 2011-05-05 Procédé et dispositif de visiophonie WO2012058913A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010531189.9 2010-11-03
CN201010531189.9A CN101990082B (zh) 2010-11-03 2010-11-03 一种实现可视电话的方法及装置

Publications (1)

Publication Number Publication Date
WO2012058913A1 true WO2012058913A1 (fr) 2012-05-10

Family

ID=43746390

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/073686 WO2012058913A1 (fr) 2010-11-03 2011-05-05 Procédé et dispositif de visiophonie

Country Status (2)

Country Link
CN (1) CN101990082B (fr)
WO (1) WO2012058913A1 (fr)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101990082B (zh) * 2010-11-03 2014-07-16 中兴通讯股份有限公司 一种实现可视电话的方法及装置
CN105450969B (zh) * 2014-06-16 2019-01-15 联想(北京)有限公司 一种实时视频数据传输方法及电子设备

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1428953A (zh) * 2002-04-22 2003-07-09 西安大唐电信有限公司 一种多通道amr声码器的实现方法和设备
CN101087319A (zh) * 2006-06-05 2007-12-12 华为技术有限公司 一种发送和接收背景噪声的方法和装置及静音压缩系统
CN101990082A (zh) * 2010-11-03 2011-03-23 中兴通讯股份有限公司 一种实现可视电话的方法及装置

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI100932B (fi) * 1995-04-12 1998-03-13 Nokia Telecommunications Oy Äänitaajuussignaalien lähetys radiopuhelinjärjestelmässä
CN100428744C (zh) * 2006-07-27 2008-10-22 华为技术有限公司 通信网络中分组数据的传输方法及其系统
CN101394586B (zh) * 2007-09-20 2012-11-21 华为技术有限公司 一种上行链路资源共享的方法、装置及系统
CN101237299B (zh) * 2007-12-27 2011-04-20 华为技术有限公司 语音业务处理模块、系统及方法
CN101431578B (zh) * 2008-10-30 2010-12-08 南京大学 一种基于g.723.1静音检测技术的信息隐藏方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1428953A (zh) * 2002-04-22 2003-07-09 西安大唐电信有限公司 一种多通道amr声码器的实现方法和设备
CN101087319A (zh) * 2006-06-05 2007-12-12 华为技术有限公司 一种发送和接收背景噪声的方法和装置及静音压缩系统
CN101990082A (zh) * 2010-11-03 2011-03-23 中兴通讯股份有限公司 一种实现可视电话的方法及装置

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Speech codec speech processing functions; Adaptive Multi-Rate-Wideband (AMR-WB) speech codec; Comfort noise aspects (Release 9).", 3GPP TS 26.192 V9.0.0:, 31 December 2009 (2009-12-31) *
"Speech codec speech processing functions; Adaptive Multi-Rate-Wideband (AMR-WB) speech codec; Frame structure (Release 9).", 3GPP TS 26.201 V9.0.0:, 31 December 2009 (2009-12-31), pages 6 4.1.1 *

Also Published As

Publication number Publication date
CN101990082A (zh) 2011-03-23
CN101990082B (zh) 2014-07-16

Similar Documents

Publication Publication Date Title
CN107196746B (zh) 实时通信中的抗丢包方法、装置和系统
KR101479393B1 (ko) 대역 내 신호들을 이용한 코덱 전개
WO2020006912A1 (fr) Procédé et dispositif permettant d'analyser une qualité de transmission de réseau, équipement informatique et support de stockage
CN103248964B (zh) 基于rtp/rtcp的车载视频传输系统
KR101749006B1 (ko) 화상 전화에서의 비디오 정지 표시
WO2010083737A1 (fr) Procédé et appareil destinés à traiter un signal vocal, procédé et appareil destinés à transmettre un signal vocal
WO2011137837A1 (fr) Procédé, dispositif et système d'obtention d'informations de clé pendant une commutation de canal rapide
CN104167210A (zh) 一种轻量级的多方会议混音方法和装置
JP2007214985A (ja) シームレスハンドオーバにおけるメディアストリーム切替方法、システム及びプログラム
JPWO2005122455A1 (ja) 双方向通信方法と装置、システムならびにプログラム
CN107079132B (zh) 在视频电话中的端口重配置之后馈送经帧内译码的视频帧
JP2018529249A (ja) ビデオ電話におけるディスプレイデバイスを切り替えること
TWI519104B (zh) 聲音資訊傳送方法以及封包通信系統
WO2012058913A1 (fr) Procédé et dispositif de visiophonie
CN100547997C (zh) 移动通信中ip数据压缩发送和接收的方法
CN103188403A (zh) 语音网关在线监听方法
CN112887497B (zh) 通信方法、装置和计算机存储介质
CN114710568A (zh) 音视频数据通信方法、设备及存储介质
KR20010067698A (ko) 아이.엠.티.-이천망을 이용한 무선 원격 멀티미디어멀티캐스팅 서비스 제공방법
WO2012155761A1 (fr) Procédé de mise en œuvre d'un cadre photo dynamique visiophonique et terminal mobile
CN107154913B (zh) 一种ip电话终端通信方法
EP4358591A1 (fr) Procédé de transmission de données et dispositif associé
WO2008083517A1 (fr) Procédé et système permettant de réaliser une compensation vocale dans un réseau de communication mobile
JP2003204575A (ja) マルチメディアデータ送信装置
JP2009055469A (ja) 送信端末

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11837441

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11837441

Country of ref document: EP

Kind code of ref document: A1