WO2012130023A1 - Method and device for improving voice communication - Google Patents

Method and device for improving voice communication Download PDF

Info

Publication number
WO2012130023A1
WO2012130023A1 PCT/CN2012/071981 CN2012071981W WO2012130023A1 WO 2012130023 A1 WO2012130023 A1 WO 2012130023A1 CN 2012071981 W CN2012071981 W CN 2012071981W WO 2012130023 A1 WO2012130023 A1 WO 2012130023A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
user
type
calling user
codec type
Prior art date
Application number
PCT/CN2012/071981
Other languages
French (fr)
Chinese (zh)
Inventor
嵇家刚
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2012130023A1 publication Critical patent/WO2012130023A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres
    • H04M7/006Networks other than PSTN/ISDN providing telephone service, e.g. Voice over Internet Protocol (VoIP), including next generation networks with a packet-switched transport layer
    • H04M7/0072Speech codec negotiation

Definitions

  • the present invention relates to the field of communications technologies, and in particular, to a method and apparatus for improving voice communication. Background technique
  • VoIP high-definition voice over IP
  • the process of talking between high-definition voice users is: if two high-definition voice users talk to each other, because both parties support high-definition voice, the TFO (Tandem Free Operation) negotiation is successful, and the broadband is successful.
  • the voice frame is sent from the calling party.
  • the base station controller BSC, Base Station Controller
  • the voice frame is transmitted from the calling party's BSC to the called party's BSC, the called party.
  • the BSC obtains a wideband speech frame from the TFO frame, and then transmits the wideband speech frame to the called user.
  • the TFO negotiation can be successful, that is, the two parties can implement the high-definition voice call.
  • the TFO negotiation fails, and the high-definition voice (broadband) Switch to narrow-band pulse code modulation (PCM) to reduce the quality of the call.
  • PCM pulse code modulation
  • the inventors of the present invention have found that in the existing implementation manner, the voice codec type supported by the HD terminal is different from the codec type supported by the non-HD (such as fixed line).
  • TFO negotiation is performed. If the calling party (ie, supporting high-definition voice) and the called party have different voice coding types, the TFO negotiation fails, and the base station controller switches the high-definition voice to the narrowband PCM voice, and the codec Switching will result in a loss of voice quality, which will reduce the quality of the call between the two parties.
  • the embodiment of the invention provides a voice communication method and device, which solves the technical problem that the communication parties fail due to different types of voice coding, and improves the call quality.
  • an embodiment of the present invention provides a method for improving voice communication, which is applied to a calling user to communicate with a called user, where the calling user is a high-definition user, and the method includes:
  • the TF0 negotiation is successfully sent to the peer end, and the voice codec type of the calling user is switched to the voice codec type consistent with the called user.
  • the call is coded and decoded by the switched voice codec type of the calling party and sent to the peer end.
  • the invention relates to a device for improving voice communication, which is applied to a communication between a calling user and a called user, wherein the calling user is a high-definition user, and the method comprises:
  • a negotiating unit configured to perform a TFO negotiation with the peer end to perform a secondary codec-free operation after establishing a call by the establishing unit;
  • the determining unit is configured to: when determining that the voice codec type of the called user is different from the voice codec type of the calling user, and the first correspondence is set, the triggering sending unit sends the TFO negotiation to the peer end, and triggers The switching unit switches the voice codec type of the calling user to a voice codec type consistent with the called user;
  • a sending unit configured to send a TFO negotiation to the peer end successfully
  • a switching unit configured to switch a voice codec type of the calling user to a voice codec type consistent with the called user
  • the codec unit is configured to encode and decode the call by using the voice codec type of the switched calling party, and then send the call to the peer end.
  • the base station controller finds that the voice codec types of the calling and called parties are inconsistent, and still makes the TFO negotiation successful, and switches the voice codec type to the voice codec type consistent with the called party, so as to avoid switching the voice to narrowband PCM voice. Quality loss. That is to say, the success rate of the TFO negotiation is greatly improved by the voice codec compatible with the narrowband in this embodiment. Further, in the case that the TFO negotiation fails, switching to the narrowband codec reduces the damage of the codec handover to the voice quality, and improves the voice call quality.
  • FIG. 1 is a flowchart of a method for improving voice communication according to an embodiment of the present invention
  • FIG. 2 is a flowchart of an application example of a method for improving voice communication according to an embodiment of the present invention
  • FIG. 3 is a schematic structural diagram of an apparatus for improving voice communication according to an embodiment of the present invention. detailed description
  • FIG. 1 is a flowchart of a method for improving voice communication according to an embodiment of the present invention. The method is applied to a calling user to communicate with a called user, where the calling user is a high-definition user; Includes:
  • Step 101 Establish a call initiated by the calling user to the called user, and negotiate with the peer to perform a secondary codec-free TFO negotiation; wherein the peer end may be a base station controller to which the called user belongs.
  • Step 102 If it is determined that the voice codec type of the called user is different from the voice codec type of the calling user, and the first corresponding relationship is met, the TFO negotiation is successfully sent to the peer end, and the voice of the calling user is sent. The codec type is switched to the voice codec type that is consistent with the called user.
  • Step 103 The call code is coded and decoded by the switched caller's voice codec type, and then sent to the peer end.
  • the voice codec type of the calling user is switched to the voice codec type consistent with the called user: the voice code of the calling user is switched to the codec type supported by both the calling user and the called user.
  • the best quality codec type in medium quality. That is to say, switching to the codec type supported by both the calling user and the called user of the voice codec type consistent by the called user, and selecting the best quality codec type from the supported voice coding types, of course, It can be a good quality codec type, which is not limited in this embodiment.
  • the TFO negotiation is successfully sent to the peer end, and The voice codec type of the calling user is switched to the voice codec type consistent with the called user, including:
  • EVRC—NW or Enhanced Voice Coding Broadband EVRC WB If the calling user's voice coding type is: EVRC—NW or Enhanced Voice Coding Broadband EVRC WB:
  • the voice coding type of the called user is: EVRC_NW
  • the EVRC_NW is not exactly the same as the specific voice coding type included in the calling EVRC_NW
  • the opposite end The TFO negotiation is successful, and the voice codec type of the calling user is switched to the capability operating point COP of the EVRC-WB;
  • the voice coding type of the called user is: enhanced voice coding narrowband EVRC_B
  • the TFO negotiation is successfully sent to the opposite end, and the voice codec type of the calling user is switched to the capability operation point COP of the EVRC_B. .
  • the first correspondence is a preset relationship between a calling voice codec type, a called voice codec type, and a negotiation result, where the calling user and the called user are in the first correspondence. At least one of the supported voice codec types is the same.
  • the method may further include: when determining that the voice codec type of the called user is different from the voice codec type of the calling user, and satisfying the set second correspondence, sending a TFO negotiation to the peer end Failed, and switch the calling user's voice codec type to
  • EVRC-B's capability operation point COP specifically includes:
  • the voice coding type of the calling user is: EVRC—NW or EVRC—WB:
  • the voice coding type of the called user is: EVRC; then the TFO negotiation fails to be sent to the opposite end, and the voice of the calling user is edited.
  • the decoding type is switched to the COP of EVRC-B;
  • TFO negotiation result is sent to the opposite end as follows: TFO negotiation fails, and the voice codec type of the calling user is switched to the COP of EVRC_B .
  • the second correspondence relationship is a preset caller codec type and a called party. Correspondence between the type of tone codec and the result of the negotiation. In the second correspondence, the types of voice codecs supported by the calling user and the called user are not the same.
  • the method may further include: setting a voice codec type of the calling user, a first correspondence relationship between the voice codec type of the called user and the TFO negotiation result, and a second correspondence relationship; A correspondence and a second correspondence are stored on the base station controller.
  • the method may further include: querying The correspondence between the voice codec type of the calling user, the voice codec type of the called user, and the TFO negotiation result, and the negotiation result that the TFO negotiation succeeds or the TFO negotiation fails.
  • the TFO negotiation when performing the TFO negotiation of the secondary codec-free operation, if the calling base station controller finds that the voice codec types of the calling and called parties are inconsistent, the TFO negotiation is still successful, and the calling party's voice is obtained.
  • the codec type is switched to the voice codec type consistent with the called party, avoiding the quality loss caused by switching the voice to narrowband PCM voice. That is to say, the speech encoding and decoding compatible with the narrowband in this embodiment greatly improves the success rate of the TFO negotiation.
  • switching to the narrowband codec reduces the damage to the voice quality caused by the codec handover, and improves the voice call quality.
  • FIG. 2 is a flowchart of an application example of a method for improving voice communication according to an embodiment of the present invention, including:
  • Step 201 The calling user's voice codec type is EVRC-NW user initiates a call; Step 202: After the calling base station controller receives the call, it performs TFO negotiation with the called base station controller;
  • Step 203 The base station controller of the calling party determines whether the voice codec type of the called user is the same as the voice codec type of the calling party. If not, perform step 204; if they are the same, perform the steps.
  • Step 204 If the voice codec type of the called user is EVRC_WB, the TFO negotiation is successful, and the voice of the calling user is coded and decoded by the EVRC-WB and sent to the called base station controller;
  • the voice codec type of the called user is EVRC_B
  • the TFO negotiation is successful, and the voice of the calling user is coded and decoded by the EVRC-B and sent to the called base station controller;
  • the TFO negotiation is successful, using EVRC-NW
  • the code of the calling user is coded and sent to the called base station controller
  • the TFO negotiation fails, and the EVRC-B is used to encode and decode the voice of the calling user and then send it to the called base station controller; If the voice codec type of the called user is PSTN, the TFO negotiation fails, and the voice of the calling user is coded and decoded by the EVRC-B and sent to the called base station controller;
  • Step 205 If the voice codec type of the called user is also EVRC_NW, the TFO negotiation is successful, and the voice of the calling user is coded and decoded by using the EVRC-WB capability operation point COP, and then sent to the called base station controller;
  • Step 206 The called base station controller decodes the received voice, acquires a broadband voice frame, and sends the broadband voice frame to the called user.
  • Step 207 The calling user and the called user make a call.
  • the voice codec type of the calling party the type of the voice codec of the called party, and the result of the negotiation are as shown in Table 1,
  • EVRC—NW and calling EVRC—WB EVRC—NW includes COP body speech coding type incomplete
  • the EVRC WB PSTN TFO negotiation fails.
  • the caller uses the COP of the EVRC-B.
  • the TFO negotiation is still successful, and the voice codec is decoded.
  • the type is switched to the voice codec type consistent with the called party, avoiding the quality loss caused by switching the voice to narrowband PCM voice.
  • switching to the narrowband speech codec EVRC-B reduces the damage of the codec switching to the voice quality.
  • the embodiment of the present invention further provides an apparatus for improving voice communication, and a schematic structural diagram thereof is shown in FIG. 3.
  • the device is applied to a calling user to communicate with a called user, where the calling user
  • the device includes: an establishing unit 31, a negotiating unit 32, a determining unit 33, a transmitting unit 34, a switching unit 35, and a codec unit 36, wherein the establishing unit 31 is configured to establish a calling user to be Calling the user-initiated call; the negotiating unit 32 is configured to perform a TFO-free negotiation with the peer end after the establishing unit establishes the call; the determining unit 33 is configured to determine the voice of the called user.
  • the trigger sending unit 34 sends the TFO negotiation to the opposite end, and triggers the switching unit 35 to change the voice encoding and decoding type of the calling user.
  • the sending unit 34 configured to send a TFO negotiation to the peer end successfully
  • the switching unit 35 configured to After codec unit 36 for switching use; audio codec is switched to the same type of voice codec called user type
  • the voice codec type of the calling user encodes and decodes the call and sends it to the peer.
  • the switching unit 35 switches the voice codec type of the calling user to the voice codec type consistent with the called user: switching the voice codec type of the calling user to the calling user and the called user.
  • the determining unit 33 is further configured to: when determining that the voice codec type of the called user is different from the voice codec type of the calling user, and satisfying the set second correspondence, triggering the sending unit to the opposite end
  • the sending TFO negotiation fails, and triggers the switching unit to switch the voice codec type of the calling user to the capability operating point COP of the EVRC_B;
  • the sending unit 34 is further configured to send a TFO negotiation failure to the peer end;
  • the switching unit 35 is further configured to switch the voice codec type of the calling user to the capability operating point COP of the EVRC-B.
  • the called voice codec type when the calling party's voice codec type is EVRC_NW or EVRC-WB, the called voice codec type is EVRC_NW, and the EVRC_NW and the calling EVRC_NW are included.
  • the specific voice coding type is not exactly the same, EVRC-B, EVRC or PSTN, etc., satisfying the first correspondence; when the calling party's voice codec type is EVRC-NW or EVRC-WB, the called voice codec type is EVRC , EVRC-B or PSTN, satisfying the second correspondence.
  • the apparatus may further include: a storage unit, configured to store a voice codec type of the calling user, a voice codec type of the called user, and a first pair of TFO negotiation results.
  • a storage unit configured to store a voice codec type of the calling user, a voice codec type of the called user, and a first pair of TFO negotiation results.
  • the first correspondence at least one of the voice codec types supported by the calling user and the called user is the same; and/or the voice codec type of the calling user and the voice codec of the called user are stored.
  • a second correspondence between the type and the TFO negotiation result wherein, in the second correspondence, the voice codec types supported by the calling user and the called user are not the same.
  • the apparatus may further include: a query unit, configured to determine that the voice codec type of the called user is different from the voice codec type of the calling user, and meet the set first correspondence or the second In the corresponding relationship, the correspondence between the voice codec type of the calling user, the voice codec type of the called user, and the TFO negotiation result is queried, and the negotiation result of the TFO negotiation success or the TFO negotiation failure is obtained.
  • a query unit configured to determine that the voice codec type of the called user is different from the voice codec type of the calling user, and meet the set first correspondence or the second In the corresponding relationship, the correspondence between the voice codec type of the calling user, the voice codec type of the called user, and the TFO negotiation result is queried, and the negotiation result of the TFO negotiation success or the TFO negotiation failure is obtained.
  • the device may be a base station controller, or may be integrated in the base station controller of the calling party, or may be integrated in the called base station controller, or may be independently deployed in the network, and this embodiment does not do limit.
  • the TFO negotiation when performing the TFO negotiation of the secondary codec-free operation, if the calling base station controller finds that the voice codec types of the calling and called parties are inconsistent, the TFO negotiation is still successful, and the voice codec type is switched. For the type of speech codec that is consistent with the called party, avoid the quality loss caused by switching the voice to narrowband PCM voice. That is to say, the success rate of the TFO negotiation is greatly improved by the voice codec compatible with the narrowband in this embodiment. Further, in the case that the TFO negotiation fails, switching to the narrowband codec reduces the damage of the codec handover to the voice quality, and improves the voice call quality. It should also be noted that, in this context, relational terms such as first and second, etc.
  • the present invention can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is a better implementation. the way.
  • the technical solution of the present invention may be embodied in the form of a software product in essence or in the form of a software product, which may be stored in a storage medium such as a ROM/RAM or a disk. , an optical disk, etc., includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments of the present invention or portions of the embodiments.

Abstract

Provided in an embodiment of the present invention are a method and device for improving voice communication, applied to the communication between a calling user and a called user, the calling user is a high definition user; the method comprises: establishing a call initiated by a calling user to a called user, and performing a tandem free operation (TFO) consultation with an opposite terminal; if judging that the voice encoding/decoding type of the called user is different from that of the calling user and satisfies a set first corresponding relation, then sending a TFO consultation success message to the opposite terminal and switching the voice encoding/decoding type of the calling user to a type that is consistent with the called user type; encoding/decoding the call using the switched voice encoding/decoding type of the calling user and then sending to the opposite terminal; thus the technical problem of TFO consultation failure due to different voice encoding/decoding types of both parties of communication is solved, thereby improving call quality.

Description

一种提高语音通信方法及装置  Method and device for improving voice communication
本申请要求于 2011 年 3 月 31 日提交中国专利局、 申请号为 201110081738.1、发明名称为"一种提高语音通信方法及装置"的中国专利申 请的优先权,其全部内容通过引用结合在本申请中。 技术领域  The present application claims priority to Chinese Patent Application No. 201110081738.1, entitled "A Method for Improving Voice Communication and Apparatus", filed on March 31, 2011, the entire contents of which is incorporated herein by reference. in. Technical field
本发明涉及通信技术领域,特别涉及一种提高语音通信方法及装置。 背景技术  The present invention relates to the field of communications technologies, and in particular, to a method and apparatus for improving voice communication. Background technique
宽带语音(即高清语音)技术的出现,优化了语音通话质量。 众多企 业一直在逐步将自己的网络向高清的 IP话音( VoIP , Voice over IP )迁移。 但是,由于高清 VoIP采用宽带音频编解码,其与现有的采用窄带音频编码 的公用电话交换网( PSTN , Public Switched Telephone Network )之间的连 接不兼容, 因此,这种迁移无法被广泛接受。  The emergence of broadband voice (ie, high-definition voice) technology optimizes the quality of voice calls. Many companies have been gradually migrating their networks to high-definition voice over IP (VoIP). However, since high-definition VoIP uses wideband audio codec, which is incompatible with the existing connection between the public switched telephone network (PSTN) using narrowband audio coding, this migration is not widely accepted.
目前,在现有技术中 ,高清语音用户之间通话过程为 :如果两个高清 语音用户之间互相通话, 因为双方都支持高清语音,则免汇编操作 ( TFO , Tandem Free Operation )协商成功,宽带语音帧从主叫方发送出来,经过主 叫侧的基站控制器( BSC , Base Station Controller )进行 TFO帧封装处理后, 语音帧从主叫方的 BSC传送到被叫方的 BSC ,被叫方的 BSC从 TFO帧中 获取宽带语音帧,再将该宽带语音帧发送给被叫用户。 现有技术中 ,如果通信两端都支持高清语音,则 TFO协商才能成功, 即双方才能实现高清语音通话,但是,如果通信两端只有一端支持高清语 音, TFO协商失败,会将高清语音 (宽带)切换为窄带脉码调制 ( PCM , Pulse Code Modulation )语音,降低了通话质量。 At present, in the prior art, the process of talking between high-definition voice users is: if two high-definition voice users talk to each other, because both parties support high-definition voice, the TFO (Tandem Free Operation) negotiation is successful, and the broadband is successful. The voice frame is sent from the calling party. After the TFO frame encapsulation process is performed by the base station controller (BSC, Base Station Controller) on the calling side, the voice frame is transmitted from the calling party's BSC to the called party's BSC, the called party. The BSC obtains a wideband speech frame from the TFO frame, and then transmits the wideband speech frame to the called user. In the prior art, if both ends of the communication support high-definition voice, the TFO negotiation can be successful, that is, the two parties can implement the high-definition voice call. However, if only one end of the communication supports the high-definition voice, the TFO negotiation fails, and the high-definition voice (broadband) Switch to narrow-band pulse code modulation (PCM) to reduce the quality of the call.
在对现有技术的研究和实践过程中 ,本发明的发明人发现,现有的实 现方式中 ,高清终端支持的语音编解码类型与非高清(比如固话等)支持 的编解码类型不同,在建立呼叫时,进行 TFO协商,如果主叫方 (即支持 高清语音)与被叫方的语音编码类型不同,则 TFO协商失败,基站控制器 会将高清语音切换为窄带 PCM语音,该编解码切换会造成语音质量的损失, 从而降低了双方的通话质量。 发明内容  In the research and practice of the prior art, the inventors of the present invention have found that in the existing implementation manner, the voice codec type supported by the HD terminal is different from the codec type supported by the non-HD (such as fixed line). When a call is established, TFO negotiation is performed. If the calling party (ie, supporting high-definition voice) and the called party have different voice coding types, the TFO negotiation fails, and the base station controller switches the high-definition voice to the narrowband PCM voice, and the codec Switching will result in a loss of voice quality, which will reduce the quality of the call between the two parties. Summary of the invention
本发明实施例提供一种语音通信方法及装置,以解决通信双方因语音 编码类型不同造成 TFO协商失败的技术问题,提高通话质量。  The embodiment of the invention provides a voice communication method and device, which solves the technical problem that the communication parties fail due to different types of voice coding, and improves the call quality.
为解决上述技术问题,本发明实施例提供一种提高语音通信的方法, 应用于主叫用户与被叫用户进行通信,所述主叫用户为高清用户 ,所述方 法包括:  To solve the above technical problem, an embodiment of the present invention provides a method for improving voice communication, which is applied to a calling user to communicate with a called user, where the calling user is a high-definition user, and the method includes:
建立主叫用户向被叫用户发起的呼叫,并与对端进行免二次编解码操 作 TFO协商;  Establish a call initiated by the calling user to the called user, and negotiate with the peer to perform a secondary codec-free TFO negotiation;
若判断被叫用户的语音编解码类型与主叫用户的语音编解码类型不 同,且满足设定的第一对应关系时,则向对端发送 TF0协商成功,并将主 叫用户的语音编解码类型切换为与被叫用户一致的语音编解码类型; If it is judged that the voice codec type of the called user and the voice codec type of the calling user are not If the first correspondence is set, the TF0 negotiation is successfully sent to the peer end, and the voice codec type of the calling user is switched to the voice codec type consistent with the called user.
利用切换后的主叫用户的的语音编解码类型对该呼叫进行编解码后发 送给对端  The call is coded and decoded by the switched voice codec type of the calling party and sent to the peer end.
一种提高语音通信的装置,应用于主叫用户与被叫用户进行通信,所 述主叫用户为高清用户 ,其特征在于,包括:  The invention relates to a device for improving voice communication, which is applied to a communication between a calling user and a called user, wherein the calling user is a high-definition user, and the method comprises:
建立单元,用于建立主叫用户向被叫用户发起的呼叫;  Establishing a unit, configured to establish a call initiated by the calling user to the called user;
协商单元,用于在建立单元建立呼叫后,与对端进行免二次编解码操 作 TFO协商;  a negotiating unit, configured to perform a TFO negotiation with the peer end to perform a secondary codec-free operation after establishing a call by the establishing unit;
判断单元,用于在判断被叫用户的语音编解码类型与主叫用户的语音 编解码类型不同,且满足设定的第一对应关系时,触发发送单元向对端发 送 TFO协商成功,并触发切换单元将主叫用户的语音编解码类型切换为与 被叫用户一致的语音编解码类型;  The determining unit is configured to: when determining that the voice codec type of the called user is different from the voice codec type of the calling user, and the first correspondence is set, the triggering sending unit sends the TFO negotiation to the peer end, and triggers The switching unit switches the voice codec type of the calling user to a voice codec type consistent with the called user;
发送单元,用于向对端发送 TFO协商成功;  a sending unit, configured to send a TFO negotiation to the peer end successfully;
切换单元,用于将主叫用户的语音编解码类型切换为与被叫用户一致 的语音编解码类型;  a switching unit, configured to switch a voice codec type of the calling user to a voice codec type consistent with the called user;
编解码单元,用于利用切换后的主叫用户的语音编解码类型对该呼叫 进行编解码后发送给对端。  The codec unit is configured to encode and decode the call by using the voice codec type of the switched calling party, and then send the call to the peer end.
本发明实施例中 ,在进行免二次编解码操作 TFO协商时,如果主叫的 基站控制器发现主被叫双方的语音编解码类型不一致,仍然让 TFO协商成 功,并将语音编解码类型切换为与被叫方一致的语音编解码类型,避免将 语音切换为窄带 PCM语音造成的质量损失。 也就是说,通过本实施例兼容 窄带的语音编解码,大大提升了 TFO协商的成功率。进一步,在 TFO协商 失败的情况下,切换到窄带的编解码,减少了编解码切换对语音质量的损 伤,提高了语音通话质量。 附图说明 In the embodiment of the present invention, when the TFO negotiation of the secondary codec-free operation is performed, if the calling party The base station controller finds that the voice codec types of the calling and called parties are inconsistent, and still makes the TFO negotiation successful, and switches the voice codec type to the voice codec type consistent with the called party, so as to avoid switching the voice to narrowband PCM voice. Quality loss. That is to say, the success rate of the TFO negotiation is greatly improved by the voice codec compatible with the narrowband in this embodiment. Further, in the case that the TFO negotiation fails, switching to the narrowband codec reduces the damage of the codec handover to the voice quality, and improves the voice call quality. DRAWINGS
图 1为本发明实施例提供的一种提高语音通信的方法的流程图 ; 图 2 为本发明实施例中提供的一种提高语音通信的方法的应用实例的 流程图;  1 is a flowchart of a method for improving voice communication according to an embodiment of the present invention; FIG. 2 is a flowchart of an application example of a method for improving voice communication according to an embodiment of the present invention;
图 3为本发明实施例中提供的一种提高语音通信的装置的结构示意图。 具体实施方式  FIG. 3 is a schematic structural diagram of an apparatus for improving voice communication according to an embodiment of the present invention. detailed description
为了使本技术领域的人员更好地理解本发明实施例的方案,下面结合 附图和实施方式对本发明实施例作进一步的详细说明。  The embodiments of the present invention are further described in detail below with reference to the accompanying drawings and embodiments.
请参阅图 1 ,为本发明实施例提供的一种提高语音通信的方法的流程 图 ,所述方法应用于主叫用户与被叫用户进行通信,所述主叫用户为高清 用户 ;所述方法包括:  FIG. 1 is a flowchart of a method for improving voice communication according to an embodiment of the present invention. The method is applied to a calling user to communicate with a called user, where the calling user is a high-definition user; Includes:
步骤 101:建立主叫用户向被叫用户发起的呼叫,并与对端进行免二次 编解码操作 TFO协商;其中 ,对端可以是被叫用户所属的基站控制器。 步骤 102:若判断被叫用户的语音编解码类型与主叫用户的语音编解码 类型不同,且满足设定的第一对应关系,则向对端发送 TFO协商成功,并 将主叫用户的语音编解码类型切换为与被叫用户一致的语音编解码类型; 步骤 103:利用切换后的主叫用户的语音编解码类型对该呼叫进行编解 码后发送给对端。 Step 101: Establish a call initiated by the calling user to the called user, and negotiate with the peer to perform a secondary codec-free TFO negotiation; wherein the peer end may be a base station controller to which the called user belongs. Step 102: If it is determined that the voice codec type of the called user is different from the voice codec type of the calling user, and the first corresponding relationship is met, the TFO negotiation is successfully sent to the peer end, and the voice of the calling user is sent. The codec type is switched to the voice codec type that is consistent with the called user. Step 103: The call code is coded and decoded by the switched caller's voice codec type, and then sent to the peer end.
优选的,所述将主叫用户的语音编解码类型切换为与被叫用户一致的 语音编解码类型为 :将主叫用户的语音编码切换为主叫用户和被叫用户均 支持的编解码类型中质量最好的编解码类型。 也就是说,切换为被叫用户 一致的语音编解码类型主叫用户和被叫用户都支持的编解码类型,且从都 支持的语音编码类型中选择质量最好的编解码类型,当然,也可以是质量 较好的编解码类型,本实施例不作限制。  Preferably, the voice codec type of the calling user is switched to the voice codec type consistent with the called user: the voice code of the calling user is switched to the codec type supported by both the calling user and the called user. The best quality codec type in medium quality. That is to say, switching to the codec type supported by both the calling user and the called user of the voice codec type consistent by the called user, and selecting the best quality codec type from the supported voice coding types, of course, It can be a good quality codec type, which is not limited in this embodiment.
在该实施例中 ,所述判断被叫用户的语音编解码类型与主叫用户的语 音编解码类型不同,且满足设定的第一对应关系时,则向对端发送 TFO协 商成功,并将主叫用户的语音编解码类型切换为与被叫用户一致的语音编 解码类型,具体包括:  In this embodiment, if it is determined that the voice codec type of the called user is different from the voice codec type of the calling user, and the first correspondence corresponding to the setting is met, the TFO negotiation is successfully sent to the peer end, and The voice codec type of the calling user is switched to the voice codec type consistent with the called user, including:
如果主叫用户的语音编码类型为 : EVRC— NW 或增强语音编码宽带 EVRC WB:  If the calling user's voice coding type is: EVRC—NW or Enhanced Voice Coding Broadband EVRC WB:
当所述被叫用户的语音编码类型为 : EVRC— NW ,且所述 EVRC— NW 与主叫的 EVRC— NW中包括的具体语音编码类型不完全相同时,则向对端 发送 TFO协商成功,并将主叫用户的语音编解码类型切换为 EVRC— WB的 能力操作点 COP; When the voice coding type of the called user is: EVRC_NW, and the EVRC_NW is not exactly the same as the specific voice coding type included in the calling EVRC_NW, then the opposite end The TFO negotiation is successful, and the voice codec type of the calling user is switched to the capability operating point COP of the EVRC-WB;
当所述被叫用户的语音编码类型为 :增强语音编码窄带 EVRC— B时, 则向对端发送 TFO 协商成功 , 并将主叫用户的语音编解码类型切换为 EVRC— B的能力操作点 COP。  When the voice coding type of the called user is: enhanced voice coding narrowband EVRC_B, the TFO negotiation is successfully sent to the opposite end, and the voice codec type of the calling user is switched to the capability operation point COP of the EVRC_B. .
其中 ,所述第一对应关系为预先设置的主叫语音编解码类型、 被叫语 音编解码类型及协商结果之间的对应关系,其中 ,该第一对应关系中 ,主 叫用户和被叫用户所支持的语音编解码类型至少有一个相同。  The first correspondence is a preset relationship between a calling voice codec type, a called voice codec type, and a negotiation result, where the calling user and the called user are in the first correspondence. At least one of the supported voice codec types is the same.
优选的,所述方法还可以进一步包括:在判断被叫用户的语音编解码 类型与主叫用户的语音编解码类型不同,且满足设定的第二对应关系时, 则向对端发送 TFO 协商失败, 并将主叫用户的语音编解码类型切换为 Preferably, the method may further include: when determining that the voice codec type of the called user is different from the voice codec type of the calling user, and satisfying the set second correspondence, sending a TFO negotiation to the peer end Failed, and switch the calling user's voice codec type to
EVRC— B的能力操作点 COP;具体包括: EVRC-B's capability operation point COP; specifically includes:
如果主叫用户的语音编码类型为 : EVRC— NW或 EVRC— WB: 当所述被叫用户的语音编码类型为 : EVRC时;则向对端发送 TFO协 商失败,并将主叫用户的语音编解码类型切换为 EVRC— B的 COP;  If the voice coding type of the calling user is: EVRC—NW or EVRC—WB: When the voice coding type of the called user is: EVRC; then the TFO negotiation fails to be sent to the opposite end, and the voice of the calling user is edited. The decoding type is switched to the COP of EVRC-B;
当所述被叫用户的语音编码类型为 :公共交换电话网 PSTN 时;则向 对端发送 TFO协商结果为 : TFO协商失败,并将主叫用户的语音编解码类 型切换为 EVRC— B的 COP。  When the voice coding type of the called user is: public switched telephone network PSTN; the TFO negotiation result is sent to the opposite end as follows: TFO negotiation fails, and the voice codec type of the calling user is switched to the COP of EVRC_B .
其中 ,所述第二对应关系为预先设置的主叫语音编解码类型、 被叫语 音编解码类型及协商结果之间的对应关系。 其中 ,该第二对应关系中 ,主 叫用户和被叫用户所支持的语音编解码类型没有一个相同。 The second correspondence relationship is a preset caller codec type and a called party. Correspondence between the type of tone codec and the result of the negotiation. In the second correspondence, the types of voice codecs supported by the calling user and the called user are not the same.
优选的,所述方法还可以进一步包括:设定主叫用户的语音编解码类 型、 被叫用户的语音编解码类型与 TFO协商结果的第一对应关系和第二对 应关系;并将所述第一对应关系和第二对应关系存储到基站控制器上。  Preferably, the method may further include: setting a voice codec type of the calling user, a first correspondence relationship between the voice codec type of the called user and the TFO negotiation result, and a second correspondence relationship; A correspondence and a second correspondence are stored on the base station controller.
优选的,在判断被叫用户的语音编解码类型与主叫用户的语音编解码 类型不同时,且满足设定的第一对应关系或第二对应关系时,所述方法还 可以进一步包括:查询所述主叫用户的语音编解码类型、 被叫用户的语音 编解码类型与 TFO协商结果的对应关系,得到 TFO协商成功或 TFO协商 失败的协商结果。  Preferably, when it is determined that the voice codec type of the called user is different from the voice codec type of the calling user, and the first correspondence or the second correspondence is set, the method may further include: querying The correspondence between the voice codec type of the calling user, the voice codec type of the called user, and the TFO negotiation result, and the negotiation result that the TFO negotiation succeeds or the TFO negotiation fails.
本发明实施例中 ,在进行免二次编解码操作 TFO协商时,如果主叫的 基站控制器发现主被叫双方的语音编解码类型不一致,仍然让 TFO协商成 功,并将主叫方的语音编解码类型切换为与被叫方一致的语音编解码类型, 避免将语音切换为窄带 PCM语音造成的质量损失。 也就是说,通过本实施 例兼容窄带的语音编解码,大大提升了 TFO协商的成功率。  In the embodiment of the present invention, when performing the TFO negotiation of the secondary codec-free operation, if the calling base station controller finds that the voice codec types of the calling and called parties are inconsistent, the TFO negotiation is still successful, and the calling party's voice is obtained. The codec type is switched to the voice codec type consistent with the called party, avoiding the quality loss caused by switching the voice to narrowband PCM voice. That is to say, the speech encoding and decoding compatible with the narrowband in this embodiment greatly improves the success rate of the TFO negotiation.
进一步,在 TFO协商失败的情况下,切换到窄带的编解码,减少了编 解码切换对语音质量的损伤,提高了语音通话质量。  Further, in the case that the TFO negotiation fails, switching to the narrowband codec reduces the damage to the voice quality caused by the codec handover, and improves the voice call quality.
为了便于本领域技术人员的理解,下面以具体的实施例来说明。 还请参阅图 2,为本发明实施例中提供的一种提高语音通信的方法的应 用实例的流程图,包括: In order to facilitate the understanding of those skilled in the art, the following description will be made with specific embodiments. FIG. 2 is a flowchart of an application example of a method for improving voice communication according to an embodiment of the present invention, including:
步骤 201:主叫用户的语音编解码类型为 EVRC— NW的用户发起呼叫; 步骤 202:该主叫的基站控制器接收该呼叫后,与被叫的基站控制器进 行 TFO协商;  Step 201: The calling user's voice codec type is EVRC-NW user initiates a call; Step 202: After the calling base station controller receives the call, it performs TFO negotiation with the called base station controller;
步骤 203:主叫的基站控制器判断被叫用户的语音编解码类型是否与主 叫的语音编解码类型是否相同,若不同,执行步骤 204;若相同,执行步骤 Step 203: The base station controller of the calling party determines whether the voice codec type of the called user is the same as the voice codec type of the calling party. If not, perform step 204; if they are the same, perform the steps.
205; 205;
步骤 204:如果被叫用户的语音编解码类型为 EVRC— WB ,则 TFO协 商成功,使用 EVRC— WB对主叫用户的语音进行编解码后发送给被叫的基 站控制器;  Step 204: If the voice codec type of the called user is EVRC_WB, the TFO negotiation is successful, and the voice of the calling user is coded and decoded by the EVRC-WB and sent to the called base station controller;
如果被叫用户的语音编解码类型为 EVRC— B ,则 TFO协商成功,使用 EVRC— B对主叫用户的语音进行编解码后发送给被叫的基站控制器;  If the voice codec type of the called user is EVRC_B, the TFO negotiation is successful, and the voice of the calling user is coded and decoded by the EVRC-B and sent to the called base station controller;
如果被叫用户的语音编解码类型为 EVRC— NW ,且 EVRC— NW包括的 具体语音编码类型与主叫的 EVRC— NW包括的具体语音编码类型不完全相 同时, TFO协商成功,使用 EVRC— NW对主叫用户的语音进行编解码后发 送给被叫的基站控制器;  If the called user's voice codec type is EVRC_NW, and the specific voice coding type included in the EVRC_NW is not exactly the same as the specific voice coding type included in the calling EVRC-NW, the TFO negotiation is successful, using EVRC-NW The code of the calling user is coded and sent to the called base station controller;
如果被叫用户的语音编解码类型为 EVRC , TFO 协商失败 ,使用 EVRC— B对主叫用户的语音进行编解码后发送给被叫的基站控制器; 如果被叫用户的语音编解码类型为 PSTN , TFO 协商失败 , 使用 EVRC— B对主叫用户的语音进行编解码后发送给被叫的基站控制器; If the called user's voice codec type is EVRC, the TFO negotiation fails, and the EVRC-B is used to encode and decode the voice of the calling user and then send it to the called base station controller; If the voice codec type of the called user is PSTN, the TFO negotiation fails, and the voice of the calling user is coded and decoded by the EVRC-B and sent to the called base station controller;
步骤 205:如果被叫用户的语音编解码类型也是 EVRC— NW ,则 TFO 协商成功,使用 EVRC— WB的能力操作点 COP对主叫用户的语音进行编解 码后发送给被叫的基站控制器;  Step 205: If the voice codec type of the called user is also EVRC_NW, the TFO negotiation is successful, and the voice of the calling user is coded and decoded by using the EVRC-WB capability operation point COP, and then sent to the called base station controller;
步骤 206:被叫的基站控制器对接收到的语音中进行解码,获取宽带语 音帧,并将该宽带语音帧发送给被叫用户 ;  Step 206: The called base station controller decodes the received voice, acquires a broadband voice frame, and sends the broadband voice frame to the called user.
步骤 207:主叫用户和被叫用户进行通话。  Step 207: The calling user and the called user make a call.
在该实施例中 ,主叫的语音编解码类型、 被叫的语音编解码类型以及 协商的结果,具体如表 1所示,  In this embodiment, the voice codec type of the calling party, the type of the voice codec of the called party, and the result of the negotiation are as shown in Table 1,
表 1  Table 1
主叫语音编解码类型 被叫语音编解码类型 协商结果  Calling voice codec type Called voice codec type Negotiation result
EVRC—丽 EVRC— NW ,且所述 TFO协商成功,使  EVRC-丽EV-NW, and the TFO negotiation is successful, so that
EVRC— NW与主叫的 用 EVRC— WB的 EVRC— NW中包括的具 COP 体语音编码类型不完全  EVRC—NW and calling EVRC—WB EVRC—NW includes COP body speech coding type incomplete
相同  the same
EVRC—丽 EVRC— B TFO协商成功,使 用 EVRC— B的 COP EVRC—丽 EVRC—稱 TFO协商成功,使 用 EVRC— WB的 COPEVRC-丽EVRC-B TFO negotiated successfully, using COP of EVRC-B EVRC-丽EVRC-called TFO negotiation success, using EVRC-WB COP
EVRC—丽 EVRC TFO协商失败,使 用 EVRC— Β的 COP EVRC-Li EVRC TFO negotiation failed, using EVRC - Β COP
EVRC—丽 PSTN TFO协商失败,主 叫使用 EVRC— B的 COP EVRC-Li PSTN TFO negotiation failed, the caller used EVRC-B COP
该实施例的实现过程与实施例 1 类似,其不同之处在于:主叫的语音 编解码类型是 EVRC— WB ,且与被叫的语音编码类型不同,并根据被叫的 语音编解码类型不同,其协商的结果如表 2所示所示: 表 2 The implementation process of this embodiment is similar to that of Embodiment 1, except that the calling party's voice codec type is EVRC-WB, and is different from the called voice coding type, and is different according to the called voice codec type. The results of the consultation are shown in Table 2: Table 2
Figure imgf000012_0001
EVRC WB PSTN TFO协商失败,主叫使 用 EVRC— B的 COP 本实施例中 ,如果主叫的基站控制器发现主被叫双方的语音编解码类 型不一致,仍然让 TFO协商成功,并将语音编解码类型切换为与被叫方一 致的语音编解码类型,避免将语音切换为窄带 PCM语音造成的质量损失。 并在 TFO协商失败的情况下,切换到窄带的语音编解码 EVRC— B ,减少了 编解码切换对语音质量的损伤。
Figure imgf000012_0001
The EVRC WB PSTN TFO negotiation fails. The caller uses the COP of the EVRC-B. In this embodiment, if the calling base station controller finds that the voice codec types of the calling and called parties are inconsistent, the TFO negotiation is still successful, and the voice codec is decoded. The type is switched to the voice codec type consistent with the called party, avoiding the quality loss caused by switching the voice to narrowband PCM voice. And in the case of TFO negotiation failure, switching to the narrowband speech codec EVRC-B reduces the damage of the codec switching to the voice quality.
基于上述实施例的实现过程,本发明实施例还提供一种提高语音通信 的装置,其结构示意图详见图 3,所述装置应用于主叫用户与被叫用户进行 通信,所述主叫用户为高清用户 ,所述装置包括:建立单元 31 ,协商单元 32 ,判断单元 33 ,发送单元 34、 切换单元 35和编解码单元 36 ,其中 ,所 述建立单元 31 ,用于建立主叫用户向被叫用户发起的呼叫;所述协商单元 32,用于在建立单元建立呼叫后,与对端进行免二次编解码操作 TFO协商; 所述判断单元 33 ,用于在判断被叫用户的语音编解码类型与主叫用户的语 音编解码类型不同,且满足设定的第一对应关系时,触发发送单元 34向对 端发送 TFO协商成功,并触发切换单元 35将主叫用户的语音编解码类型切 换为与被叫用户一致的语音编解码类型;所述发送单元 34 ,用于向对端发 送 TFO协商成功;所述切换单元 35,用于将主叫用户的语音编解码类型切 换为与被叫用户一致的语音编解码类型;编解码单元 36 ,用于利用切换后 的主叫用户的语音编解码类型对该呼叫进行编解码后发送给对端。 Based on the implementation process of the foregoing embodiment, the embodiment of the present invention further provides an apparatus for improving voice communication, and a schematic structural diagram thereof is shown in FIG. 3. The device is applied to a calling user to communicate with a called user, where the calling user For a high-definition user, the device includes: an establishing unit 31, a negotiating unit 32, a determining unit 33, a transmitting unit 34, a switching unit 35, and a codec unit 36, wherein the establishing unit 31 is configured to establish a calling user to be Calling the user-initiated call; the negotiating unit 32 is configured to perform a TFO-free negotiation with the peer end after the establishing unit establishes the call; the determining unit 33 is configured to determine the voice of the called user. When the decoding type is different from the voice codec type of the calling user, and the first correspondence is set, the trigger sending unit 34 sends the TFO negotiation to the opposite end, and triggers the switching unit 35 to change the voice encoding and decoding type of the calling user. Switching to a voice codec type consistent with the called user; the sending unit 34, configured to send a TFO negotiation to the peer end successfully; the switching unit 35, configured to After codec unit 36 for switching use; audio codec is switched to the same type of voice codec called user type The voice codec type of the calling user encodes and decodes the call and sends it to the peer.
优选的,所述切换单元 35将主叫用户的语音编解码类型切换为与被叫 用户一致的语音编解码类型为 :将主叫用户的语音编解码类型切换为主叫 用户和被叫用户均支持的编解码类型中质量最好的编解码类型。  Preferably, the switching unit 35 switches the voice codec type of the calling user to the voice codec type consistent with the called user: switching the voice codec type of the calling user to the calling user and the called user. The best quality codec type among the supported codec types.
优选的,所述判断单元 33 ,还用于在判断被叫用户的语音编解码类型 与主叫用户的语音编解码类型不同,且满足设定的第二对应关系时,触发 发送单元向对端发送 TFO协商失败,并触发切换单元将主叫用户的语音编解 码类型切换为 EVRC— B的能力操作点 COP;  Preferably, the determining unit 33 is further configured to: when determining that the voice codec type of the called user is different from the voice codec type of the calling user, and satisfying the set second correspondence, triggering the sending unit to the opposite end The sending TFO negotiation fails, and triggers the switching unit to switch the voice codec type of the calling user to the capability operating point COP of the EVRC_B;
所述发送单元 34 ,还用于向对端发送 TFO协商失败;  The sending unit 34 is further configured to send a TFO negotiation failure to the peer end;
所述切换单元 35 , 还用于将主叫用户的语音编解码类型切换为 EVRC— B的能力操作点 COP。  The switching unit 35 is further configured to switch the voice codec type of the calling user to the capability operating point COP of the EVRC-B.
在该实施例中 ,当主叫的语音编解码类型为 EVRC— NW或 EVRC— WB , 被叫的语音编解码类型为 EVRC— NW , 且所述 EVRC— NW 与主叫的 EVRC— NW 中包括的具体语音编码类型不完全相同、 EVRC— B、 EVRC 或 PSTN等,满足第一对应关系; 当主叫的语音编解码类型为 EVRC— NW或 EVRC— WB ,被叫的语音编解码类型为 EVRC 、 EVRC— B或 PSTN ,满足 第二对应关系。  In this embodiment, when the calling party's voice codec type is EVRC_NW or EVRC-WB, the called voice codec type is EVRC_NW, and the EVRC_NW and the calling EVRC_NW are included. The specific voice coding type is not exactly the same, EVRC-B, EVRC or PSTN, etc., satisfying the first correspondence; when the calling party's voice codec type is EVRC-NW or EVRC-WB, the called voice codec type is EVRC , EVRC-B or PSTN, satisfying the second correspondence.
优选的,所述装置还可以进一步包括:存储单元,用于存储主叫用户 的语音编解码类型、 被叫用户的语音编解码类型与 TFO协商结果的第一对 应关系,所述第一对应关系中 ,主叫用户和被叫用户所支持的语音编解码 类型至少有一个相同;和 /或存储主叫用户的语音编解码类型、 被叫用户的 语音编解码类型与 TFO协商结果的第二对应关系,其中,所述第二对应关 系中 ,主叫用户和被叫用户所支持的语音编解码类型没有一个相同。 Preferably, the apparatus may further include: a storage unit, configured to store a voice codec type of the calling user, a voice codec type of the called user, and a first pair of TFO negotiation results. It should be noted that, in the first correspondence, at least one of the voice codec types supported by the calling user and the called user is the same; and/or the voice codec type of the calling user and the voice codec of the called user are stored. A second correspondence between the type and the TFO negotiation result, wherein, in the second correspondence, the voice codec types supported by the calling user and the called user are not the same.
优选的,所述装置还可以仅以包括:查询单元,用于在判断被叫用户 的语音编解码类型与主叫用户的语音编解码类型不同,且满足设定的第一 对应关系或第二对应关系时,查询所述主叫用户的语音编解码类型、 被叫 用户的语音编解码类型与 TFO协商结果的对应关系,得到 TFO协商成功或 TFO协商失败的协商结果。  Preferably, the apparatus may further include: a query unit, configured to determine that the voice codec type of the called user is different from the voice codec type of the calling user, and meet the set first correspondence or the second In the corresponding relationship, the correspondence between the voice codec type of the calling user, the voice codec type of the called user, and the TFO negotiation result is queried, and the negotiation result of the TFO negotiation success or the TFO negotiation failure is obtained.
优选的,所述装置可以是基站控制器,也可以是集成在主叫的基站控 制器中 ,也可以集成在被叫的基站控制器中 ,也可以独立部署在网络中 , 本实施例不做限制。  Preferably, the device may be a base station controller, or may be integrated in the base station controller of the calling party, or may be integrated in the called base station controller, or may be independently deployed in the network, and this embodiment does not do limit.
本发明实施例中 ,在进行免二次编解码操作 TFO协商时,如果主叫的 基站控制器发现主被叫双方的语音编解码类型不一致,仍然让 TFO协商成 功,并将语音编解码类型切换为与被叫方一致的语音编解码类型,避免将 语音切换为窄带 PCM语音造成的质量损失。 也就是说,通过本实施例兼容 窄带的语音编解码,大大提升了 TFO协商的成功率。进一步,在 TFO协商 失败的情况下,切换到窄带的编解码,减少了编解码切换对语音质量的损 伤,提高了语音通话质量。 还需要说明的是,在本文中 ,诸如第一和第二等之类的关系术语仅仅 用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或 者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。 而且,术 语"包括"、 "包含 "或者其任何其他变体意在涵盖非排他性的包含,从而使得 包括一系列要素的过程、 方法、 物品或者设备不仅包括那些要素,而且还 包括没有明确列出的其他要素,或者是还包括为这种过程、 方法、 物品或 者设备所固有的要素。 在没有更多限制的情况下, 由语句 "包括一个 ...... " 限定的要素,并不排除在包括所述要素的过程、 方法、 物品或者设备中还 存在另外的相同要素。 In the embodiment of the present invention, when performing the TFO negotiation of the secondary codec-free operation, if the calling base station controller finds that the voice codec types of the calling and called parties are inconsistent, the TFO negotiation is still successful, and the voice codec type is switched. For the type of speech codec that is consistent with the called party, avoid the quality loss caused by switching the voice to narrowband PCM voice. That is to say, the success rate of the TFO negotiation is greatly improved by the voice codec compatible with the narrowband in this embodiment. Further, in the case that the TFO negotiation fails, switching to the narrowband codec reduces the damage of the codec handover to the voice quality, and improves the voice call quality. It should also be noted that, in this context, relational terms such as first and second, etc. are used merely to distinguish one entity or operation from another entity or operation, without necessarily requiring or implying such entities or operations. There is any such actual relationship or order between them. Furthermore, the terms "including", "comprising" or "comprising" or "comprising" are intended to encompass a non-exclusive inclusion, such that a process, method, article, or device that includes a plurality of elements includes not only those elements but also Other elements, or elements that are inherent to such a process, method, item, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional elements in the process, method, article, or device that comprises the element.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到本 发明可借助软件加必需的通用硬件平台的方式来实现, 当然也可以通过硬 件,但很多情况下前者是更佳的实施方式。 基于这样的理解,本发明的技 术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体 现出来,该计算机软件产品可以存储在存储介质中 ,如 ROM/RAM、 磁碟、 光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服 务器,或者网络设备等)执行本发明各个实施例或者实施例的某些部分所 述的方法。  Through the description of the above embodiments, those skilled in the art can clearly understand that the present invention can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is a better implementation. the way. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product in essence or in the form of a software product, which may be stored in a storage medium such as a ROM/RAM or a disk. , an optical disk, etc., includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments of the present invention or portions of the embodiments.
以上所述仅是本发明的优选实施方式,应当指出 ,对于本技术领域的 普通技术人员来说,在不脱离本发明原理的前提下,还可以作出若干改进 和润饰,这些改进和润饰也应视为本发明的保护范围。 The above is only a preferred embodiment of the present invention, and it should be noted that those skilled in the art can make some improvements without departing from the principles of the present invention. And retouching, these improvements and retouchings should also be considered as protection scope of the present invention.

Claims

权利要求 Rights request
1、 一种提高语音通信的方法,应用于主叫用户与被叫用户进行通信, 所述主叫用户为高清用户 ,其特征在于,包括:  A method for improving voice communication, which is applied to a communication between a calling user and a called user, wherein the calling user is a high-definition user, and the method includes:
建立主叫用户向被叫用户发起的呼叫,并与对端进行免二次编解码操 作 TFO协商;  Establish a call initiated by the calling user to the called user, and negotiate with the peer to perform a secondary codec-free TFO negotiation;
若判断被叫用户的语音编解码类型与主叫用户的语音编解码类型不 同,且满足设定的第一对应关系时,则向对端发送 TFO协商成功,并将主 叫用户的语音编解码类型切换为与被叫用户一致的语音编解码类型;  If it is determined that the voice codec type of the called user is different from the voice codec type of the calling user, and the first correspondence corresponding to the setting is satisfied, the TFO negotiation is successfully sent to the peer end, and the voice code of the calling user is coded and decoded. The type is switched to the voice codec type consistent with the called user;
利用切换后的主叫用户的语音编解码类型对该呼叫进行编解码后发送 给对端。  The call is coded and decoded by the switched party's voice codec type and sent to the peer.
2、 根据权利要求 1所述的方法,其特征在于,所述将主叫用户的语音 编解码类型切换为与被叫用户一致的语音编解码类型为 :将主叫用户的语 音编码切换为主叫用户和被叫用户均支持的编解码类型中质量最好的编解 码类型。  2. The method according to claim 1, wherein the switching the voice codec type of the calling user to the voice codec type consistent with the called user is: switching the voice code of the calling user to the master The best quality codec type among the codec types supported by both the user and the called user.
3、 根据权利要求 1或 2所述的方法,其特征在于,所述判断被叫用户 的语音编解码类型与主叫用户的语音编解码类型不同,且满足设定的第一 对应关系,则向对端发送 TFO协商成功,并将主叫用户的语音编解码类型 切换为与被叫用户一致的语音编解码类型,包括:  The method according to claim 1 or 2, wherein the determining that the called user's voice codec type is different from the calling user's voice codec type, and satisfying the set first correspondence relationship, The TFO negotiation is successfully sent to the peer end, and the voice codec type of the calling user is switched to the voice codec type consistent with the called user, including:
如果主叫用户的语音编码类型为 : EVRC NW 或增强语音编码宽带 EVRC—稱: If the calling user's voice coding type is: EVRC NW or enhanced voice coding broadband EVRC - said:
当所述被叫用户的语音编码类型为 : EVRC— NW ,且所述 EVRC— NW 与主叫的 EVRC— NW中包括的语音编码类型不完全相同时,则向对端发送 TFO协商成功,并将主叫用户的语音编解码类型切换为 EVRC— WB的能力 操作点 COP;  When the voice coding type of the called user is: EVRC_NW, and the voice coding type included in the EVRC_NW of the calling party is not exactly the same, the TFO negotiation is successfully sent to the opposite end, and Switching the voice codec type of the calling user to the capability operating point COP of the EVRC-WB;
当所述被叫用户的语音编码类型为 :增强语音编码窄带 EVRC— B时, 则向对端发送 TFO 协商成功 , 并将主叫用户的语音编解码类型切换为 EVRC— B的能力操作点 COP。  When the voice coding type of the called user is: enhanced voice coding narrowband EVRC_B, the TFO negotiation is successfully sent to the opposite end, and the voice codec type of the calling user is switched to the capability operation point COP of the EVRC_B. .
4、 根据权利要求 1所述的方法,其特征在于,所述方法还包括: 在判断被叫用户的语音编解码类型与主叫用户的语音编解码类型不 同,且满足设定的第二对应关系时,则向对端发送 TFO协商失败,并将主 叫用户的语音编解码类型切换为 EVRC— B的能力操作点 COP。  The method according to claim 1, wherein the method further comprises: determining that the type of the voice codec of the called user is different from the type of the voice codec of the calling user, and satisfying the second corresponding setting In the case of a relationship, the TFO negotiation failure is sent to the opposite end, and the voice codec type of the calling user is switched to the capability operating point COP of the EVRC-B.
5、 根据权利要求 4所述的方法,其特征在于,所述在判断被叫用户的 语音编解码类型与主叫用户的语音编解码类型不同,且满足设定的第二对 应关系时, 向对端发送 TFO协商失败,并将主叫用户的语音编解码类型切 换为 EVRC— B的能力操作点 COP ,包括:  The method according to claim 4, wherein when determining that the type of the speech codec of the called user is different from the type of the speech codec of the calling user, and satisfying the set second correspondence, The peer sends a TFO negotiation failure, and switches the calling user's voice codec type to the EVRC-B capability operating point COP, including:
如果主叫用户的语音编码类型为 : EVRC— NW或 EVRC— WB: 当所述被叫用户的语音编码类型为 : EVRC时;则向对端发送 TFO协 商失败,并将主叫用户的语音编解码类型切换为 EVRC— B的 COP; 当所述被叫用户的语音编码类型为 :公共交换电话网 PSTN 时;则向 对端发送 TFO协商结果为 : TFO协商失败,并将主叫用户的语音编解码类 型切换为 EVRC— B的 COP。 If the voice coding type of the calling user is: EVRC—NW or EVRC—WB: When the voice coding type of the called user is: EVRC; then the TFO negotiation fails to be sent to the opposite end, and the voice of the calling user is edited. The decoding type is switched to the COP of EVRC-B; When the voice coding type of the called user is: public switched telephone network PSTN; the TFO negotiation result is sent to the opposite end as follows: TFO negotiation fails, and the voice codec type of the calling user is switched to the COP of EVRC_B .
6、 根据权利要求 1至 3任一项所述的方法,其特征在于,所述方法还 包括:  The method according to any one of claims 1 to 3, further comprising:
存储主叫用户的语音编解码类型、 被叫用户的语音编解码类型与 TFO 协商结果的第一对应关系,其中 ,所述第一对应关系中 ,主叫用户和被叫 用户所支持的语音编解码类型至少有一个相同。  The first correspondence between the voice codec type of the calling user, the voice codec type of the called user, and the TFO negotiation result, where the voice code supported by the calling user and the called user is included in the first correspondence The decoding type has at least one of the same.
7、 根据权利要求 1至 5任一项所述的方法,其特征在于,所述方法还 包括:  The method according to any one of claims 1 to 5, wherein the method further comprises:
存储主叫用户的语音编解码类型、 被叫用户的语音编解码类型与 TFO 协商结果的第二对应关系,其中 ,所述第二对应关系中 ,主叫用户和被叫 用户所支持的语音编解码类型没有一个相同。  The second corresponding relationship between the voice codec type of the calling user, the voice codec type of the called user, and the TFO negotiation result, where the voice code supported by the calling user and the called user is stored in the second correspondence The decoding types are not the same.
8、 一种提高语音通信的装置,应用于主叫用户与被叫用户进行通信, 所述主叫用户为高清用户 ,其特征在于,包括:  A device for improving voice communication, which is applied to a communication between a calling user and a called user, wherein the calling user is a high-definition user, and the method includes:
建立单元,用于建立主叫用户向被叫用户发起的呼叫;  Establishing a unit, configured to establish a call initiated by the calling user to the called user;
协商单元,用于在建立单元建立呼叫后,与对端进行免二次编解码操 作 TFO协商;  a negotiating unit, configured to perform a TFO negotiation with the peer end to perform a secondary codec-free operation after establishing a call by the establishing unit;
判断单元,用于在判断被叫用户的语音编解码类型与主叫用户的语音 编解码类型不同,且满足设定的第一对应关系时,触发发送单元向对端发 送 TFO协商成功,并触发切换单元将主叫用户的语音编解码类型切换为与被 叫用户一致的语音编解码类型; a determining unit, configured to determine a voice codec type of the called user and a voice of the calling user When the codec type is different and the first correspondence is set, the trigger sending unit sends the TFO negotiation to the peer end successfully, and triggers the switching unit to switch the voice codec type of the calling user to the voice code consistent with the called user. Decoding type;
发送单元,用于向对端发送 TFO协商成功;  a sending unit, configured to send a TFO negotiation to the peer end successfully;
切换单元,用于将主叫用户的语音编解码类型切换为与被叫用户一致 的语音编解码类型;  a switching unit, configured to switch a voice codec type of the calling user to a voice codec type consistent with the called user;
编解码单元,用于利用切换后的主叫用户的语音编码类型对该呼叫进 行编解码后发送给对端。  The codec unit is configured to encode and decode the call by using the voice coding type of the switched calling party, and then send the call to the opposite end.
9、 根据权利要求 8所述的装置,其特征在于,所述切换单元将主叫用 户的语音编解码类型切换为与被叫用户一致的语音编解码类型为 :将主叫 用户的语音编解码类型切换为主叫用户和被叫用户均支持的编解码类型中 质量最好的编解码类型。  The device according to claim 8, wherein the switching unit switches the voice codec type of the calling user to a voice codec type consistent with the called user: encoding and decoding the voice of the calling user Type switching is the best quality codec type among the codec types supported by both the calling user and the called user.
10、 根据权利要求 8所述的装置,其特征在于,  10. Apparatus according to claim 8 wherein:
所述判断单元,还用于在判断被叫用户的语音编解码类型与主叫用户 的语音编解码类型不同,且满足设定的第二对应关系时,触发发送单元向 对端发送 TFO协商失败,并触发切换单元将主叫用户的语音编解码类型切换 为 EVRC— B的能力操作点 COP;  The determining unit is further configured to: when determining that the voice codec type of the called user is different from the voice codec type of the calling user, and satisfying the set second correspondence, the trigger sending unit sends the TFO negotiation failure to the peer end. And triggering the switching unit to switch the voice codec type of the calling user to the capability operating point COP of the EVRC_B;
所述发送单元,还用于向对端发送 TFO协商失败;  The sending unit is further configured to send a TFO negotiation failure to the peer end;
所述切换单元,还用于将主叫用户的语音编解码类型切换为 EVRC— B 的能力操作点 COP。 The switching unit is further configured to switch the voice codec type of the calling user to EVRC-B The ability to operate point COP.
11、 根据权利要求 8至 10任一项所述的装置,其特征在于,还包括: 存储单元,用于存储主叫用户的语音编解码类型、 被叫用户的语音编 解码类型与 TFO协商结果的第一对应关系;其中 ,所述第一对应关系中 , 主叫用户和被叫用户所支持的语音编解码类型至少有一个相同;和 /或,用 于存储主叫用户的语音编解码类型、 被叫用户的语音编解码类型与 TFO协 商结果的第一对应关系和第二对应关系;其中 ,所述第二对应关系中 ,主 叫用户和被叫用户所支持的语音编解码类型没有一个相同。  The device according to any one of claims 8 to 10, further comprising: a storage unit, configured to store a voice codec type of the calling user, a voice codec type of the called user, and a TFO negotiation result. a first correspondence relationship, wherein, in the first correspondence, at least one of a voice codec type supported by the calling user and the called user is the same; and/or, for storing a voice codec type of the calling user And a first correspondence relationship between the voice codec type of the called user and the TFO negotiation result, and a second correspondence relationship, wherein in the second correspondence relationship, the voice codec type supported by the calling user and the called user is not one. the same.
PCT/CN2012/071981 2011-03-31 2012-03-06 Method and device for improving voice communication WO2012130023A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110081738.1 2011-03-31
CN201110081738.1A CN102739605B (en) 2011-03-31 2011-03-31 Method and device for improving speech communication

Publications (1)

Publication Number Publication Date
WO2012130023A1 true WO2012130023A1 (en) 2012-10-04

Family

ID=46929421

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/071981 WO2012130023A1 (en) 2011-03-31 2012-03-06 Method and device for improving voice communication

Country Status (2)

Country Link
CN (1) CN102739605B (en)
WO (1) WO2012130023A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014205821A1 (en) * 2013-06-28 2014-12-31 华为技术有限公司 Method, apparatus and system for voice communication
CN104424951B (en) * 2013-08-19 2018-03-20 中国电信股份有限公司 A kind of different systems TFO and the method and apparatus of TrFO intercommunications conversion
CN107018537B (en) * 2017-03-30 2020-03-17 努比亚技术有限公司 Voice communication method and device
CN108521400B (en) * 2018-03-05 2020-09-22 厦门亿联网络技术股份有限公司 Codec self-adaptive method for delayed backoff
CN116708675A (en) * 2022-11-17 2023-09-05 荣耀终端有限公司 Conversation method and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090003570A1 (en) * 2007-06-26 2009-01-01 Texas Instruments Incorporated Method, system and apparatus for providing endpoint-to-endpoint transcoding free connection
CN101378521A (en) * 2007-08-29 2009-03-04 中国移动通信集团公司 Method and system for implementing encode/decode-free operation
CN101742560A (en) * 2009-11-20 2010-06-16 华为技术有限公司 Data transmission method, data transmission device and network system
US20100273417A1 (en) * 2009-04-23 2010-10-28 Motorola, Inc. Establishing Full-Duplex Audio Over an Asynchronous Bluetooth Link

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090003570A1 (en) * 2007-06-26 2009-01-01 Texas Instruments Incorporated Method, system and apparatus for providing endpoint-to-endpoint transcoding free connection
CN101378521A (en) * 2007-08-29 2009-03-04 中国移动通信集团公司 Method and system for implementing encode/decode-free operation
US20100273417A1 (en) * 2009-04-23 2010-10-28 Motorola, Inc. Establishing Full-Duplex Audio Over an Asynchronous Bluetooth Link
CN101742560A (en) * 2009-11-20 2010-06-16 华为技术有限公司 Data transmission method, data transmission device and network system

Also Published As

Publication number Publication date
CN102739605A (en) 2012-10-17
CN102739605B (en) 2014-12-24

Similar Documents

Publication Publication Date Title
EP1845687B1 (en) A method for ip-based service transmission
US8515053B2 (en) Method for changing session media, method for establishing a call, and equipment thereof
US8885638B2 (en) Method and apparatus for enabling peer-to-peer communication between endpoints on a per call basis
CN102959920B (en) Codec deployment using in-band signals
WO2012130023A1 (en) Method and device for improving voice communication
US20090003570A1 (en) Method, system and apparatus for providing endpoint-to-endpoint transcoding free connection
WO2007118380A1 (en) Method, system and device for negotiating voice coding/decoding in communication system
WO2009024043A1 (en) Video inter-working gateway equipment, system and method for realizing video call service
WO2015000356A1 (en) Webrtc communication method, related device and system
WO2006097045A1 (en) A multimedia call processing method and the system thereof
CN101288320B (en) Method and device for establishing and optimizing bearer path
WO2008098490A1 (en) Method and apparatus for adjusting audio codecs
CN101141807B (en) Coding/decoding negotiation method
WO2012075966A1 (en) Processing method for media streams, and media gateway
WO2014166366A1 (en) Method and device for performing capability negotiation in a long term evolution cluster network
EP1905169A2 (en) Methods and system for communications between equipment using one or more interleaved mobile level stuffing sequences
JP4421187B2 (en) Communications system
WO2010083773A1 (en) Coding-decoding negotiation method, communication system, and device for encrypted voice call
WO2019228534A1 (en) Media transmission method and h323-sip gateway
US20130231103A1 (en) Core network and communication system
WO2009079960A1 (en) Method, system and equipment for outband dtmf signaling interworking
KR100641326B1 (en) Call management method in 3GPP generation mobile communication network
WO2012089064A1 (en) Method and device for interacting control information between access terminal in circuit switch domain and as
RU2446605C2 (en) Method, system and device for reconciliation of session initiation protocol signaling data service
EP2574100B1 (en) Rate adjustment method and apparatus applied to trfo voice call switching

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12764578

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12764578

Country of ref document: EP

Kind code of ref document: A1