WO2009024066A1 - Procédé de commande de détection d'activité vocale et dispositif de commande apparenté - Google Patents

Procédé de commande de détection d'activité vocale et dispositif de commande apparenté Download PDF

Info

Publication number
WO2009024066A1
WO2009024066A1 PCT/CN2008/071995 CN2008071995W WO2009024066A1 WO 2009024066 A1 WO2009024066 A1 WO 2009024066A1 CN 2008071995 W CN2008071995 W CN 2008071995W WO 2009024066 A1 WO2009024066 A1 WO 2009024066A1
Authority
WO
WIPO (PCT)
Prior art keywords
call
called
encoder
calling
ring back
Prior art date
Application number
PCT/CN2008/071995
Other languages
English (en)
French (fr)
Inventor
Zhenhua Liu
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Priority to EP08783986.6A priority Critical patent/EP2099253B1/en
Publication of WO2009024066A1 publication Critical patent/WO2009024066A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42017Customized ring-back tones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Definitions

  • the present invention relates to the field of communication technologies, and in particular, to a method for voice activation detection control and a control device thereof. Background technique
  • the CRBT personalized ring back tone
  • the CRBT service can be applied to mobile networks and fixed networks.
  • the content that is usually played through the ring tones can be a piece of music, a song, or a small story.
  • the call between the calling party and the called party needs to pass a VOIP (Voice Over IP, voice over IP (Internet Protocol) network) (or VOATM (Voice Over ATM) voice in ATM (Asynchronous Transfer) Mode, Asynchronous Transfer Mode)
  • VOIP Voice Over IP
  • VOATM Voice Over ATM
  • ATM ATM
  • Asynchronous Transfer Asynchronous Transfer Mode
  • VAD is a speech discriminating technology.
  • a user makes a phone call, it is usually a two-way communication.
  • the calling party and the called party both listen for an average of half of the time.
  • the user listens the user does not have a strong voice to transmit.
  • the speech signal detected by the encoder is weak, basically the background noise.
  • the encoder only detects part of the background sound and encodes and outputs the same.
  • the voice signal data is small and the bandwidth occupied is narrow. For background noise, it is usually The number of frame passes is reduced, so VAD technology can reduce the codec transmission bandwidth without affecting voice quality.
  • the prior art has at least the following problems:
  • the existing VAD technology is generally effective for human voices, etc., but the spectrum of music cannot be made.
  • the judgment of the effect makes some music not transmitted as a valid sound.
  • the ringtone content such as music and songs is transmitted from the called side to the calling side through the VOIP (VOATM) path
  • the VAD technology in the narrowband compression codec makes the music effect worse, so the music effect of the existing ring tones is often not satisfactory. Sometimes I can't even hear it clearly.
  • the embodiment of the invention provides a method for controlling voice activation detection and a control device thereof, so as to solve the problem that the effect of transmitting CRBT music in the prior art is affected by the VAD technology.
  • an embodiment of the present invention provides a method for voice activation detection control, including the following steps:
  • the ringback tone is a personalized ring back tone, and if not, the process ends, otherwise the voice activation detection VAD function of the encoder in the call path is closed, When the called party enters a call state with the calling party, the VAD function of the encoder is turned on.
  • Another embodiment of the present invention provides a method for voice activation detection control, including the following steps:
  • the VAD function of the encoder in the call path is closed, and detecting whether the called standard ringback tone is received, if a standard ringback is detected Tones, the VAD function of the encoder is turned on. Otherwise, when the called party and the calling party enter a call state, the VAD function of the encoder is turned on.
  • Another embodiment of the present invention provides a method for voice activation detection control, including the following steps:
  • the VAD function of the encoder in the call path is closed, and a timer is started to start timing
  • the VAD function of the encoder in the call path is turned on.
  • the embodiment of the present invention further provides a voice activation detection control device, including an identification unit, a determination unit, and a control unit, where the determination unit is used to determine the ringback of the current call.
  • the tone is a personalized ring back tone
  • the determining unit determines that the ring back tone of the call is a personalized ring back tone
  • the identifying unit is configured to identify whether the calling party and the called party are in a call phase, and when the identifying unit identifies that the calling party and the called party are in a talking phase, instructing the control unit to enable the VAD of the encoder in the calling path Features.
  • the embodiment of the present invention further provides another voice activation detection control device, where the device includes a timing device and a control unit.
  • the control unit When the called side device sends a ringback tone to the calling side device, the control unit The VAD function for starting the timer and turning off the encoder in the call path; when the timer set time expires, or when the called party and the caller enter the call state, the control unit is used to enable the The VAD function of the encoder in the call path.
  • Another embodiment of the present invention further provides a voice activation detection control device, where the device includes an identification unit, a control unit, and a detection unit, where:
  • the identifying unit is configured to identify that the calling and called parties are in a ringback tone phase or a call phase, and transmit the detection result to the control unit;
  • the detecting unit detects whether the ring back tone of the current call is a standard ring back tone, and transmits the detection result to the control unit;
  • the control unit When the identifying unit identifies that the calling and called parties are in a ringback tone phase, the control unit is configured to turn off the VAD function of the encoder in the call path; when the detecting unit detects When the ring back tone of the call is a standard ring back tone, the control unit is used to enable the VAD function of the encoder in the call path.
  • FIG. 1 is a system structural diagram of an embodiment of the present invention
  • FIG. 2 is a schematic block diagram of a called switch according to an embodiment of the present invention.
  • Figure 3 is a flow chart of an embodiment of the present invention.
  • FIG. 4 is a schematic block diagram of a called switch according to another embodiment of the present invention.
  • FIG. 5 is a flowchart of another embodiment of the present invention.
  • FIG. 6 is a schematic block diagram of a called switch according to still another embodiment of the present invention.
  • FIG. 7 is a flow chart of still another embodiment of the present invention. detailed description
  • the technical solution provided by the embodiment of the present invention is: Since the CRBT service is only played when the called ringing is performed, the VAD function is disabled in the ringback tone phase, so that the CRBT content in the ringback phase is not adversely affected by the VAD technology.
  • the VAD function is turned on during the call phase, and the VAD technology is used to ensure that the bandwidth occupied by the codec can be effectively reduced during the call phase.
  • the VOIP device is taken as an example to describe the application of the technical solution provided by the embodiment of the present invention.
  • the technical solution provided by the present invention is also applicable to VOATM and VOFR (voice over ATM voice is transmitted over ATM & voice over frame relay voice in frame relay)
  • VOATM and VOFR voice over ATM voice is transmitted over ATM & voice over frame relay voice in frame relay
  • the above-mentioned voice packet transmission architecture, etc. can be implemented by a person skilled in the art without creative labor, and will not be described here.
  • the VAD can be turned off directly during the ringback tone phase, and the VAD function is enabled during the call phase.
  • the called VOIP device determines whether the call ringback tone is a CRBT, for example:
  • the called switch can directly judge based on the called subscriber's "subscription data"; for example: the CRBT SS-CODE (254) service code carried in the SRI-ACK in the HLR (Home Location Register) to MSC message marks the call Call for the ring tones. User ringtone subscription data is not used to exclude other messages.
  • subscription data for example: the CRBT SS-CODE (254) service code carried in the SRI-ACK in the HLR (Home Location Register) to MSC message marks the call Call for the ring tones.
  • User ringtone subscription data is not used to exclude other messages.
  • the switch between the MSC and the CRBT platform can determine whether it is a CRBT service based on the 12531 prefix.
  • FIG. 1 The system structure diagram of this embodiment is shown in FIG. 1 , and the system package provided by the embodiment of the present invention
  • the calling user terminal, the calling exchange, the called switch, the ring back tone center and the called user terminal wherein:
  • the block diagram of the switch is shown in Figure 2 (only the part related to the present invention is described here, and other parts are not shown), including the call identification unit, the judging unit and the control unit.
  • the judging unit is used to judge whether the ring back tone of the call is The CRBT, when the judging unit judges that the ring back tone of the call is a CRBT, instructs the control unit to disable the VAD function of the encoder in the call path, and the control unit is configured to close the VAD function of the encoder in the call path according to the instruction of the judging unit;
  • the identifying unit is configured to identify whether the calling party and the called party are in a talking phase.
  • the indicating control unit starts the VAD function of the encoder in the calling path, and the control unit is configured to use the Indicates the VAD function that turns on the encoder in the call path.
  • FIG. 3 The flowchart of this embodiment is shown in FIG. 3, and specifically includes the following steps (taking SS7 signaling as an example):
  • Step S301 The calling switch sends an IAM (Initial Address Message) to the called switch according to the call request of the calling user terminal.
  • IAM Initial Address Message
  • Step S302 The called switch responds to the calling switch with an ACM (Address Complete Message);
  • Step S303 After receiving the IAM message, the called switch determines whether the called user has signed the CRBT service according to the subscription data of the called user or the service code and the called number of the current call. If yes, go to step S304, otherwise go to step S306;
  • Step S304 The called switch closes the VAD function of the encoder in the call path, and plays the CRBT to the calling party.
  • Step S305 The user picks up the phone, and the called switch sends an ANM (answer) message to the calling switch, and then performs step S307.
  • ANM answer
  • Step S306 The called switch sends an ANM (Answer) message to the calling switch.
  • Step S307 The calling user and the called user talk, and simultaneously activate the VAD function of the encoder in the call path.
  • the signaling parameters of the ringback tone phase and the call phase are different according to the inter-office signaling.
  • the principles of the technical solution provided by the embodiment are basically the same. It is well known in the art.
  • determining whether the ringback tone is a CRBT is completed by the called switch, but when the technical solution provided by the embodiment of the present invention is specifically applied, the CRBT service is not only implemented by the called side device but also in the call path. Other devices can also be implemented. Therefore, it is not limited to be determined by the called switch to determine whether the ring back tone is a ring back tone. For example, it may be a calling switch, or one or more other switches between the calling party and the called party or Other equipment.
  • the called VOIP device does not determine whether the user has signed the CRBT service.
  • the VOIP device first turns off the VAD function of the encoder in the call path during the ringback phase, and detects whether the called standard ringback tone is received, if the called side is received.
  • the standard ringback tone played by the device indicates that the VAD function of the encoder in the call path is automatically turned on if there is no CRBT. If the standard ringback tone is not detected, it is a CRBT, then the VAD of the encoder in the call path is opened during the call phase.
  • the system structure diagram of this embodiment is as shown in FIG. 1.
  • the system provided by the embodiment of the present invention includes a calling user terminal, a calling switch, a called switch, a ring back tone center and a called user terminal, where:
  • the block diagram of the called switch is shown in FIG. 4, which includes a call identification unit, a control unit and a detecting unit.
  • the detecting unit is configured to detect whether the ring back tone of the call is a standard ring back tone and transmit the detection result to the control unit.
  • the control unit determines to turn on and off the VAD function of the encoder in the call path according to the judgment result transmitted by the detecting unit, and the identification unit is used to identify whether the calling party and the called party are in the ringback phase or the call phase, and the identification unit recognizes the calling party and the calling unit.
  • the control unit When the called party is in the ringback phase, the control unit first turns off the VAD function in the call path, and the detecting unit starts detecting.
  • Step S501 The calling switch sends an IAM (Initial Address) message to the called switch according to the call request of the calling user terminal.
  • Step S502 The called switch responds to the calling switch with the ACM (Address Full Message), and turns off the VAD function of the encoder in the call path.
  • ACM Address Full Message
  • Step S503 After receiving the IAM message, the called switch detects the ring back tone of the current call. If it is detected that the ring back tone of the current call is a standard ring back tone, the process goes to step S504, otherwise, the process goes to step S506;
  • Step S504 Enable the VAD function of the encoder in the call path, and the called switch sends an ANM (Answer) message to the calling switch.
  • ANM Answer
  • Step S505 The calling user and the called user talk
  • Step S506 The called switch sends an ANM (Answer) message to the calling switch. If the VAD function of the encoder in the call path is not enabled, the called switch starts the VAD function of the encoder in the calling path.
  • ANM Anaswer
  • Step S507 The calling user and the called user talk.
  • detecting whether the ring back tone is a standard ring back tone is performed by the called switch, but when the technical solution provided by the embodiment of the present invention is specifically applied, the ring back tone service can be called by the called side device. Other devices in the path can also be implemented. Therefore, the detection of the ringback tone is not limited to being performed by the called switch, for example, it can be a calling switch, or one or more other switches between the calling party and the called party. .
  • Timer X when the timer X configuration time expires or the calling party and the called party enter the call state, the called side VOIP device opens the VAD function of the encoder in the call path, and the timer X configuration time may be the network called The maximum waiting time for no answer on-hook, or the average duration that the user rings to go off-hook or User specified time.
  • the system structure diagram of this embodiment is as shown in FIG. 1.
  • the system provided by the embodiment of the present invention includes a calling user terminal, a calling switch, a called switch, a ring back tone center and a called user terminal, where:
  • the block diagram of the called switch is shown in Figure 6. It includes the control unit and the timer.
  • the control unit closes the VAD function of the encoder in the call path at the beginning of the call and starts the timer. When the timer is set, the time expires or the main When the call and the called party enter the call state, the control unit turns on the VAD function of the encoder in the call path.
  • Step S701 The calling switch sends an IAM (Initial Address) message to the called switch according to the call request of the calling user terminal.
  • IAM Intelligent Address
  • Step S702 The called switch responds to the calling switch with an ACM address full message, and closes the VAD function of the encoder in the call path.
  • Step S703 After the called switch receives the IAM message, the timer is started. Step S704, the timer setting time expires, or the calling party and the called party enter the call state, and the VAD function of the encoder in the call path is enabled.
  • Step S705 The called switch sends an ANM (Answer) message to the calling switch.
  • Step S706 The calling user and the called user talk.
  • the expiration of the timer setting may occur before the called switch sends an ANM (Answer) message to the calling switch, and may also occur after the called switch sends an ANM (Answer) message to the calling switch, depending on the message.
  • the length of time the timer is set and the time the called party rings from ringing to answering.
  • the technical solution provided by the embodiment of the present invention controls the VAD function of the encoder in the call path in stages: the VAD function is turned off during the ring back tone phase, and the VAD function is enabled during the call phase, and the color ring tone is transmitted. It is not affected by VAD, and it takes up as little bandwidth as possible when transmitting call voice, which satisfies the requirements for music effects when the CRBT service is transmitted, and does not occupy too much bandwidth.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Description

语音激活检测控制的方法及其控制设备 技术领域
本发明涉及通信技术领域,特别是涉及语音激活检测控制的方法 及其控制设备。 背景技术
随着增值业务实现技术的快速发展,各种各样的增值业务层出不 穷, 其中的彩铃(个性化回铃音)业务使得主叫用户呼叫被叫时, 在 被叫振铃阶段可以听见被叫用户定制的各种各样的音乐,改变了传统 的主叫用户在等待被叫接听阶段只能听枯燥的回铃声的情况,提升了 用户感受, 因此彩铃业务发展很快。 彩铃业务可以应用于移动网络和 固定网络, 通常通过彩铃播放的内容可以是一段音乐、歌曲或者是小 故事等。
通常来说, 主叫和被叫之间的通话需要通过一段 VOIP ( Voice Over IP, 语音在 IP ( Internet Protocol, 网际协议) 网络上传送)(或 者 VOATM ( Voice Over ATM )语音在 ATM ( Asynchronous Transfer Mode, 异步传输模式) 网络上传送)路径传递, 如果被叫用户申请 了彩铃业务, 则彩铃也需要通过 VOIP (或者 VOATM )路径从被叫 侧传送到主叫侧。 现有技术通常在使用 VOIP (或者 VOATM )路径 时釆用窄带压缩编解码技术对传递的信号进行编解码,为降低窄带编 解码的带宽,在编解码时往往用到 VAD技术。 VAD是一种语音判别 技术, 用户通过电话通话时通常是一种双向交流, 主叫方和被叫方都 平均大致有一半时间是在听,听对方说话时用户没有较强的语音要传 送, 这时编码器检测到的语音信号较弱, 基本都是背景噪音, 编码器 只检测到部分背景声音并对其编码输出,话音信号数据很少因而占用 的带宽就窄, 对背景噪音, 通常可以减少帧传递个数, 因此, VAD 技术可以降低编解码传送带宽并且不会影响话音质量。
在实现本发明的过程中, 发明人发现现有技术至少存在以下问 题: 现有 VAD技术对人声等通常有效, 但对音乐的频谱不能作出有 效的判断, 使得部分音乐没有被当作有效声音传送。 音乐、 歌曲等彩 铃内容通过 VOIP ( VOATM )路径从被叫侧传送到主叫侧时 , 窄带压 缩编解码中的 VAD技术使得音乐效果变差, 因此现有的彩铃的音乐 效果往往效果不理想, 有时候甚至无法听清楚。
发明内容
本发明实施例提供一种语音激活检测控制的方法及其控制设备, 以实现解决现有技术中传送彩铃音乐效果被 VAD技术影响的问题。
为达到上述目的,本发明实施例提供了一种语音激活检测控制的 方法, 包括如下的步骤:
当被叫回铃响应主叫的呼叫请求时,判断所述的回铃音是否是个 性化回铃音, 如果不是, 则结束流程, 否则关闭呼叫路径中的编码器 的语音激活检测 VAD功能, 当所述的被叫与所述的主叫进入通话状 态时, 开启所述的编码器的 VAD功能。
本发明实施例还提供了另一种语音激活检测控制的方法, 包括如 下的步骤:
当所述的被叫回铃响应所述主叫的呼叫请求时,关闭所述的呼叫 路径中编码器的 VAD功能, 并检测是否收到被叫的标准回铃音, 如 果检测到标准回铃音, 则开启所述的编码器的 VAD功能, 否则, 当 所述的被叫与所述的主叫进入通话状态时,开启所述的编码器的 VAD 功能。
本发明实施例还提供了另一种语音激活检测控制的方法, 包括如 下的步骤:
当所述的被叫回铃响应所述主叫的呼叫请求时,关闭所述呼叫路 径中编码器的 VAD功能, 并启动定时器开始计时;
当所述的定时器设置的定时时间届满,或者当主叫和被叫进入通 话状态时, 所述呼叫路径中编码器的 VAD功能开启。
本发明实施例还提供了一种语音激活检测控制设备,包括识别单 元、 判断单元和控制单元, 所述的判断单元用来判断本次呼叫的回铃 音是否是个性化回铃音, 当所述的判断单元判断本次呼叫的回铃音是 个性化回铃音时,指示所述的控制单元关闭呼叫路径中编码器的 VAD 功能; 所述的识别单元用来识别主叫和被叫是否处于通话阶段, 当所 述识别单元识别所述的主叫和被叫处于通话阶段时,指示所述的控制 单元开启所述呼叫路径中编码器的 VAD功能。
本发明实施例还提供了另一种语音激活检测控制设备,所述的设 备包括定时装置和控制单元, 当所述的被叫侧设备向主叫侧设备发送 回铃音时,所述控制单元用来启动定时器并且关闭呼叫路径中编码器 的 VAD功能; 当所述的定时器设置的定时时间届满时, 或者被叫和 主叫进入通话态时,所述的控制单元用来开启所述呼叫路径中编码器 的 VAD功能。
本发明实施例还提供了另一种语音激活检测控制设备,所述的设 备包括识别单元、 控制单元和检测单元, 其中:
所述的识别单元用来识别主叫和被叫处于回铃音阶段还是通话 阶段, 并将检测结果传输到所述的控制单元;
所述的检测单元检测本次呼叫的回铃音是否为标准回铃音,并将 所述的检测结果传输到控制单元;
当所述的识别单元识别所述的主叫和被叫处于回铃音阶段时,所 述的控制单元用来关闭所述的呼叫路径中的编码器的 VAD功能; 当 所述的检测单元检测本次呼叫的回铃音为标准回铃音时,所述的控制 单元用来开启所述的呼叫路径中的编码器的 VAD功能。
附图说明
图 1是本发明一个实施例的系统结构图;
图 2是本发明一个实施例的被叫交换机的原理框图;
图 3是本发明一个实施例的流程图;
图 4是本发明另一个实施例的被叫交换机的原理框图; 图 5是本发明另一个实施例的流程图;
图 6是本发明再一个实施例的被叫交换机的原理框图; 图 7是本发明再一个实施例的流程图。 具体实施方式
本发明实施例提供的技术方案是: 由于彩铃业务只在被叫振铃时 播放, 因此, 在回铃音阶段关闭 VAD功能, 使得回铃阶段的彩铃内 容不会因为 VAD技术而受到不利影响, 在通话阶段打开 VAD功能, 利用 VAD技术保证在通话阶段能够有效的降低编解码占用的带宽。
为了使本发明的目的、技术方案及优点更加清楚明白, 以下结合 附图及实施例, 对本发明进行进一步详细说明。 应当理解, 此处所描 述的具体实施例仅仅用以解释本发明, 并不用于限定本发明。
下面以 VOIP设备为例详细说明本发明实施例提供的技术方案的 应用,本发明提供的技术方案也适用于 VOATM和 VOFR ( voice over ATM语音在 ATM上传送& voice over frame relay语音在帧中继上传 送)等类似语音分组传送架构, 本领域技术人员不需要经过创造性劳 动即可实现, 此不赘述。
本发明的一个实施例:
被叫侧 VOIP设备判断本次呼叫回铃音是彩铃, 则可直接在回铃 音阶段关闭 VAD, 并在通话阶段打开 VAD功能。
具体来说, 被叫侧 VOIP设备判断本次呼叫回铃音是否为彩铃的 方法艮多, 例如:
1、 基于被叫用户 "签约数据", 被叫交换机可以直接判断; 例如: 在 HLR (归属位置寄存器)到 MSC的消息中 SRI-ACK 中携带的彩铃 SS-CODE ( 254 )业务码标志该呼叫为彩铃呼叫。 不排 除使用其他消息获得用户彩铃签约数据。
2、 基于 "业务码 +被叫号码", 被叫交换机可以直接判断。
例如: MSC 向彩铃平台发起呼叫时, 被叫号码是 "12531+被叫 用户 MSISDN (移动用户国际号码)", 其中 12531作为前缀示例, 实 际应用应该可配置。 此时, 在 MSC到彩铃平台之间的交换机可以根 据 12531前缀判断是否为彩铃业务。
本实施例的系统结构图如图 1所示,本发明实施例提供的系统包 括主叫用户终端、 主叫交换机、 被叫交换机, 彩铃中心和被叫用户终 端, 其中:
交换机的原理框图如图 2所示 (这里只描述与本发明相关部分, 其他部分不体现), 包括呼叫识别单元、 判断单元和控制单元, 判断 单元用来判断本次呼叫的回铃音是否是彩铃, 当判断单元判断本次呼 叫的回铃音是彩铃时, 指示控制单元关闭呼叫路径中编码器的 VAD 功能,控制单元用来根据判断单元的指示关闭呼叫路径中的编码器的 VAD 功能; 识别单元用来识别主叫和被叫是否处于通话阶段, 当识 别单元识别主叫和被叫处于通话阶段时,指示控制单元开启呼叫路径 中编码器的 VAD功能, 控制单元用来根据识别单元的指示开启呼叫 路径中的编码器的 VAD功能。
本实施例的流程图如图 3 所示, 具体包括如下的步骤(以 SS7 信令为例):
步骤 S301、 主叫交换机根据主叫用户终端的呼叫请求向被叫交 换机发送 IAM ( Initial Address Message, 初始地址消息);
步骤 S302、 被叫交换机向主叫交换机响应 ACM ( Address Complete Message, 地址全消息);
步骤 S303、 被叫交换机收到 IAM消息后, 根据被叫用户的签约 数据或者本次呼叫的业务码和被叫号码判断被叫用户是否签署了彩 铃业务, 如果是, 转步骤 S304, 否则转步骤 S306;
步骤 S304、 被叫交换机关闭呼叫路径中编码器的 VAD功能, 向 主叫播放彩铃;
步骤 S305、 用户摘机, 被叫交换机向主叫交换机发送 ANM (应 答 ) 消息, 然后执行步骤 S307。
步骤 S306、 被叫交换机向主叫交换机发送 ANM (应答) 消息; 步骤 S307、 主叫用户和被叫用户通话, 同时开启呼叫路径中编 码器的 VAD功能。
根据局间信令不同,具体区分回铃音阶段和通话阶段的信令参数 也不同, 但应用本实施例提供的技术方案的原理基本一致, 这些也是 本领域内人所皆知的。
本实施例中, 判断回铃音是否为彩铃由被叫交换机完成, 但具体 应用本发明实施例提供的技术方案时,由于彩铃业务除了可以由被叫 侧设备实现之外, 在呼叫路径中的其他设备也可以实现, 因此, 对判 断回铃音是否为彩铃并不局限于由被叫交换机完成, 例如, 可以是主 叫交换机,或主叫和被叫之间的一个或者多个其他交换机或者其他的 设备。
本发明的另一个实施例:
被叫侧 VOIP设备不判断用户是否签署了彩铃业务, VOIP设备 在回铃阶段先关闭呼叫路径中编码器的 VAD功能, 并检测是否收到 了被叫的标准回铃音, 如果收到了被叫侧设备播放的标准回铃音, 说 明没有彩铃, 则自动打开呼叫路径中编码器的 VAD功能; 如果没有 检测到标准回铃音, 说明是彩铃, 则在通话阶段才打开呼叫路径中编 码器的 VAD功能。 由于每个国家的标准回铃音都是特定频率的几种 信号音的组合,因此被叫交换机能够非常容易检测到语音信号是否为 标准回铃音。
本实施例的系统结构图如图 1所示,本发明实施例提供的系统包 括主叫用户终端、 主叫交换机、 被叫交换机, 彩铃中心和被叫用户终 端, 其中:
被叫交换机的原理框图如图 4所示, 包括呼叫识别单元、控制单 元和检测单元,检测单元用来检测本次呼叫的回铃音是否为标准回铃 音并将检测结果传送到控制单元,控制单元根据检测单元传送的判断 结果决定对呼叫路径中编码器的 VAD功能的开启和关闭, 识别单元 用来识别主叫和被叫处在回铃阶段还是通话阶段, 当识别单元识别主 叫和被叫处于回铃阶段时,控制单元先关闭呼叫路径中的 VAD功能, 检测单元开始检测,如果检测单元检测到本次呼叫的回铃音不是标准 回铃音, 控制单元一直到通话阶段才开启呼叫路径中编码器的 VAD 功能; 如果检测单元检测到本次呼叫的回铃音是标准回铃音, 则控制 单元直接将呼叫路径中编码器的 VAD功能开启。 本实施例流程图如图 5所示, 具体包括如下步骤: 步骤 S501、 主叫交换机根据主叫用户终端的呼叫请求向被叫交 换机发送 IAM (初始地址) 消息;
步骤 S502、 被叫交换机向主叫交换机响应 ACM (地址全消息 ), 关闭呼叫路径中编码器的 VAD功能;
步骤 S503、 被叫交换机收到 IAM消息后, 检测本次呼叫的回铃 音, 如果检测到本次呼叫的回铃音是标准回铃音, 转步骤 S504, 否 则转步骤 S506;
步骤 S504、 开启呼叫路径中编码器的 VAD功能, 被叫交换机向 主叫交换机发送 ANM (应答) 消息;
步骤 S505、 主叫用户和被叫用户通话;
步骤 S506、 被叫交换机向主叫交换机发送 ANM (应答) 消息, 如果呼叫路径中编码器的 VAD功能尚未开启, 被叫交换机开启呼叫 路径中编码器的 VAD功能;
步骤 S507、 主叫用户和被叫用户通话。
当网络中多数用户都没有签署彩铃业务时,这种检测回铃音的方 法是很有效的, 可以节省大量带宽。
本实施例中, 检测回铃音是否为标准回铃音由被叫交换机完成, 但具体应用本发明实施例提供的技术方案时,由于彩铃业务除了可以 由被叫侧设备实现之外,在呼叫路径中的其他设备也可以实现,因此, 对回铃音的检测并不局限于由被叫交换机完成, 例如, 可以是主叫交 换机, 或主叫和被叫之间的一个或者多个其他交换机。
本发明的再一个实施例:
当被叫侧 VOIP设备不判断被叫用户是否申请了彩铃业务时, 并 且不检测是否收到了被叫的回铃声,而是直接关闭其呼叫路径中编码 器的 VAD功能, 同时启动一个可配置的定时器 X, 当定时器 X配置 的时间届满时或者主叫和被叫进入通话状态, 被叫侧 VOIP设备打开 呼叫路径中编码器的 VAD功能, 定时器 X配置的时间可以是该网络 被叫无应答挂机的最长等待时间,或者用户振铃到摘机的平均时长或 用户指定的时间。 本实施例的系统结构图如图 1所示, 本发明实施例 提供的系统包括主叫用户终端、 主叫交换机、 被叫交换机, 彩铃中心 和被叫用户终端, 其中:
被叫交换机的原理框图如图 6所示, 包括控制单元和定时器, 控 制单元在呼叫开始即关闭呼叫路径中编码器的 VAD功能并启动定时 器, 在定时器设定的时间届满时或者主叫和被叫进入通话状态时, 控 制单元开启呼叫路径中编码器的 VAD功能。
本实施例的流程图如图 7所示, 具体包括如下的步骤: 步骤 S701、 主叫交换机根据主叫用户终端的呼叫请求向被叫交 换机发送 IAM (初始地址) 消息;
步骤 S702、被叫交换机向主叫交换机响应 ACM地址全消息, 关 闭呼叫路径中编码器的 VAD功能;
步骤 S703、 被叫交换机收到 IAM消息后, 启动定时器; 步骤 S704、 定时器设定定时时间届满或者主叫和被叫进入通话 状态, 开启呼叫路径中编码器的 VAD功能;
步骤 S705、 被叫交换机向主叫交换机发送 ANM (应答) 消息; 步骤 S706、 主叫用户和被叫用户通话。
具体应用发明方案时,定时器设置的时间届满可能发生在被叫交 换机向主叫交换机发送 ANM (应答) 消息之前, 也可能发生在被叫 交换机向主叫交换机发送 ANM (应答) 消息之后, 取决于定时器设 置的时间长短以及被叫从响铃到应答的时间。
如上所述的,本发明实施例提供的技术方案通过对呼叫路径中编 码器的 VAD功能的分阶段控制: 在回铃音阶段关闭 VAD功能、在通 话阶段开启 VAD功能, 实现了在传送彩铃音时不会受 VAD影响,在 传送通话语音时尽可能占用少的带宽的目的,满足了彩铃业务传送时 对音乐效果的要求, 同时不会过多占用带宽。
本领域普通技术人员可以理解实现上述实施例方法中的全部或 部分流程, 是可以通过计算机程序来指令相关的硬件来完成, 所述的 程序可存储于一计算机可读取存储介质中, 该程序在执行时, 可包括 如上述各方法的实施例的流程。 其中, 所述的存储介质可为磁碟、 光 盘、 只读存储记忆体(Read-Only Memory, ROM )或随机存储记忆 体 ( Random Access Memory, RAM )等。
以上所述仅为本发明的较佳实施例而已, 并不用以限制本发明, 凡在本发明的精神和原则之内所作的任何修改、 等同替换和改进等 , 均应包含在本发明的保护范围之内。

Claims

权利要求
1、 一种语音激活检测控制的方法, 其特征在于, 所述的方法包 括如下的步骤:
当被叫回铃响应主叫的呼叫请求时,判断所述的回铃音是否是个 性化回铃音, 如果不是, 则结束流程, 否则关闭呼叫路径中的编码器 的语音激活检测 VAD功能, 当所述的被叫与所述的主叫进入通话状 态时, 开启所述的编码器的 VAD功能。
2、 根据权利要求 1所述的方法, 其特征在于, 所述的判断本次 呼叫的回铃音是否是个性化回铃音是基于被叫用户的签约数据来判 断的。
3、 根据权利要求 1所述的方法, 其特征在于, 所述的判断本次 呼叫的回铃音是否是个性化回铃音是基于本次呼叫的业务码和被叫 号码来判断的。
4、 一种语音激活检测控制的方法, 其特征在于, 所述的方法包 括如下的步骤:
当被叫回铃响应主叫的呼叫请求时, 关闭呼叫路径中编码器的 VAD 功能, 并检测是否收到被叫的标准回铃音, 如果检测到标准回 铃音, 则开启所述的编码器的 VAD功能, 否则, 当所述的被叫与所 述的主叫进入通话状态时, 开启所述的编码器的 VAD功能。
5、 一种语音激活检测控制的方法, 其特征在于, 所述的方法包 括如下的步骤:
当被叫回铃响应主叫的呼叫请求时, 关闭呼叫路径中编码器的 VAD功能, 并启动定时器开始计时;
当所述的定时器设置的定时时间届满,或者当主叫和被叫进入通 话状态时, 所述呼叫路径中编码器的 VAD功能开启。
6、 根据权利要求 5所述的方法, 其特征在于, 所述的定时器设 置的定时时间为被叫无应答挂机的最长等待时间或者或用户振铃到 摘机的平均时长或用户指定的时间。
7、 一种语音激活检测控制设备, 其特征在于, 所述的设备包括 识别单元、 判断单元和控制单元, 所述的判断单元用来判断本次呼叫 的回铃音是否是个性化回铃音, 当所述的判断单元判断本次呼叫的回 铃音是个性化回铃音时,指示所述的控制单元关闭呼叫路径中编码器 的 VAD功能; 所述的识别单元用来识别主叫和被叫是否处于通话阶 段, 当所述识别单元识别所述的主叫和被叫处于通话阶段时, 指示所 述的控制单元开启所述呼叫路径中编码器的 VAD功能。
8、 一种语音激活检测控制设备, 其特征在于, 所述的设备包括 定时装置和控制单元, 当所述的被叫侧设备向主叫侧设备发送回铃音 时,所述控制单元用来启动定时器并且关闭呼叫路径中编码器的 VAD 功能; 当所述的定时器设置的定时时间届满时, 或者被叫和主叫进入 通话态时, 所述的控制单元用来开启所述呼叫路径中编码器的 VAD 功能。
9、 根据权利要求 8所述的控制设备, 其特征在于, 所述的定时 器设置的定时时间为被叫无应答挂机的最长等待时间或者或用户振 铃到摘机的平均时长或用户指定的时间。
10、 一种语音激活检测控制设备, 其特征在于, 所述的设备包括 识别单元、 控制单元和检测单元, 其中:
所述的识别单元用来识别主叫和被叫处于回铃音阶段还是通话 阶段, 并将检测结果传输到所述的控制单元;
所述的检测单元检测本次呼叫的回铃音是否为标准回铃音,并将 所述的检测结果传输到控制单元;
当所述的识别单元识别所述的主叫和被叫处于回铃音阶段时,所 述的控制单元用来关闭所述的呼叫路径中的编码器的 VAD功能; 当 所述的检测单元检测本次呼叫的回铃音为标准回铃音时,所述的控制 单元用来开启所述的呼叫路径中的编码器的 VAD功能。
11、 根据权利要求 10所述的控制设备, 其特征在于, 当所述的 检测单元检测本次呼叫的回铃音为个性化回铃音时,所述的控制单元 还用来在通话阶段开启所述的呼叫路径中的编码器的 VAD功能。
PCT/CN2008/071995 2007-08-17 2008-08-14 Procédé de commande de détection d'activité vocale et dispositif de commande apparenté WO2009024066A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP08783986.6A EP2099253B1 (en) 2007-08-17 2008-08-14 Method for voice activity detection controlling and controlling device thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200710076520.0 2007-08-17
CN2007100765200A CN101159891B (zh) 2007-08-17 2007-08-17 语音激活检测控制的方法及其控制设备

Publications (1)

Publication Number Publication Date
WO2009024066A1 true WO2009024066A1 (fr) 2009-02-26

Family

ID=39307786

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2008/071995 WO2009024066A1 (fr) 2007-08-17 2008-08-14 Procédé de commande de détection d'activité vocale et dispositif de commande apparenté

Country Status (3)

Country Link
EP (1) EP2099253B1 (zh)
CN (1) CN101159891B (zh)
WO (1) WO2009024066A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115985347A (zh) * 2023-02-22 2023-04-18 南方电网数字电网研究院有限公司 基于深度学习的语音端点检测方法、装置和计算机设备

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101159891B (zh) * 2007-08-17 2010-09-08 华为技术有限公司 语音激活检测控制的方法及其控制设备
US8989058B2 (en) * 2011-09-28 2015-03-24 Marvell World Trade Ltd. Conference mixing using turbo-VAD
US9532191B2 (en) * 2012-05-18 2016-12-27 Kirusa, Inc. Multi-modal transmission of early media notifications

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001009878A1 (en) 1999-07-29 2001-02-08 Conexant Systems, Inc. Speech coding with voice activity detection for accommodating music signals
WO2002003376A2 (en) 2000-06-30 2002-01-10 Ericsson Inc. Ringback detection circuit
CN1529526A (zh) * 2003-10-17 2004-09-15 中国移动通信集团公司 在移动通讯系统中提供个性化回铃音的方法
WO2005043926A2 (en) 2003-11-04 2005-05-12 Widerthan.Com Co., Ltd. Method for providing ringback tone substitute multimedia
WO2006009362A1 (en) * 2004-07-16 2006-01-26 Sk Telecom Co., Ltd. Terminal for multimedia ring back tone service and method for controlling terminal
WO2006026221A2 (en) 2004-08-25 2006-03-09 Motorola, Inc. Speakerphone having improved outbound audio quality
CN1867003A (zh) 2005-05-16 2006-11-22 华为技术有限公司 一种实现多媒体回铃音业务的系统及方法
WO2007078186A1 (en) 2006-01-06 2007-07-12 Realnetworks Asiapacific Co., Ltd. Method of processing audio signals for improving the quality of output audio signal which is transferred to subscriber's terminal over network and audio signal pre-processing apparatus of enabling the method
CN101159891A (zh) * 2007-08-17 2008-04-09 华为技术有限公司 语音激活检测控制的方法及其控制设备

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1288935C (zh) * 2003-08-23 2006-12-06 华为技术有限公司 一种实现主叫用户收听自制回铃音的方法
CN1220398C (zh) * 2003-07-21 2005-09-21 中国移动通信集团公司 一种在移动智能网系统中提供个性化回铃音的方法

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001009878A1 (en) 1999-07-29 2001-02-08 Conexant Systems, Inc. Speech coding with voice activity detection for accommodating music signals
WO2002003376A2 (en) 2000-06-30 2002-01-10 Ericsson Inc. Ringback detection circuit
CN1529526A (zh) * 2003-10-17 2004-09-15 中国移动通信集团公司 在移动通讯系统中提供个性化回铃音的方法
WO2005043926A2 (en) 2003-11-04 2005-05-12 Widerthan.Com Co., Ltd. Method for providing ringback tone substitute multimedia
WO2006009362A1 (en) * 2004-07-16 2006-01-26 Sk Telecom Co., Ltd. Terminal for multimedia ring back tone service and method for controlling terminal
WO2006026221A2 (en) 2004-08-25 2006-03-09 Motorola, Inc. Speakerphone having improved outbound audio quality
CN1867003A (zh) 2005-05-16 2006-11-22 华为技术有限公司 一种实现多媒体回铃音业务的系统及方法
WO2007078186A1 (en) 2006-01-06 2007-07-12 Realnetworks Asiapacific Co., Ltd. Method of processing audio signals for improving the quality of output audio signal which is transferred to subscriber's terminal over network and audio signal pre-processing apparatus of enabling the method
CN101159891A (zh) * 2007-08-17 2008-04-09 华为技术有限公司 语音激活检测控制的方法及其控制设备

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115985347A (zh) * 2023-02-22 2023-04-18 南方电网数字电网研究院有限公司 基于深度学习的语音端点检测方法、装置和计算机设备

Also Published As

Publication number Publication date
EP2099253A4 (en) 2012-09-12
CN101159891B (zh) 2010-09-08
EP2099253A1 (en) 2009-09-09
EP2099253B1 (en) 2020-10-07
CN101159891A (zh) 2008-04-09

Similar Documents

Publication Publication Date Title
JP4430107B2 (ja) マルチメディアリングバックトーンサービスのための端末機及び端末機の制御方法
JP4374053B2 (ja) 発信側交換機を用いてマルチメディアリングバックトーンサービスを提供する方法及びシステム
KR101037341B1 (ko) 불연속 전송 기능을 향상시키는 방법 및 시스템
KR20080065302A (ko) 호를 발신할 때 통화 연결음을 재생하는 통신 단말기 및 방법
JP2005526466A5 (zh)
JP2008506329A (ja) マルチメディアリングバックトーンサービスのための端末コーデック設定方法及びシステム
JP4374054B2 (ja) 着信側交換機を用いてマルチメディアリングバックトーンサービスを提供する方法及びシステム
JP3873048B2 (ja) リングバックトーン伝送方法、端末機、リングバックトーン生成方法、およびリングバックトーンを生成するためのシステム
KR20050088397A (ko) 음성 호출 경보를 제공하는 방법 및 장치
WO2009024066A1 (fr) Procédé de commande de détection d'activité vocale et dispositif de commande apparenté
US8085906B2 (en) Method, system and apparatus for providing alternative multimedia ring back tone substitute service by using intelligent network
KR100559011B1 (ko) 가입자 기반 링백톤 서비스의 듀얼 코덱을 이용한 음질개선 방법
US20040001518A1 (en) System and method for emulating ringback transparently
KR100747693B1 (ko) 이동통신 단말기의 능력에 따른 멀티미디어 링백톤 대체음서비스 제공 방법, 시스템 및 장치
KR101002463B1 (ko) 휴대 단말기의 자체 링백톤 설정 방법
CN101304605A (zh) 再应答呼叫实现方法
JPWO2005119940A1 (ja) 情報提供システム及び方法並びに情報提供用プログラム
KR101110151B1 (ko) 통화 대기음 서비스 방법
WO2006045236A1 (fr) Procede permettant de joindre un fond sonore a une communication mobile
WO2009074071A1 (fr) Commutateur et procede de commande de tonalites de retour d'appel personnalisee
KR100273447B1 (ko) 호 제어 방법
CN105992199B (zh) 一种语音通信明密识别方法及系统
CN102083231A (zh) 一种用于消除彩铃播放杂音的装置及方法
KR100608738B1 (ko) 타이머를 이용한 휴대 단말기의 자체 링백톤 설정 방법
JP2000307720A (ja) ネットワーク接続装置における音声符号化方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08783986

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2008783986

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE