WO2007109960A1 - Method, system and data signal detector for realizing dada service - Google Patents

Method, system and data signal detector for realizing dada service Download PDF

Info

Publication number
WO2007109960A1
WO2007109960A1 PCT/CN2007/000423 CN2007000423W WO2007109960A1 WO 2007109960 A1 WO2007109960 A1 WO 2007109960A1 CN 2007000423 W CN2007000423 W CN 2007000423W WO 2007109960 A1 WO2007109960 A1 WO 2007109960A1
Authority
WO
WIPO (PCT)
Prior art keywords
data signal
voice
signal frame
signal
data
Prior art date
Application number
PCT/CN2007/000423
Other languages
French (fr)
Chinese (zh)
Inventor
Tong Jin
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Publication of WO2007109960A1 publication Critical patent/WO2007109960A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Definitions

  • the present invention relates to media stream delivery technologies, and more particularly to a method, system and data signal detector for implementing data services.
  • the media streams processed by the gateway mainly include: a voice stream, a data stream, a video stream, and the like.
  • the data stream mainly refers to the signals sent by the data devices such as fax machines, digital modems (modems), and text phones during the interaction.
  • the way in which the industry transmits data streams through gateways mainly includes VBD (Voice-Band Data) and Relay (forward).
  • the VBD mode refers to the low-loss codec processing of the data stream as a normal voice stream by using a codec with a relatively small signal impairment.
  • the proposed codec standard is ITU-T (International Telecommunication Union-Telecommunication Standard) Divisional) G.711 and ITU-T G.726.
  • ITU-T International Telecommunication Union-Telecommunication Standard
  • G.711 International Telecommunication Union-Telecommunication Standard
  • ITU-T G.726 ITU-T (International Telecommunication Union-Telecommunication Standard) Divisional) G.711 and ITU-T G.726.
  • the advantage of this method is that it is simple to implement, and does not care about the meaning of the specific data signal. It only needs to process the data signal as voice, and the processing power of the gateway is very small.
  • the processing of the voice signal usually adopts voice detection and silence compression technology to solve the problem of bandwidth occupation.
  • the voice signal is composed of a voice signal and a non-voice signal.
  • the voice signal refers to a signal during a speech period, and the non-voice signal refers to a signal. No one speaks, only the signal during background noise. More than 30 ⁇ 50% of the typical speech signal is the background noise signal without voice signal.
  • voice detection and silence compression techniques in speech signal coding can greatly reduce the code rate without affecting the speech quality.
  • the entire system includes three modules: a sender gateway, a communication channel, and a receiver gateway.
  • the sender gateway has a voice activity detector (VAD), a voice coder and a non-voice coder.
  • the receiver gateway has a voice decoder and a non-voice decoder, and the communication channel is generally an IP network.
  • the VAD analyzes each input voice frame, and divides the voice frame into a voice signal frame and a non-voice signal frame according to whether or not the voice signal is carried. If the input speech frame is a speech signal frame, the VAD controls the speech frame to enter the speech encoder for speech encoding. Otherwise, the VAD controls the speech frame to enter the non-speech encoder for silent compression encoding.
  • the information output by the voice encoder is called a voice packet, and the information output by the non-voice encoder is called a mute packet.
  • the background noise feature information in the mute packet is only used to recover the background noise, so the code rate is very low, and the voice packet is not available. One tenth of the code rate.
  • the receiving gateway After receiving the voice packet, the receiving gateway sends the voice decoder to the voice decoding to output the voice signal, and after receiving the silence packet, it sends the non-voice decoder to reconstruct the background noise excitation signal, so that the reconstructed noise signal is more natural.
  • Voice detection and muting compression technology can greatly reduce the bandwidth occupied by the communication channel, but it has not been applied in the VBD mode of data services, and it is also explicitly stipulated in the ITU-T V.152 protocol that VAD is prohibited in VBD mode. . Because the basic function of VAD is the judgment of voice and non-speech, the so-called voice is the sound signal emitted by the human vocal organ. However, the data signals and voice signals sent by the data device have different characteristics. If the VAD is applied to the detection of the data signal, the data signal is detected as background noise and sent to the non-voice encoder, causing signal damage.
  • the data signals sent by the data device in the existing VBD mode are all used as voice signal input voice encoders for low-loss codec processing, and the principle block diagram is as shown in FIG. 2.
  • An embodiment of the present invention provides a method, a system, and a data signal detector for implementing data services by using a voiceband data VBD, so as to solve the problem of occupying a wide bandwidth when implementing data services through VBD in the prior art.
  • the present invention adopts the following embodiment scheme:
  • a method for implementing a data service includes the following steps:
  • the data signal detector divides the input signal frame into a data signal frame and a non-data signal frame, and the voice signal encoder performs voice coding on the data signal frame, and the non-voice encoder performs the non-data signal frame on the frame.
  • Silent compression coding
  • the data signal frame encoded by the voice decoder voice is decoded after the voice is decoded, and the non-voice decoder reconstructs the silence signal after decoding the silence compressed and encoded non-data signal frame.
  • an energy decision threshold may be set, and by the energy decision threshold, the data signal detector further divides the input signal frame into a data signal frame and a non-data signal frame, and divides the input signal frame into equal intervals. Or a plurality of sub-signal frames that are not equally spaced, each sub-signal frame corresponding to a signal window, respectively calculating the signal energy in each signal window and comparing with the energy decision threshold, if the signal energy in each signal window is If the energy decision is less than the threshold value, the signal frame is a non-data signal frame, otherwise it is a data signal frame.
  • the method further includes: buffering a T2 time before encoding the signal frame; the T2 is greater than a data signal loss when the non-voice encoder performs silence compression coding on the non-data signal frame and then performs voice coding on the data signal frame to the voice encoder. Time period.
  • the method further includes: the data signal detector determines that the input signal enters the non-data signal stage and delays the T1 + T2 time, and then the non-voice encoder performs silence compression encoding on the non-data signal frame; the T1 is greater than the speech coding. The time period during which the data signal is lost after the data signal frame is voice-coded to the non-voice encoder to perform non-data signal frame muting compression coding.
  • the system for implementing data services includes a sender gateway, a receiver gateway, and a communication channel connecting the sender gateway and the receiver gateway, where the sender gateway includes a voice encoder, a non-voice encoder, and a data signal.
  • data signal detector will signal frame Dividing into a data signal frame and a non-data signal frame, and controlling the data signal frame to enter the voice encoder for voice coding, and the non-data signal frame to enter the non-voice encoder for silent compression coding;
  • the receiver gateway includes a received voice decoder and The non-voice decoder, the voice signal encoded by the voice decoder is decoded by the voice signal to output a valid data signal, and the non-voice decoder reconstructs the silence signal after decoding the silenced and encoded non-data signal frame.
  • the system further includes a buffer disposed between the sender gateway signal input end and the voice encoder and the non-voice encoder, configured to output the input signal frame after buffering for T2 time before encoding; the T2 is greater than The period during which the non-voice encoder performs silence compression encoding on the non-data signal frame and then loses the data signal when the voice encoder switches the voice signal to the data signal frame.
  • the data signal detector determines that the input signal enters the non-data signal stage and delays the T1 + T2 time, and then the non-voice encoder performs silent compression coding on the non-data signal frame; the T2 is greater than the non-voice coding.
  • the time period during which the data signal is lost when the non-data signal frame is muted and compressed.
  • the embodiment of the present invention further provides a data signal detector, comprising: a determining unit and a control unit, wherein the determining unit is configured to determine whether the input signal frame carries a valid data signal; and the control unit is configured to: according to the result of the determining The signal frame is output as a data signal frame or a non-data signal frame.
  • a data signal detector comprising: a determining unit and a control unit, wherein the determining unit is configured to determine whether the input signal frame carries a valid data signal; and the control unit is configured to: according to the result of the determining The signal frame is output as a data signal frame or a non-data signal frame.
  • the VBD can be greatly reduced.
  • the bandwidth of the communication channel is occupied, and the availability of the VBD mode is improved.
  • FIG. 1 is a block diagram of a communication system using voice detection and silence compression technology
  • FIG. 2 is a block diagram of a communication system of the existing VBD mode
  • Figure 3 is a block diagram of a first embodiment of the system of the present invention.
  • 4 is a schematic diagram of a signal passing through a data signal detector
  • 5 is a schematic diagram of a signal for buffering T2 time before encoding a data signal in the system embodiment of FIG. 3;
  • FIG. 6 is a schematic diagram of a signal for determining that the delay is delayed by T1+T2 after entering the mute;
  • FIG. 7 is a block diagram of a second embodiment of the system according to the present invention.
  • FIG. 3 A block diagram of a system embodiment of the present invention is shown in FIG. 3.
  • the entire system includes three modules: a sender gateway, a communication channel, and a receiver gateway.
  • the sender gateway has a data signal detector, a low-loss voice coder and a non-voice coder.
  • the receiver gateway has a voice decoder and a non-voice decoder, and the communication channel is an IP network.
  • the data signal After the data signal enters the sender gateway, it is first divided into equally spaced signal frames.
  • the frame length is determined according to the encoding protocol used by the encoder, generally between 5 and 30 milliseconds, and then the data signal detector for each input signal frame. For analysis, if the signal frame carries a valid data signal, the signal frame is used as a data signal frame, otherwise the signal frame is used as a non-data signal frame. If the signal frame is a data signal frame, the data signal detector controls the signal frame to enter the speech coder for low-loss speech coding. Otherwise, the data signal detector controls the signal frame to enter the non-speech encoder for silent compression coding.
  • the receiving gateway After receiving the voice packet, the receiving gateway sends the voice decoder to decode the voice and outputs a valid data signal. After receiving the voice packet, the gateway sends the non-voice decoder to reconstruct the mute signal.
  • the signal from the data device the signal energy during the valid data period is relatively high and stable; while during the mute, it does not mix a lot of background noise like the speech signal, but rather pure silence, even if there is some noise or electrical echo, The energy is also very small. Therefore, a reasonable energy decision threshold can be set to distinguish the valid data from the silence by the amount of signal energy.
  • a signal frame can be divided into many parts, that is, multiple sub-signal frames, and each sub-signal frame is called a signal window. Calculating the signal energy in each signal window separately. When the signal energy of all signal windows in a signal frame is lower than the set energy decision threshold, this is considered as The frame is a silent frame.
  • the size of the signal window and the energy decision threshold can be set according to the specific conditions of different gateway devices. The size of the signal window can be the same or different.
  • determining that the data signal enters the mute and exits the mute has a corresponding decision threshold, which causes an effective data signal that loses the length of the t1 time when entering the mute phase, and exits.
  • the valid data signal of the t2 time length is lost.
  • the mute process when the signal energy exceeds the set decision threshold, the mute is considered to be exited, resulting in loss of signal between the true exit of the signal and the decision to exit the mute.
  • the incomplete data signal is reconstructed. For data devices, such incomplete data signals are unacceptable.
  • the receiving data device may think that the signal is hopping or the cycle is incomplete and anomalies occur, even leading to data. Business failed.
  • the problem of losing the signal when exiting the mute phase in the present embodiment is to buffer the T2 before the data signal is sent to the encoder (T2>t2).
  • T2>t2 the technical solution of time ensures that when the data signal detector judges to exit the mute and control the data signal to be sent to the speech encoder, the valid data signal has not yet reached the encoder.
  • the reconstructed data signal after buffering is as shown in FIG.
  • the data signal detector judges that the moment of entering the silence is earlier than the time of actually entering the silence, the problem of losing the signal when entering the silent phase in this embodiment is adopted after the data signal detector determines to enter the silence, and the delay T1 (Tl> T1)
  • T1 T1
  • the data signal has a buffer of T2, which is equivalent to the time when the data signal detector judges to be muted earlier than the time when the signal sent to the encoder enters silence, tl+T2.
  • VBD mode communication system block diagram for processing data signal detection and silence compression techniques using the above technical solution is shown in FIG.
  • a buffer is provided between the sender gateway signal input terminal and the voice encoder and the non-voice encoder for buffering the input signal frame before encoding.
  • Output after T2 time to ensure that the data signal detector judges to exit the mute and When the control data signal is sent to the voice encoder, the valid data signal has not yet reached the encoder.
  • the T2 is greater than a period of time during which the non-voice encoder performs silence compression coding on the non-data signal frame and then loses the data signal to the voice encoder when the voice signal is switched on the data signal frame.
  • the data signal detector determines that the input signal enters the non-data signal stage and delays the T1 + T2 time, and then the non-voice encoder performs silence compression coding on the non-data signal frame to Ensure that the valid data signals are all sent through the voice encoder; the T2 is greater than the time period during which the non-voice encoder performs silence compression coding on the non-data signal frame and then loses the data signal to the voice encoder when the voice signal is switched on the data signal frame;
  • the T1 is greater than a time period during which the voice signal is lost after the voice encoder performs voice encoding on the data signal frame and the non-voice encoder performs silent compression coding switching on the non-data signal frame.
  • the data signal detector is typically configured to include a determining unit and a control unit, and the determining unit is configured to determine whether the input signal frame carries a valid data signal; And a unit, configured to output the signal frame as a data signal frame or a non-data signal frame to a voice encoder or a non-voice encoder according to the result of the judgment.
  • the time of T1 and T2 can be determined by the performance of the data signal detection algorithm used by the system, generally in the leap second stage, so the delay introduced by the data signal buffer T2 time hardly generates data traffic. Any impact. It can be seen that through the above two steps, the data signal loss caused by the data signal detector judging to enter the mute and exit the mute can be effectively avoided.
  • the signal outputted by the data device is divided into a valid data signal and a non-effective data signal, such as a mute signal, and processed by different coding modes for different signals, which can be greatly reduced in data service applications.
  • a non-effective data signal such as a mute signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

A method for realizing data service in a voice band data manner comprises the following steps: a. a data signal detector divides the input signal frames into data signal frames and non-data signal frames, then a voice coder proceeds a voice coding on the data signal frames, while the non-voice coder proceeds a mute compression coding on the non-data signal frames; b. a voice decoder proceeds a voice decoding on the data signal frames which are voice coded to output available dada signals, while a non-voice decoder proceeds a decoding on the non-data signal frames which are mute compression coded to reconstruct mute signals. A system and a signal detector use this method.

Description

实现数据业务的方法、 系统和数据信号检测器 技术领域  Method, system and data signal detector for implementing data services
本发明涉及媒体流传递技术, 尤其涉及一种实现数据业务的方法、 系 统和一种数据信号检测器。  The present invention relates to media stream delivery technologies, and more particularly to a method, system and data signal detector for implementing data services.
背景技术 Background technique
网络技术的飞速发展使得利用分组技术来传输多媒体成为可能, 传统 通讯技术和分组技术融合的趋势越来越明显, 由于传统的通讯网络和现代 分组通讯网络中传输的媒体流编码方式不同, 因此在传统网絡和分组网络 的结合点需要编解码器进行媒体流编码方式的转换, 实现这种转换的设备 为网关。  The rapid development of network technology makes it possible to use packet technology to transmit multimedia. The convergence of traditional communication technologies and packet technologies is becoming more and more obvious. Because traditional communication networks and media packets transmitted in modern packet communication networks are encoded differently, The combination of the traditional network and the packet network requires the codec to perform the media stream coding mode conversion, and the device that implements the conversion is the gateway.
目前网关处理的媒体流主要包括: 语音流、 数据流、 视频流等。 其中 的数据流主要是指传真机、 数字 modem (调制解调器)、 文本电话等数据 设备在交互过程中发出的信号。 目前业界通过网关传递数据流的方式主要 有 VBD ( Voice-Band Data话带数据 )方式和 Relay (转发)方式。  Currently, the media streams processed by the gateway mainly include: a voice stream, a data stream, a video stream, and the like. The data stream mainly refers to the signals sent by the data devices such as fax machines, digital modems (modems), and text phones during the interaction. At present, the way in which the industry transmits data streams through gateways mainly includes VBD (Voice-Band Data) and Relay (forward).
其中, VBD方式是指通过对信号损伤比较小的编解码方式,把数据流 作为普通的语音流进行低损的编解码处理, 建议采用的编解码标准是 ITU-T (国际电信联盟一电信标准分部 )规定的 G.711和 ITU-T G.726。这 种方式的优点是实现简单, 不用关心具体数据信号的含义, 只要把数据信 号当作语音进行处理就可以了, 对网关的处理能力消耗非常小。  The VBD mode refers to the low-loss codec processing of the data stream as a normal voice stream by using a codec with a relatively small signal impairment. The proposed codec standard is ITU-T (International Telecommunication Union-Telecommunication Standard) Divisional) G.711 and ITU-T G.726. The advantage of this method is that it is simple to implement, and does not care about the meaning of the specific data signal. It only needs to process the data signal as voice, and the processing power of the gateway is very small.
随着数据业务的不断发展,数据业务的种类层出不穷, VBD方式以其 实现简单、 应用范围宽的特点, 越来越受到大家的重视, 但是 VBD方式 占用带宽大的问题却一直没有得到很好的解决, 这对于很多带宽有限的应 用场合是难以接受的。  With the continuous development of data services, the types of data services are endless. The VBD method has attracted more and more attention because of its simple implementation and wide application range. However, the problem of large bandwidth occupied by VBD has not been well received. Solution, this is unacceptable for many bandwidth-constrained applications.
现有技术中对语音信号的处理通常采用话音检测和静音压缩技术来 解决带宽占用的问题, 语音信号由话音信号和非话音信号组成, 话音信号 是指有人说话期间的信号, 非话音信号是指没有人说话, 只有背景噪声期 间的信号, 典型的语音信号中有 30~50%以上的时间是不含话音信号的背 景噪声信号。 利用这一特点, 在语音信号编码中采取话音检测和静音压缩 技术可以大大降低码率, 而且不影响语音质量。 附图 1为采取话音检测和静音压缩技术的通信系统原理框图: 整个系 统包括 3个模块: 发送方网关、 通信信道和接收方网关。 其中发送方网关 中有话音检测器(Voice Activity Detector, VAD )、 话音编码器和非话音 编码器, 接收方网关中有话音解码器和非话音解码器, 通信信道一般为 IP 网络。 In the prior art, the processing of the voice signal usually adopts voice detection and silence compression technology to solve the problem of bandwidth occupation. The voice signal is composed of a voice signal and a non-voice signal. The voice signal refers to a signal during a speech period, and the non-voice signal refers to a signal. No one speaks, only the signal during background noise. More than 30~50% of the typical speech signal is the background noise signal without voice signal. By using this feature, voice detection and silence compression techniques in speech signal coding can greatly reduce the code rate without affecting the speech quality. 1 is a block diagram of a communication system employing voice detection and silence compression techniques: The entire system includes three modules: a sender gateway, a communication channel, and a receiver gateway. The sender gateway has a voice activity detector (VAD), a voice coder and a non-voice coder. The receiver gateway has a voice decoder and a non-voice decoder, and the communication channel is generally an IP network.
语音信号进入发送方网关后, VAD对每一个输入的语音帧进行分析, 根据是否携带话音信号, 将语音帧分为话音信号帧和非话音信号帧。 若输 入语音帧为话音信号帧, 则 VAD控制该语音帧进入话音编码器进行话音 编码, 否则, VAD控制该语音帧进入非话音编码器进行静音压缩编码。 话 音编码器输出的信息被称为语音包, 非话音编码器输出的信息被称为静音 包,静音包中的背景噪声特征信息只用来恢复背景噪声,所以码率非常低, 不到语音包码率的十分之一。  After the voice signal enters the sender gateway, the VAD analyzes each input voice frame, and divides the voice frame into a voice signal frame and a non-voice signal frame according to whether or not the voice signal is carried. If the input speech frame is a speech signal frame, the VAD controls the speech frame to enter the speech encoder for speech encoding. Otherwise, the VAD controls the speech frame to enter the non-speech encoder for silent compression encoding. The information output by the voice encoder is called a voice packet, and the information output by the non-voice encoder is called a mute packet. The background noise feature information in the mute packet is only used to recover the background noise, so the code rate is very low, and the voice packet is not available. One tenth of the code rate.
接收方网关收到语音包后 , 送进话音解码器进行话音解码后输出话音 信号, 收到静音包后送入非话音解码器重构背景噪声的激励信号, 使重构 噪声信号更为自然。  After receiving the voice packet, the receiving gateway sends the voice decoder to the voice decoding to output the voice signal, and after receiving the silence packet, it sends the non-voice decoder to reconstruct the background noise excitation signal, so that the reconstructed noise signal is more natural.
话音检测和静音压缩技术可以大幅度降低占用通信信道的带宽, 但是 目前在数据业务的 VBD方式中还没有得到应用,而且在 ITU-T V.152协议 中也明确规定在 VBD模式下禁止使用 VAD。 因为 VAD的基本功能是对 话音和非话音的判断,所谓话音,就是通过人的发声器官发出的声音信号。 但是数据设备发出的数据信号和话音有着不同的特征, 如果把 VAD应用 到数据信号的检测中 , 往往会把数据信号检测成背景噪声送入非话音编码 器, 造成信号的损伤。  Voice detection and muting compression technology can greatly reduce the bandwidth occupied by the communication channel, but it has not been applied in the VBD mode of data services, and it is also explicitly stipulated in the ITU-T V.152 protocol that VAD is prohibited in VBD mode. . Because the basic function of VAD is the judgment of voice and non-speech, the so-called voice is the sound signal emitted by the human vocal organ. However, the data signals and voice signals sent by the data device have different characteristics. If the VAD is applied to the detection of the data signal, the data signal is detected as background noise and sent to the non-voice encoder, causing signal damage.
因此, 现有的 VBD方式下数据设备发出的数据信号全部作为话音信 号输入话音编码器进行低损的编解码处理, 其原理框图如附图 2所示。  Therefore, the data signals sent by the data device in the existing VBD mode are all used as voice signal input voice encoders for low-loss codec processing, and the principle block diagram is as shown in FIG. 2.
但是在很多数据业务中, 有很多时间数据设备发出的只是静音, 而不 是有效的数据信号。 比如现在应用最广泛的支持最高速率为 14400bps 的 传真机, 其采用的传真过程是半双工的, 也就是说整个传真过程有 50%以 上的时间传真机发出的信号为静音。 对于这些设备发出的信号, 现有技术 统一按照数据信号对待, 浪费了较多的带宽。 发明内容 But in many data services, there is a lot of time when data devices send out silent, not valid data signals. For example, the most widely used fax machine supporting a maximum speed of 14400bps is a half-duplex fax process, which means that more than 50% of the time of the fax process is muted by the fax machine. For the signals sent by these devices, the prior art uniformly treats the data signals, and wastes more bandwidth. Summary of the invention
本发明实施例在于提供一种通过话带数据 VBD实现数据业务的方法、 系统和数据信号检测器, 以解决现有技术中通过 VBD实现数据业务时占 用过宽带宽的问题  An embodiment of the present invention provides a method, a system, and a data signal detector for implementing data services by using a voiceband data VBD, so as to solve the problem of occupying a wide bandwidth when implementing data services through VBD in the prior art.
为实现上述目的, 本发明采用如下的实施例方案:  To achieve the above object, the present invention adopts the following embodiment scheme:
一种实现数据业务的方法, 包括如下步骤:  A method for implementing a data service includes the following steps:
a、 数据信号检测器将输入的信号帧分成数据信号帧和非数据信号帧, 由话音编码器对所述的数据信号帧进行话音编码, 由非话音编码器对所述 的非数据信号帧进行静音压缩编码;  a. The data signal detector divides the input signal frame into a data signal frame and a non-data signal frame, and the voice signal encoder performs voice coding on the data signal frame, and the non-voice encoder performs the non-data signal frame on the frame. Silent compression coding;
b、 话音解码器对话音编码后的数据信号帧进行话音解码后输出有效 数据信号, 非话音解码器对静音压缩编码后的非数据信号帧解码后重构静 音信号。  b. The data signal frame encoded by the voice decoder voice is decoded after the voice is decoded, and the non-voice decoder reconstructs the silence signal after decoding the silence compressed and encoded non-data signal frame.
其中, 可以设定能量判决阈值, 以及通过所述能量判决阈值, 所述的 数据信号检测器将输入的信号帧区分为数据信号帧和非数据信号帧更进 一步, 将输入的信号帧分成等间隔的或者非等间隔的多个子信号帧, 每个 子信号帧对应一个信号窗, 分别计算每一个信号窗中的信号能量后与所述 的能量判决阈值比较, 如果每一个信号窗中的信号能量都小于所述能量判 决阔值, 则所述的信号帧为非数据信号帧, 否则为数据信号帧。  Wherein, an energy decision threshold may be set, and by the energy decision threshold, the data signal detector further divides the input signal frame into a data signal frame and a non-data signal frame, and divides the input signal frame into equal intervals. Or a plurality of sub-signal frames that are not equally spaced, each sub-signal frame corresponding to a signal window, respectively calculating the signal energy in each signal window and comparing with the energy decision threshold, if the signal energy in each signal window is If the energy decision is less than the threshold value, the signal frame is a non-data signal frame, otherwise it is a data signal frame.
所述方法还包括: 对信号帧进行编码之前緩存 T2时间; 所述 T2大于 非话音编码器对非数据信号帧进行静音压缩编码后向话音编码器对数据 信号帧进行话音编码切换时数据信号丢失的时间段。  The method further includes: buffering a T2 time before encoding the signal frame; the T2 is greater than a data signal loss when the non-voice encoder performs silence compression coding on the non-data signal frame and then performs voice coding on the data signal frame to the voice encoder. Time period.
所述方法还包括: 数据信号检测器判断输入信号进入非数据信号阶段 后延迟 Tl + T2时间, 然后由非话音编码器对所述的非数据信号帧进行静 音压缩编码; 所述 T1 大于话音编码器对数据信号帧进行话音编码后向非 话音编码器对非数据信号帧进行静音压缩编码切换时数据信号丟失的时 间段。  The method further includes: the data signal detector determines that the input signal enters the non-data signal stage and delays the T1 + T2 time, and then the non-voice encoder performs silence compression encoding on the non-data signal frame; the T1 is greater than the speech coding. The time period during which the data signal is lost after the data signal frame is voice-coded to the non-voice encoder to perform non-data signal frame muting compression coding.
本发明实施例提供的实现数据业务的系统, 包括发送方网关、 接收方 网关以及连接发送方网关和接收方网关的通信信道, 所述发送方网关包括 话音编码器、 非话音编码器和数据信号监测器, 数据信号检测器将信号帧 分成数据信号帧和非数据信号帧, 并控制数据信号帧进入话音编码器进行 话音编码、 非数据信号帧进入非话音编码器进行静音压缩编码; 所述接收 方网关包括对接收的话音解码器和非话音解码器, 话音解码器对话音编码 后的数据信号帧进行话音解码后输出有效数据信号, 非话音解码器对静音 压缩编码后的非数据信号帧解码后重构静音信号。 The system for implementing data services provided by the embodiment of the present invention includes a sender gateway, a receiver gateway, and a communication channel connecting the sender gateway and the receiver gateway, where the sender gateway includes a voice encoder, a non-voice encoder, and a data signal. Monitor, data signal detector will signal frame Dividing into a data signal frame and a non-data signal frame, and controlling the data signal frame to enter the voice encoder for voice coding, and the non-data signal frame to enter the non-voice encoder for silent compression coding; the receiver gateway includes a received voice decoder and The non-voice decoder, the voice signal encoded by the voice decoder is decoded by the voice signal to output a valid data signal, and the non-voice decoder reconstructs the silence signal after decoding the silenced and encoded non-data signal frame.
所述系统还包括设置在发送方网关信号输入端和所述话音编码器和 非话音编码器之间的緩存器, 用于将输入的信号帧在编码之前缓存 T2时 间后输出; 所述 T2大于非话音编码器对非数据信号帧进行静音压缩编码 后向话音编码器对数据信号帧进行话音编码切换时数据信号丢失的时间 段。  The system further includes a buffer disposed between the sender gateway signal input end and the voice encoder and the non-voice encoder, configured to output the input signal frame after buffering for T2 time before encoding; the T2 is greater than The period during which the non-voice encoder performs silence compression encoding on the non-data signal frame and then loses the data signal when the voice encoder switches the voice signal to the data signal frame.
所述系统中, 数据信号检测器判断输入信号进入非数据信号阶段后延 迟 Tl + T2时间, 然后由非话音编码器对所述的非数据信号帧进行静音压 缩编码; 所述 T2大于非话音编码器对非数据信号帧进行静音压缩编码后 向话音编码器对数据信号帧进行话音编码切换时数据信号丟失的时间段; 所述 T1 大于话音编码器对数据信号帧进行话音编码后向非话音编码 器对非数据信号帧进行静音压缩编码切换时数据信号丟失的时间段。  In the system, the data signal detector determines that the input signal enters the non-data signal stage and delays the T1 + T2 time, and then the non-voice encoder performs silent compression coding on the non-data signal frame; the T2 is greater than the non-voice coding. The time period during which the data signal is lost after the voice data is encoded by the voice encoder and the voice signal is switched by the voice encoder; the T1 is greater than the voice encoder to perform voice coding on the data signal frame and then to the non-voice code. The time period during which the data signal is lost when the non-data signal frame is muted and compressed.
本发明实施例还提供一种数据信号检测器, 包括判断单元和控制单 元, 所述判断单元, 用于判断输入的信号帧是否携带有效数据信号; 所述 控制单元, 用于根据判断的结果将所述信号帧作为数据信号帧或非数据信 号帧输出。  The embodiment of the present invention further provides a data signal detector, comprising: a determining unit and a control unit, wherein the determining unit is configured to determine whether the input signal frame carries a valid data signal; and the control unit is configured to: according to the result of the determining The signal frame is output as a data signal frame or a non-data signal frame.
按照本发明提供的上述实施例 , 通过将数据设备输出的信号区分为数 据信号和非数据信号信号并针对不同的信号采用不同的编码方式进行处 理, 在数据业务应用中, 能够大幅度降低通过 VBD方式实现数据业务时 占用通信信道的带宽, 提高了 VBD方式的可用性。  According to the above embodiment provided by the present invention, by dividing the signal output by the data device into a data signal and a non-data signal signal and processing different signals for different signals, in the data service application, the VBD can be greatly reduced. When the data service is implemented, the bandwidth of the communication channel is occupied, and the availability of the VBD mode is improved.
附图说明 DRAWINGS
图 1为现有的采用语音检测和静音压缩技术的通信系统框图; 图 2为现有的 VBD方式的通信系统框图;  1 is a block diagram of a communication system using voice detection and silence compression technology; FIG. 2 is a block diagram of a communication system of the existing VBD mode;
图 3为本发明所述系统第一实施例框图;  Figure 3 is a block diagram of a first embodiment of the system of the present invention;
图 4为现有的经过数据信号检测器的信号示意图; 图 5为图 3所述系统实施例中对数据信号进行编码前緩存 T2时间的 信号示意图; 4 is a schematic diagram of a signal passing through a data signal detector; 5 is a schematic diagram of a signal for buffering T2 time before encoding a data signal in the system embodiment of FIG. 3;
图 6为判断进入静音后延迟 T1+T2时间进行切换的信号示意图; 图 7为本发明所述系统第二实施例框图。  6 is a schematic diagram of a signal for determining that the delay is delayed by T1+T2 after entering the mute; FIG. 7 is a block diagram of a second embodiment of the system according to the present invention.
具体实施方式 detailed description
本发明所述系统实施例框图如附图 3所示, 整个系统包括 3个模块: 发送方网关、 通信信道和接收方网关。 发送方网关中有数据信号检测器、 低损的话音编码器和非话音编码器,接收方网关中有话音解码器和非话音 解码器, 通信信道为 IP网络。  A block diagram of a system embodiment of the present invention is shown in FIG. 3. The entire system includes three modules: a sender gateway, a communication channel, and a receiver gateway. The sender gateway has a data signal detector, a low-loss voice coder and a non-voice coder. The receiver gateway has a voice decoder and a non-voice decoder, and the communication channel is an IP network.
数据信号进入发送方网关后, 首先被分成等间隔的信号帧, 帧长根据 编码器采用的编码协议而定, 一般在 5〜30毫秒之间, 然后数据信号检测 器对每一个输入的信号帧进行分析, 如果信号帧携带有效数据信号, 将信 号帧作为数据信号帧, 否则将信号帧作为非数据信号帧。 若信号帧为数据 信号帧, 则数据信号检测器控制该信号帧进入话音编码器进行低损的话音 编码, 否则, 数据信号检测器控制该信号帧进入非话音编码器进行静音压 缩编码。  After the data signal enters the sender gateway, it is first divided into equally spaced signal frames. The frame length is determined according to the encoding protocol used by the encoder, generally between 5 and 30 milliseconds, and then the data signal detector for each input signal frame. For analysis, if the signal frame carries a valid data signal, the signal frame is used as a data signal frame, otherwise the signal frame is used as a non-data signal frame. If the signal frame is a data signal frame, the data signal detector controls the signal frame to enter the speech coder for low-loss speech coding. Otherwise, the data signal detector controls the signal frame to enter the non-speech encoder for silent compression coding.
接收方网关收到语音包后 , 送进话音解码器进行话音解码后输出有效 数据信号, 收到静音包后送入非话音解码器重构静音信号。  After receiving the voice packet, the receiving gateway sends the voice decoder to decode the voice and outputs a valid data signal. After receiving the voice packet, the gateway sends the non-voice decoder to reconstruct the mute signal.
其中数据信号检测器对输入的信号帧进行分析使用的检测算法本实 施例中采用小窗能量检测法:  The detection algorithm used by the data signal detector to analyze the input signal frame uses the small window energy detection method in this embodiment:
数据设备发出的信号, 在有效数据期间的信号能量比较高而且比较稳 定; 而在静音期间, 不会像语音信号一样夹杂很多背景噪声, 而是比较纯 的静音, 即使有一些噪声或者电学回声, 能量也非常小。 所以可以设定一 个合理的能量判决阈值 , 通过信号能量的大小将有效数据和静音区分开 来。  The signal from the data device, the signal energy during the valid data period is relatively high and stable; while during the mute, it does not mix a lot of background noise like the speech signal, but rather pure silence, even if there is some noise or electrical echo, The energy is also very small. Therefore, a reasonable energy decision threshold can be set to distinguish the valid data from the silence by the amount of signal energy.
考虑到数据信号能量变化快, 有可能在信号帧最后几个毫秒发生能量 突变的特点, 可以将一个信号帧分为很多份, 即多个子信号帧, 每一个子 信号帧被称为一个信号窗, 分别计算每一个信号窗中的信号能量, 当一个 信号帧中所有信号窗的信号能量都低于设定的能量判决阈值时, 才认为此 帧为静音帧, 信号窗的大小和能量判决阈值可以根据不同网关设备的具体 情况合理设定, 信号窗的大小可以相同, 也可以不同。 Considering that the energy of the data signal changes rapidly, it is possible that the energy mutation occurs in the last few milliseconds of the signal frame. A signal frame can be divided into many parts, that is, multiple sub-signal frames, and each sub-signal frame is called a signal window. Calculating the signal energy in each signal window separately. When the signal energy of all signal windows in a signal frame is lower than the set energy decision threshold, this is considered as The frame is a silent frame. The size of the signal window and the energy decision threshold can be set according to the specific conditions of different gateway devices. The size of the signal window can be the same or different.
但是, 如附图 4所示, 数据信号检测器中, 判断数据信号进入静音和 退出静音都有一个相应的判决阈值, 这将造成在进入静音阶段时丢失 tl 时间长度的有效数据信号,在退出静音阶段时丢失 t2时间长度的有效数据 信号, 以退出静音过程为例, 当信号能量超过设定的判决阈值后才认为退 出静音, 造成从信号真正退出静音到判决退出静音之间的信号丢失, 导致 重构有效数据信号的不完整, 对于数据设备来说, 这种不完整的数据信 号是不能接受的, 接收方数据设备可能会认为信号发生跳变或者周期不完 整而发生异常, 甚至导致数据业务失败。  However, as shown in FIG. 4, in the data signal detector, determining that the data signal enters the mute and exits the mute has a corresponding decision threshold, which causes an effective data signal that loses the length of the t1 time when entering the mute phase, and exits. In the silent phase, the valid data signal of the t2 time length is lost. Taking the mute process as an example, when the signal energy exceeds the set decision threshold, the mute is considered to be exited, resulting in loss of signal between the true exit of the signal and the decision to exit the mute. The incomplete data signal is reconstructed. For data devices, such incomplete data signals are unacceptable. The receiving data device may think that the signal is hopping or the cycle is incomplete and anomalies occur, even leading to data. Business failed.
因为数据信号检测器判断退出静音的时刻比真正退出静音的时刻晚 t2, 所以在本实施例中解决退出静音阶段时丢失信号的问题采用将数据信 号送入编码器之前缓存 T2 ( T2>t2 ) 时间的技术方案, 保证在数据信号检 测器判断退出静音并控制数据信号送入话音编码器时, 有效数据信号尚未 到达编码器。 緩存后重构数据信号如附图 5所示。  Since the data signal detector judges that the time to exit the mute is later than the time when the mute is actually exited, the problem of losing the signal when exiting the mute phase in the present embodiment is to buffer the T2 before the data signal is sent to the encoder (T2>t2). The technical solution of time ensures that when the data signal detector judges to exit the mute and control the data signal to be sent to the speech encoder, the valid data signal has not yet reached the encoder. The reconstructed data signal after buffering is as shown in FIG.
同时因为数据信号检测器判断进入静音的时刻比真正进入静音的时 刻早 tl , 所以本实施例中解决进入静音阶段时丢失信号的问题采用在数据 信号检测器判断进入静音后, 延迟 T1 ( Tl>tl ) 时间再将数据信号由话音 编码器切换到非话音编码器的技术方案。 考虑到为了解决退出静音阶段时 丢失信号的问题, 数据信号有一个 T2的緩存, 相当于数据信号检测器判 断进入静音的时刻比送入编码器的信号进入静音的时刻早 tl+T2。 所以在 数据信号检测器判断进入静音后,延迟 T1+T2时间再将数据信号由话音编 码器切换到非话音编码器, 以保证有效数据信号全部通过话音编码器发 出, 延迟后重构数据信号如附图 6所示。  At the same time, because the data signal detector judges that the moment of entering the silence is earlier than the time of actually entering the silence, the problem of losing the signal when entering the silent phase in this embodiment is adopted after the data signal detector determines to enter the silence, and the delay T1 (Tl> T1) The technical solution of switching the data signal from the speech encoder to the non-speech encoder. Considering the problem of losing the signal when exiting the mute phase, the data signal has a buffer of T2, which is equivalent to the time when the data signal detector judges to be muted earlier than the time when the signal sent to the encoder enters silence, tl+T2. Therefore, after the data signal detector judges to be muted, delay the T1+T2 time and then switch the data signal from the voice encoder to the non-voice encoder to ensure that all valid data signals are sent through the voice encoder, and the data signal is reconstructed after delay. Figure 6 shows.
采用上述的技术方案处理数据信号检测和静音压缩技术的 VBD方式 通信系统框图如附图 7所示。  The VBD mode communication system block diagram for processing data signal detection and silence compression techniques using the above technical solution is shown in FIG.
需要指出的是, 在图 7所述的系统中, 设置在发送方网关信号输入端 和所述话音编码器和非话音编码器之间的緩存器, 用于将输入的信号帧在 编码之前緩存 T2时间后输出, 以保证在数据信号检测器判断退出静音并 控制数据信号送入话音编码器时,有效数据信号尚未到达编码器。所述 T2 大于非话音编码器对非数据信号帧进行静音压缩编码后向话音编码器对 数据信号帧进行话音编码切换时数据信号丟失的时间段。 It should be noted that in the system described in FIG. 7, a buffer is provided between the sender gateway signal input terminal and the voice encoder and the non-voice encoder for buffering the input signal frame before encoding. Output after T2 time to ensure that the data signal detector judges to exit the mute and When the control data signal is sent to the voice encoder, the valid data signal has not yet reached the encoder. The T2 is greater than a period of time during which the non-voice encoder performs silence compression coding on the non-data signal frame and then loses the data signal to the voice encoder when the voice signal is switched on the data signal frame.
在图 7所述的系统中, 所述数据信号检测器判断输入信号进入非数据 信号阶段后延迟 T1 + T2时间, 然后由非话音编码器对所述的非数据信号 帧进行静音压缩编码, 以保证有效数据信号全部通过话音编码器发出; 所 述 T2 大于非话音编码器对非数据信号帧进行静音压缩编码后向话音编码 器对数据信号帧进行话音编码切换时数据信号丢失的时间段;  In the system of FIG. 7, the data signal detector determines that the input signal enters the non-data signal stage and delays the T1 + T2 time, and then the non-voice encoder performs silence compression coding on the non-data signal frame to Ensure that the valid data signals are all sent through the voice encoder; the T2 is greater than the time period during which the non-voice encoder performs silence compression coding on the non-data signal frame and then loses the data signal to the voice encoder when the voice signal is switched on the data signal frame;
所述 T1 大于话音编码器对数据信号帧进行话音编码后向非话音编码 器对非数据信号帧进行静音压缩编码切换时数据信号丢失的时间段。  The T1 is greater than a time period during which the voice signal is lost after the voice encoder performs voice encoding on the data signal frame and the non-voice encoder performs silent compression coding switching on the non-data signal frame.
另外, 在图 7所述的系统中, 所述数据信号检测器的典型构成是, 包 括判断单元和控制单元, 所述判断单元, 用于判断输入的信号帧是否携带 有效数据信号; 所述控制单元, 用于根据判断的结果将所述信号帧作为数 据信号帧或非数据信号帧输出到话音编码器或非话音编码器。  In addition, in the system of FIG. 7, the data signal detector is typically configured to include a determining unit and a control unit, and the determining unit is configured to determine whether the input signal frame carries a valid data signal; And a unit, configured to output the signal frame as a data signal frame or a non-data signal frame to a voice encoder or a non-voice encoder according to the result of the judgment.
在图 6所述的系统中, T1和 T2的时间可以由系统采用的数据信号 检测算法性能决定, 一般在亳秒级, 所以将数据信号缓存 T2时间引入的 时延几乎不会对数据业务产生任何影响。 可以看出, 通过以上两个步骤, 可以有效避免因为数据信号检测器判断进入静音和退出静音时造成的数 据信号丟失问题。  In the system described in FIG. 6, the time of T1 and T2 can be determined by the performance of the data signal detection algorithm used by the system, generally in the leap second stage, so the delay introduced by the data signal buffer T2 time hardly generates data traffic. Any impact. It can be seen that through the above two steps, the data signal loss caused by the data signal detector judging to enter the mute and exit the mute can be effectively avoided.
上述实施例中, 通过将数据设备输出的信号区分为有效数据信号和非 有效数据信号, 例如静音信号, 并针对不同的信号采用不同的编码方式进 行处理, 在数据业务应用中, 能够大幅度降低通过 VBD方式实现数据业 务时占用通信信道的带宽, 提高 VBD方式的可用性。  In the above embodiment, the signal outputted by the data device is divided into a valid data signal and a non-effective data signal, such as a mute signal, and processed by different coding modes for different signals, which can be greatly reduced in data service applications. When the data service is implemented by the VBD method, the bandwidth of the communication channel is occupied, and the availability of the VBD mode is improved.

Claims

权 利 要 求 Rights request
1、 一种通过话带数据方式实现数据业务的方法, 其特征在于, 所述 方法包括如下步骤:  A method for implementing data services by means of voiceband data, characterized in that the method comprises the following steps:
a、 数据信号检测器将输入的信号帧分成数据信号帧和非数据信号帧, 由话音编码器对所述的数据信号帧进行话音编码, 由非话音编码器对所述 的非数据信号帧进行静音压缩编码;  a. The data signal detector divides the input signal frame into a data signal frame and a non-data signal frame, and the voice signal encoder performs voice coding on the data signal frame, and the non-voice encoder performs the non-data signal frame on the frame. Silent compression coding;
b、 话音解码器对话音编码后的数据信号帧进行话音解码后输出有效 数据信号, 非话音解码器对静音压缩编码后的非数据信号帧解码后重构静 音信号。  b. The data signal frame encoded by the voice decoder voice is decoded after the voice is decoded, and the non-voice decoder reconstructs the silence signal after decoding the silence compressed and encoded non-data signal frame.
2、 才艮据权利要求 1所述的方法, 其特征在于, 设定能量判决阁值, 以及通过所述能量判决阔值, 所述的数据信号检测器将输入的信号帧区分 为数据信号帧和非数据信号帧。 ' 2. The method according to claim 1, wherein the energy decision value is set, and the data signal detector divides the input signal frame into data signal frames by the energy decision threshold And non-data signal frames. '
3、 根据权利要求 1所述的方法, 其特征在于, 将输入的信号帧分成 等间隔的或非等间隔的多个子信号帧, 每个子信号帧对应一个信号窗, 分 别计算每一信号窗中的信号能量后与所述的能量判决阈值比较, 如果每一 信号窗中的信号能量都小于所述能量判决阈值, 则所述的信号帧为非数据 信号帧, 否则为数据信号帧。 3. The method according to claim 1, wherein the input signal frame is divided into equally spaced or non-equally spaced plurality of sub-signal frames, each sub-signal frame corresponding to one signal window, and each signal window is calculated separately. The signal energy is then compared with the energy decision threshold. If the signal energy in each signal window is less than the energy decision threshold, the signal frame is a non-data signal frame, otherwise it is a data signal frame.
4、 根据权利要求 1、 2、 或 3所述的方法, 其特征在于还包括: 对信 号帧进行编码之前緩存 T2时间;所述 T2大于非话音编码器对非数据信号 帧进行静音压缩编码后向话音编码器对数据信号帧进行话音编码切换时 数据信号丢失的时间段。  4. The method according to claim 1, 2, or 3, characterized by further comprising: buffering a T2 time before encoding the signal frame; wherein the T2 is greater than a non-voice encoder to perform silence compression coding on the non-data signal frame. The period of time during which the data signal is lost when the voice encoder switches the voice signal to the data signal frame.
5、 根据权利要求 4所述的方法, 其特征在于还包括: 数据信号检测 器判断输入信号进入非数据信号阶段后延迟 Tl + T2时间, 然后由非话音 编码器对所述的非数据信号帧进行静音压缩编码; 所述 T2大于非话音编 码器对非数据信号帧进行静音压缩编码后向话音编码器对数据信号帧进 行话音编码切换时数据信号丢失的时间段;  5. The method according to claim 4, further comprising: the data signal detector determining that the input signal enters the non-data signal stage and delaying the T1 + T2 time, and then the non-voice encoder to the non-data signal frame Performing silence compression coding; the T2 is greater than a time period when the non-voice encoder performs silence compression coding on the non-data signal frame and then loses the data signal to the voice encoder when the voice signal is switched on the data signal frame;
所述 T1 大于话音编码器对数据信号帧进行话音编码后向非话音编码 器对非数据信号帧进行静音压缩编码切换时数据信号丢失的时间段。  The T1 is greater than a time period during which the voice signal is lost after the voice encoder performs voice encoding on the data signal frame and the non-voice encoder performs silent compression coding switching on the non-data signal frame.
6、 一种通过 VBD方式实现数据业务的系统, 包括发送方网关、 接收 方网关以及连接发送方网关和接收方网关的通信信道, 其特征在于, 所述 发送方网关包括话音编码器、 非话音编码器和数据信号检测器, 所述数据 信号检测器将信号帧分成数据信号帧和非数据信号帧, 并控制数据信号帧 进入话音编码器进行话音编码、 非数据信号帧进入非话音编码器进行静音 压缩编码; 所述接收方网关包括话音解码器和非话音解码器, 所述话音解 码器对话音编码后的数据信号帧进行话音解码后输出有效数据信号, 所述 非话音解码器对静音压缩编码后的非数据信号帧解码后重构静音信号。 6. A system for implementing data services by means of VBD, including a sender gateway, receiving a party gateway and a communication channel connecting the sender gateway and the receiver gateway, wherein the sender gateway includes a voice coder, a non-voice coder, and a data signal detector, and the data signal detector divides the signal frame into data a signal frame and a non-data signal frame, and control the data signal frame to enter the voice encoder for voice coding, and the non-data signal frame to enter the non-voice encoder for silent compression coding; the receiver gateway includes a voice decoder and a non-voice decoder, The voice signal encoded by the voice decoder is subjected to voice decoding to output a valid data signal, and the non-voice decoder decodes the silenced and encoded non-data signal frame to reconstruct the silence signal.
7、 如权利要求 6所述的系统, 其特征在于还包括设置在发送方网关 信号输入端和所述话音编码器和非话音编码器之间的缓存器, 用于将输入 的信号帧在编码之前缓存 T2时间后输出;所述 T2大于非话音编码器对非 数据信号帧进行静音压缩编码后向话音编码器对数据信号帧进行话音编 码切换时数据信号丟失的时间段。  7. The system of claim 6 further comprising a buffer disposed between the sender gateway signal input and the voice encoder and the non-voice encoder for encoding the input signal frame The T2 time is buffered before outputting; the T2 is greater than a period of time when the non-voice encoder performs silence compression coding on the non-data signal frame and then loses the data signal to the voice encoder when the voice signal is switched on the data signal frame.
8、 根据权利要求 6或 7所述的系统, 其特征在于: 所述数据信号检 测器判断输入信号进入非数据信号阶段后延迟 Tl + T2时间, 然后由非话 音编码器对所述的非数据信号帧进行静音压缩编码; 所述 T2大于非话音 编码器对非数据信号帧进行静音压缩编码后向话音编码器对数据信号帧 进行话音编码切换时数据信号丟失的时间段;  8. The system according to claim 6 or 7, wherein: the data signal detector determines that the input signal enters the non-data signal stage and delays the time T1 + T2, and then the non-voice encoder performs the non-data. The signal frame is subjected to silent compression coding; the T2 is greater than a time period during which the non-voice encoder performs silence compression coding on the non-data signal frame and then loses the data signal to the voice encoder when the voice signal is switched on the data signal frame;
所述 T1 大于话音编码器对数据信号帧进行话音编码后向非话音编码 器对非数据信号帧进行静音压缩编码切换时数据信号丢失的时间段。  The T1 is greater than a time period during which the voice signal is lost after the voice encoder performs voice encoding on the data signal frame and the non-voice encoder performs silent compression coding switching on the non-data signal frame.
9、 一种数据信号检测器, 其特征在于包括判断单元和控制单元, 所 述判断单元, 用于判断输入的信号帧是否携带有效数据信号; 所述控制单 元, 用于根据判断的结果将所述信号帧作为数据信号帧或非数据信号帧输 出。  A data signal detector, comprising: a determining unit and a control unit, wherein the determining unit is configured to determine whether the input signal frame carries a valid data signal; and the control unit is configured to: according to the result of the determining The signal frame is output as a data signal frame or a non-data signal frame.
PCT/CN2007/000423 2006-03-24 2007-02-07 Method, system and data signal detector for realizing dada service WO2007109960A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN 200610034664 CN101043759B (en) 2006-03-24 2006-03-24 Method for realizing data service through voice band data VBD mode and system thereof
CN200610034664.5 2006-03-24

Publications (1)

Publication Number Publication Date
WO2007109960A1 true WO2007109960A1 (en) 2007-10-04

Family

ID=38540793

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2007/000423 WO2007109960A1 (en) 2006-03-24 2007-02-07 Method, system and data signal detector for realizing dada service

Country Status (2)

Country Link
CN (1) CN101043759B (en)
WO (1) WO2007109960A1 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BRPI0913549A2 (en) * 2008-09-18 2020-05-26 Koninklijke Philips Electronics N.V. METHOD TO CONTROL AT LEAST ONE DEVICE USING A TELECOMMUNICATIONS SYSTEM, TELECOMMUNICATIONS SYSTEM AND COMPUTER PROGRAM
CN102244825A (en) * 2011-06-10 2011-11-16 中兴通讯股份有限公司 Multimedia stream playing method and device
CN102355484B (en) * 2011-08-05 2018-12-04 广州华多网络科技有限公司 A kind of method of audio data transmission
CN102523356B (en) * 2011-12-21 2015-04-15 上海会畅通讯股份有限公司 Method and device for detecting sound of meeting passages
CN102800318B (en) * 2012-06-27 2016-02-17 优视科技有限公司 The transmission of audio data stream, receiving trap and method
CN113436637A (en) * 2021-06-20 2021-09-24 杭州登虹科技有限公司 Compression algorithm of audio flow
CN114898506B (en) * 2022-05-12 2023-06-27 南京百米需供应链管理有限公司 Intelligent cabinet control system and control method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1290094A (en) * 2000-11-03 2001-04-04 国家数字交换系统工程技术研究中心 Multi-channel 64Kbps squelch compression method for packet switch network
US6249757B1 (en) * 1999-02-16 2001-06-19 3Com Corporation System for detecting voice activity
CN1332441A (en) * 2000-07-04 2002-01-23 朗迅科技公司 Method and apparatus for recognizing speech from speech band data in communication network

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL129752A (en) * 1999-05-04 2003-01-12 Eci Telecom Ltd Telecommunication method and system for using same
CN1136745C (en) * 2001-07-18 2004-01-28 华为技术有限公司 Method for implementing multi-language coding-decoding in universal mobile communication system
CN1617606A (en) * 2003-11-12 2005-05-18 皇家飞利浦电子股份有限公司 Method and device for transmitting non voice data in voice channel

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6249757B1 (en) * 1999-02-16 2001-06-19 3Com Corporation System for detecting voice activity
CN1332441A (en) * 2000-07-04 2002-01-23 朗迅科技公司 Method and apparatus for recognizing speech from speech band data in communication network
CN1290094A (en) * 2000-11-03 2001-04-04 国家数字交换系统工程技术研究中心 Multi-channel 64Kbps squelch compression method for packet switch network

Also Published As

Publication number Publication date
CN101043759A (en) 2007-09-26
CN101043759B (en) 2010-12-08

Similar Documents

Publication Publication Date Title
JP4426454B2 (en) Delay trade-off between communication links
US20040032860A1 (en) Quality of voice calls through voice over IP gateways
US7283585B2 (en) Multiple data rate communication system
US7773511B2 (en) Generic on-chip homing and resident, real-time bit exact tests
JP5295120B2 (en) Jitter buffer control
AU2007349607B2 (en) Method of transmitting data in a communication system
WO2007109960A1 (en) Method, system and data signal detector for realizing dada service
US20020075857A1 (en) Jitter buffer and lost-frame-recovery interworking
US6725191B2 (en) Method and apparatus for transmitting voice over internet
JP2006238445A (en) Method and apparatus for handling network jitter in voice-over ip communication network using virtual jitter buffer and time scale modification
US6697342B1 (en) Conference circuit for encoded digital audio
US20100104049A1 (en) Dual-rate single band communication system
US20110135038A1 (en) Multiple data rate communication system
Johansson et al. Bandwidth efficient AMR operation for VoIP
US7233605B1 (en) Method and apparatus for minimizing delay induced by DTMF processing in packet telephony systems
Smith et al. Speaker selection for tandem-free operation VoIP conference bridges
US7313233B2 (en) Tone clamping and replacement
JP4078795B2 (en) Speech encoding / decoding device, speech encoding device, and encoding method
CN1474581A (en) Method for realizing voice coding and decoding changeover and intercommunication
Beritelli et al. New speech processing issues in IP telephony
PS et al. Steganography in Audio Files by Entropy using FEC as Reed-Solomon of VOIP Streams
CN1801773A (en) Method and system for transmitting data on gateway
JPH10135977A (en) Cell transmission system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07702300

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07702300

Country of ref document: EP

Kind code of ref document: A1