CN101414462A - Audio encoding method and multi-point audio signal mixing control method and corresponding equipment - Google Patents

Audio encoding method and multi-point audio signal mixing control method and corresponding equipment Download PDF

Info

Publication number
CN101414462A
CN101414462A CNA2007101803163A CN200710180316A CN101414462A CN 101414462 A CN101414462 A CN 101414462A CN A2007101803163 A CNA2007101803163 A CN A2007101803163A CN 200710180316 A CN200710180316 A CN 200710180316A CN 101414462 A CN101414462 A CN 101414462A
Authority
CN
China
Prior art keywords
side information
audio
mixing
code stream
core code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2007101803163A
Other languages
Chinese (zh)
Inventor
张清
李伟
许丽净
杜正中
许剑峰
胡晨
杨毅
苗磊
齐峰岩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CNA2007101803163A priority Critical patent/CN101414462A/en
Publication of CN101414462A publication Critical patent/CN101414462A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the invention discloses a transmission coding method of voice audio signals. The audio mixing side information is computed at a terminal, identified in a data packet and then transmitted to a multi-point audio processing device, which causes the multi-point audio processing device to directly control the audio mixing mode of the signals according to the audio mixing side information. The embodiment of the invention further provides an audio mixing control method of the multi-point voice audio signals, a voice audio signal decoder and the multi-point audio processing device. The method reduces the computation complexity of the multi-point audio processing device, improves the audio mixing control capacity thereof, and causes the device to be capable of flexibly realizing a variety of audio mixing control strategies according to the audio mixing side information configuration in the terminal data packet.

Description

Audio coding method, multipoint audio signal mixing control method and corresponding devices
Technical Field
The invention relates to the technical field of voice audio signal processing, in particular to a transmission coding method of a voice audio signal, a method for carrying out sound mixing control on a multipoint voice audio signal, a voice audio signal coder and multipoint audio processing equipment.
Background
With the continuous development of communication technology, the application of communication service is increasing day by day, and the centralized conference technology based on multipoint audio interaction has important application value.
In a centralized conference, each terminal and a multipoint audio processing device as an interactive core are usually connected based on a unicast (unicast) manner, as shown in fig. 1, where the multipoint audio processing device generally adopts a Multipoint Control Unit (MCU) to act as the multipoint control Unit. And the MCU receives the voice data packets of all the terminals in real time, performs appropriate sound mixing processing according to a synthesis strategy and then transmits the voice data packets to all the terminals in real time.
Obviously, the effect of mixing audio data of n terminals by the MCU has a great influence on the quality of output voice audio signals, and the current mixing modes generally adopted by the MCU include the following two modes.
I, simply adding.
Referring to fig. 2, the MCU decodes each channel of audio data and then mixes the audio of multiple channels of decoded signals, and the mixing process in fig. 2 is to simply sum up each channel of decoded data except the local channel, then encode the summed data by the encoder, and finally transmit the summed data to each terminal.
In this way, the superposition will cause overflow of the result due to the quantization upper and lower limits of the digital speech audio signal. If the upper and lower limit values are reset through saturation operation of overflow detection, the original time domain characteristics of the voice signal can be damaged, noise is introduced, and the phenomena of explosion sound, voice discontinuity and the like are caused. Meanwhile, as the number of terminal channels participating in mixing increases, the frequency of overflow also increases, so that the method has an upper limit on the number of terminal channels. Experiments prove that under the general condition, when 4 paths of terminals participate in sound mixing, the result has much noise and discontinuity, and the speech flow cannot be distinguished.
And secondly, weighting sound mixing.
Referring to fig. 3, after decoding each channel of audio data, the MCU further performs mixing calculation to obtain a weight of each channel of audio signal, and then performs mixing and encoding on each channel of signal according to the weight. The weight obtained by the mixing calculation is based on information such as energy of each signal, and the specific processing strategy can be determined according to actual conditions. For example, several signals with large energy can be selected for mixing output.
In this way, complex calculation needs to be performed on each path of audio data in the MCU to obtain a weight basis, which undoubtedly increases the computational burden of the MCU, and especially when the number of terminals increases, the computational complexity is very high, which cannot well meet the real-time and efficient requirements of conference communication.
Disclosure of Invention
The embodiment of the invention provides a transmission coding method of a voice audio signal, a method for performing sound mixing control on a multipoint voice audio signal, a voice audio signal coder and a multipoint audio processing device, which are used for improving the sound mixing control capability of the multipoint audio processing device on the premise of reducing the calculation complexity of the multipoint audio processing device as much as possible.
A method of transmission coding of a speech audio signal, comprising: generating a core code stream according to the voice audio signal; calculating side information of the voice audio signal; and generating a data packet carrying the core code stream and the side information, wherein the side information is marked as audio mixing side information in the data packet, so that a receiving end of the data packet can perform corresponding audio mixing processing on the core code stream according to the audio mixing side information.
A method of performing mixing control on a multi-point voice audio signal, comprising: acquiring more than one path of data packets, wherein at least one path of data packets carries core code stream and audio mixing side information; analyzing the data packet carrying the core code stream and the audio mixing side information to obtain the core code stream and the audio mixing side information; and selecting a corresponding sound mixing processing mode according to the sound mixing side information to perform sound mixing processing on the core code stream.
A speech audio signal encoder comprising: the coding unit is used for generating a core code stream according to the voice audio signal; a calculation unit for calculating side information of the voice audio signal; and the packaging unit is used for generating a data packet carrying the core code stream and the side information, and the side information is marked as audio mixing side information in the data packet, so that a receiving end of the data packet can perform corresponding audio mixing processing on the core code stream according to the audio mixing side information.
A multipoint audio processing device comprising: the system comprises a packet processing unit, a data processing unit and a data processing unit, wherein the packet processing unit is used for acquiring more than one path of data packets, and at least one path of data packets carries a core code stream and audio mixing side information; analyzing each path of data packet to obtain a core code stream, and analyzing the data packet carrying the core code stream and the audio mixing side information to obtain the core code stream and the audio mixing side information; the audio mixing processing unit is used for selecting a corresponding audio mixing processing mode for the corresponding core code stream according to the audio mixing side information and carrying out audio mixing processing with the core code streams of other paths; outputting voice audio signals which are subjected to sound mixing processing and correspond to each path of input; and the coding output unit is used for coding and outputting each path of voice audio signal output by the sound mixing processing unit.
In the technical scheme of the embodiment of the invention, the multi-point audio processing equipment can directly control the audio mixing mode of the signal according to the audio mixing side information by calculating the audio mixing side information at the terminal and identifying the audio mixing side information in the data packet and then transmitting the audio mixing side information to the multi-point audio processing equipment, thereby not only reducing the calculation complexity of the multi-point audio processing equipment, but also enriching the audio mixing control capability of the multi-point audio processing equipment, and flexibly realizing diversified audio mixing control strategies according to the configuration of the audio mixing side information in the terminal data packet.
Drawings
FIG. 1 is a schematic diagram of a prior art centralized conferencing architecture;
FIG. 2 is a schematic diagram of a conventional mixing method;
FIG. 3 is a schematic diagram of another mixing method;
FIG. 4 is a flowchart illustrating a method for encoding a speech audio signal according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of a logic structure of a speech audio signal encoder according to an embodiment of the present invention;
FIG. 6 is a flowchart illustrating a method for performing mixing control on a multi-point speech audio signal according to an embodiment of the present invention;
fig. 7 is a schematic diagram of a logical structure of a multipoint audio processing device according to an embodiment of the present invention.
Detailed Description
The embodiment of the invention provides a transmission coding method of a voice audio signal, which is characterized in that the audio mixing side information is calculated at a terminal and is identified in a data packet and then transmitted to a multipoint audio processing device, so that the multipoint audio processing device can directly control the audio mixing mode of the signal according to the audio mixing side information. The embodiment of the invention also provides a corresponding method for performing sound mixing control on the multipoint voice audio signal, a voice audio signal encoder and multipoint audio processing equipment, which are respectively described in detail below.
The method for transmitting and encoding the voice audio signal of the embodiment of the invention is a method for transmitting and encoding the voice audio signal which needs to be sent to the multipoint audio processing equipment by a terminal, and with reference to a figure 4, the method comprises the following steps:
and A1, generating a core code stream according to the voice audio signal.
The coding mode of the core code stream can select various common audio coding modes, such as ITU standards g.711, g.722, g.729, and the like.
A2, calculating the side information of the coded voice audio signal.
The side information may be one or several parameters related to the core codestream, for example, one or several of the following parameters may be selected: silence Activity Detection (VAD) information, voice weighting score (voicing score), voice frame energy mean, voice Activity Detection (SAD), etc. The specific calculation of various parameters as side information may be performed by referring to the calculation method provided by the present embodiment, and the present embodiment is not limited thereto.
And A3, generating a data packet carrying the core code stream and the side information, and identifying the side information as audio mixing side information in the data packet, so that a receiving end of the data packet can perform corresponding audio mixing processing on the core code stream according to the audio mixing side information.
The side information may be identified by any identification means that can be identified by the receiving end, for example, the side information is placed in a specific position or identified and described by additional specific information. In a specific carrying manner, the side information may be included in the encoded code stream or may be placed in a packet header in a data packet format, and based on the two carrying manners, the step of generating the data packet carrying the core code stream and the side information may specifically adopt: firstly, generating an encoding code stream comprising a core code stream and side information, identifying the side information as audio mixing side information in the encoding code stream, and then packaging the encoding code stream into a data packet; or, the core code stream may be directly encapsulated as the encoded code stream into a data packet, and the side information is placed in the packet header of the data packet, where the side information is identified as the audio mixing side information.
The following describes the identification and placement of side information by way of example. At the simplest, one coded bit may be used to carry side information, and the coded bit is placed at a specific position of the coded code stream or placed in a specific field of the packet header of the data packet to identify it as mixing side information. For example, in the case of VAD, the mixing side information may be set to "0" when silence is detected, and set to "1" otherwise; in the case of using voicinscore, the mixing side information may be set to "0" when the voice audio signal is judged to be male, and set to "1" when the voice audio signal is judged to be female; or the value of the Voicing score is used as sound mixing side information to be transmitted to the MCU, and the MCU performs selection processing according to the set score threshold value; under the condition of using the voice frame energy, when the voice frame energy is judged to be higher than the set threshold value, the sound mixing side information is set to be 1, otherwise, the sound mixing side information is set to be 0.
An embodiment of a speech audio signal encoder for performing the above-described transmission encoding method of a speech audio signal is given below, and the speech audio signal encoder of the present embodiment can be used for terminal audio encoding of a centralized conference system. Logical structure referring to fig. 5, including:
an encoding unit 101, configured to generate a core code stream according to a voice audio signal;
a calculating unit 102, configured to calculate side information of the voice audio signal;
the encapsulating unit 103 is configured to generate a data packet carrying the core code stream generated by the encoding unit 101 and the side information generated by the computing unit 102, and identify the side information as audio mixing side information in the data packet, so that a receiving end of the data packet can perform corresponding audio mixing processing on a corresponding core code stream according to the audio mixing side information.
According to a specific adopted encapsulation manner, the encapsulation unit 103 may place the side information in the encoded code stream, or may place the side information in the header of the data packet.
In the terminal transmission coding scheme provided in the above embodiment, the terminal performs the calculation of the audio mixing side information and carries the audio mixing side information in the data packet, which greatly reduces the calculation complexity of the receiving end (i.e., the multipoint audio processing device), and provides a basis for the multipoint audio processing device to perform flexible audio mixing control.
The method for performing audio mixing control on a multi-point voice audio signal according to the embodiment of the present invention is a method for performing audio mixing control on a voice audio signal sent by a multi-path terminal by using a multi-point audio processing device, and with reference to fig. 6, the method includes the steps of:
and B1, acquiring more than one path of data packets, wherein at least one path of data packets carries core code stream and audio mixing side information.
The data packet sent by the terminal carries a core code stream obtained by encoding the voice audio signal, and if the terminal performs transmission encoding based on the scheme of the embodiment of the invention, the data packet also carries audio mixing side information. In this embodiment, it is not excluded that some terminals do not have the function of adding audio mixing side information, and at this time, the encoded code stream of the terminal may enter audio mixing processing according to the existing execution modes, for example, the encoded code stream may be directly decoded and then enter audio mixing, or may also be decoded and then subjected to audio mixing calculation to obtain a weight and then enter audio mixing. Therefore, in the following process description, the case of processing a code stream without mixing side information will not be described in detail, but a description will be given to a processing process of one-way data packet with mixing side information. Of course, all the input packets of each path may be packets carrying audio mixing side information, and at this time, each path may be processed according to the following process.
And B2, analyzing the data packet carrying the core code stream and the audio mixing side information to obtain the core code stream and the audio mixing side information.
The receiving end can obtain the core code stream and the audio mixing side information from the data packet according to the identification and information placement mode which are observed together with the coding terminal. Based on different packaging modes adopted by the coding terminal, the receiving end can acquire the audio mixing side information from the coding code stream and/or the data packet header.
And B3, selecting a corresponding mixing processing mode according to the mixing side information to perform mixing processing on the corresponding core code stream.
The receiving end can formulate a sound mixing processing mode meeting the application requirements based on the parameter type, the representation mode, the parameter content and the like of the specific sound mixing side information adopted by the coding terminal. For example, the receiving end may adopt an existing processing method of performing weighted mixing according to the mixing calculation result, but in this embodiment, the mixing calculation result is not required to be calculated by the receiving end, but is calculated by the encoding terminal, and is directly obtained from the received mixing side information by the receiving end.
Two mixing processing modes are given below by way of example, and can be used separately or comprehensively before and after decoding.
Judging whether the corresponding core code stream needs to be decoded or not according to the audio mixing side information; if yes, decoding the core code stream, and performing sound mixing processing on the decoded signal; if not, the core code stream of the path is not decoded, and the signal of the path is not subjected to the sound mixing operation.
For example, if VAD information is used as the audio mixing side information, the core code stream of the path may not be decoded when the path is determined to be silent, so that all the code streams do not need to be decoded, the number of paths needing audio mixing is reduced, the computational complexity of the multi-point audio processing device is reduced, and the computational efficiency is improved.
Decoding a core code stream (under the condition of a comprehensive use mode I, the method only executes the code stream which is judged to be decoded); and then selecting a corresponding sound mixing strategy according to the sound mixing side information to perform sound mixing processing on the signals after the core code stream is decoded.
For example, an existing weighted mixing processing method may be adopted, the weight of the signal decoded from the core code stream is determined according to the mixing side information, and the signal decoded from the core code stream is mixed according to the determined weight. For example, if the voicing score is used as the mixing side information and the mixing rule for emphasizing female voice is used, the mixing weight of the route can be increased when the route signal is judged to be female voice, and conversely, the mixing weight of the route can be decreased when the route signal is judged to be male voice.
An embodiment of a multipoint audio processing device for executing the above method for performing mixing control on a multipoint speech audio signal is given below, and the multipoint audio processing device of this embodiment can be used as an MCU of a centralized conference system. Logical structure referring to fig. 7, including:
a packet processing unit 201, configured to obtain n data packets from terminals 1 to n, where at least one data packet carries a core code stream and audio mixing side information; and analyzing each path of data packet to obtain a core code stream, and analyzing the data packet carrying the core code stream and the audio mixing side information to obtain the core code stream and the audio mixing side information.
In this embodiment, the situation that some path data packets do not have the audio mixing side information is not excluded, and for these path data packets, after obtaining the code stream carried by them, the audio mixing processing unit may enter the audio mixing process according to the existing execution modes, for example, may not perform decoding selection according to the audio mixing side information, but directly perform decoding; after decoding, the audio mixing is directly carried out, or the audio mixing is carried out after weight information is obtained through audio mixing calculation. For simplicity, it is assumed in fig. 7 that each data packet is a data packet carrying a core stream and audio mixing side information.
The audio mixing processing unit 202 is configured to select a corresponding audio mixing processing mode for the corresponding core code stream according to the audio mixing side information of each path, and perform audio mixing processing with the core code streams of other paths; and outputting the voice audio signals which are subjected to the sound mixing processing and correspond to the input channels. In order to reduce the computational complexity as much as possible by using the mixing side information, the mixing processing unit 202 may have the following structure, including:
the selecting unit 2021 is configured to determine whether to output a corresponding core code stream for decoding according to the audio mixing side information of each channel;
the decoding unit 2022 may be composed of a plurality of decoders, and is configured to decode an input core code stream and output a decoded signal;
a mixing unit 2023 for performing mixing processing on the decoded signal output by the decoding unit 2022; the audio signals corresponding to the respective inputs of the terminals 1 to n after the audio mixing process are output, and are respectively marked as audio mixing signals 1 to n in fig. 7. Of course, when the mixing unit 2023 performs mixing processing on multiple paths of signals, it may further select a corresponding mixing strategy according to mixing side information corresponding to the multiple paths of signals to perform mixing processing, and the specific rule setting may refer to an existing similar weighted mixing manner, or may be specifically designed according to application requirements. If the input signal of a certain path of terminal does not enter the audio mixing, only the path of signal is shown not to participate in the audio mixing processing, and the audio mixing signal corresponding to the path of input can be output in the same way.
The encoding output unit 203, which may be composed of several encoders, is configured to encode and output the audio signals (i.e., the audio mixing signals 1 to n) output by the audio mixing unit 2023.
The audio mixing control scheme for the multi-point audio processing device provided in the above embodiment directly uses the audio mixing side information carried in the data packet sent by the terminal to control the audio mixing mode of the voice audio signal of the terminal, thereby greatly reducing the computational complexity of the multi-point audio processing device and improving the flexibility of audio mixing control.
Those skilled in the art will appreciate that all or part of the steps in the methods of the above embodiments may be implemented by associated hardware instructed by a program, which may be stored in a computer-readable storage medium, and the storage medium may include: ROM, RAM, magnetic or optical disks, and the like.
The transmission encoding method of a speech audio signal, the method for performing mixing control on a multipoint speech audio signal, the speech audio signal encoder and the multipoint audio processing device provided by the embodiment of the present invention are described in detail above, a specific example is applied in the present document to explain the principle and the embodiment of the present invention, and the description of the above embodiment is only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (11)

1. A method for encoding a speech audio signal for transmission, comprising:
generating a core code stream according to the voice audio signal;
calculating side information of the voice audio signal;
and generating a data packet carrying the core code stream and the side information, wherein the side information is marked as audio mixing side information in the data packet, so that a receiving end of the data packet can perform corresponding audio mixing processing on the core code stream according to the audio mixing side information.
2. The method according to claim 1, wherein the step of generating the data packet carrying the core stream and the side information specifically comprises:
generating an encoding code stream comprising the core code stream and side information, identifying the side information as audio mixing side information in the encoding code stream, and packaging the encoding code stream into the data packet; or,
and packaging the core code stream into a data packet, placing the side information into a packet header of the data packet, and identifying the side information as audio mixing side information in the packet header of the data packet.
3. The method for transmission coding of a speech audio signal according to claim 1 or 2, characterized in that the side information comprises: silence activity detection information, and/or voice weighting score, and/or voice frame energy mean, and/or voice activity detection.
4. A method for performing mixing control on a multi-point speech audio signal, comprising:
acquiring more than one path of data packets, wherein at least one path of data packets carries core code stream and audio mixing side information;
analyzing the data packet carrying the core code stream and the audio mixing side information to obtain the core code stream and the audio mixing side information;
and selecting a corresponding sound mixing processing mode according to the sound mixing side information to perform sound mixing processing on the core code stream.
5. The method of claim 4, wherein the step of selecting a corresponding mixing processing mode according to the mixing side information to perform mixing processing on the core code stream comprises:
judging whether the core code stream needs to be decoded or not according to the audio mixing side information;
if yes, decoding the core code stream, and performing sound mixing processing on the decoded signal;
and if not, not decoding the core code stream.
6. The method of claim 4 or 5, wherein the step of selecting a corresponding mixing processing mode according to the mixing side information to perform mixing processing on the core code stream comprises:
decoding the core code stream;
and selecting a corresponding sound mixing strategy according to the sound mixing side information to perform sound mixing processing on the signal after the core code stream is decoded.
7. The method of claim 6, wherein the step of selecting a corresponding mixing strategy according to the mixing side information to perform mixing processing on the decoded signal of the core code stream specifically comprises:
and determining the weight of the decoded signal of the core code stream according to the audio mixing side information, and performing audio mixing processing on the decoded signal of the core code stream according to the weight.
8. A speech audio signal encoder, characterized by comprising:
the coding unit is used for generating a core code stream according to the voice audio signal;
a calculation unit for calculating side information of the voice audio signal;
and the packaging unit is used for generating a data packet carrying the core code stream and the side information, and the side information is marked as audio mixing side information in the data packet, so that a receiving end of the data packet can perform corresponding audio mixing processing on the core code stream according to the audio mixing side information.
9. A multipoint audio processing device, comprising:
the system comprises a packet processing unit, a data processing unit and a data processing unit, wherein the packet processing unit is used for acquiring more than one path of data packets, and at least one path of data packets carries a core code stream and audio mixing side information; analyzing each path of data packet to obtain a core code stream, and analyzing the data packet carrying the core code stream and the audio mixing side information to obtain the core code stream and the audio mixing side information;
the audio mixing processing unit is used for selecting a corresponding audio mixing processing mode for the corresponding core code stream according to the audio mixing side information and carrying out audio mixing processing with the core code streams of other paths; outputting voice audio signals which are subjected to sound mixing processing and correspond to each path of input;
and the coding output unit is used for coding and outputting each path of voice audio signal output by the sound mixing processing unit.
10. The multipoint audio processing device according to claim 9, wherein said mixing processing unit comprises:
the selection unit is used for judging whether a corresponding core code stream needs to be output for decoding according to the audio mixing side information;
the decoding unit is used for decoding the input core code stream and outputting a decoded signal;
a sound mixing unit, configured to perform sound mixing processing on the decoded signal output by the decoding unit; and outputting the voice audio signals which are subjected to the sound mixing processing and correspond to the inputs of the sound mixing processing unit.
11. The multipoint audio processing device of claim 10, wherein: and the sound mixing unit is also used for selecting a corresponding sound mixing strategy according to the sound mixing side information to carry out sound mixing processing on the decoded signal of the core code stream corresponding to the sound mixing side information.
CNA2007101803163A 2007-10-15 2007-10-15 Audio encoding method and multi-point audio signal mixing control method and corresponding equipment Pending CN101414462A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2007101803163A CN101414462A (en) 2007-10-15 2007-10-15 Audio encoding method and multi-point audio signal mixing control method and corresponding equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2007101803163A CN101414462A (en) 2007-10-15 2007-10-15 Audio encoding method and multi-point audio signal mixing control method and corresponding equipment

Publications (1)

Publication Number Publication Date
CN101414462A true CN101414462A (en) 2009-04-22

Family

ID=40594962

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2007101803163A Pending CN101414462A (en) 2007-10-15 2007-10-15 Audio encoding method and multi-point audio signal mixing control method and corresponding equipment

Country Status (1)

Country Link
CN (1) CN101414462A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102202038A (en) * 2010-03-24 2011-09-28 华为技术有限公司 Method and system for realizing voice energy display, conference server and terminal
CN102968995A (en) * 2012-11-16 2013-03-13 新奥特(北京)视频技术有限公司 Sound mixing method and device of audio signal
CN103050124A (en) * 2011-10-13 2013-04-17 华为终端有限公司 Sound mixing method, device and system
CN103500580A (en) * 2013-09-23 2014-01-08 广东威创视讯科技股份有限公司 Audio mixing processing method and system
CN103686059A (en) * 2013-09-23 2014-03-26 广东威创视讯科技股份有限公司 A distributed-type audio mixing processing method and a system
CN104167210A (en) * 2014-08-21 2014-11-26 华侨大学 Lightweight class multi-side conference sound mixing method and device
CN105847825A (en) * 2015-01-16 2016-08-10 杭州海康威视数字技术股份有限公司 Encoding, index storage and access methods for video encoding code stream and corresponding apparatus
CN105976830A (en) * 2013-01-11 2016-09-28 华为技术有限公司 Audio signal coding and decoding method and audio signal coding and decoding device
US9900720B2 (en) 2013-03-28 2018-02-20 Dolby Laboratories Licensing Corporation Using single bitstream to produce tailored audio device mixes
CN109920445A (en) * 2019-03-04 2019-06-21 北京佳讯飞鸿电气股份有限公司 A kind of sound mixing method, device and equipment
CN110070878A (en) * 2019-03-26 2019-07-30 苏州科达科技股份有限公司 The coding/decoding method and electronic equipment of audio code stream
CN110995946A (en) * 2019-12-25 2020-04-10 苏州科达科技股份有限公司 Sound mixing method, device, equipment, system and readable storage medium
CN113038060A (en) * 2019-12-25 2021-06-25 中国电信股份有限公司 Multi-channel audio processing method and system
CN113674751A (en) * 2021-07-09 2021-11-19 北京字跳网络技术有限公司 Audio processing method and device, electronic equipment and storage medium
CN114710475A (en) * 2022-04-11 2022-07-05 三星电子(中国)研发中心 Streaming media audio fusion method and device

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102202038B (en) * 2010-03-24 2015-05-06 华为技术有限公司 Method and system for realizing voice energy display, conference server and terminal
CN102202038A (en) * 2010-03-24 2011-09-28 华为技术有限公司 Method and system for realizing voice energy display, conference server and terminal
CN103050124A (en) * 2011-10-13 2013-04-17 华为终端有限公司 Sound mixing method, device and system
WO2013053336A1 (en) * 2011-10-13 2013-04-18 华为终端有限公司 Sound mixing method, device and system
CN103050124B (en) * 2011-10-13 2016-03-30 华为终端有限公司 Sound mixing method, Apparatus and system
US9456273B2 (en) 2011-10-13 2016-09-27 Huawei Device Co., Ltd. Audio mixing method, apparatus and system
CN102968995A (en) * 2012-11-16 2013-03-13 新奥特(北京)视频技术有限公司 Sound mixing method and device of audio signal
CN102968995B (en) * 2012-11-16 2018-10-02 新奥特(北京)视频技术有限公司 A kind of sound mixing method and device of audio signal
CN105976830A (en) * 2013-01-11 2016-09-28 华为技术有限公司 Audio signal coding and decoding method and audio signal coding and decoding device
CN105976830B (en) * 2013-01-11 2019-09-20 华为技术有限公司 Audio-frequency signal coding and coding/decoding method, audio-frequency signal coding and decoding apparatus
US10373629B2 (en) 2013-01-11 2019-08-06 Huawei Technologies Co., Ltd. Audio signal encoding and decoding method, and audio signal encoding and decoding apparatus
US9900720B2 (en) 2013-03-28 2018-02-20 Dolby Laboratories Licensing Corporation Using single bitstream to produce tailored audio device mixes
CN103500580A (en) * 2013-09-23 2014-01-08 广东威创视讯科技股份有限公司 Audio mixing processing method and system
CN103500580B (en) * 2013-09-23 2017-04-12 广东威创视讯科技股份有限公司 Audio mixing processing method and system
CN103686059B (en) * 2013-09-23 2017-04-05 广东威创视讯科技股份有限公司 Distributed mixed audio processing method and system
CN103686059A (en) * 2013-09-23 2014-03-26 广东威创视讯科技股份有限公司 A distributed-type audio mixing processing method and a system
CN104167210A (en) * 2014-08-21 2014-11-26 华侨大学 Lightweight class multi-side conference sound mixing method and device
CN105847825A (en) * 2015-01-16 2016-08-10 杭州海康威视数字技术股份有限公司 Encoding, index storage and access methods for video encoding code stream and corresponding apparatus
US10575009B2 (en) 2015-01-16 2020-02-25 Hangzhou Hikvision Digital Technology Co., Ltd. Systems, devices and methods for video coding
CN109920445A (en) * 2019-03-04 2019-06-21 北京佳讯飞鸿电气股份有限公司 A kind of sound mixing method, device and equipment
CN110070878A (en) * 2019-03-26 2019-07-30 苏州科达科技股份有限公司 The coding/decoding method and electronic equipment of audio code stream
CN110995946A (en) * 2019-12-25 2020-04-10 苏州科达科技股份有限公司 Sound mixing method, device, equipment, system and readable storage medium
CN113038060A (en) * 2019-12-25 2021-06-25 中国电信股份有限公司 Multi-channel audio processing method and system
CN110995946B (en) * 2019-12-25 2021-08-20 苏州科达科技股份有限公司 Sound mixing method, device, equipment, system and readable storage medium
CN113038060B (en) * 2019-12-25 2022-11-18 中国电信股份有限公司 Multi-channel audio processing method and system
CN113674751A (en) * 2021-07-09 2021-11-19 北京字跳网络技术有限公司 Audio processing method and device, electronic equipment and storage medium
CN114710475A (en) * 2022-04-11 2022-07-05 三星电子(中国)研发中心 Streaming media audio fusion method and device
CN114710475B (en) * 2022-04-11 2024-03-15 三星电子(中国)研发中心 Streaming media audio fusion method and device

Similar Documents

Publication Publication Date Title
CN101414462A (en) Audio encoding method and multi-point audio signal mixing control method and corresponding equipment
CN103988486B (en) The method of active channel is selected in the audio mixing of multiparty teleconferencing
US9456273B2 (en) Audio mixing method, apparatus and system
CN100393085C (en) Audio signal quality enhancement in a digital network
CN110995946B (en) Sound mixing method, device, equipment, system and readable storage medium
CN102741831B (en) Scalable audio frequency in multidrop environment
CN111583942B (en) Method and device for controlling coding rate of voice session and computer equipment
CN112104836A (en) Audio mixing method, system, storage medium and equipment for audio server
JP2006504300A (en) Method and apparatus for DTMF search and speech mixing in CELP parameter domain
CN110070878B (en) Decoding method of audio code stream and electronic equipment
CN101414463B (en) Method, apparatus and system for encoding mixed sound
JP4033840B2 (en) Audio mixing method, audio mixing apparatus, audio mixing program, and recording medium recording the same
CN112767955B (en) Audio encoding method and device, storage medium and electronic equipment
CN104167210A (en) Lightweight class multi-side conference sound mixing method and device
US7020613B2 (en) Method and apparatus of mixing audios
US6898272B2 (en) System and method for testing telecommunication devices
US20130066641A1 (en) Encoder Adaption in Teleconferencing System
CN105009208A (en) Methods and apparatuses for dtx hangover in audio coding
CN101478616A (en) Instant voice communication method
CN102324235A (en) Sound mixing encoding method, device and system
US20220124201A1 (en) Multipoint control method, apparatus and program
US11810581B2 (en) Multipoint control method, apparatus and program
US11949817B2 (en) Multipoint control method, apparatus and program
US11837241B2 (en) Encoding and decoding method, decoding method, apparatuses therefor and program
US20220116502A1 (en) Multipoint control method, apparatus and program

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20090422