CN104486518B - A kind of videoconference distribution sound mixing method under bandwidth limited network environment - Google Patents
A kind of videoconference distribution sound mixing method under bandwidth limited network environment Download PDFInfo
- Publication number
- CN104486518B CN104486518B CN201410729699.5A CN201410729699A CN104486518B CN 104486518 B CN104486518 B CN 104486518B CN 201410729699 A CN201410729699 A CN 201410729699A CN 104486518 B CN104486518 B CN 104486518B
- Authority
- CN
- China
- Prior art keywords
- mixer
- voice
- domain
- audio mixing
- meeting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Telephonic Communication Services (AREA)
Abstract
The invention discloses the videoconference distribution sound mixing method under a kind of bandwidth limited network environment, the exit deployment mixer of network domains where node of attending a meeting, it is top mixer to use static schema or dynamic mode to specify certain mixer, remaining mixer is secondary mixer, secondary mixer is responsible for the stereo process in this domain, and audio mixing result is sent to top mixer, top mixer attended a meeting node voice by this domain and secondary audio mixing result carries out stereo process again as audio mixing input, this domain is sent the result to attend a meeting node and secondary mixer, the attend a meeting voice of node of top audio mixing result that secondary mixer will be received and this domain carries out the stereo process in this domain as input.The inventive method greatly reduces the Media Stream quantity transmitted between heterogeneous networks domain, when personnel participating in the meeting is increased, the expense of bandwidth limited link between additional voice Media Stream quantity, effectively save domain will not be increased between network domains, and mixer load is reduced, facilitates meeting to extend.
Description
Technical field
The present invention relates to the videoconference distribution sound mixing method under a kind of bandwidth limited network environment.
Background technology
TeleConference Bridge is held a meeting there is provided a kind of convenient solution for strange land.Current telephony conference system master
There is end subscriber mixed mode and concentrate mixed mode.End subscriber mixed mode is to be responsible for selection algorithm by server, and terminal is born
Duty voice mixing treatment, the pattern advantage can be reduce server treatment load, have the disadvantage it is higher to terminal capability requirement,
Speech data redundancy in network, efficiency of bandwidth use is low, easily causes localized network congestion;Mixed mode is concentrated there was only server
Center mixer, all Media Streams are sent to after mixer audio mixing and N bars Media Stream (N is number of terminals of attending a meeting) are sent into each ginseng again
Meeting terminal, the pattern has the convenient advantage of Media Stream mixed processing, has the disadvantage that server performance requirement is higher, it is impossible to avoid office
Portion's network congestion, efficiency of bandwidth use is not high.Both patterns have high requirements to bandwidth, with the increase of number of participants, mix
The performance and bandwidth of sound device turn into unvanquishable bottleneck.
In actual use, the member for participating in videoconference often has concentration of local wide area characteristic distributions, concentrates
Refer to that concentration is compared in same region, same level members network site, often just in same LAN, bandwidth is filled between member
Foot, forms relatively-stationary member's group;Distribution refers between different geographical, different stage member's group or same rank is different
The diverse location of network is distributed between member's group, the communication connection between them is mainly by radio communications such as wide area network or satellites
Net, limited bandwidth is costly.End subscriber mixed mode and concentration mixed mode videoconference do not adapt to this network and open up
The characteristics of flutterring.
The content of the invention
In order to overcome the disadvantages mentioned above of prior art, the invention provides the phone meeting under a kind of bandwidth limited network environment
The distributed sound mixing method of view, overcomes the low drawn game of bandwidth utilization rate that audio medium stream causes when mixer is converged and is distributed
The problem of portion's network congestion, it is to avoid transmission in a network repeats voice, improves efficiency of bandwidth use, and reduce center mixer
Performance cost.
The technical solution adopted for the present invention to solve the technical problems is:A kind of phone meeting under bandwidth limited network environment
The distributed sound mixing method of view, the exit deployment mixer of network domains where node of attending a meeting, using static schema or dynamic analog
Formula specifies certain mixer for top mixer, and remaining mixer is secondary mixer, and secondary mixer is responsible for the audio mixing in this domain
Treatment, and audio mixing result is sent into top mixer, top mixer attends a meeting node voice and secondary audio mixing result this domain
Stereo process being carried out again as audio mixing input, this domain being sent the result to and is attended a meeting node with secondary mixer, secondary mixer is incited somebody to action
The attend a meeting voice of node of the top audio mixing result that receives and this domain carries out the stereo process in this domain as input;Described mixer
Including receiving sending module, voice flow selecting module and mix module, wherein:The reception sending module, when receiving, voice is defeated
It is fashionable, decoded according to voice coding modes, decoded result feeding voice flow selecting module treatment;When voice is sent, according to mesh
End coded system voice is encoded, re-send to destination;The voice flow selecting module, calculates each road voice flow letter
Number energy size, mute signal is filtered by Jing Yin detection, maximum audio mixing number according to configuration and ensures that speech is continuous former
Then filter out satisfactory voice flow;The mix module is processed input voice flow according to Mixed Audio Algorithm.
Compared with prior art, the positive effect of the present invention is:Greatly reduce the media transmitted between heterogeneous networks domain
Stream quantity, when personnel participating in the meeting is increased, due to the preliminary stereo process in this domain, will not increase additional voice matchmaker between network domains
The expense of bandwidth limited link between body stream quantity, effectively save domain, and mixer load is reduced, facilitate meeting to extend.This hair
It is bright to can be used for any network topology structure with integrated distribution formula feature.
Brief description of the drawings
Examples of the present invention will be described by way of reference to the accompanying drawings, wherein:
Fig. 1 is bandwidth limited network topological diagram;
Fig. 2 is distributed audio mixing configuration diagram;
Fig. 3 is the flow chart of the inventive method.
Specific embodiment
Bandwidth limited network topological diagram is as shown in figure 1, integrated distribution formula network is divided into multiple according to terminal distributing position
Network domains, each network domains dispose local concentration audio mixing server, and each audio mixing server has the relation of equity.Network domains it
Between be the wireless communication networks such as wide area network or satellite, limited bandwidth is costly.
When Meeting Held, need to first select certain mixer for top mixer, be responsible for the audio mixing of top.It is distributed
Sound mixing method has two ways to select top mixer, i.e. static schema and dynamic mode, and static schema is by configuration file
Top mixer is specified, will not be changed in configuration modification perclimax mixer;Dynamic mode refer to when videoconference is held according to
The tactful top mixer of dynamic select, under dynamic mode, decision factor is including mixer loading condition, network topological information etc..
Dynamic mode is proposed with the case of it can obtain the network resource informations such as network topology and loading condition, it is possible to achieve audio mixing
The load balancing of device;The reasonable efficient profit of Internet resources can be ensured when can not obtain network resource information by static schema
With.Once top mixer determines, both of which has no Different Effects to the present invention, using identical distribution sound mixing method.
After determining top mixer, remaining mixer is secondary mixer, as shown in Fig. 2 with the audio mixing in network domains 1
Device is confirmed as the distributed audio mixing framework illustrated in conference process as a example by top mixer.
Distributed audio mixing framework is by top mixer (Top-Mixer), secondary mixer (Sub-Mixer), client
(UAC) constitute.The secondary mixer of client connection or top mixer, secondary mixer connect top mixer.
Top mixer and secondary mixer have identical functional module and handling process, and difference is the voice flow of input
Difference, top mixer input is this domain client voice flow and multiple secondary mixer audio mixing streams, and secondary mixer input is
This domain client voice flow and unique top mixer audio mixing stream.Under dynamic designated mode, while the multiple phones held
Meeting can specify different top mixers, mixer can have simultaneously top mixer and secondary mixer role or
Changed between two kinds of roles of top mixer and secondary mixer.
Described mixer includes receiving sending module, voice flow selecting module and mix module.
Sending module is received, when phonetic entry is received, is decoded according to voice coding modes, decoded result feeding voice flow
Selecting module treatment;When voice is sent, voice is encoded according to destination coded system, re-send to destination.
Voice flow selecting module, the module calculates each road voice flow signal energy size, filters Jing Yin by Jing Yin detection
Signal, maximum audio mixing number and the continuous principle of guarantee speech according to configuration filter out satisfactory voice flow, specifically
Screening technique is as follows:
When current mixer has been maxed out audio mixing way n, if the energy value of the (n+1)th road voice signal is more than preceding
Certain energy value all the way in n roads, then control the preceding n roads made a speech to continue to make a speech, and the (n+1)th tunnel wouldn't be allowed to make a speech;Until
When certain stops speech all the way in preceding n roads through making a speech, the (n+1)th tunnel is chosen to add audio mixing, it is allowed to make a speech on the road.
Mix module, the module is processed input voice flow according to Mixed Audio Algorithm, and commonly using Mixed Audio Algorithm is included directly
Clamp method, equalization audio mixing, alignment audio mixing etc., in order to avoid echo, the road need to be removed when audio mixing result is sent to destination
The voice of itself.The selection of Mixed Audio Algorithm, without different influences, performs identical distribution mix process to the present invention, therefore to tool
Body algorithm is no longer described in detail.
Distributed sound mixing method flow of the invention is as shown in figure 3, comprise the following steps:
When step one, reception sending module receive this domain client and other mixer voice flows, according to the mark of voice flow
Knowledge is put into corresponding pending queue;
Step 2, reception sending module are decoded according to the coded system of voice flow, and decoded result are sent into language
Sound stream selecting module;
Step 3, voice flow selecting module calculate decoded voice flow energy value, according to energy value size and guarantee language
Sound principle of continuity selects satisfactory voice to stream to mix module;
Step 4, mix module carry out stereo process to the voice flow for selecting, then after the road voice of itself is removed
Audio mixing result is sent to reception sending module;
Step 5, reception sending module are carried out according to the coded system that purpose client or purpose mixer are supported to voice
After coding, the speech data after coding is sent to corresponding purpose client or purpose mixer.
It is a feature of the present invention that the voice flow of mixer is input into destination except comprising general local VoIP client
End, also comprising other distal end mixers, the output result after the voice flow and distal end mixer audio mixing of local client can be made
For the input of current mixer carries out stereo process again;Certain mixer can be provided simultaneously with top mixer and secondary mixer
Role, or changed between top mixer and secondary two kinds of roles of mixer.When Meeting Held, in static schema
Under, a plurality of voice flow reaches this domain mixer and carries out audio mixing, if the mixer is configured to secondary mixer, then the knot of audio mixing
The top mixer that fruit will be sent to configuration carries out audio mixing again, if the mixer is configured to top mixer, the knot of audio mixing
Fruit is directly returned by source port, and in a dynamic mode, a plurality of voice flow reaches this domain mixer and carries out audio mixing, meanwhile, root
It is secondary mixer or top audio mixing in this conversation procedure according to load condition, the network topological information dynamic decision mixer
Device, the result of decision of this time is to the dynamic decision in next conversation procedure without absolute effect.In conference process, top and secondary
Only comprising up all the way and downlink voice stream all the way between network domains, the voice between secondary network domain is mixed by top mixer
Sound is forwarded, it is not necessary to is increased additional voice stream, is greatlyd save bandwidth cost;Secondary mixer is responsible for the stereo process in place domain,
Top mixer is responsible for the stereo process of place domain and secondary mixer, and this has just disperseed mixer load, reduces to mixer
The requirement of performance.
Operation principle of the invention is:The present invention proposes a kind of distributed sound mixing method using distributed thought, is attending a meeting
The exit deployment mixer of network domains where node, certain mixer is specified using configuration file and based on tactful two ways
It is top mixer, remaining mixer is secondary mixer, secondary mixer is responsible for the stereo process in this domain, and by audio mixing result
It is sent to top mixer, top mixer attended a meeting node voice by this domain and secondary audio mixing result is carried out again as audio mixing input
Stereo process, sends the result to this domain and attends a meeting node and secondary mixer, the top audio mixing result that secondary mixer will be received
With the attend a meeting voice of node of this domain the stereo process in this domain is carried out as input.
Claims (4)
1. a kind of distributed sound mixing method of videoconference under bandwidth limited network environment, it is characterised in that:In the node institute that attends a meeting
Mixer is disposed in the exit of network domains, it is top mixer, remaining mixer to use dynamic mode to specify certain mixer
It is secondary mixer, secondary mixer is responsible for the stereo process in this domain, and audio mixing result is sent into top mixer, top mixed
Sound device attended a meeting node voice by this domain and secondary audio mixing result carries out stereo process again as audio mixing input, sends the result to this
Domain is attended a meeting node and secondary mixer, the top audio mixing result and this domain that secondary mixer will be received attend a meeting node voice as
Input carries out the stereo process in this domain;Described mixer includes receiving sending module, voice flow selecting module and mix module,
Wherein:The reception sending module, when phonetic entry is received, decodes according to voice coding modes, decoded result feeding voice
Stream selecting module treatment;When voice is sent, voice is encoded according to destination coded system, re-send to destination;
The voice flow selecting module, calculates each road voice flow signal energy size, and mute signal is filtered by Jing Yin detection, according to matching somebody with somebody
The maximum audio mixing number and the continuous principle of guarantee speech put filter out satisfactory voice flow;The mix module is according to mixed
Sound algorithm is processed input voice flow;Under dynamic designated mode, while the multiple videoconferences held can be specified not
With top mixer, mixer can have simultaneously top mixer and secondary mixer role or in top mixer and
Changed between two kinds of roles of secondary mixer.
2. the distributed sound mixing method of videoconference under a kind of bandwidth limited network environment according to claim 1, it is special
Levy and be:The voice flow selecting module is to the screening technique of voice flow:Audio mixing road is had been maxed out in current mixer
During number n, if the energy value of the (n+1)th road voice signal is more than certain energy value all the way in preceding n roads, before controlling to make a speech
N roads continue to make a speech, and the (n+1)th tunnel wouldn't be allowed to make a speech;When certain stops speech all the way in the preceding n roads made a speech, n-th is chosen
+ 1 tunnel adds audio mixing, it is allowed to make a speech on the road.
3. the distributed sound mixing method of videoconference under a kind of bandwidth limited network environment according to claim 1, it is special
Levy and be:The sound mixing method of the mixer comprises the following steps:
When step one, reception sending module receive this domain client and other mixer voice flows, the mark according to voice flow is put
Enter corresponding pending queue;
Step 2, reception sending module are decoded according to the coded system of voice flow, and decoded result are sent into voice flow
Selecting module;
Step 3, voice flow selecting module calculate decoded voice flow energy value, are connected according to energy value size and guarantee voice
Continuous principle selects satisfactory voice to stream to mix module;
Step 4, mix module carry out stereo process to the voice flow for selecting, then will be mixed after the road voice of itself is removed
Sound result is sent to reception sending module;
Step 5, reception sending module are encoded according to the coded system that purpose client or purpose mixer are supported to voice
Afterwards, the speech data after coding is sent to corresponding purpose client or purpose mixer.
4. the distributed sound mixing method of videoconference under a kind of bandwidth limited network environment according to claim 1, it is special
Levy and be:The Mixed Audio Algorithm includes directly clamp method, equalization audio mixing and alignment audio mixing.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410729699.5A CN104486518B (en) | 2014-12-03 | 2014-12-03 | A kind of videoconference distribution sound mixing method under bandwidth limited network environment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410729699.5A CN104486518B (en) | 2014-12-03 | 2014-12-03 | A kind of videoconference distribution sound mixing method under bandwidth limited network environment |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104486518A CN104486518A (en) | 2015-04-01 |
CN104486518B true CN104486518B (en) | 2017-06-30 |
Family
ID=52761020
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410729699.5A Active CN104486518B (en) | 2014-12-03 | 2014-12-03 | A kind of videoconference distribution sound mixing method under bandwidth limited network environment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104486518B (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104869004B (en) * | 2015-05-15 | 2017-07-25 | 百度在线网络技术(北京)有限公司 | Audio data processing method and device |
CN106559639B (en) * | 2015-09-29 | 2020-08-14 | 深圳市潮流网络技术有限公司 | Method and system for processing relay video of teleconference |
EP3425891B1 (en) | 2016-02-29 | 2021-01-06 | Audio-Technica Corporation | Conference system |
CN106060450B (en) * | 2016-07-13 | 2019-03-29 | 苏州科达科技股份有限公司 | A kind of sound mixing method and device of video conference |
CN107911361B (en) * | 2017-11-14 | 2020-05-08 | 网易(杭州)网络有限公司 | Voice management method and device supporting multiple sessions, terminal equipment and storage medium |
CN108712584B (en) * | 2018-05-16 | 2020-04-17 | 中国电子科技集团公司第二十八研究所 | Distributed sound mixing method for teleconference |
JP7116240B2 (en) * | 2019-02-19 | 2022-08-09 | 株式会社ソニー・インタラクティブエンタテインメント | Audio output control system, relay device, communication device, audio output control method and program |
CN111585776B (en) * | 2020-05-26 | 2021-06-11 | 腾讯科技(深圳)有限公司 | Data transmission method, device, equipment and computer readable storage medium |
CN113271432B (en) * | 2021-06-30 | 2022-11-18 | 北京二六三企业通信有限公司 | Method and apparatus for transmitting and receiving speaker list |
CN114173011B (en) * | 2021-11-29 | 2024-03-19 | 河北远东通信系统工程有限公司 | Mixing control method for collaborative command media engine |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6728356B1 (en) * | 2000-04-14 | 2004-04-27 | Lucent Technologies Inc. | Method and apparatus for providing telephony services by switch-based processing of media streams |
CN1543181A (en) * | 2003-04-30 | 2004-11-03 | 华为技术有限公司 | A distributed mix processing method |
CN101252452A (en) * | 2007-03-31 | 2008-08-27 | 红杉树(杭州)信息技术有限公司 | Distributed type tone mixing system in multimedia conference |
-
2014
- 2014-12-03 CN CN201410729699.5A patent/CN104486518B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6728356B1 (en) * | 2000-04-14 | 2004-04-27 | Lucent Technologies Inc. | Method and apparatus for providing telephony services by switch-based processing of media streams |
CN1543181A (en) * | 2003-04-30 | 2004-11-03 | 华为技术有限公司 | A distributed mix processing method |
CN101252452A (en) * | 2007-03-31 | 2008-08-27 | 红杉树(杭州)信息技术有限公司 | Distributed type tone mixing system in multimedia conference |
Non-Patent Citations (1)
Title |
---|
混音技术及其在IP电话会议系统中的应用;张传永;《中国优秀硕士学位论文全文数据库》;中国学术期刊(光盘版)电子杂志社;20110715(第7期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN104486518A (en) | 2015-04-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104486518B (en) | A kind of videoconference distribution sound mixing method under bandwidth limited network environment | |
CN1849824B (en) | System and method for performing distributed video conferencing | |
US8634295B2 (en) | System and method for voice and data communication | |
CN102238487B (en) | Multimedia cluster scheduling system and method thereof | |
CN102365857B (en) | Method and apparatus for the efficient transmission of multimedia streams for teleconferencing | |
US8659634B2 (en) | Method and system for implementing three-party video call by mobile terminals | |
CN111541860B (en) | Real-time audio transmission system and using method thereof | |
CN105357208A (en) | Multi-party network audio session method and system | |
CN101594529A (en) | Processing system for video and the method that is used for processing video data | |
CN104363407B (en) | A kind of video conferencing system means of communication and related device | |
EP1675413A3 (en) | Method and system for call path reconfiguration | |
CN103327287B (en) | A kind of conference signal playing method and device, video conference terminal, mobile device | |
CN107800902A (en) | The sound mixing method and system of multi-path voice | |
CN101771854A (en) | Method for initiatively establishing video conference by wireless terminal and system therefor | |
CN102893603A (en) | Video conferencing processing method, apparatus and communication system | |
CN105338288A (en) | Multi-people network video session method and system | |
CN104158673B (en) | Conference model system of selection and server | |
CN106130747A (en) | A kind of method of Conference control, system and mobile terminal | |
CN108712584A (en) | A kind of distributed sound mixing method for videoconference | |
WO2014166366A1 (en) | Method and device for performing capability negotiation in a long term evolution cluster network | |
CN105704423A (en) | Voice output method and device | |
CN105025247A (en) | Video conference forwarding system | |
CN101707593B (en) | Conference system based on tree-shaped servers, PC client sides and telephone terminals | |
CN107294923A (en) | Adjust the method and device of video coding rate | |
CN113810650B (en) | Audio mixing method for realizing multiparty call by vehicle-mounted audio and video monitoring system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |