CN101335867A - Voice excited control method of meeting television system - Google Patents
Voice excited control method of meeting television system Download PDFInfo
- Publication number
- CN101335867A CN101335867A CNA2007101236914A CN200710123691A CN101335867A CN 101335867 A CN101335867 A CN 101335867A CN A2007101236914 A CNA2007101236914 A CN A2007101236914A CN 200710123691 A CN200710123691 A CN 200710123691A CN 101335867 A CN101335867 A CN 101335867A
- Authority
- CN
- China
- Prior art keywords
- meeting
- terminal
- voice
- place
- place terminal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
The present invention discloses a voice excited control method of meeting television system. The meeting television system includes at least one meeting place terminal. The method includes the following steps: A. sampling voice of each meeting place terminal in a prearranged sampling frequency in the prearranged sampling period, and taking the sound volume with maximum absolute value in sampling points in the sampling period as the sound volume reference value of the sampling period; B. taking continuous number of sampling periods as an acquisition cycle, averaging these sound volume reference values in the acquisition cycle, and taking the average value as the sound volume representative value of the corresponding meeting place terminal; C. taking the meeting place terminal with maximum sound volume representative value as the current voice excited terminal. The invention takes the meeting place terminal with maximum sound volume representative value as the voice-mixing excited terminal, thereby preventing speakers from generating frequent switch caused by accidental voice, and stabilizing the running of the entire meeting television system.
Description
Technical field
The present invention relates to the video conferencing field, relate in particular to a kind of voice excited control method of TV conference system.
Background technology
Along with development of telecom technology, video conferencing service has obtained application more and more widely.In TV conference system, need judge a plurality of sides of speaking in the meeting, and, make the participant feel more natural its sound mix; Meanwhile, also to give other participants with spokesman's broadcast of images.
TV conference system is a core with video conferencing multipoint control unit (MCU), is responsible for the image switching and the sound mix of all meeting-place terminals and handles.Because the participant of video conferencing is generally a plurality of, need carry out control and management to whole meeting, i.e. the spokesman meeting-place is for example switched in meeting control, the control audio mixing, and the speech person sees meeting-place or the like.Difference according to the control subject of implementation can be divided into the meeting control model three kinds of different patterns: chairman's control model, director's control model, voice-activated control model.Below three kinds of patterns are described slightly.
Chairman's control model: comprising main meeting-place and sub-venue in the meeting, is the main meeting-place with meeting-place, chairman place, and other are sub-venue, chairman can initiatively and a sub-venue be talked with, its speech is wanted in roll-call, the sub-venue speech needs the application to chairman, through chairman's approval, can make a speech.
Director's control model: carry out meeting control by director's (operating console of MCU), it is initiator that the director can specify a meeting-place, other meeting-place is listened to watch this initiator.
The voice-activated pattern: when the speech of a plurality of meeting-place is arranged at the same time, with the meeting-place of sound maximum as initiator, with its sound or broadcast of images to other meeting-place.
Under the situation of discussion group's pattern meeting, adopting voice-activated pattern automatic switchover spokesman meeting-place is more convenient pattern, and generally speaking, in actual life was discussed, people's ordinary practice was in a side of listening maximum.In addition, in TV conference system, can guarantee that also chairman has the power of speech usually, but speech when not accepting too many meeting-place.So may have two to three meeting-place usually and can make a speech simultaneously, for example comprise chairman, by the spokesman that the voice-activated pattern is selected, these voice will carry out audio mixing to be handled to guarantee that other meeting-place can be good at hearing chairman or spokesman's voice.Under the voice-activated pattern, judge the meeting-place of sound maximum by the meeting multipoint control unit, and the sound in this meeting-place is joined during audio mixing handles; And, the situation of simulating reality, people naturally and understandably can pay close attention to its image when uppick sound.Like this, when joining the audio mixing processing in meeting-place, also will switch to this meeting-place, give other meeting-place the broadcast of images in this meeting-place with the sound maximum.Accordingly, may have in the middle of audio mixing after withdrawing from this and judge in the last meeting-place that once is judged as the sound maximum handles.
Present voice-activated pattern implementation is normally: set a sampling period, normally 20 milliseconds, sample therein, with the volume of the maximum value in the sampled point volume reference value as a meeting-place terminal, the volume reference value that compares each meeting-place terminal,, the sound in this meeting-place is added audio mixing handle as spokesman's terminal with the meeting-place terminal of max volume reference value, and switch this spokesman's broadcast of images and give other meeting-place terminals.
As shown in Figure 1, exemplarily show a TV conference system among the figure, comprised multipoint control unit and 4 meeting-place terminals.In one example, have two meeting-place terminals of A, B and make a speech simultaneously, the audio frequency of two meeting-place terminals arrives multipoint control unit through wired or wireless communication link.Multipoint control unit adopts default sample rate and sampling period to sample.Through over-sampling, for the audio frequency of each meeting-place terminal, all produced a plurality of audio sample points, select the point of volume absolute value maximum in these sampled points, with the volume value of this point as the volume reference value of each meeting-place terminal in this sampling period.
In the prior art, after having determined volume reference value, then compare, for example in sampling period, A meeting-place terminal volume reference value is bigger, then after this sampling period, A meeting-place terminal will be as the voice-activated terminal, and its voice will join in the audio mixing processing, simultaneously, switch A can field picture, be broadcast to other meeting-place.In next sampling period, have A, B, three meeting-place terminals of C are made a speech simultaneously, repeat above-mentioned processing procedure, for example determining B meeting-place terminal is the voice-activated terminal, so, in the following down-sampling cycle, to join the voice of B meeting-place terminal in the audio mixing processing, simultaneously, switch and B meeting-place terminal image, be broadcast to other meeting-place.
Yet such processing mode causes spokesman's frequent switching most probably.For example, when select A meeting-place terminal as the spokesman in A, B, C meeting-place terminal, the voice of other terminals listen A meeting-place, meeting-place terminals are also watched the image of A meeting-place terminal.This should work as is a stable process, yet sometime, a cough sound from D meeting-place terminal may appear, the perhaps huge sound that knocked over of C meeting-place terminal implements, these precipitate sounds may surpass the normal speech of A meeting-place terminal fully and cause the unnecessary switching of spokesman's terminal in next sampling period in a sampling period.Rapid switching between this unnecessary spokesman has caused the unstable and inefficient of whole system.
Summary of the invention
In view of this, technical problem to be solved by this invention has provided a kind of voice excited control method of video conferencing, makes that system's operation is more stable.
To achieve these goals, the present invention adopts following technical scheme:
A kind of voice excited control method of TV conference system, described TV conference system comprise at least one meeting-place terminal, and this method comprises following steps:
A, with the voice of default sample rate each meeting-place terminal of sampling in the default sampling period, with the volume of maximum value in each sampled point in the sampling period volume reference value as this sampling period;
B, with a continuous threshold value sampling period be a collection period, and each volume reference value in this collection period is averaged, with the volume typical value of this mean value as corresponding meeting-place terminal;
C, with the meeting-place terminal of volume typical value maximum as the current speech stimuli terminal.
Further, described voice excited control method is set a unit interval, comprises continuous a plurality of collection period in this unit interval, all be chosen as the voice-activated terminal if any meeting-place terminal each collection period in the described unit interval, then with this meeting-place terminal as current speaker's terminal.
The described unit interval is 1 second.
Default sample rate in the described steps A is 8kHz, and the described default sampling period is 20 milliseconds.
The threshold value in the continuous threshold value sampling period among the described step B is 4.
Beneficial effect of the present invention is: by setting collection period, mean value with the volume reference value in the collection period is the volume typical value of meeting-place terminal, and be audio mixing meeting-place terminal with the meeting-place terminal of volume typical value maximum, thereby can guarantee that the spokesman can frequent switching not take place because of the sound of burst, guarantee that the operation of whole TV conference system is more stable.
Description of drawings
Fig. 1 is the schematic block diagram of a TV conference system;
Fig. 2 is the control method flow chart of the specific embodiment of the invention.
Embodiment
The contrast accompanying drawing elaborates to the present invention in conjunction with embodiment below.
Audio frequency with G.711 encode (a kind of encoding and decoding speech standard of pulse code modulation) is an example, and as shown in Figure 2, the voice excited control method of the specific embodiment of the invention comprises the steps:
1, at first setting sample rate is 8kHz, and the sampling period is 20 milliseconds.Per 20 milliseconds of sound code streams that participant meeting-place terminal is delivered to multipoint control unit are decoded, and like this corresponding to each meeting-place terminal, all obtain 160 sampled points.
2, at each meeting-place terminal, the volume of getting the maximum value of 160 sampled points is as the volume reference value of each meeting-place terminal.
3, be a collection period with a continuous threshold value sampling period, in this example, this threshold value gets 4, and promptly the volume reference value in 4 sampling periods of continuous acquisition is averaged to 4 volume reference value of this collection period, then as the volume typical value of each meeting-place terminal.
4, per 80 milliseconds get the volume typical value after, the volume typical value of all meeting-place terminals is compared, compare maximum volume typical value, with the meeting-place terminal of correspondence as the current speech stimuli terminal, during the audio mixing that participates in next collection period is handled.It should be noted that, herein, still the meeting-place terminal that will join audio mixing is called the voice-activated terminal, but its correspondence is the meeting-place terminal of volume typical value maximum in the collection period, but not the meeting-place terminal of volume reference value maximum in sampling period in the prior art.
5, setting one comprises the unit interval of continuous a plurality of collection period, this unit interval can freely be set, it for example can be 1 second, if certain meeting-place terminal each collection period in a unit interval, be in each 80 milliseconds, all be chosen to be the voice-activated terminal, then this meeting-place terminal switched to current speaker's terminal, give other meeting-place terminals the broadcast of images of this spokesman's terminal.
As previously mentioned, TV conference system of the prior art usually causes unnecessary switching under the voice-activated pattern.The present invention then adopts the delay process pattern, promptly by setting collection period, as the volume typical value, is used as the standard that audio mixing switches with volume reference value mean value.Because noise, for example aforesaid cough sound, the implements sound that falls down to the ground generally is burst, instantaneous, and so down average, these precipitate noises can conductively-closed, thereby in making that the voice of voice-activated terminal are correctly joined audio mixing handling.
Further, unit interval with a setting is reached longer delay process, having only in a meeting-place terminal each collection period in this unit interval all is the voice-activated terminal, just can be after this unit interval with it as spokesman's terminal, give other meeting-place terminals the broadcast of images of this spokesman's terminal.Thereby further guarantee the stable operation of TV conference system.
TV conference system of the present invention, frequent, the unnecessary switching of having avoided TV conference system of the prior art to exist; And the audio mixing that can more accurate, stably finish spokesman's terminal is handled and image switching, and system is stability and high efficiency more.
Above content be in conjunction with concrete preferred implementation to further describing that the present invention did, can not assert that concrete enforcement of the present invention is confined to these explanations.For the general technical staff of the technical field of the invention, without departing from the inventive concept of the premise, can also make some simple deduction or replace, all should be considered as belonging to protection scope of the present invention.
Claims (5)
1. the voice excited control method of a TV conference system, described TV conference system comprises at least one meeting-place terminal, it is characterized in that, and described method comprises following steps:
A, with the voice of default sample rate each meeting-place terminal of sampling in the default sampling period, with the volume of maximum value in each sampled point in the sampling period volume reference value as this sampling period;
B, with a continuous threshold value sampling period be a collection period, and each volume reference value in this collection period is averaged, with the volume typical value of this mean value as corresponding meeting-place terminal;
C, with the meeting-place terminal of volume typical value maximum as the current speech stimuli terminal.
2. voice excited control method as claimed in claim 1, it is characterized in that, set a unit interval, comprise continuous a plurality of collection period in this unit interval, all be chosen as the voice-activated terminal if any meeting-place terminal each collection period in the described unit interval, then with this meeting-place terminal as current speaker's terminal.
3. voice excited control method as claimed in claim 2 is characterized in that, the described unit interval is 1 second.
4. as the arbitrary described voice excited control method of claim 1 to 3, it is characterized in that the default sample rate in the described steps A is 8kHz, the described default sampling period is 20 milliseconds.
5. as the arbitrary described voice excited control method of claim 1 to 3, it is characterized in that the threshold value in the continuous threshold value sampling period among the described step B is 4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2007101236914A CN101335867A (en) | 2007-09-27 | 2007-09-27 | Voice excited control method of meeting television system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2007101236914A CN101335867A (en) | 2007-09-27 | 2007-09-27 | Voice excited control method of meeting television system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101335867A true CN101335867A (en) | 2008-12-31 |
Family
ID=40198129
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2007101236914A Pending CN101335867A (en) | 2007-09-27 | 2007-09-27 | Voice excited control method of meeting television system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101335867A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102202038A (en) * | 2010-03-24 | 2011-09-28 | 华为技术有限公司 | Method and system for realizing voice energy display, conference server and terminal |
CN102281424A (en) * | 2010-06-11 | 2011-12-14 | 中兴通讯股份有限公司 | Conference site picture broadcasting method and multipoint control unit |
CN103050124A (en) * | 2011-10-13 | 2013-04-17 | 华为终端有限公司 | Sound mixing method, device and system |
CN105307012A (en) * | 2015-11-20 | 2016-02-03 | 青岛海信电器股份有限公司 | Television volume adjustment method and device |
CN106060707A (en) * | 2016-05-27 | 2016-10-26 | 北京小米移动软件有限公司 | Reverberation processing method and device |
CN111785297A (en) * | 2020-07-01 | 2020-10-16 | 广州科天视畅信息科技有限公司 | Voice excitation control method and device |
-
2007
- 2007-09-27 CN CNA2007101236914A patent/CN101335867A/en active Pending
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102202038A (en) * | 2010-03-24 | 2011-09-28 | 华为技术有限公司 | Method and system for realizing voice energy display, conference server and terminal |
CN102281424A (en) * | 2010-06-11 | 2011-12-14 | 中兴通讯股份有限公司 | Conference site picture broadcasting method and multipoint control unit |
WO2011153926A1 (en) * | 2010-06-11 | 2011-12-15 | 中兴通讯股份有限公司 | Method for broadcasting meeting place image and multipoint control unit |
CN102281424B (en) * | 2010-06-11 | 2013-08-07 | 中兴通讯股份有限公司 | Conference site picture broadcasting method and multipoint control unit |
US9456273B2 (en) | 2011-10-13 | 2016-09-27 | Huawei Device Co., Ltd. | Audio mixing method, apparatus and system |
CN103050124A (en) * | 2011-10-13 | 2013-04-17 | 华为终端有限公司 | Sound mixing method, device and system |
WO2013053336A1 (en) * | 2011-10-13 | 2013-04-18 | 华为终端有限公司 | Sound mixing method, device and system |
CN103050124B (en) * | 2011-10-13 | 2016-03-30 | 华为终端有限公司 | Sound mixing method, Apparatus and system |
CN105307012A (en) * | 2015-11-20 | 2016-02-03 | 青岛海信电器股份有限公司 | Television volume adjustment method and device |
CN105307012B (en) * | 2015-11-20 | 2019-06-14 | 青岛海信电器股份有限公司 | A kind of television volume regulating method and device |
CN106060707A (en) * | 2016-05-27 | 2016-10-26 | 北京小米移动软件有限公司 | Reverberation processing method and device |
CN106060707B (en) * | 2016-05-27 | 2021-05-04 | 北京小米移动软件有限公司 | Reverberation processing method and device |
CN111785297A (en) * | 2020-07-01 | 2020-10-16 | 广州科天视畅信息科技有限公司 | Voice excitation control method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101179693B (en) | Mixed audio processing method of session television system | |
US5953049A (en) | Adaptive audio delay control for multimedia conferencing | |
US8175242B2 (en) | Voice conference historical monitor | |
US7428223B2 (en) | Method for background noise reduction and performance improvement in voice conferencing over packetized networks | |
US7567270B2 (en) | Audio data control | |
US8243120B2 (en) | Method and device for realizing private session in multipoint conference | |
CN101473637B (en) | Audio mixing | |
CN1929593B (en) | Spatially correlated audio in multipoint videoconferencing | |
US8379076B2 (en) | System and method for displaying a multipoint videoconference | |
CN101335867A (en) | Voice excited control method of meeting television system | |
EP2154885A1 (en) | A caption display method and a video communication system, apparatus | |
US20020093531A1 (en) | Adaptive display for video conferences | |
US20090028316A1 (en) | Method of and System for Managing Conference Calls | |
WO2002091641A3 (en) | Control unit for multipoint multimedia/audio system | |
GB2412536B (en) | Multipoint conferencing system employing ip network and its configuration method | |
WO2001090839A2 (en) | Participant-controlled conference calling system | |
JPH1075310A (en) | Multi-point video conference system | |
CN108933914B (en) | Method and system for carrying out video conference by using mobile terminal | |
CN101510988A (en) | Method and apparatus for processing and playing voice signal | |
US20010053132A1 (en) | Management method and a conference unit for use in a communication system including user terminals communicating by means of the internet protocol | |
WO2005112413A1 (en) | A method and apparatus of audio switching | |
CN112351237A (en) | Automatic switching decision algorithm for main video of video conference | |
CN101888521A (en) | Roll-call method for video conference | |
CN112019488B (en) | Voice processing method, device, equipment and storage medium | |
EP2285107A1 (en) | Method, conference control equipment and conference system for prompting call progress state |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20081231 |