CN101335867A - Voice excited control method of meeting television system - Google Patents

Voice excited control method of meeting television system Download PDF

Info

Publication number
CN101335867A
CN101335867A CNA2007101236914A CN200710123691A CN101335867A CN 101335867 A CN101335867 A CN 101335867A CN A2007101236914 A CNA2007101236914 A CN A2007101236914A CN 200710123691 A CN200710123691 A CN 200710123691A CN 101335867 A CN101335867 A CN 101335867A
Authority
CN
China
Prior art keywords
meeting
terminal
voice
place
place terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2007101236914A
Other languages
Chinese (zh)
Inventor
唐庶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHENZHEN DVISION TECHNOLOGY Co Ltd
DIWEIXIN SOFTWARE TECHNOLOGY Co Ltd SHENZHEN CITY
Original Assignee
SHENZHEN DVISION TECHNOLOGY Co Ltd
DIWEIXIN SOFTWARE TECHNOLOGY Co Ltd SHENZHEN CITY
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHENZHEN DVISION TECHNOLOGY Co Ltd, DIWEIXIN SOFTWARE TECHNOLOGY Co Ltd SHENZHEN CITY filed Critical SHENZHEN DVISION TECHNOLOGY Co Ltd
Priority to CNA2007101236914A priority Critical patent/CN101335867A/en
Publication of CN101335867A publication Critical patent/CN101335867A/en
Pending legal-status Critical Current

Links

Images

Abstract

The present invention discloses a voice excited control method of meeting television system. The meeting television system includes at least one meeting place terminal. The method includes the following steps: A. sampling voice of each meeting place terminal in a prearranged sampling frequency in the prearranged sampling period, and taking the sound volume with maximum absolute value in sampling points in the sampling period as the sound volume reference value of the sampling period; B. taking continuous number of sampling periods as an acquisition cycle, averaging these sound volume reference values in the acquisition cycle, and taking the average value as the sound volume representative value of the corresponding meeting place terminal; C. taking the meeting place terminal with maximum sound volume representative value as the current voice excited terminal. The invention takes the meeting place terminal with maximum sound volume representative value as the voice-mixing excited terminal, thereby preventing speakers from generating frequent switch caused by accidental voice, and stabilizing the running of the entire meeting television system.

Description

A kind of voice excited control method of TV conference system
Technical field
The present invention relates to the video conferencing field, relate in particular to a kind of voice excited control method of TV conference system.
Background technology
Along with development of telecom technology, video conferencing service has obtained application more and more widely.In TV conference system, need judge a plurality of sides of speaking in the meeting, and, make the participant feel more natural its sound mix; Meanwhile, also to give other participants with spokesman's broadcast of images.
TV conference system is a core with video conferencing multipoint control unit (MCU), is responsible for the image switching and the sound mix of all meeting-place terminals and handles.Because the participant of video conferencing is generally a plurality of, need carry out control and management to whole meeting, i.e. the spokesman meeting-place is for example switched in meeting control, the control audio mixing, and the speech person sees meeting-place or the like.Difference according to the control subject of implementation can be divided into the meeting control model three kinds of different patterns: chairman's control model, director's control model, voice-activated control model.Below three kinds of patterns are described slightly.
Chairman's control model: comprising main meeting-place and sub-venue in the meeting, is the main meeting-place with meeting-place, chairman place, and other are sub-venue, chairman can initiatively and a sub-venue be talked with, its speech is wanted in roll-call, the sub-venue speech needs the application to chairman, through chairman's approval, can make a speech.
Director's control model: carry out meeting control by director's (operating console of MCU), it is initiator that the director can specify a meeting-place, other meeting-place is listened to watch this initiator.
The voice-activated pattern: when the speech of a plurality of meeting-place is arranged at the same time, with the meeting-place of sound maximum as initiator, with its sound or broadcast of images to other meeting-place.
Under the situation of discussion group's pattern meeting, adopting voice-activated pattern automatic switchover spokesman meeting-place is more convenient pattern, and generally speaking, in actual life was discussed, people's ordinary practice was in a side of listening maximum.In addition, in TV conference system, can guarantee that also chairman has the power of speech usually, but speech when not accepting too many meeting-place.So may have two to three meeting-place usually and can make a speech simultaneously, for example comprise chairman, by the spokesman that the voice-activated pattern is selected, these voice will carry out audio mixing to be handled to guarantee that other meeting-place can be good at hearing chairman or spokesman's voice.Under the voice-activated pattern, judge the meeting-place of sound maximum by the meeting multipoint control unit, and the sound in this meeting-place is joined during audio mixing handles; And, the situation of simulating reality, people naturally and understandably can pay close attention to its image when uppick sound.Like this, when joining the audio mixing processing in meeting-place, also will switch to this meeting-place, give other meeting-place the broadcast of images in this meeting-place with the sound maximum.Accordingly, may have in the middle of audio mixing after withdrawing from this and judge in the last meeting-place that once is judged as the sound maximum handles.
Present voice-activated pattern implementation is normally: set a sampling period, normally 20 milliseconds, sample therein, with the volume of the maximum value in the sampled point volume reference value as a meeting-place terminal, the volume reference value that compares each meeting-place terminal,, the sound in this meeting-place is added audio mixing handle as spokesman's terminal with the meeting-place terminal of max volume reference value, and switch this spokesman's broadcast of images and give other meeting-place terminals.
As shown in Figure 1, exemplarily show a TV conference system among the figure, comprised multipoint control unit and 4 meeting-place terminals.In one example, have two meeting-place terminals of A, B and make a speech simultaneously, the audio frequency of two meeting-place terminals arrives multipoint control unit through wired or wireless communication link.Multipoint control unit adopts default sample rate and sampling period to sample.Through over-sampling, for the audio frequency of each meeting-place terminal, all produced a plurality of audio sample points, select the point of volume absolute value maximum in these sampled points, with the volume value of this point as the volume reference value of each meeting-place terminal in this sampling period.
In the prior art, after having determined volume reference value, then compare, for example in sampling period, A meeting-place terminal volume reference value is bigger, then after this sampling period, A meeting-place terminal will be as the voice-activated terminal, and its voice will join in the audio mixing processing, simultaneously, switch A can field picture, be broadcast to other meeting-place.In next sampling period, have A, B, three meeting-place terminals of C are made a speech simultaneously, repeat above-mentioned processing procedure, for example determining B meeting-place terminal is the voice-activated terminal, so, in the following down-sampling cycle, to join the voice of B meeting-place terminal in the audio mixing processing, simultaneously, switch and B meeting-place terminal image, be broadcast to other meeting-place.
Yet such processing mode causes spokesman's frequent switching most probably.For example, when select A meeting-place terminal as the spokesman in A, B, C meeting-place terminal, the voice of other terminals listen A meeting-place, meeting-place terminals are also watched the image of A meeting-place terminal.This should work as is a stable process, yet sometime, a cough sound from D meeting-place terminal may appear, the perhaps huge sound that knocked over of C meeting-place terminal implements, these precipitate sounds may surpass the normal speech of A meeting-place terminal fully and cause the unnecessary switching of spokesman's terminal in next sampling period in a sampling period.Rapid switching between this unnecessary spokesman has caused the unstable and inefficient of whole system.
Summary of the invention
In view of this, technical problem to be solved by this invention has provided a kind of voice excited control method of video conferencing, makes that system's operation is more stable.
To achieve these goals, the present invention adopts following technical scheme:
A kind of voice excited control method of TV conference system, described TV conference system comprise at least one meeting-place terminal, and this method comprises following steps:
A, with the voice of default sample rate each meeting-place terminal of sampling in the default sampling period, with the volume of maximum value in each sampled point in the sampling period volume reference value as this sampling period;
B, with a continuous threshold value sampling period be a collection period, and each volume reference value in this collection period is averaged, with the volume typical value of this mean value as corresponding meeting-place terminal;
C, with the meeting-place terminal of volume typical value maximum as the current speech stimuli terminal.
Further, described voice excited control method is set a unit interval, comprises continuous a plurality of collection period in this unit interval, all be chosen as the voice-activated terminal if any meeting-place terminal each collection period in the described unit interval, then with this meeting-place terminal as current speaker's terminal.
The described unit interval is 1 second.
Default sample rate in the described steps A is 8kHz, and the described default sampling period is 20 milliseconds.
The threshold value in the continuous threshold value sampling period among the described step B is 4.
Beneficial effect of the present invention is: by setting collection period, mean value with the volume reference value in the collection period is the volume typical value of meeting-place terminal, and be audio mixing meeting-place terminal with the meeting-place terminal of volume typical value maximum, thereby can guarantee that the spokesman can frequent switching not take place because of the sound of burst, guarantee that the operation of whole TV conference system is more stable.
Description of drawings
Fig. 1 is the schematic block diagram of a TV conference system;
Fig. 2 is the control method flow chart of the specific embodiment of the invention.
Embodiment
The contrast accompanying drawing elaborates to the present invention in conjunction with embodiment below.
Audio frequency with G.711 encode (a kind of encoding and decoding speech standard of pulse code modulation) is an example, and as shown in Figure 2, the voice excited control method of the specific embodiment of the invention comprises the steps:
1, at first setting sample rate is 8kHz, and the sampling period is 20 milliseconds.Per 20 milliseconds of sound code streams that participant meeting-place terminal is delivered to multipoint control unit are decoded, and like this corresponding to each meeting-place terminal, all obtain 160 sampled points.
2, at each meeting-place terminal, the volume of getting the maximum value of 160 sampled points is as the volume reference value of each meeting-place terminal.
3, be a collection period with a continuous threshold value sampling period, in this example, this threshold value gets 4, and promptly the volume reference value in 4 sampling periods of continuous acquisition is averaged to 4 volume reference value of this collection period, then as the volume typical value of each meeting-place terminal.
4, per 80 milliseconds get the volume typical value after, the volume typical value of all meeting-place terminals is compared, compare maximum volume typical value, with the meeting-place terminal of correspondence as the current speech stimuli terminal, during the audio mixing that participates in next collection period is handled.It should be noted that, herein, still the meeting-place terminal that will join audio mixing is called the voice-activated terminal, but its correspondence is the meeting-place terminal of volume typical value maximum in the collection period, but not the meeting-place terminal of volume reference value maximum in sampling period in the prior art.
5, setting one comprises the unit interval of continuous a plurality of collection period, this unit interval can freely be set, it for example can be 1 second, if certain meeting-place terminal each collection period in a unit interval, be in each 80 milliseconds, all be chosen to be the voice-activated terminal, then this meeting-place terminal switched to current speaker's terminal, give other meeting-place terminals the broadcast of images of this spokesman's terminal.
As previously mentioned, TV conference system of the prior art usually causes unnecessary switching under the voice-activated pattern.The present invention then adopts the delay process pattern, promptly by setting collection period, as the volume typical value, is used as the standard that audio mixing switches with volume reference value mean value.Because noise, for example aforesaid cough sound, the implements sound that falls down to the ground generally is burst, instantaneous, and so down average, these precipitate noises can conductively-closed, thereby in making that the voice of voice-activated terminal are correctly joined audio mixing handling.
Further, unit interval with a setting is reached longer delay process, having only in a meeting-place terminal each collection period in this unit interval all is the voice-activated terminal, just can be after this unit interval with it as spokesman's terminal, give other meeting-place terminals the broadcast of images of this spokesman's terminal.Thereby further guarantee the stable operation of TV conference system.
TV conference system of the present invention, frequent, the unnecessary switching of having avoided TV conference system of the prior art to exist; And the audio mixing that can more accurate, stably finish spokesman's terminal is handled and image switching, and system is stability and high efficiency more.
Above content be in conjunction with concrete preferred implementation to further describing that the present invention did, can not assert that concrete enforcement of the present invention is confined to these explanations.For the general technical staff of the technical field of the invention, without departing from the inventive concept of the premise, can also make some simple deduction or replace, all should be considered as belonging to protection scope of the present invention.

Claims (5)

1. the voice excited control method of a TV conference system, described TV conference system comprises at least one meeting-place terminal, it is characterized in that, and described method comprises following steps:
A, with the voice of default sample rate each meeting-place terminal of sampling in the default sampling period, with the volume of maximum value in each sampled point in the sampling period volume reference value as this sampling period;
B, with a continuous threshold value sampling period be a collection period, and each volume reference value in this collection period is averaged, with the volume typical value of this mean value as corresponding meeting-place terminal;
C, with the meeting-place terminal of volume typical value maximum as the current speech stimuli terminal.
2. voice excited control method as claimed in claim 1, it is characterized in that, set a unit interval, comprise continuous a plurality of collection period in this unit interval, all be chosen as the voice-activated terminal if any meeting-place terminal each collection period in the described unit interval, then with this meeting-place terminal as current speaker's terminal.
3. voice excited control method as claimed in claim 2 is characterized in that, the described unit interval is 1 second.
4. as the arbitrary described voice excited control method of claim 1 to 3, it is characterized in that the default sample rate in the described steps A is 8kHz, the described default sampling period is 20 milliseconds.
5. as the arbitrary described voice excited control method of claim 1 to 3, it is characterized in that the threshold value in the continuous threshold value sampling period among the described step B is 4.
CNA2007101236914A 2007-09-27 2007-09-27 Voice excited control method of meeting television system Pending CN101335867A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2007101236914A CN101335867A (en) 2007-09-27 2007-09-27 Voice excited control method of meeting television system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2007101236914A CN101335867A (en) 2007-09-27 2007-09-27 Voice excited control method of meeting television system

Publications (1)

Publication Number Publication Date
CN101335867A true CN101335867A (en) 2008-12-31

Family

ID=40198129

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2007101236914A Pending CN101335867A (en) 2007-09-27 2007-09-27 Voice excited control method of meeting television system

Country Status (1)

Country Link
CN (1) CN101335867A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102202038A (en) * 2010-03-24 2011-09-28 华为技术有限公司 Method and system for realizing voice energy display, conference server and terminal
CN102281424A (en) * 2010-06-11 2011-12-14 中兴通讯股份有限公司 Conference site picture broadcasting method and multipoint control unit
CN103050124A (en) * 2011-10-13 2013-04-17 华为终端有限公司 Sound mixing method, device and system
CN105307012A (en) * 2015-11-20 2016-02-03 青岛海信电器股份有限公司 Television volume adjustment method and device
CN106060707A (en) * 2016-05-27 2016-10-26 北京小米移动软件有限公司 Reverberation processing method and device
CN111785297A (en) * 2020-07-01 2020-10-16 广州科天视畅信息科技有限公司 Voice excitation control method and device

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102202038A (en) * 2010-03-24 2011-09-28 华为技术有限公司 Method and system for realizing voice energy display, conference server and terminal
CN102281424A (en) * 2010-06-11 2011-12-14 中兴通讯股份有限公司 Conference site picture broadcasting method and multipoint control unit
WO2011153926A1 (en) * 2010-06-11 2011-12-15 中兴通讯股份有限公司 Method for broadcasting meeting place image and multipoint control unit
CN102281424B (en) * 2010-06-11 2013-08-07 中兴通讯股份有限公司 Conference site picture broadcasting method and multipoint control unit
US9456273B2 (en) 2011-10-13 2016-09-27 Huawei Device Co., Ltd. Audio mixing method, apparatus and system
CN103050124A (en) * 2011-10-13 2013-04-17 华为终端有限公司 Sound mixing method, device and system
WO2013053336A1 (en) * 2011-10-13 2013-04-18 华为终端有限公司 Sound mixing method, device and system
CN103050124B (en) * 2011-10-13 2016-03-30 华为终端有限公司 Sound mixing method, Apparatus and system
CN105307012A (en) * 2015-11-20 2016-02-03 青岛海信电器股份有限公司 Television volume adjustment method and device
CN105307012B (en) * 2015-11-20 2019-06-14 青岛海信电器股份有限公司 A kind of television volume regulating method and device
CN106060707A (en) * 2016-05-27 2016-10-26 北京小米移动软件有限公司 Reverberation processing method and device
CN106060707B (en) * 2016-05-27 2021-05-04 北京小米移动软件有限公司 Reverberation processing method and device
CN111785297A (en) * 2020-07-01 2020-10-16 广州科天视畅信息科技有限公司 Voice excitation control method and device

Similar Documents

Publication Publication Date Title
CN101179693B (en) Mixed audio processing method of session television system
US5953049A (en) Adaptive audio delay control for multimedia conferencing
US8175242B2 (en) Voice conference historical monitor
US7428223B2 (en) Method for background noise reduction and performance improvement in voice conferencing over packetized networks
US7567270B2 (en) Audio data control
US8243120B2 (en) Method and device for realizing private session in multipoint conference
CN101473637B (en) Audio mixing
CN1929593B (en) Spatially correlated audio in multipoint videoconferencing
US8379076B2 (en) System and method for displaying a multipoint videoconference
CN101335867A (en) Voice excited control method of meeting television system
EP2154885A1 (en) A caption display method and a video communication system, apparatus
US20020093531A1 (en) Adaptive display for video conferences
US20090028316A1 (en) Method of and System for Managing Conference Calls
WO2002091641A3 (en) Control unit for multipoint multimedia/audio system
GB2412536B (en) Multipoint conferencing system employing ip network and its configuration method
WO2001090839A2 (en) Participant-controlled conference calling system
JPH1075310A (en) Multi-point video conference system
CN108933914B (en) Method and system for carrying out video conference by using mobile terminal
CN101510988A (en) Method and apparatus for processing and playing voice signal
US20010053132A1 (en) Management method and a conference unit for use in a communication system including user terminals communicating by means of the internet protocol
WO2005112413A1 (en) A method and apparatus of audio switching
CN112351237A (en) Automatic switching decision algorithm for main video of video conference
CN101888521A (en) Roll-call method for video conference
CN112019488B (en) Voice processing method, device, equipment and storage medium
EP2285107A1 (en) Method, conference control equipment and conference system for prompting call progress state

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20081231