JPH0484553A - Voice mixing device - Google Patents
Voice mixing deviceInfo
- Publication number
- JPH0484553A JPH0484553A JP19887690A JP19887690A JPH0484553A JP H0484553 A JPH0484553 A JP H0484553A JP 19887690 A JP19887690 A JP 19887690A JP 19887690 A JP19887690 A JP 19887690A JP H0484553 A JPH0484553 A JP H0484553A
- Authority
- JP
- Japan
- Prior art keywords
- voice
- lines
- audio
- information
- voice signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000005540 biological transmission Effects 0.000 claims abstract description 5
- 230000008685 targeting Effects 0.000 claims description 2
- 230000015572 biosynthetic process Effects 0.000 abstract description 11
- 238000003786 synthesis reaction Methods 0.000 abstract description 11
- 238000001514 detection method Methods 0.000 abstract description 8
- 230000002194 synthesizing effect Effects 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 5
- 238000012544 monitoring process Methods 0.000 description 3
- 206010002953 Aphonia Diseases 0.000 description 2
- 238000010187 selection method Methods 0.000 description 2
- 238000002592 echocardiography Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
Abstract
Description
【発明の詳細な説明】
〔産業上の利用分野〕
本発明は音声会議システムに使用される音声ミキシンク
装置に関し、特に複数地点を対象とした多地点の会議シ
ステムにおいて同時に発言を行っている複数地点の音声
情報から限定された数の音声情報のみを選択合成する音
声ミキシンク装置に関する。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to an audio mixing device used in an audio conference system, and particularly in a multi-point conference system that targets multiple points, where multiple points are simultaneously speaking. The present invention relates to an audio mixing device that selects and synthesizes only a limited number of audio information from audio information.
従来、この種の音声ミキシング装置は、複数地点の会議
端末から入力された全ての音声情報を単純に合成してい
た。Conventionally, this type of audio mixing device simply synthesized all audio information input from conference terminals at multiple locations.
複数地点の音声情報を合成する場合、会議端末か設置さ
れる各会議室等の環境にもよるか、5地点以上の会議室
からの音声を無条件に合成した場合、各会議室からのエ
コーか重畳され、合成された音声が聞きとりにくくなる
問題かあり、運用」−あるいは技術上から4地点程度ま
での合成が限度である。従来の音声ミキシング装置ては
、無条件に入力音声情報を合成していたため、多地点の
会議システム等で提供する会議サービスは特に運用面で
制限を受けていた。すなわち、会議への参加地点数を4
地点に制限するとか、エコーの問題を解決せずに品質の
悪い音声で運用せねばならないという欠点があった。When synthesizing audio information from multiple locations, it depends on the environment of the conference terminal or each conference room where it is installed, or if audio from conference rooms from five or more locations is synthesized unconditionally, the echo from each conference room However, there is a problem that the synthesized voice may be difficult to hear due to the superimposed sound, and due to operational or technical reasons, synthesis is limited to about four points. Conventional audio mixing devices unconditionally synthesize input audio information, and therefore conferencing services provided by multipoint conference systems and the like are particularly limited in terms of operation. In other words, the number of participating points in the conference is 4.
The drawbacks were that it was limited to certain locations and had to be operated with poor quality audio without solving the echo problem.
本発明の目的は、品質の良い音声が提供でき、運用面で
の利便性を向上させた多地点会議システムを横築するこ
とのできる音声ミキシング装置を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide an audio mixing device that can provide high-quality audio and can horizontally build a multipoint conference system with improved operational convenience.
本発明の音声ミキシング装置は、複数地点を対象とした
多地点の音声会議システムにおいて、前記複数地点の会
議端末からの音声情報を引込む引込手段と、前記引込手
段で引込まれた各地点の音声情報の音声レベルを一定時
間毎に検出し音声の有音部分及び無音部分を識別する識
別手段と、複数の音声情報の有音部分を検出したときに
前記複数の音声情報のうち特定のN個の音声情報のみを
ある一定時間優先選択する選択手段と、この優先選択さ
れた音声情報を混合する混合手段と、混合された音声情
報のうちそれぞれの送出元から入力された音声情報を削
除し残りの混合された音声情報を前記送信元へ送出する
送出手段とを(taえる構成である。The audio mixing device of the present invention is a multi-point audio conference system targeting a plurality of locations, and includes a pull-in means for pulling in audio information from conference terminals at the plurality of points, and a voice information of each point pulled in by the pull-in means. an identification means for detecting the sound level of the voice at regular intervals and identifying a sound part and a silent part of the sound; A selection means for preferentially selecting only audio information for a certain period of time, a mixing means for mixing the preferentially selected audio information, and a selection means for deleting the audio information input from each transmission source from the mixed audio information, It is configured to include a sending means for sending the mixed audio information to the transmission source.
次に、本発明の実施例について図面を参照して説明する
。Next, embodiments of the present invention will be described with reference to the drawings.
本発明の一実施例を小ず第1Nを参照すると、音声ミキ
シンク装置は、音声情報送受信部1〜3と、音声レベル
検出部4〜6と、優先選択部7と、制御部8と、音声情
報合成部9とを備える。Referring to No. 1N for an embodiment of the present invention, the audio mixing device includes audio information transmitting/receiving units 1 to 3, audio level detection units 4 to 6, a priority selection unit 7, a control unit 8, and an audio and an information synthesis section 9.
音声情報は交換機のネットワークを経由して音声情報送
受信部1〜・3に接続され、音声情報送受信部1〜3で
は、この入力された音声情報を音声レベル検出部4〜6
及び優先選択部7へ出力する。The voice information is connected to the voice information transmitting/receiving sections 1 to 3 via the exchange network, and the voice information transmitting and receiving sections 1 to 3 transmit the input voice information to the voice level detecting sections 4 to 6.
and output to the priority selection section 7.
音声レベル検出部4〜6では、各回線毎の音声情報の有
音部分及び無音部分を判断し、この判断結果を音声情報
の付随データとして制御部8に出力する。この付随デー
タは、回線番号1回線対応の音声の有音部分を認識した
時刻、音声の無音部分を認識した時刻及び音声の有無情
報などから構成される。The audio level detection units 4 to 6 determine whether the audio information for each line is a sound part or a silent part, and output the determination result to the control unit 8 as accompanying data of the audio information. This accompanying data is composed of the time when the active part of the voice corresponding to line number 1 was recognized, the time when the silent part of the voice was recognized, and the presence/absence information of the voice.
第2図は本発明の音声ミキシング装置を使用した音声会
議システムの一実施例を説明するための図である。10
〜12は各会議室などに設置される会議端末で13は交
換機である。複数地点に設置された会議端末のうち会議
に参加する会議端末は交換機13で交換接続され、1゛
1声ミキシンク装置]4に接続される。FIG. 2 is a diagram for explaining an embodiment of an audio conference system using the audio mixing device of the present invention. 10
12 are conference terminals installed in each conference room, and 13 is an exchange. Among the conference terminals installed at a plurality of locations, the conference terminals participating in the conference are exchange-connected by an exchange 13 and connected to a 1-voice mixing device 4.
以下に動作を説明する。会議端末からネットワークを介
した音声情報は音声情報送受信部1〜3で受信され、音
声レベル検出部4〜6及び優先選択部7へ伝達される。The operation will be explained below. Audio information from the conference terminal via the network is received by audio information transmitting/receiving units 1 to 3, and transmitted to audio level detection units 4 to 6 and priority selection unit 7.
音声レベル検出部4〜6では、一定時間ごとに音声の有
音部分及び無音部分を監視して、この監視時間が経過し
ても同一の状態が継続しているか否かにより音声の有音
部分及び無音部分の識別判断を行う。The sound level detection units 4 to 6 monitor the sound portion and silent portion of the sound at regular intervals, and detect the sound portion of the sound depending on whether the same state continues even after the monitoring time has elapsed. and identify silent parts.
音声レベル検出部4〜6から音声の有無及び認識した時
刻を受信した制御部8は、まず音声の有音部分が検出さ
れた回線数が予め設定されたN(Nは正の整数)回線よ
り少ないかどうかをチエツクし、少ない場合には優先選
択部7を制御し、音声の有音部分が検出された回線のみ
を音声情報合成部9に接続する。又、多い場合にはこれ
ら回線で音声の有音部分が認識された時刻を時系列的に
チエツクし、音声の発生した順序に従い早いものからN
同線を選択し、優先選択部7を制御し選択されたN回線
を音声情報合成部9へ接続する。なお、音声の有音部分
が検出された回線かN個以下の場合には無条件にその音
声情報を音声情報合成部9に出力する。The control unit 8, which receives the presence or absence of voice and the recognized time from the voice level detection units 4 to 6, first detects the presence or absence of voice from the preset number of lines N (N is a positive integer) on which the active part of the voice has been detected. It is checked whether the number is low, and if it is, the priority selection section 7 is controlled to connect only the line in which the voiced part of the voice is detected to the voice information synthesis section 9. In addition, if there are many, check the time when the active part of the voice was recognized on these lines in chronological order, and select N from the earliest according to the order in which the voice occurred.
The same line is selected, the priority selection unit 7 is controlled, and the selected N lines are connected to the audio information synthesis unit 9. Note that if the number of lines in which the active portion of voice is detected is N or less, the voice information is unconditionally output to the voice information synthesis section 9.
音声情報合成部9は選択された最大N回線の音声情報を
合成して音声情報送受信部1〜3へ出力し、音声情報送
受信部1〜3では、個々にこの合成された音声情報から
交換機経由で入力されたそれぞれ送出元の音声情報を削
除し、交換機経由で各地点へ出力する。The voice information synthesizing section 9 synthesizes the voice information of the selected maximum N lines and outputs it to the voice information transmitting/receiving sections 1 to 3.The voice information transmitting and receiving sections 1 to 3 individually transmit the synthesized voice information from the synthesized voice information via the exchange. The audio information input at each transmission source is deleted and output to each location via the exchange.
第3図は優先選択部7における選択方法の概念を説明す
るための図である。第3図において、時間軸はマクロ的
な時間(音声の有無を監視する監視時間は考慮しない)
を示し、a〜eはそれぞれ地点1〜地点5の音声情報回
線である。例えは、音声合成可能地点数Nを「4」と仮
定すると、時刻A及び時刻Cでは音声の有音部分が認識
された音声情報回線は、それぞれa、c、d及びbc、
eとなりNより少ないため、これら音声情報回線は音声
情報合成部9に接続され合成される。FIG. 3 is a diagram for explaining the concept of the selection method in the priority selection section 7. In Figure 3, the time axis is macro time (monitoring time for monitoring the presence or absence of audio is not taken into account)
, and a to e are audio information lines at points 1 to 5, respectively. For example, assuming that the number N of points where voice synthesis is possible is "4", the voice information lines in which the voiced part of the voice is recognized at time A and time C are a, c, d, and bc, respectively.
Since the number e is less than N, these voice information lines are connected to the voice information synthesis section 9 and synthesized.
しかし、時刻Bではa〜eの全ての音声情報回線に音声
の有音部分が認識されN以上となるため、音声の有音部
分を認識した時刻の早いものからe、b、a、及びdの
4つの音声情報回線か優先選択され、音声情報合成部9
に接続され合成される。又、時刻AとBあるいは時刻B
とCとの間隔は、音声の有音部分及び無音部分をチエツ
クする一定の時間間隔で、この一定時間内は直前に優先
選択された音声情報回線の音声情報が音声情報合成部9
に出力され合成される。However, at time B, since the active part of the voice is recognized in all the voice information lines a to e, and the number of times is greater than N, e, b, a, and d The four voice information lines are selected preferentially, and the voice information synthesis section 9
are connected and synthesized. Also, time A and B or time B
The interval between and C is a fixed time interval for checking the voiced part and the silent part of the voice, and within this fixed time, the voice information of the voice information line that has been priority-selected immediately before is transferred to the voice information synthesis section 9.
is output and synthesized.
従って、例えは、10地点が参加して会議が行われる場
合、N=4とすると、5地点以上から同時に音声情報が
入力されても、」二連したように4地点からの音声情報
のみが自動的に合成されるので、通常の会議室で発生ず
るエコー程度には十分対応可能となり、運用面でも違和
感なく会議を運営できる。Therefore, for example, if a conference is held with 10 participating locations, and if N = 4, even if audio information is input from 5 or more locations simultaneously, only the audio information from 4 locations will be input in duplicate. Since it is automatically synthesized, it can sufficiently cope with the echoes that occur in a normal conference room, and the conference can be run without any discomfort.
本発明は以上説明したように、複数地点の会議端末から
送信される音声情報から、音声の有音部分が認識された
N回線を音声の発生順序に従い自動的に抽出し合成する
ように構成したので、品質の良い音声が提供でき、運用
面での利便性を向」ニさせた多地点会議システムを構築
することができるという効果を有する。As explained above, the present invention is configured to automatically extract and synthesize N lines in which voiced portions of voices are recognized from voice information transmitted from conference terminals at multiple locations according to the order in which voices are generated. Therefore, it is possible to provide high-quality audio and construct a multipoint conference system with improved operational convenience.
第1図は本発明の一実施例を示す構成図、第2図は本発
明の音声ミキシンク装置を使用した音声会議システムの
一実施例を説明するための図、第3図は優先Ja択部に
おける選択方法の概念を説明するための図である。
1〜3・・・・・・音声情報送受信部、4〜6・・・・
・・音声レベル検出部、7・・・・・・優先選択部、8
・・・・・制御部、9・・・・・音声情報合成部、10
〜12・・・・−・会議端末、13・・・・・・交換機
、14・・・・・・音声ミキシング装置。Fig. 1 is a block diagram showing an embodiment of the present invention, Fig. 2 is a diagram illustrating an embodiment of an audio conference system using the audio mixing device of the invention, and Fig. 3 is a priority Ja selection section. FIG. 2 is a diagram for explaining the concept of a selection method in FIG. 1-3...Audio information transmitting/receiving section, 4-6...
...Audio level detection section, 7...Priority selection section, 8
...Control section, 9...Speech information synthesis section, 10
~12... Conference terminal, 13... Switchboard, 14... Audio mixing device.
Claims (1)
いて、前記複数地点の会議端末からの音声情報を引込む
引込手段と、前記引込手段で引込まれた各地点の音声情
報の音声レベルを一定時間毎に検出し音声の有音部分及
び無音部分を識別する識別手段と、複数の音声情報の有
音部分を検出したときに前記複数の音声情報のうち特定
のN個の音声情報のみをある一定時間優先選択する選択
手段と、この優先選択された音声情報を混合する混合手
段と、混合された音声情報のうちそれぞれの送出元から
入力された音声情報を削除し残りの混合された音声情報
を前記送信元へ送出する送出手段とを備えたことを特徴
とする音声ミキシング装置。In a multi-point audio conferencing system targeting multiple locations, a pull-in means pulls in audio information from conference terminals at the multiple locations, and the audio level of the audio information at each point pulled in by the pull-in means is determined at regular intervals. an identification means for detecting and identifying a voiced part and a silent part of a voice; and when a voiced part of a plurality of voice information is detected, priority is given to only specific N pieces of voice information among the plurality of voice information for a certain period of time; a selection means for selecting; a mixing means for mixing the preferentially selected audio information; and a mixing means for deleting the audio information input from each transmission source from among the mixed audio information and transmitting the remaining mixed audio information. 1. An audio mixing device comprising: a sending means for sending the audio to the original source.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP19887690A JPH0484553A (en) | 1990-07-26 | 1990-07-26 | Voice mixing device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP19887690A JPH0484553A (en) | 1990-07-26 | 1990-07-26 | Voice mixing device |
Publications (1)
Publication Number | Publication Date |
---|---|
JPH0484553A true JPH0484553A (en) | 1992-03-17 |
Family
ID=16398394
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP19887690A Pending JPH0484553A (en) | 1990-07-26 | 1990-07-26 | Voice mixing device |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPH0484553A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007096555A (en) * | 2005-09-28 | 2007-04-12 | Nec Corp | Voice conference system, terminal, talker priority level control method used therefor, and program thereof |
JP2009100154A (en) * | 2007-10-16 | 2009-05-07 | Yamaha Corp | Remote conference system, and multiple-point voice connection apparatus |
WO2009060798A1 (en) * | 2007-11-08 | 2009-05-14 | Yamaha Corporation | Voice communication device |
JP2013026851A (en) * | 2011-07-21 | 2013-02-04 | Hitachi Ltd | Private branch exchange |
JP2014520423A (en) * | 2011-05-16 | 2014-08-21 | アルカテル−ルーセント | Method and apparatus for providing bi-directional communication between segments of a home network |
-
1990
- 1990-07-26 JP JP19887690A patent/JPH0484553A/en active Pending
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007096555A (en) * | 2005-09-28 | 2007-04-12 | Nec Corp | Voice conference system, terminal, talker priority level control method used therefor, and program thereof |
JP2009100154A (en) * | 2007-10-16 | 2009-05-07 | Yamaha Corp | Remote conference system, and multiple-point voice connection apparatus |
WO2009060798A1 (en) * | 2007-11-08 | 2009-05-14 | Yamaha Corporation | Voice communication device |
CN101855867A (en) * | 2007-11-08 | 2010-10-06 | 雅马哈株式会社 | Voice communication device |
JP2014520423A (en) * | 2011-05-16 | 2014-08-21 | アルカテル−ルーセント | Method and apparatus for providing bi-directional communication between segments of a home network |
US9749118B2 (en) | 2011-05-16 | 2017-08-29 | Alcatel Lucent | Method and apparatus for providing bidirectional communication between segments of a home network |
JP2013026851A (en) * | 2011-07-21 | 2013-02-04 | Hitachi Ltd | Private branch exchange |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6332153B1 (en) | Apparatus and method for multi-station conferencing | |
US6100882A (en) | Textual recording of contributions to audio conference using speech recognition | |
JP2006087150A (en) | Multi-address communication system to distributed exchange network | |
JP2000270304A (en) | Multispot video conference system | |
JP2725218B2 (en) | Distributed teleconferencing controller | |
GB2493801A (en) | Improved audio quality in teleconferencing system with co-located devices | |
JPH0484553A (en) | Voice mixing device | |
CA1146247A (en) | Multiport conference circuit with voice level coding | |
US7970113B2 (en) | Caller number notification | |
JP2722862B2 (en) | Call system for multipoint distributed connection conference telephone service | |
US6891824B1 (en) | Audible communication with a modem over a wide area network | |
JPS6314588A (en) | Electronic conference system | |
JPH02228158A (en) | Video conference equipment | |
JPH02142238A (en) | Inter-multispot communication conference control system | |
JP2588970B2 (en) | Multipoint conference method | |
JP3102328B2 (en) | Audio conference system | |
JP2766682B2 (en) | Multimedia communication conference terminal | |
US20050058275A1 (en) | Audio source identification | |
JPH01233932A (en) | Alarm collection method for specific channel remote station | |
JPS63102520A (en) | Network supervisory equipment | |
Bially et al. | Information Processing Techniques Program. Volume 1. Packet Speech Systems Technology | |
JPS622762A (en) | Transmission system for voice information | |
JPH01200858A (en) | System for remote maintenance | |
JPH08237627A (en) | Multi-point video conference system | |
JPS6292653A (en) | Speech processor |