JPH07303148A

JPH07303148A - Communication conference equipment

Info

Publication number: JPH07303148A
Application number: JP6095949A
Authority: JP
Inventors: Ikuichirou Kinoshita; 郁一郎木下; Shigeaki Aoki; 茂明青木
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1994-05-10
Filing date: 1994-05-10
Publication date: 1995-11-14

Abstract

PURPOSE:To attain the communication of quality equal to that in a same space conversation by implementing signal processing providing effective sound image localization to a recipient in a short processing time in a voice communication among plural points. CONSTITUTION:A call information detection section 2 detects a call start signal and a call end signal based on a voice signal picked up by a microphone 1. A line decision section 3 decides a line through which a voice signal whose localization is to be processed or a line through which the voice signal is added based on call information. A localization processing adder section 4 synthesizes left and right channels of voice signals to provide a sense of localization to a prescribed sound image position and adds the voice signal sent from other point to the voice signal subject to localization processing. Through the constitution above, n-sets of localization processing sections making sound image localization signal processing whose localization number is (n) are installed in parallel and a sense of localization is provided to the different voice signal among n-points at maximum.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、通信会議装置に関
し、特に、複数地点を結んだ音声通信において音声を再
生するに際して、通信相手である送話者の音声を受話者
が音像定位技術により所定の位置に定位して聴取する通
信会議装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a communication conference apparatus, and more particularly, when a voice is reproduced in voice communication connecting a plurality of points, the voice of a talker who is a communication partner is specified by a listener by a sound image localization technique. The present invention relates to a communication conference device that localizes and listens to a position.

【０００２】[0002]

【従来の技術】互いに離隔した複数地点間の音声通信に
おいて、受話者は、通信相手の音声信号を単に加算して
再生聴取することのみによっては、多数の地点における
通信相手である送話者を同時に同定することができず、
発生内容を明瞭に了解することは困難である。2. Description of the Related Art In voice communication between a plurality of points which are separated from each other, a listener only needs to add a voice signal of a communication partner and listen to it to listen to a talker who is a communication partner at many points. Could not be identified at the same time,
It is difficult to understand clearly what happened.

【０００３】この問題を解決するに、複数の地点から送
信された音声信号に信号処理を施して複数の音像をそれ
ぞれ異なる位置に定位する様にする試みがなされてい
る。人間は音源から発せられた音を聴取することにより
音源の位置および方向を知覚しており、即ち、定位感を
得ることができる。スピーカ、イヤホンの如き音源によ
り音声を再生する場合、音声信号に信号処理を施すこと
によりこれらの音源とは異なる空間方向に音声を定位さ
せることができる。そして、複数の音声が同時に受話者
に呈示されている場合、それぞれの音声を互いに異なる
空間方向に定位させることにより音量を増加させること
なくしてこれらの音声の明瞭度を向上し、話者同定を容
易にすることができることも知られている。In order to solve this problem, it has been attempted to perform signal processing on the voice signals transmitted from a plurality of points to localize a plurality of sound images at different positions. Human beings perceive the position and direction of the sound source by listening to the sound emitted from the sound source, that is, they can obtain a sense of localization. When sound is reproduced by a sound source such as a speaker or an earphone, the sound can be localized in a spatial direction different from those of the sound source by performing signal processing on the sound signal. Then, when multiple voices are presented to the listener at the same time, the intelligibility of these voices is improved without increasing the volume by localizing each voice in mutually different spatial directions, and speaker identification is performed. It is also known that it can be facilitated.

【０００４】ここで、図１ないし図３を参照して音像定
位技術の概略を説明する。図１は伝達関数の畳み込みに
よる音像定位技術の概念図を示す。図１（ａ）は音源と
受話者との間の関連を示す図であり、図１（ｂ）は相異
なる位置に設置される音源１および音源２を使用した場
合においても、音源と受話者間の伝達関数を制御して音
響信号に畳み込み、聴取位置における実音源による音響
信号を再現することにより、聴取位置において図１
（ａ）における音源の位置に音像を定位することができ
る。An outline of the sound image localization technique will be described with reference to FIGS. 1 to 3. FIG. 1 shows a conceptual diagram of a sound image localization technique by convolution of a transfer function. FIG. 1 (a) is a diagram showing a relationship between a sound source and a listener, and FIG. 1 (b) is a sound source and a listener even when using a sound source 1 and a sound source 2 installed at different positions. 1 is reproduced at the listening position by controlling the transfer function between the convolutions with the acoustic signal and reproducing the acoustic signal from the real sound source at the listening position.
The sound image can be localized at the position of the sound source in (a).

【０００５】図２はレベル差の制御による音像定位位置
の制御を説明する図である。図２（ａ）は受話者前方左
右２個の音源である音源Ａおよび音源Ｂを使用し、同一
信号にレベル差を付与して再生した場合の概念図であ
り、図２（ｂ）は音源Ａおよび音源Ｂそれぞれに対する
相対信号強度と受話者の音像の定位位置との間の関係を
示す図である。FIG. 2 is a diagram for explaining control of the sound image localization position by controlling the level difference. FIG. 2 (a) is a conceptual diagram when a sound source A and a sound source B, which are two sound sources on the left and right in front of the listener, are used and reproduced by giving a level difference to the same signal, and FIG. 2 (b) is a sound source. It is a figure which shows the relationship between the relative signal strength with respect to each of A and the sound source B, and the localization position of the sound image of a listener.

【０００６】図３は時間差の制御による音像定位位置の
制御例をを説明する図である。図３（ａ）は受話者前方
左右２個の音源である音源Ａおよび音源Ｂを使用し、同
一信号に時間差を付与して再生した場合の概念図を示す
図である。図３（ｂ）は音源Ａと音源Ｂとの間の左右時
間差と受話者の音像の定位位置との間の関係を示す図で
ある。なお、Ｂが先行したときは（Ｂ−Ａ）は負値であ
る。FIG. 3 is a diagram for explaining an example of controlling the sound image localization position by controlling the time difference. FIG. 3A is a diagram showing a conceptual diagram in the case of using the sound source A and the sound source B, which are the two sound sources on the left and right in front of the listener, and reproducing the same signal with a time difference. FIG. 3B is a diagram showing the relationship between the left-right time difference between the sound source A and the sound source B and the localization position of the sound image of the listener. When B precedes, (B-A) is a negative value.

【０００７】上述した通り、音像の定位位置を制御する
には、図１に示される如く音源と聴取位置間の伝達関数
を音響信号に畳み込んだり、図２或は図３に示される如
く複数音源間のレベル差或は時間差を制御する方法があ
る。しかし、この様な信号処理を施すには多量の演算量
を要し、信号処理に必要なパラメータは音像の定位位置
に依存して大きくなる。異なる空間方向に定位感を付与
する場合、予め異なる定位方向をもたらすパラメータを
設定して定位処理を行なうことが考えられる。そして、
信号処理時間は、相互に離隔した複数地点間において自
然な意志疎通を許容する遅延時間とされる数１０ｍｓよ
り短くしなければならない。As described above, in order to control the localization position of the sound image, the transfer function between the sound source and the listening position is convoluted into the acoustic signal as shown in FIG. 1 or plural as shown in FIG. 2 or FIG. There is a method of controlling the level difference or time difference between sound sources. However, a large amount of calculation is required to perform such signal processing, and the parameters required for signal processing increase depending on the localization position of the sound image. In the case of giving a localization feeling to different spatial directions, it is considered that a localization process is performed in advance by setting parameters that bring about different localization directions. And
The signal processing time must be shorter than several tens of ms which is a delay time that allows natural communication between a plurality of points separated from each other.

【０００８】この様にするには、それぞれの地点から送
信される音声信号について音像定位信号処理即ち定位処
理を、信号処理専用の素子を使用して並列に１段階処理
する必要がある。そして、複数の通信相手との間の音声
通信において、それぞれ異なる空間方向への定位感を実
現するには、各空間方向について各別の信号処理用素子
を使用して定位処理することが現実的である。ここで、
複数地点間の音声通信において、各地点から送信される
音声が同時に発話されている場合、各地点から送信され
る音声信号をそれぞれ各別の信号処理用素子に入力する
ことにより遅延時間を少なくしてそれぞれ異なる方向の
定位感を各地点から送信される音声に付与することがで
きるに到る。数１０ｍｓ以下程度のオーダーの短い遅延
時間の音声信号処理用回路素子も既に開発されている。In order to do so, it is necessary to perform the sound image localization signal processing, that is, the localization processing on the audio signals transmitted from the respective points in parallel in one step by using an element dedicated to the signal processing. Then, in voice communication with a plurality of communication partners, in order to realize localization feeling in different spatial directions, it is realistic to perform localization processing using different signal processing elements in each spatial direction. Is. here,
In the voice communication between multiple points, when the voices transmitted from each point are spoken at the same time, the delay time can be reduced by inputting the voice signals transmitted from each point to each different signal processing element. Thus, localization feelings in different directions can be added to the voice transmitted from each point. An audio signal processing circuit element having a short delay time on the order of several tens of ms or less has already been developed.

【０００９】一方、同一空間において通信装置を介さず
に同時に複数の話者が対話している場合、受話者は音声
のみを手掛かりとして最大４〜５人程度までしか話者を
同定することができないことが知られている。そして、
通常の対話においては同時に多数の話者が発話している
ことは一話者のみが発話している状態と比較して稀な状
態であり、実際は同時に発話が開始されることは少な
い。また、音像定位、話者同定するには発話開始直後に
おける音声が大きく貢献することも調べられている。On the other hand, when a plurality of speakers are talking at the same time without using a communication device in the same space, the listener can identify only up to about 4 to 5 speakers by using only the voice as a clue. It is known. And
In a normal dialogue, it is rare that a large number of speakers are speaking at the same time as compared with a state where only one speaker is speaking, and in reality, speaking is rarely started at the same time. It has also been investigated that the sound immediately after the start of speech greatly contributes to sound image localization and speaker identification.

【００１０】[0010]

【発明が解決しようとする課題】ところで、同時発話中
の通信相手数が受話回路装置において装備される定位処
理部の数より多い場合、最後に発話開始された或る地点
から送信された音声信号を優先して音像定位処理を施
し、それ以外の地点から送信される音声信号を音像定位
処理した音声信号に単に加算すれば定位感および話者同
定を損なうことなく音声を再生することができると考え
られている。しかし、長時間に亘り音声信号を単に加算
して再生していると、受話者は時間が経過するに伴い各
話者の音声の特徴を忘却して話者同定が困難になる。By the way, when the number of communicating parties during simultaneous utterance is larger than the number of localization processing units equipped in the receiving circuit device, a voice signal transmitted from a certain point where the last utterance is started. By prioritizing the sound image localization process and adding the sound signal transmitted from other points to the sound signal subjected to the sound image localization process, the sound can be reproduced without impairing the localization feeling and speaker identification. It is considered. However, if the voice signals are simply added and played back for a long time, the listener forgets the features of the voice of each speaker over time, and it becomes difficult to identify the speaker.

【００１１】この発明は、上述した通りの人間の知覚、
対話の特徴を利用し、複数地点間の音声通信において、
短い処理時間により受話者に効果的な音像定位をもたら
す信号処理を行なうことにより、受話者による話者同定
を容易にすると共に、音響信号処理に起因する遅延によ
り自然性が損なわれることの少ない同一空間内における
対話と同等な品質の通信をする通信会議装置を提供する
ものである。The present invention is based on the above-mentioned human perception,
Utilizing the features of dialogue, in voice communication between multiple points,
Signal processing that provides effective sound image localization to the listener with a short processing time facilitates speaker identification by the listener, and the naturalness is less likely to be impaired by delays caused by acoustic signal processing. It is intended to provide a communication conference apparatus for performing communication with a quality equivalent to that of dialogue in a space.

【００１２】[0012]

【課題を解決するための手段】音声信号を収録して他地
点へ伝送する送話回路装置（１）、他地点から伝送され
る音声信号を再生する受話回路装置（３、４、５）、相
異なる複数ｍ地点同士を物理的或は論理的に接続する伝
送チャネルを有する伝送回線（６）より成る通信会議装
置において、伝送される音声信号に音像定位処理を施す
定位処理部（４１）および発話中の通信相手による音声
信号を音像定位処理された音声信号に加算する加算部
（４２）より成る定位処理加算部（４）ｎ組を並列に具
備し、通信相手の音声信号を伝送する伝送回線（６）と
ｎ組の定位処理部（４１）および加算部（４２）との間
の接続を切り替える回線決定部（３）を具備する通信会
議装置を構成した。Means for Solving the Problems A transmitter circuit device (1) for recording a voice signal and transmitting it to another point, a receiver circuit device (3, 4, 5) for reproducing a voice signal transmitted from another point, In a communication conferencing apparatus comprising a transmission line (6) having a transmission channel for physically or logically connecting different m points, a localization processing unit (41) for performing sound image localization processing on a transmitted audio signal, and A transmission for transmitting a voice signal of a communication partner, which is provided with n sets of localization processing addition units (4) in parallel, each of which includes an addition unit (42) for adding a voice signal of a communication partner who is uttering a voice signal subjected to sound image localization processing. A communication conferencing apparatus is provided with a line determining unit (3) for switching the connection between the line (6) and the n sets of localization processing units (41) and adding units (42).

【００１３】そして、回線決定部（３）は通信相手の発
話開始および発話終了を示す発話情報に基づいて制御さ
れるものである通信会議装置を構成した。また、送話回
路装置は発話情報検出部（２）を具備し、検出された発
話情報を伝送回線（６）を介して受話回路装置に伝送す
る通信会議装置を構成した。更に、発話中の音声信号の
内の最後に発話開始された音声信号に音像定位処理を優
先して施し、その他の音声信号はこれを音像定位処理を
施した音声信号に加算する通信会議装置を構成した。The line determining unit (3) constitutes a communication conference apparatus which is controlled based on the utterance information indicating the utterance start and utterance end of the communication partner. Further, the transmitting circuit device is provided with the utterance information detecting section (2), and the communication conferencing device configured to transmit the detected utterance information to the receiving circuit device via the transmission line (6). Furthermore, a communication conferencing apparatus that prioritizes the sound image localization process on the last voice signal of the voice signals being uttered and adds the other voice signals to the voice signal subjected to the sound image localization process is provided. Configured.

【００１４】また、最初に加算開始された音声信号が発
話された地点から伝送された音声信号を優先して音像定
位処理を再開する通信会議装置をも構成した。Further, a communication conferencing apparatus for resuming the sound image localization processing by prioritizing the voice signal transmitted from the point where the voice signal whose addition has been started first is uttered is also configured.

【００１５】[0015]

【実施例】この発明の実施例を図４ないし図１０を参照
して具体的に説明する。先ず、図５を参照するに、各地
点の送話回路装置において発話情報を検出し、検出した
発話情報を他地点の受話回路装置に伝送し、受話回路装
置において伝送された発話情報に基づいて定位処理すべ
き音声信号を伝送した回線および加算すべき音声信号を
伝送した回線を決定する回線決定部を具備し、１組の定
位処理部および加算部を具備して、他地点から伝送され
る音声信号を両耳イヤホンにより再生するこの発明の通
信会議装置の一例である。この通信会議装置は送話回路
装置、受話回路装置および伝送回線６より成る。１はマ
イクロホン、２は発話情報検出部であり、これらマイク
ロホン１および発話情報検出部２により送話回路装置は
構成される。この発話情報検出部２は、マイクロホン１
により収録される音声信号に基づいて発話情報である発
話開始信号および発話終了信号を検出する。３は回線決
定部、４は定位処理加算部であり、これら回線決定部３
および定位処理加算部４により受話回路装置は構成され
る。この回線決定部３は、発話情報に基づいて、定位処
理されるべき音声信号を伝送した回線或は加算処理され
るべき回線を決定する部である。定位処理加算部４は、
音声信号に対して所定の音像位置への定位感を付与する
音声信号を左右２チャネルについて合成し、他の地点か
ら伝送された音声信号を定位処理された音声信号に加算
する部である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT An embodiment of the present invention will be specifically described with reference to FIGS. First, referring to FIG. 5, utterance information is detected in the transmitting circuit device at each point, the detected utterance information is transmitted to the receiving circuit device at another point, and based on the utterance information transmitted in the receiving circuit device. A line determining unit for determining a line transmitting a voice signal to be subjected to localization processing and a line transmitting a voice signal to be added is provided, and a set of localization processing unit and addition unit is provided to transmit from another point. 1 is an example of a communication conference apparatus of the present invention that reproduces a voice signal by a binaural earphone. This communication conferencing device comprises a transmitter circuit device, a receiver circuit device and a transmission line 6. Reference numeral 1 is a microphone, 2 is an utterance information detection unit, and the microphone 1 and the utterance information detection unit 2 constitute a transmission circuit device. The utterance information detection unit 2 is provided with the microphone 1
The utterance start signal and utterance end signal, which are utterance information, are detected based on the voice signal recorded by. Reference numeral 3 is a line determination unit, and 4 is a localization processing addition unit.
And the localization processing addition unit 4 constitutes a reception circuit device. The line determining unit 3 is a unit that determines a line that has transmitted a voice signal to be localized or a line to be added, based on the utterance information. The localization processing addition unit 4
This is a unit for synthesizing audio signals that give a feeling of localization to a predetermined sound image position to the audio signals for the left and right two channels, and adding the audio signals transmitted from other points to the audio signals subjected to localization processing.

【００１６】定位処理加算部４の詳細は図６に示され
る。ｎ組（イヤホン聴取の場合左右２ｎ個）の定位処理
部４１_Lおよび４１_R、各定位処理部の後段に対応接続
されるｎ組の加算部４２_Lおよび４２_R、各地点から伝
送される音声信号を受け渡す端子Ｔ₁およびＴ₂、と定
位処理部と加算部４２に音声信号を受け渡す（ｎ＋１）
個の端子ｔ₁、ｔ₂、ｔ₃より成る回線決定部４１とに
より構成される。音声信号は矢印つき実線、発話情報は
矢印つき破線に示す順路に従って各機能間を受け渡され
る。５は左右２チャネルのイヤホンを示す。図５および
図６はｎ＝１の場合を示している。The details of the localization processing addition section 4 are shown in FIG. n sets (2n left and right in the case of listening to earphones) of localization processing units 41 _L and 41 _R , n sets of addition units 42 _L and 42 _R correspondingly connected to the subsequent stages of the localization processing units, and audio transmitted from each point Transfers audio signals to terminals T ₁ and T _{2 for} transferring signals, and localization processing unit and addition unit 42 (n + 1)
It is configured by a line determining unit 41 including individual terminals t ₁ , t ₂ , and t ₃ . The voice signal is transferred between the respective functions according to the route shown by the solid line with an arrow and the utterance information according to the broken line with an arrow. Reference numeral 5 denotes a left and right two-channel earphone. 5 and 6 show the case where n = 1.

【００１７】図５の通信会議装置において定位処理およ
び加算処理の切り替えをするには、各地点の送話者の発
話開始および発話終了を示す発話情報を必要とするが、
これは発話情報検出部２により検出伝送される。なお、
通常の会話においては発話情報量は数１０ｂ／ｓ以下の
オーダーであり、これは数１０ｋｂ／ｓ・チャネルのオ
ーダーの音声情報量と比較して格段に少ないので、発話
情報の伝送に伴う情報量の増加は殆ど無視することがで
きる。そして、発話情報検出部２により検出伝送される
発話情報により回線決定部３を制御し、各地点に接続す
る音声信号回線と定位処理加算部４との間の接続を交換
する。即ち、発話情報は各地点について独立であるか
ら、各地点における送話回路装置により発話開始および
発話終了を検出して他の地点の受話回路装置に対してこ
れら検出された発話情報を送信し、受話回路装置におい
て各送話回路装置から伝送された発話情報を受信して定
位処理すべき音声信号を伝送する回線および加算処理す
べき音声信号を伝送する回線を決定する回線決定部３を
具備する。また、音声信号を伝送する回線と定位処理部
および加算部との間の接続は物理的接続のみならず論理
的接続によるものとすることができる。To switch between the localization process and the addition process in the communication conference apparatus of FIG. 5, utterance information indicating the utterance start and utterance end of the speaker at each point is required.
This is detected and transmitted by the utterance information detector 2. In addition,
In a normal conversation, the amount of utterance information is on the order of several tens of b / s or less, which is significantly smaller than the amount of voice information on the order of several tens of kb / s · channel. The increase in is almost negligible. Then, the line determining unit 3 is controlled by the utterance information detected and transmitted by the utterance information detecting unit 2, and the connection between the voice signal line connected to each point and the localization processing adding unit 4 is exchanged. That is, since the utterance information is independent for each point, the utterance start and utterance end are detected by the utterance circuit device at each point and the utterance information detected is transmitted to the receiver circuit device at another point, The receiving circuit device is provided with a line determining unit 3 which receives the speech information transmitted from each transmitting circuit device and determines a line for transmitting a voice signal to be localized and a line for transmitting a voice signal to be added. . Further, the connection between the line transmitting the audio signal and the localization processing unit and the addition unit may be not only a physical connection but also a logical connection.

【００１８】ここで、図４は、Ａ地点、Ｂ地点、Ｃ地点
間の対話の場合にＡ地点およびＢ地点の音声をＣ地点に
おいて聴取する際の状況を示している。なお、定位処理
機能の数ｎを１としている。図４は、最初にＡ地点にお
いて発話開始し、次にＢ地点において発話開始および発
話終了し、最後にＡ地点において発話終了する状況を一
例として想定している。先ず、Ａ地点において発話開始
したとき、Ｂ地点において発話開始されるまでＡ地点に
おいて収録された音声に定位処理を施す。次に、Ｂ地点
における発話開始から終了に到るまでは、Ｂ地点におい
て収録された音声信号に定位処理を施し、Ａ地点におい
て収録された音声については音像定位信号処理を中断し
てこれをＢ地点において収録されて定位処理が施された
音声信号に加算する。Ｂ地点において発話が終了した場
合に、再びＡ地点において収録された音声信号に定位処
理を施す。図５に示されるこの発明の通信会議装置は上
述の如き処理を実施するものである。Here, FIG. 4 shows a situation in which the sounds at the points A and B are heard at the point C in the case of a dialogue between the points A, B, and C. Note that the number n of localization processing functions is one. FIG. 4 assumes, as an example, a situation in which utterance starts first at point A, then utterance starts and ends at point B, and finally utters at point A. First, when speech is started at point A, localization processing is performed on the voice recorded at point A until speech is started at point B. Next, from the start to the end of utterance at the point B, the sound signal recorded at the point B is subjected to localization processing, and the sound image recorded at the point A is interrupted by the sound image localization signal processing. It is added to the audio signal recorded at the point and subjected to localization processing. When the utterance ends at the point B, the localization processing is performed again on the audio signal recorded at the point A. The communication conference apparatus of the present invention shown in FIG. 5 carries out the above-described processing.

【００１９】図７は定位処理すべき音声信号を伝送する
回線および加算する音声信号を伝送する回線を決定する
回線決定部３の動作を説明するフローチャートである。
発呼後、通信を行なうために接続された相手の地点の確
認を行なう。次に、位置情報、定位処理部４１或は加算
部４２の制御に必要なパラメータを設定する。パラメー
タ設定完了後、通信を開始する。発話開始信号受信直
後、発話開始に伴う回線の定位処理部４１への接続或は
定位処理部４１から加算部４２への切替を行なう。ま
た、発話終了信号受信直後、発話終了に伴う定位処理部
４１への回線接続の切断或は加算部４２から定位処理部
４１への切替を行なう。なお、通信中は呼が終了しない
限り発話開始および発話終了信号を受信待ちにする。一
例として、定位処理すべき或は加算すべき音声信号を伝
送する回線を決定する制御パラメータとして定位優先度
ｐ、各地点からの回線についてｐｒｉｏを導入し、以下
の通りの操作を行なうことによりｐの値に現在加算部に
接続している回線数を示し、ｐｒｉｏに加算部に接続さ
れている回線の接続開始された時間的順序を示す様に設
定する。FIG. 7 is a flow chart for explaining the operation of the line deciding unit 3 for deciding the line for transmitting the voice signal to be localized and the line for transmitting the voice signal to be added.
After making a call, the location of the other party connected for communication is confirmed. Next, position information and parameters necessary for controlling the localization processing unit 41 or the addition unit 42 are set. Communication is started after parameter setting is completed. Immediately after the utterance start signal is received, the line is connected to the localization processing unit 41 or the localization processing unit 41 is switched to the addition unit 42 when the utterance starts. Immediately after receiving the utterance end signal, the line connection to the localization processing unit 41 is disconnected or the addition unit 42 is switched to the localization processing unit 41 when the utterance ends. During communication, the utterance start and utterance end signals are placed on standby until the call ends. As an example, localization priority p is introduced as a control parameter for determining a line for transmitting a voice signal to be localized or added, and prio is introduced for a line from each point, and p is obtained by performing the following operation. The value of is set to indicate the number of lines currently connected to the adder, and the prio is set to indicate the time sequence in which the lines connected to the adder are started to be connected.

【００２０】図８は他地点における発話開始に伴う回線
の接続或は切替手順を示す。先ず、発話開始信号に対応
する地点から音声信号を伝送する回線を定位処理部４１
に接続する。既に定位処理部４１にその他の地点を結ぶ
回線が接続されていない場合は、そのまま次の動作へ進
み、発話開始或は終了信号の受信待ちにする。既に定位
処理部４１に他の相手の回線が接続されている場合は、
ｐの値に１を加算し、既に定位処理加算部４に接続され
た回線を加算部４２に接続して、新たに加算部４２に接
続された回線に対応するｐｒｉｏの値にｐの値を与え
る。その後、次の動作へ進む。FIG. 8 shows a connection or switching procedure of a line at the start of speech at another point. First, the localization processing unit 41 sets a line for transmitting a voice signal from a point corresponding to the utterance start signal.
Connect to. When the line connecting the other points is not already connected to the localization processing unit 41, the process proceeds to the next operation as it is, and waits for reception of the utterance start or end signal. When the other party's line is already connected to the localization processing unit 41,
1 is added to the value of p, the line already connected to the localization processing addition unit 4 is connected to the addition unit 42, and the value of p is added to the value of prio corresponding to the line newly connected to the addition unit 42. give. After that, the operation proceeds to the next operation.

【００２１】図９は発話終了に伴う回線の切断或は切替
手順を示す。先ず、発話終了信号を受信した地点に対応
する回線と定位処理部４１または加算部４２との間の接
続を切断する。切断された回線が定位処理部４１に接続
され対応するｐｒｉｏの値が０であった場合は、そのま
ま次の動作へ進む。ｐの値が０よりも大きい場合は、ｐ
ｒｉｏ＝１となる回線を定位処理部へ接続する。未だ加
算部４２へ接続されている全ての回線に対応するｐｒｉ
ｏの値から１を減ずる。更に、現在のｐの値から１を減
ずる。切断された回線が加算部４２に接続されていた場
合は、切断された相手から対応するｐｒｉｏの値よりも
大きいｐｒｉｏの値（但し、未だ加算部に接続されてい
る回線に対応するもののうち）から１を減ずる。更に、
ｐの値に１を減算する。FIG. 9 shows a procedure for disconnecting or switching the line when the utterance ends. First, the connection between the line corresponding to the point where the utterance end signal is received and the localization processing unit 41 or the addition unit 42 is disconnected. When the disconnected line is connected to the localization processing unit 41 and the value of the corresponding prio is 0, the process directly proceeds to the next operation. If the value of p is greater than 0, p
The line with rio = 1 is connected to the localization processing unit. Pri corresponding to all the lines still connected to the adding unit 42
Subtract 1 from the value of o. Further, 1 is subtracted from the current value of p. If the disconnected line is connected to the addition unit 42, the prio value larger than the corresponding prio value from the disconnected partner (however, the one corresponding to the line still connected to the addition unit) Subtract 1 from. Furthermore,
Subtract 1 from the value of p.

【００２２】図１０は図４に示される発話状況および信
号処理状況における、各相手に対応する回線と定位処理
部４１および加算部４２への接続状況、各相手に対応す
るｐｒｉｏの値、ｐの値を示す。図１０（ａ）はＡ地点
の音声信号を伝送する回線が定位処理部４１に接続され
る状態を示し、Ａ地点から伝送される音声信号に定位処
理が４１施される。図１０（ｂ）はＡ地点の音声信号を
伝送する回線が加算部４２に、Ｂ地点から伝送される回
線が定位処理部４１に接続されている状態を示し、Ｂ地
点から伝送される音声信号は定位処理されると共に、Ａ
地点から伝送される音声信号は定位処理されたＢ地点か
ら伝送された音声信号に加算される。FIG. 10 shows the lines corresponding to the respective parties and the connection statuses to the localization processing section 41 and the addition section 42, the prio values corresponding to the respective parties, and p in the utterance and signal processing situations shown in FIG. Indicates a value. FIG. 10A shows a state in which the line for transmitting the audio signal at the point A is connected to the localization processing unit 41, and the localization processing 41 is performed on the audio signal transmitted from the point A. FIG. 10B shows a state in which the line transmitting the audio signal at the point A is connected to the adding unit 42 and the line transmitting from the point B is connected to the localization processing unit 41, and the audio signal transmitted from the point B is shown. Is localized and A
The audio signal transmitted from the point is added to the audio signal transmitted from the point B, which has been subjected to the localization processing.

【００２３】[0023]

【発明の効果】以上の通り、この発明は、複数ｍ地点間
の音声通信において、一例として図５および図６に示さ
れる様に少ない遅延時間で定位方向数ｎの音像定位信号
処理を行なうｎ組の定位処理部を並列に設置して音声回
線と回線交換する回線決定部を具備し、離れた地点から
送信される複数の音声に相異なる方向へ定位感を最大ｎ
地点間までの通信において付与することができる。As described above, according to the present invention, in the voice communication between a plurality of m points, the sound image localization signal processing of the localization direction number n is performed with a short delay time as shown in FIGS. 5 and 6, for example. It has a line determination unit that installs a set of localization processing units in parallel and exchanges a line with a voice line, and provides a maximum of n localization feelings in different directions for a plurality of voices transmitted from distant points.
It can be given in communication between points.

【００２４】そして、同時に発話されている相手数ｋが
定位方向数ｎを上回るときは、直近のｍ地点において発
話された音声に対して定位感を優先的に付与する定位処
理を施し、その他の（ｋ−ｎ）個の相手については発話
された音声を単に加算することにより定位処理部の数を
削減することができる。また、定位処理が施されている
発話が終了したとき、最初に加算開始された音声信号に
ついて優先して定位処理を再開することにより、受話者
はその音声の特徴を音像定位によって再び容易に捉える
ことができ、話者同定を回復することができる。When the number k of people who are uttering at the same time exceeds the number n of localization directions, localization processing is performed to give a feeling of localization preferentially to the speech uttered at the nearest m point, and other The number of localization processing units can be reduced by simply adding the uttered voices to the (k−n) opponents. Also, when the utterance that has undergone localization processing ends, by prioritizing the localization processing for the voice signal that was first started to be added, the listener easily captures the characteristics of the voice again by sound image localization. And the speaker identification can be restored.

[Brief description of drawings]

【図１】伝達関数の畳み込みによる音像定位技術の概念
を示す図。FIG. 1 is a diagram showing a concept of a sound image localization technique by convolution of a transfer function.

【図２】レベル差による音像定位技術の概念を示す図。FIG. 2 is a diagram showing a concept of a sound image localization technique based on a level difference.

【図３】時間差による音像定位技術の概念を示す図。FIG. 3 is a diagram showing a concept of a sound image localization technique based on a time difference.

【図４】伝送される音声信号に施すべき処理を示す図。FIG. 4 is a diagram showing processing to be performed on a transmitted audio signal.

【図５】実施例を説明する図。FIG. 5 is a diagram illustrating an example.

【図６】定位処理加算部を説明する図。FIG. 6 is a diagram illustrating a localization processing addition unit.

【図７】回線決定部の動作を説明するフローチャート。FIG. 7 is a flowchart illustrating the operation of the line determining unit.

【図８】図７における発話開始に伴う動作を説明する
図。FIG. 8 is a diagram for explaining an operation accompanying the start of speech in FIG. 7.

【図９】図７における発話終了に伴う動作を説明する
図。FIG. 9 is a diagram for explaining an operation accompanying the end of utterance in FIG. 7.

【図１０】定位処理部および加算部における回線接続状
況、ｐ、ｐｒｉｏの値を示す図。FIG. 10 is a diagram showing line connection statuses and p and prio values in a localization processing unit and an addition unit.

[Explanation of symbols]

１マイクロホン２発話情報検出部３回線決定部４定位処理加算部４１定位処理部４２加算部５イヤホン DESCRIPTION OF SYMBOLS 1 Microphone 2 Speech information detection unit 3 Line determination unit 4 Localization processing addition unit 41 Localization processing unit 42 Addition unit 5 Earphone

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｈ０４Ｓ 1/00 Ｋ ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁶ Identification code Office reference number FI technical display location H04S 1/00 K

Claims

[Claims]

1. A transmitter circuit device for recording a voice signal and transmitting it to another point, a receiver circuit device for reproducing a voice signal transmitted from another point, and a plurality of different m points physically or logically. In a communication conference apparatus including a transmission line having a transmission channel to be connected, a localization processing unit that performs a sound image localization process on a transmitted audio signal and an addition that adds a voice signal by a communication partner who is speaking to the sound image localization processed voice signal. And a line determining unit for switching the connection between the transmission line for transmitting the voice signal of the communication partner and the n sets of the localization processing unit and the adding unit. Characteristic teleconference equipment.

2. The communication conferencing apparatus according to claim 1, wherein the line determining unit is controlled based on utterance information indicating utterance start and utterance end of the communication partner. .

3. The communication conference apparatus according to claim 2, wherein the transmission circuit device includes an utterance information detection unit, and transmits the detected utterance information to the reception circuit device via a transmission line. Teleconference equipment to be.

4. The communication conference apparatus according to any one of claims 1 to 3, wherein the sound image localization processing is prioritized to the last voice signal of the voice signals being uttered which is started to be uttered. The communication conferencing apparatus is characterized in that the other audio signals are added to the audio signals subjected to the sound image localization processing.

5. The communication conferencing apparatus according to claim 4, wherein the sound image localization process is restarted by giving priority to the audio signal transmitted from the point where the audio signal first started to be added is uttered. Teleconference equipment to do.