JPH02230898A - Voice reproduction system - Google Patents

Voice reproduction system

Info

Publication number
JPH02230898A
JPH02230898A JP5147589A JP5147589A JPH02230898A JP H02230898 A JPH02230898 A JP H02230898A JP 5147589 A JP5147589 A JP 5147589A JP 5147589 A JP5147589 A JP 5147589A JP H02230898 A JPH02230898 A JP H02230898A
Authority
JP
Japan
Prior art keywords
sound
speaker
voice
frequency band
masking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP5147589A
Other languages
Japanese (ja)
Other versions
JP2523366B2 (en
Inventor
Naofumi Inmaki
印牧 直文
Toshiharu Tanabe
田邊 敏晴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nippon Telegraph and Telephone Corp
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Priority to JP5147589A priority Critical patent/JP2523366B2/en
Publication of JPH02230898A publication Critical patent/JPH02230898A/en
Application granted granted Critical
Publication of JP2523366B2 publication Critical patent/JP2523366B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Abstract

PURPOSE:To allow the content of conversation to be listened by concerned persons only by synthesizing a voice at a frequency band other than the voice effecting the understanding of the content of conversation with a masking voice and reproducing the synthesized voice through a nondirectional speaker. CONSTITUTION:The frequency band is split (split into A and B) at a frequency split circuit 11 to an audio signal of a monaural sound X(=A+B) transferred from an input terminal 10. For example, the voice A of a noticed characteristic frequency region is reproduced by a directivity speaker 12 and the voice B of the other frequency is transferred to a making sound addition section 25. The masking sound addition section 25 adds the 'masking sound C with respect to A' to the voice B and the synthesized sound (B+C) is reproduced by the nondirectional speaker 13. Thus, the directivity speaker 12 acts it that the content of conversation is not listened to a person not concerned resident therearound (not understood) but listened to a person concerned 15.

Description

【発明の詳細な説明】 「産業上の利用分野」 この発明は例えば通信会議システムに用いられる音声再
生方式に関する. 「従来の技術」 音声会議、テレビ会議等の通信会議システムを実現する
際には、会議の性格上、再生装置を長時間使用すること
が多く、受話器やイヤホンを用いると受聴者に対して重
圧感、圧迫惑を生じさせるという第1の問題が発生する
. 他方、受話器やイヤホンを用いず拡声スピ一カを使う方
式が考えられるが、この場合は受話器やイヤホンでは問
題視されなかった欠点が生じる。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] This invention relates to an audio reproduction method used, for example, in a teleconferencing system. ``Prior art'' When realizing a communication conference system such as an audio conference or a video conference, playback devices are often used for long periods of time due to the nature of the conference, and using a telephone receiver or earphones places a heavy burden on listeners. The first problem arises, which is the feeling of pressure and confusion. On the other hand, it is possible to use a loudspeaker instead of a telephone receiver or earphones, but in this case, there are drawbacks that are not seen as problems with telephone receivers or earphones.

即ち、会議とは無関係な人間(非当事者)が周囲にいる
環境の中で、通信会議を行う場合、再生される会議内容
が当該非当事者に受聰されてしまうという第2の問題点
が生じる. この点に対処するために、拡声スピーカに狭指向性を持
たせる再生方式が考えられる.狭指向性によって小さく
させた受聴範囲内に当事者だけが位置すると、会議とは
無関係な非当事者の受聴が困難になるということを利用
している.ところが、この従来方式では音の波長が大き
い低音域の指向性については実現が難しく、無理に実現
するとスピーカ口径が大きくなり装置の大型化という問
題が生じる. この問題を解決するために、会話内容の了解性に影響を
与える周波数帯域(例えばホルマント周波数帯域)に対
して狭指向性スピーカで再生し、それ以外の周波数帯域
に対しては無指向性スピーカで再生する方式が提案され
ている.この発明はこの方式を改良し、周囲の当該非当
事者への会話内容の了解性を更に低減させる、即ち、周
囲への会話内容のもれを防止するものである.この発明
の目的は、上記従来の欠点を除去するため、会話内容の
了解性に影響を与える周波数帯域に対して狭指向性スピ
ーカで再生し、マスキング効果を活用して周囲の当該非
当事者への会話内容の了解性を低減させるためのマスキ
ング音を無指向性スピ一カで再生する音声再生方式を提
供することにある。
In other words, when a teleconference is held in an environment where there are people (non-parties) who are unrelated to the conference, the second problem arises that the content of the conference being played back may be heard by the non-parties. .. To address this issue, a playback method can be considered in which the loudspeaker has narrow directivity. This takes advantage of the fact that if only the parties involved are located within the listening range reduced by the narrow directivity, it becomes difficult for non-parties unrelated to the conference to hear the conference. However, with this conventional method, it is difficult to achieve directivity in the bass range, where the wavelength of sound is large, and if this is achieved forcibly, the speaker diameter becomes large, resulting in an increase in the size of the device. To solve this problem, the frequency bands that affect the intelligibility of conversation content (for example, the formant frequency band) are played back using narrow directional speakers, and the other frequency bands are played back using omnidirectional speakers. A method for reproducing the data has been proposed. This invention improves this method to further reduce the intelligibility of the conversation to surrounding non-parties, that is, to prevent the conversation from leaking to the surroundings. The purpose of the present invention is to eliminate the above-mentioned conventional drawbacks by using a narrow directional speaker to reproduce the frequency band that affects the intelligibility of conversation content, and by utilizing the masking effect to make it easier for surrounding non-parties to hear the sound. An object of the present invention is to provide an audio reproduction method for reproducing masking sound for reducing the intelligibility of conversation contents using an omnidirectional speaker.

「課題を解決するための手段」 この発明は、会話内容の了解性に影響する度合いによっ
て周波数帯域を2分割し、その了解性に影響する周波数
帯域(指向性周波数帯域)の音声を狭指向性スピーカで
再生し、前記了解性に影響を与えない周波数帯域(無指
向性周波数帯域)の音声に、前記狭指向性スピーカより
再生される音声をマスキングするマスキング音を加えて
合成し無指向性スピーカで再生することを最も主要な特
徴とする.会話内容の了解性に影響する周波数帯域とし
ては、日本語の5母音を特徴づける成分音であるホルマ
ントの周波数帯域や個人を判別できる周波数帯域等があ
る. 「実施例」 第1図は、この発明の特徴を示す第1のシステム例であ
る.入力端子10から転送されてくるモノラル音X (
=A+B)のオーディオ信号に対して周波数分割回路1
lで周波数の帯域分割(AとBとの分割)を行い、例え
ば着目するホルマント周波数(その付近の周波数を含む
)の音声Aを指向性スピーカl2により再生し、それ以
外の周波数の音声Bをマスキング音付加部25に転送す
る。
"Means for Solving the Problem" This invention divides the frequency band into two according to the degree to which it affects the intelligibility of conversation content, and narrowly directs the sound in the frequency band that affects the intelligibility (directional frequency band). A masking sound that masks the sound played from the narrow directional speaker is added to the sound in a frequency band that does not affect the intelligibility (omnidirectional frequency band) that is played back by the speaker, and synthesized. The main feature is that it can be played back. Frequency bands that affect the intelligibility of conversation content include the frequency band of formants, which are component sounds that characterize the five vowels of Japanese, and the frequency band that allows identification of individuals. ``Embodiment'' FIG. 1 is a first example of a system showing the features of this invention. Monaural sound X transferred from input terminal 10 (
=A+B) frequency division circuit 1 for the audio signal
Frequency band division (dividing into A and B) is performed using l, and for example, audio A with the formant frequency of interest (including frequencies around it) is reproduced by directional speaker l2, and audio B with other frequencies is reproduced. It is transferred to the masking sound adding section 25.

マスキング音付加部25は、マスキング音用入力端子1
5から転送されてくる“Aに対するマスキング音C”を
音声Bに付加して、この合成音(B十C)を無指向性.
スピーカ13より再生する。マスキング音Cは例えば音
声Aと類似した他の音声である。
The masking sound addition section 25 has a masking sound input terminal 1
The "masking sound C for A" transferred from 5 is added to sound B, and this synthesized sound (B + C) is omnidirectional.
It is played back from the speaker 13. Masking sound C is another sound similar to sound A, for example.

指向性スピーカ12は、会話内容が周囲の非当事者14
に聞えず(理解されずに)、当事者15に聞えるように
する役割を有し、無指向性スピーカl3は、音の拡がり
怒、厚み惑等を出す役割を有する.即ち、当事者15は
スピーカ12とスピーカl3から再生される音声X+C
 (=A+B+C)を聞き、非当事者14はスピーカl
3から再生される音声B十〇を聞くことになり、マスキ
ング音Cの音量を適宜制御することによって、非当事者
l4への当事者15の会話内容の了解性を低滅させるこ
とが出来る. 第2図はこの発明の特徴を示す第2のシステム例である
.入力端子10から転送されてくるステレオ音のオーデ
ィオ信号(L信号.R信号)に対して、L信号とR信号
は、それぞれ第1図で述べた再生動作に従って、例えば
ホルマント周波数帯域の音声とそれ以外の周波数帯域の
音声とに分割され、前者の音声はL信号用指向性スピー
カ12.LとR信号用指向性スピーカ12Rとがらそれ
ぞれ再生され、後者の音声(L信号,R信号)は、マス
キング音付加部25へ転送される。マスキング音付加部
は、マスキング音用入力端子15がら転送されてくるマ
スキングするステレオ音と周波数分割回路11から転送
されるステレオ音とを対応するし信号、R信号毎に合成
し、この合成音をそれぞれL信号用無指向性スピーカ1
3LとR信号用無指向性スピーカ13Rとから再生する
。当事者l5は会話内容を音像定位して聞くことができ
る. 従来の技術では、全周波数帯域に指向性を持たせたため
、スピーカが大型化する傾向にあったが、この発明では
、会話内容の了解性に影響を与える周波数帯域に対して
指向性を高め、更に、マスキング効果を活用して、了解
性の効果を高める点が従来との大きな相違点である. 第3図はこの発明の実施例の構成を示すブロック図であ
る.制御部24の指令により、周波数帯域設定部23は
、男声、女声、会話の効果音等を考慮して予め定められ
た指向性帯域設定データと無指向性帯域設定データとを
それぞれ指向性帯域抽出再生部21と無指向性帯域抽出
再生部22とに転送する.指向性帯域抽出再生部2lは
、その指向性帯域設定データに基づき、初期設定を行い
、その設定完了後、その完了通知を周波数帯域設定部2
3に転送する.同時に、無指向性帯域抽出再生部22は
、前記無指向性帯域設定データに基づき、初期設定を行
い、その設定完了後、その完了通知を周波数帯域設定部
23に転送する.周波数帯域設定部23は指向性帯域抽
出再生部2lから受信する前記完了通知と無指向性帯域
抽出再生部22から受信する前記完了通知とを受けとっ
た後、起動開始指令を指向性帯域抽出再生部21と無指
向性帯域抽出再生部22とに通知する.その通知完了後
、指向性帯域抽出再生部21は、入力端子10から送ら
れてくるオーディオ信号に対して初期設定された音声の
帯域のみ抽出し、その音声を指向性スピーカ12を介し
て再生する.他方、無指向性帯域抽出再生部22は、初
期設定された音声のみ抽出し、その音声をマスキング音
付加部25に転送する.マスキング音付加部25は、無
指向性帯域抽出再生部22から転送されてくる音声に、
マスキング音用入力端子15から入力されるマスキング
音を加えて合成し、その合成音を無指向性スピーカ13
を介して再生する. 「発明の効果」 以上説明したように、この発明による音声再生方式によ
れば、会話内容の了解性に影響する例えばホルマント周
波数帯域(その付近の周波数を含む)の音声に対して、
指向性スピーカを介して再生し、それ以外の周波数帯域
の音声に対してはマスキング音を加えて合成し、その合
成音に対して無指向性スピーカを介して再生することが
ら、受聴者がハンドフリーとなる利点があるとともに、
マスキング効果によって会話内容が当事者だけに聞こえ
、周囲の非当事者には聞こえないという利点がある.更
に、人間の発声範囲の100Hz〜8000Hzの全周
波数に対して指向性を与えるのではなく、中域の300
Hz〜2000Hzのうちのいくつかの周波数帯域のみ
に指向性を与えることから、スピーカ(スピーカ口径)
の小型化が図れるとともに経済化が図れるという利点が
ある。また無指向性スピーカを用いることによって音の
拡がり感や厚み感を出すことが出来るという利点もある
The directional speaker 12 allows the conversation content to be transmitted to surrounding non-parties 14.
The non-directional speaker 13 has the role of making the sound audible to the person concerned 15 without being heard (understood) by others, and the omnidirectional speaker 13 has the role of making the sound appear wider, louder, and thicker. That is, the party 15 listens to the audio X+C played from the speaker 12 and the speaker l3.
(=A+B+C), the non-party 14 uses the speaker l
By controlling the volume of the masking sound C appropriately, the intelligibility of the conversation content of the party 15 to the non-party 14 can be reduced. Figure 2 is a second system example showing the features of this invention. Regarding the stereo audio signals (L signal and R signal) transferred from the input terminal 10, the L signal and R signal are respectively processed according to the playback operation described in FIG. The former sound is transmitted through the L signal directional speaker 12. The L and R signal directional speakers 12R are respectively reproduced, and the latter sounds (L signal, R signal) are transferred to the masking sound adding section 25. The masking sound adding section synthesizes the masking stereo sound transferred from the masking sound input terminal 15 and the stereo sound transferred from the frequency division circuit 11 for each corresponding R signal and R signal, and generates this synthesized sound. Omnidirectional speaker 1 for each L signal
3L and the R signal omnidirectional speaker 13R. Participant 15 can listen to the content of the conversation by localizing the sound image. In conventional technology, the speaker tends to become larger because the entire frequency band has directivity, but with this invention, the directivity is increased in the frequency band that affects the intelligibility of conversation content. Furthermore, a major difference from conventional methods is that it utilizes masking effects to increase the effect of intelligibility. FIG. 3 is a block diagram showing the configuration of an embodiment of this invention. In response to a command from the control unit 24, the frequency band setting unit 23 extracts directional band setting data and non-directional band setting data that are predetermined in consideration of male voices, female voices, conversation sound effects, etc. The data is transferred to the playback section 21 and the omnidirectional band extraction and playback section 22. The directional band extraction and reproduction unit 2l performs initial settings based on the directional band setting data, and after completing the setting, sends a completion notification to the frequency band setting unit 2.
Transfer to 3. At the same time, the omnidirectional band extraction and reproducing section 22 performs initial setting based on the omnidirectional band setting data, and after completing the setting, transfers a completion notification to the frequency band setting section 23. After receiving the completion notification received from the directional band extraction and playback unit 2l and the completion notification received from the omnidirectional band extraction and playback unit 22, the frequency band setting unit 23 issues an activation start command to the directional band extraction and playback unit. 21 and the omnidirectional band extraction and reproduction section 22. After the notification is completed, the directional band extraction and playback unit 21 extracts only the initially set audio band for the audio signal sent from the input terminal 10 and plays the audio via the directional speaker 12. .. On the other hand, the omnidirectional band extraction and playback unit 22 extracts only the initially set audio and transfers the audio to the masking sound addition unit 25. The masking sound addition unit 25 adds sound to the audio transferred from the omnidirectional band extraction and playback unit 22.
The masking sound input from the masking sound input terminal 15 is added and synthesized, and the synthesized sound is output to the omnidirectional speaker 13.
Play via . "Effects of the Invention" As explained above, according to the audio reproduction method according to the present invention, for example, the voice in the formant frequency band (including frequencies in the vicinity), which affects the intelligibility of conversation content, can be reproduced.
It is played back through a directional speaker, and the sound in other frequency bands is synthesized by adding masking sound, and the synthesized sound is played back through an omnidirectional speaker. It has the advantage of being free, and
The masking effect has the advantage that the conversation can only be heard by the person concerned, but not by those around them. Furthermore, instead of providing directivity for all frequencies in the human vocal range of 100Hz to 8000Hz,
Since it provides directivity only in some frequency bands from Hz to 2000Hz, the speaker (speaker aperture)
This has the advantage that it can be made smaller and more economical. Another advantage of using omnidirectional speakers is that you can create a sense of spaciousness and depth in the sound.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図はこの発明の特徴を示す第1のシステム例を示す
ブロック図、第2図はこの発明の特徴を示す第2のシス
テム例を示すブロック図、第3図はこの発明の実施例の
構成を示すブロック図である. ヤ 1 図 十 固 木 図
FIG. 1 is a block diagram showing a first system example showing the features of this invention, FIG. 2 is a block diagram showing a second system example showing features of this invention, and FIG. 3 is a block diagram showing a second system example showing features of this invention. It is a block diagram showing the configuration. 1 Figure 1: Hardwood diagram

Claims (1)

【特許請求の範囲】[Claims] (1)指向性を持つ指向性スピーカと無指向性を持つ無
指向性スピーカとを有する音声再生システムであって、 入力されたオーディオ信号に対して再生する際に指向性
を持たせる指向性周波数帯域と指向性を持たせない無指
向性周波数帯域とを設定する手段と、 上記入力オーディオ信号から上記指向性周波数帯域のオ
ーディオ信号を抽出して上記指向性スピーカを用いて再
生する手段と、 上記入力オーディオ信号から上記無指向性周波数帯域の
オーディオ信号を抽出する手段と、上記指向性周波数帯
域のオーディオ信号をマスキングするオーディオ信号を
上記無指向性周波数帯域のオーディオ信号に加えて、上
記無指向性スピーカを用いて再生する手段とを具備する
ことを特徴とする音声再生方式。
(1) An audio reproduction system that has a directional speaker with directivity and an omnidirectional speaker with omnidirectionality, which has a directional frequency that gives directionality when playing back an input audio signal. means for setting a band and an omnidirectional frequency band having no directivity; means for extracting an audio signal in the directional frequency band from the input audio signal and reproducing it using the directional speaker; means for extracting an audio signal in the omnidirectional frequency band from an input audio signal; and a means for adding an audio signal for masking the audio signal in the omnidirectional frequency band to the audio signal in the omnidirectional frequency band; 1. A sound reproduction method comprising means for reproducing using a speaker.
JP5147589A 1989-03-03 1989-03-03 Audio playback method Expired - Lifetime JP2523366B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP5147589A JP2523366B2 (en) 1989-03-03 1989-03-03 Audio playback method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5147589A JP2523366B2 (en) 1989-03-03 1989-03-03 Audio playback method

Publications (2)

Publication Number Publication Date
JPH02230898A true JPH02230898A (en) 1990-09-13
JP2523366B2 JP2523366B2 (en) 1996-08-07

Family

ID=12887978

Family Applications (1)

Application Number Title Priority Date Filing Date
JP5147589A Expired - Lifetime JP2523366B2 (en) 1989-03-03 1989-03-03 Audio playback method

Country Status (1)

Country Link
JP (1) JP2523366B2 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005076661A1 (en) 2004-02-10 2005-08-18 Mitsubishi Denki Engineering Kabushiki Kaisha Mobile body with superdirectivity speaker
JP2007235864A (en) * 2006-03-03 2007-09-13 Glory Ltd Voice processor and voice processing method
WO2009022278A1 (en) 2007-08-14 2009-02-19 Koninklijke Philips Electronics N.V. An audio reproduction system comprising narrow and wide directivity loudspeakers
JP2012073411A (en) * 2010-09-29 2012-04-12 Yamaha Corp Masker sound output control device
JP2013123266A (en) * 2010-09-08 2013-06-20 Panasonic Corp Sound reproduction device

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005076661A1 (en) 2004-02-10 2005-08-18 Mitsubishi Denki Engineering Kabushiki Kaisha Mobile body with superdirectivity speaker
EP1715717A1 (en) * 2004-02-10 2006-10-25 HONDA MOTOR CO., Ltd. Mobile body with superdirectivity speaker
EP1715717A4 (en) * 2004-02-10 2009-04-08 Honda Motor Co Ltd Mobile body with superdirectivity speaker
JP2007235864A (en) * 2006-03-03 2007-09-13 Glory Ltd Voice processor and voice processing method
JP4680099B2 (en) * 2006-03-03 2011-05-11 グローリー株式会社 Audio processing apparatus and audio processing method
WO2009022278A1 (en) 2007-08-14 2009-02-19 Koninklijke Philips Electronics N.V. An audio reproduction system comprising narrow and wide directivity loudspeakers
JP2010537465A (en) * 2007-08-14 2010-12-02 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Audio reproduction system having a narrow directional loudspeaker and a wide directional loudspeaker
US8472652B2 (en) 2007-08-14 2013-06-25 Koninklijke Philips Electronics N.V. Audio reproduction system comprising narrow and wide directivity loudspeakers
JP2013123266A (en) * 2010-09-08 2013-06-20 Panasonic Corp Sound reproduction device
US9743186B2 (en) 2010-09-08 2017-08-22 Panasonic Intellectual Property Management Co., Ltd. Sound reproduction device
JP2012073411A (en) * 2010-09-29 2012-04-12 Yamaha Corp Masker sound output control device

Also Published As

Publication number Publication date
JP2523366B2 (en) 1996-08-07

Similar Documents

Publication Publication Date Title
US8509454B2 (en) Focusing on a portion of an audio scene for an audio signal
US8488820B2 (en) Spatial audio processing method, program product, electronic device and system
CN111464905A (en) Hearing enhancement method and system based on intelligent wearable device and wearable device
JPH03236691A (en) Audio circuit for television receiver
US20080004866A1 (en) Artificial Bandwidth Expansion Method For A Multichannel Signal
US20070109977A1 (en) Method and apparatus for improving listener differentiation of talkers during a conference call
JP2002511715A (en) Echo cancellation
WO2011104418A1 (en) Modifying spatial image of a plurality of audio signals
JP4992591B2 (en) Communication system and communication terminal
EA013670B1 (en) Method and apparatus for recording, transmitting and playing back sound events for communication applications
JP2645731B2 (en) Sound image localization reproduction method
JP2523367B2 (en) Audio playback method
JP2523366B2 (en) Audio playback method
JP2000059893A (en) Hearing aid device and its method
CN110891216A (en) Earphone device
JP2539672B2 (en) Audio playback method
JP2003518891A (en) Audio signal processing device
WO2021057214A1 (en) Sound field extension method, computer apparatus, and computer readable storage medium
WO2022043906A1 (en) Assistive listening system and method
JPH03114000A (en) Voice reproduction system
CN110856068A (en) Communication method of earphone device
WO2017211448A1 (en) Method for generating a two-channel signal from a single-channel signal of a sound source
JPH03141799A (en) Loudspeaker system
JP2003069968A (en) Method for realizing electronic conference with sense of reality
JP2023107364A (en) Communication device and communication method

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090531

Year of fee payment: 13

EXPY Cancellation because of completion of term