JP2000333297A

JP2000333297A - Stereophonic sound generator, method for generating stereophonic sound, and medium storing stereophonic sound

Info

Publication number: JP2000333297A
Application number: JP11134976A
Authority: JP
Inventors: Yuzo Okamoto; 勇三岡元; Narutoshi Uchida; 成俊内田; Haruo Hamada; 晴夫浜田
Original assignee: DIMAGIC Inc; SOUND VISION KK
Current assignee: DIMAGIC Inc; SOUND VISION KK
Priority date: 1999-05-14
Filing date: 1999-05-14
Publication date: 2000-11-30

Abstract

PROBLEM TO BE SOLVED: To obtain a reproduction sound with excellent presence that applies an optional localization to a specific sound source by combining a binaural source where sound localization is ensured from a recording state with a virtual source applying sound localization to processed conventional sound. SOLUTION: This stereophonic sound generator is provided with a 1st acoustic signal generating means 10 that generates a virtual source applying sound image localization processing to a recorded sound or a synthesis sound or the like, a 2nd acoustic signal generating means 20 that generates a binaural source that is recorded by a dummy head microphone and whose sound image is localized from the recording state, an adder 30 that sums 1st and 2nd acoustic signals, and left right loudspeakers 40L, 40R that reproduce a sound signal outputted from the adder 30.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ダミーヘッドマイ
クによって収録した第１の音源からの音響信号と、通常
のマイクで収録したり、音声合成によって生成した通常
音に対して音像の定位を加えた第２の音源からの音響信
号とを加算して、臨場感に優れた立体的な再生音を得る
ようにした立体音生成装置、立体音生成方法及び立体音
を記録した媒体に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method of adding a sound signal from a first sound source recorded by a dummy head microphone to a normal sound recorded by a normal microphone or a normal sound generated by voice synthesis. The present invention relates to a three-dimensional sound generation device, a three-dimensional sound generation method, and a medium on which a three-dimensional sound is recorded by adding a sound signal from a second sound source to obtain a three-dimensional reproduction sound with excellent realism. .

【０００２】[0002]

【従来の技術】最近、舞台や放送現場、映画、ＴＶゲー
ム機などの音を扱う分野においては、実際に収録した音
声やコンピュータによる合成音に対して種々の加工を施
すことが行われている。その一つに、左右２チャンネル
の信号系の音声信号にそれぞれ時間遅延および振幅調整
を施すことにより、各信号系間に時間差および振幅差を
発生させて、収録音や合成音などの音源とは異なった方
向感および距離感を有する再生音を実際のスピーカの得
るようにした音像定位装置が知られている。このような
音像定位装置によって得られる音像は、現実の音像では
なくコンピュータなどを利用して仮想的に生成されたも
のであり、また、実際の音源やスピーカとは異なる任意
の位置に音像が定位しているように感じさせることか
ら、一般にバーチャルソースとも呼ばれる。2. Description of the Related Art Recently, in the field of handling sounds such as a stage, a broadcast site, a movie, a TV game machine, etc., various processings are performed on actually recorded sounds and synthesized sounds by a computer. . One of them is to apply time delay and amplitude adjustment to the audio signals of the left and right two-channel signal system, respectively, to generate a time difference and amplitude difference between each signal system. 2. Description of the Related Art A sound image localization apparatus has been known in which a reproduced sound having a different sense of direction and a different sense of distance is obtained from an actual speaker. The sound image obtained by such a sound image localization apparatus is not a real sound image but is virtually generated using a computer or the like, and the sound image is localized at an arbitrary position different from an actual sound source or a speaker. It is generally called a virtual source because it makes you feel like you are doing it.

【０００３】この音像定位装置としては、各種のものが
知られているが、両耳における信号のレベル差と位相差
（時間差）によって特定位置（特定方向）に音源を感じ
させるものがある。たとえば、特開平２−２９８２００
号、特開平６−２５３３９９などが知られている。この
デジタル回路を使用した音像定位の方法は、音源からの
信号をＦＦＴ（Fast Fourier Transform）変換して周波
数軸上で処理し、左右の両チャンネル信号に周波数に依
存したレベル差と位相差とを与えて、音像の定位をデジ
タル的に制御するものである。Various types of sound image localization devices are known, and there are devices that allow a sound source to be sensed at a specific position (specific direction) by a signal level difference and a phase difference (time difference) between both ears. For example, JP-A-2-298200
And JP-A-6-253399 are known. The method of sound image localization using this digital circuit is to convert a signal from a sound source into an FFT (Fast Fourier Transform), process it on the frequency axis, and convert both left and right channel signals into a frequency-dependent level difference and a phase difference. In this case, the localization of the sound image is digitally controlled.

【０００４】一方、原音場における音圧をヘッドフォン
を利用して、厳密に生成する方法として、従来からバイ
ノーラルシステムが知られている。このシステムは、原
音場にダミーヘッド（人間の頭に似たものの左右の耳の
位置にマイクロフォンが設置されたもの）と呼ばれる特
殊なマイクを設置し、人が実際に聞いている状態で、両
耳に到達する音をそのまま２チャンネルで録音する方式
である。録音された音響信号は、記録・伝送などを経た
あと、ヘッドフォンを用いて受聴者の耳元で生成され
る。On the other hand, a binaural system has been conventionally known as a method for strictly generating sound pressure in an original sound field using headphones. In this system, a special microphone called a dummy head (similar to a human head but with microphones installed at the left and right ears) is installed in the original sound field, and both people are listening while people are actually listening. In this method, the sound reaching the ear is recorded as it is on two channels. The recorded acoustic signal is generated at the listener's ear using headphones after recording and transmission.

【０００５】この録音方式では、マイクの全周囲の音声
を録音し、これを収録時の音の定位そのままの状況を再
現することができるため、たいへん優れた原音場の定位
感や臨場感のある再生が特徴であるが、従来は、実質上
ヘッドホンによる再生に限られていた。しかしこの場
合、ヘッドホンの特性の混入や、装着時に違和感を伴う
という問題がある。[0005] In this recording method, the sound around the microphone can be recorded, and the sound can be reproduced as it is when the sound is localized. Therefore, there is a very good sense of localization and presence of the original sound field. Reproduction is a feature, but conventionally, reproduction was substantially limited to headphones. However, in this case, there is a problem that the characteristics of the headphones are mixed and that the headphone is uncomfortable.

【０００６】そのため、最近では、スピーカを用いて再
生するトランスオーラル（transaural）方式が提案され
ている。この方式ではヘッドフォンを使用した前者に比
べて再生装置が複雑になるが、周波数特性の乱れや、違
和感があるという問題を解消することができる。一方、
トランスオーラル方式は、原音場においての受聴者の両
耳に達する音響信号と等価な信号を、再生音場内のスピ
ーカにより受聴者の両耳に生成する方式である。つま
り、原音場での右耳( 右チャンネル入力) の信号は再生
音場での右耳にのみ、左チャンネルの信号は左耳にのみ
正確に到達することを理想とする。しかしスピーカを用
いることによって、片側の信号が逆の耳に到達してしま
うという現象（クロストーク）が生じてしまう。また、
スピーカの周波数特性や、スピーカ−耳間の伝達特性に
より、全体の周波数特性も乱されてしまう。これらのこ
とにより所期の目的を果たせなくなってしまう。よっ
て、トランスオーラル方式ではクロストークを押さえ、
周波数特性も平坦にする必要がある。これをDSP (Digit
al Signal Processor)によるディジタルフィルタを用い
て実現している。For this reason, recently, a transaural system for reproducing using a speaker has been proposed. In this system, the playback device is more complicated than in the former using headphones, but it is possible to solve the problem that the frequency characteristics are disturbed and the user feels uncomfortable. on the other hand,
The transaural method is a method in which a signal equivalent to an acoustic signal reaching the listener's both ears in the original sound field is generated in the listener's both ears by a speaker in the reproduction sound field. In other words, it is ideal that the signal of the right ear (right channel input) in the original sound field reaches the right ear only in the reproduction sound field, and the signal of the left channel only reaches the left ear in the reproduction sound field. However, the use of the speaker causes a phenomenon (crosstalk) that a signal on one side reaches the opposite ear. Also,
The overall frequency characteristics are also disturbed by the frequency characteristics of the speaker and the transfer characteristics between the speaker and the ear. As a result, the intended purpose cannot be fulfilled. Therefore, the trans-oral system suppresses crosstalk,
The frequency characteristics also need to be flat. This is called DSP (Digit
al Signal Processor).

【０００７】[0007]

【発明が解決しようとする課題】ところが、前記のよう
なバーチャルソースを利用した音像定位装置は、受聴者
の周囲の任意の位置に音像を定位させることができる反
面、個々のマイクで収録された音源や合成音ごとに音像
の定位を行うものであるから、臨場感を得るために音源
の種類が極めて多数に上る場合には、多数の音源のそれ
ぞれについて異なった音像の定位を行う必要があり、そ
のための作業が非常に困難であった。たとえば、空港や
駅などの都会の雑踏、コンサートホールやホテルの内
部、鳥の声や小川のせせらぎなど多くの音にあふれた自
然の中などにおいて発生するすべての音声を個別に録音
したり合成した後、更にそれぞれの音に対して独自に音
像の定位を行うことは、音の種類が極めた多数に上るた
めに、事実上は不可能であった。However, the sound image localization apparatus using a virtual source as described above can localize a sound image at an arbitrary position around a listener, but is recorded using individual microphones. Since the sound image is localized for each sound source or synthesized sound, if the number of sound sources is extremely large in order to obtain a sense of reality, it is necessary to perform different sound image localization for each of the large number of sound sources. The work for that was very difficult. For example, we recorded and synthesized all the sounds that occur in urban busy places such as airports and train stations, inside concert halls and hotels, in nature filled with many sounds such as birds and babbling streams. After that, it was practically impossible to localize the sound image independently for each sound because of the large number of types of sounds.

【０００８】一方、バイノーラルシステムは、現実の音
源をそのままの形で収録して再生するものであるから、
極めて多数の音源が存在する場合であっても、臨場感に
優れた再生音を得ることができるものの、その反面、収
録時から音像の定位が確定されているために、収録した
各音源ごとに音像の定位などの加工を行うことはでき
ず、所望の音源に対して任意の音の定位を与えることが
できないといった問題があった。その結果、テレビゲー
ム機やアニメ映画などのように、現実の音源を収録する
ことができず、収録音や合成音に対して音の定位を与え
る必要のある用途には、使用することができない欠点が
あった。[0008] On the other hand, the binaural system records and reproduces an actual sound source as it is,
Even if there are a very large number of sound sources, it is possible to obtain a playback sound with a great sense of realism, but on the other hand, since the sound image has been localized from the time of recording, Processing such as localization of a sound image cannot be performed, and there has been a problem that an arbitrary localization of sound cannot be given to a desired sound source. As a result, it cannot be used for applications where it is not possible to record a real sound source, such as a video game machine or an animated movie, and it is necessary to provide sound localization for the recorded or synthesized sound. There were drawbacks.

【０００９】本発明は、前記のような従来技術の問題点
を解決するために提案されたものであって、その目的
は、収録時から音の定位を確保したバイノーラルソース
と通常音を処理して音の定位を加えたバーチャルソース
とを組み合わせて、臨場感に優れしかも特定の音源に対
して任意の定位を加えた再生音を得ることができるよう
にした立体音生成装置、立体音生成方法及び立体音を記
録した媒体を提供することにある。The present invention has been proposed to solve the above-mentioned problems of the prior art. It is an object of the present invention to process a binaural source that secures sound localization from the time of recording and a normal sound. Sound generating device and method for generating a reproduced sound with an excellent locality and an arbitrary localization added to a specific sound source by combining with a virtual source to which sound localization has been added And a medium on which a three-dimensional sound is recorded.

【００１０】また、本発明の他の目的は、前記バイノー
ラルソースとバーチャルソースとを組み合わせた音源を
ステレオ・ダイポール方式（ＳＤ方式）によって再生す
ることによって、より実用的かつ効果的な再生音を得る
ことのできる立体音生成装置、立体音生成方法及び立体
音を記録した媒体を提供することにある。Another object of the present invention is to obtain a more practical and effective reproduced sound by reproducing a sound source obtained by combining the binaural source and the virtual source by a stereo dipole method (SD method). It is an object of the present invention to provide a three-dimensional sound generation device, a three-dimensional sound generation method, and a medium in which three-dimensional sound is recorded.

【００１１】[0011]

【課題を解決するための手段】前記の目的を達成するた
めに、請求項１の発明は、録音あるいは合成された第１
の音響信号に対して音の定位を加える音像定位演算手段
と、この音像定位演算手段から出力される第１の音響信
号に対して、第１の音響信号を左右のスピーカから出力
した場合に生じる各チャンネルのクロストークをキャン
セルするための処理を施す第１のクロストークキャンセ
ル演算手段と、ダミーヘッドマイクによって収録した第
２の音響信号に対して、第２の音響信号を左右のスピー
カから出力した場合に生じる各チャンネルのクロストー
クをキャンセルするための処理を施す第２のクロストー
クキャンセル演算手段と、これら第１および第２のクロ
ストークキャンセル演算手段からの音響信号を加算して
左右のチャンネルのスピーカから出力する手段とを備え
ていることを特徴とする。In order to achieve the above object, a first aspect of the present invention is to provide a first recorded or synthesized first
Sound image localization calculating means for adding sound localization to the first sound signal, and the first sound signal is output from the left and right speakers with respect to the first sound signal output from the sound image localization calculation means. The first and second crosstalk canceling means for performing processing for canceling the crosstalk of each channel, and the second sound signal are output from the left and right speakers with respect to the second sound signal recorded by the dummy head microphone. A second crosstalk canceling means for performing processing for canceling crosstalk of each channel generated in the case, and adding sound signals from the first and second crosstalk canceling means to add left and right channels. Means for outputting from a speaker.

【００１２】このような構成を有する請求項１の発明に
よれば、第１の音響信号生成手段によって得られた音の
定位を任意に与えられた第１の音響信号と、ダミーヘッ
ドマイクによって収録された第２の音響信号を加算する
ことにより、収録時から音の定位を確保したバイノーラ
ルソースと音の定位を任意に加えたバーチャルソースと
を組み合わせた臨場感に優れた立体音を得ることができ
る。According to the first aspect of the present invention having such a configuration, the first sound signal to which the localization of the sound obtained by the first sound signal generating means is arbitrarily given is recorded by the dummy head microphone. By adding the obtained second acoustic signals, it is possible to obtain a highly realistic three-dimensional sound combining a binaural source that has secured sound localization from the time of recording and a virtual source to which sound localization has been arbitrarily added. it can.

【００１３】請求項２の発明は、録音あるいは合成され
た第１の音響信号に対して音の定位を加える音像定位演
算手段と、この音像定位演算手段から出力される第１の
音響信号に対して、第１の音響信号を左右のスピーカか
ら出力した場合に生じる各チャンネルのクロストークを
キャンセルするための処理を施す第１のクロストークキ
ャンセル演算手段とから第１の音響信号生成手段を構成
し、ダミーヘッドマイクによって収録した第２の音響信
号に対して、第２の音響信号を左右のスピーカから出力
した場合に生じる各チャンネルのクロストークをキャン
セルするための処理を施す第２のクロストークキャンセ
ル演算手段によって第２の音響信号生成手段を構成し、
前記第１の音響信号生成手段を音の定位を行う音源の数
に応じて複数個並列に設け、これら複数の第１の音響信
号生成手段および第２の音響信号生成手段からの音響信
号を加算して左右のチャンネルのスピーカから出力する
手段とを備えていることを特徴とする。According to a second aspect of the present invention, there is provided a sound image localization calculating means for adding sound localization to a recorded or synthesized first sound signal, and a first sound signal output from the sound image localization calculating means. And a first crosstalk canceling operation unit for performing processing for canceling crosstalk of each channel generated when the first audio signal is output from the left and right speakers, and constitutes a first audio signal generation unit. Second crosstalk cancellation processing for canceling the crosstalk of each channel generated when the second audio signal is output from the left and right speakers to the second audio signal recorded by the dummy head microphone The second acoustic signal generating means is constituted by the arithmetic means,
A plurality of the first sound signal generating means are provided in parallel in accordance with the number of sound sources for localizing sounds, and the sound signals from the plurality of first sound signal generating means and the sound signals from the second sound signal generating means are added. Means for outputting from left and right channel speakers.

【００１４】このような構成を有する請求項２の発明に
よれば、第１の音響信号生成手段を複数個設けることに
より、異なった音源に対して異なった定位を与えること
が可能となり、第２の音響信号生成手段によって生成し
た音響信号を再生する音場内において、複数の音源をそ
れぞれ独立に移動させることが可能となる。According to the second aspect of the present invention having such a configuration, by providing a plurality of the first acoustic signal generating means, it becomes possible to give different localizations to different sound sources. It is possible to move a plurality of sound sources independently in a sound field for reproducing the acoustic signal generated by the acoustic signal generating means.

【００１５】請求項３の発明は、請求項１または請求項
２に記載の立体音生成装置において、前記左右のチャン
ネルのスピーカが受聴者のリスニングポイントから見
て、見開き角度が６度から２０度の間に配置されている
ことを特徴とする。According to a third aspect of the present invention, in the three-dimensional sound generating apparatus according to the first or second aspect, the speakers of the left and right channels have a spread angle of 6 to 20 degrees when viewed from the listener's listening point. It is characterized by being arranged between.

【００１６】このような構成を有する請求項３の発明に
よれば、音像定位演算手段および各クロストークキャン
セル演算手段と、近接した左右チャンネルのスピーカに
よって、ＳＤ方式を利用した本発明の立体音生成装置を
構成することが可能となる。According to the third aspect of the present invention having such a configuration, the sound image localization calculation means and the respective crosstalk cancellation calculation means, and the speakers of the adjacent left and right channels, and the stereophonic sound generation method of the present invention utilizing the SD system. The device can be configured.

【００１７】請求項４の発明は、前記請求項１の発明を
方法の観点から捕らえたものであって、録音あるいは合
成された第１の音響信号に対して音の定位を加え、この
第１の音響信号に対して、第１の音響信号を左右のスピ
ーカから出力した場合に生じる各チャンネルのクロスト
ークをキャンセルするための処理を施し、ダミーヘッド
マイクによって収録した第２の音響信号に対して、第２
の音響信号を左右のスピーカから出力した場合に生じる
各チャンネルのクロストークをキャンセルするための処
理を施し、これらの処理を施した第１および第２の音響
信号を加算して左右のチャンネルのスピーカから出力す
ることを特徴とする。According to a fourth aspect of the present invention, the first aspect of the present invention is captured from the viewpoint of a method, in which sound localization is added to a recorded or synthesized first sound signal, and the first sound signal is added to the first sound signal. Processing for canceling the crosstalk of each channel which occurs when the first sound signal is output from the left and right speakers, and the second sound signal recorded by the dummy head microphone is , Second
To cancel the crosstalk of each channel generated when the left and right speakers output the left and right speakers, add the first and second sound signals subjected to these processes, and add the left and right speakers. Output from

【００１８】請求項５の発明は、前記請求項１ないし請
求項４に記載の立体音生成装置及び立体音生成方法を実
現するための記録媒体に関するものであって、録音ある
いは合成された第１の音響信号に対して音の定位を加
え、この第１の音響信号に対して、第１の音響信号を左
右のスピーカから出力した場合に生じる各チャンネルの
クロストークをキャンセルするための処理を施し、ダミ
ーヘッドマイクによって収録した第２の音響信号に対し
て、第２の音響信号を左右のスピーカから出力した場合
に生じる各チャンネルのクロストークをキャンセルする
ための処理を施し、これらの処理を施した第１および第
２の音響信号を左右のチャンネルのスピーカから出力可
能に記録したことを特徴とする。According to a fifth aspect of the present invention, there is provided a recording medium for realizing the three-dimensional sound generating apparatus and the three-dimensional sound generating method according to any one of the first to fourth aspects. The sound localization is added to the sound signal of the first sound signal, and the first sound signal is subjected to processing for canceling crosstalk of each channel generated when the first sound signal is output from the left and right speakers. The second audio signal recorded by the dummy head microphone is subjected to a process for canceling crosstalk of each channel that occurs when the second audio signal is output from the left and right speakers, and these processes are performed. The recorded first and second acoustic signals are recorded so as to be output from left and right channel speakers.

【００１９】[0019]

【発明の実施の形態】以下、本発明の実施形態の１つを
図面に従って具体的に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be specifically described below with reference to the drawings.

【００２０】（１）実施形態の構成（１−１）全体構成本実施形態の立体音生成装置は、図１に示すとおり、収
録音や合成音などの原音に対して音像の定位処理を施し
たバーチャルソースを生成する第１の音響信号生成手段
１０と、ダミーヘッドマイクロフォンによって収録され
収録時から音像が定位されたバイノーラルソースを生成
する第２の音響信号生成手段２０と、これら第１および
第２の音響信号生成手段によって生成された各音響信号
を加算する加算器３０と、この加算器３０から出力され
た音響信号を再生する左右チャンネルのスピーカ４０
Ｌ，４０Ｒとを備えている。(1) Configuration of the Embodiment (1-1) Overall Configuration As shown in FIG. 1, the three-dimensional sound generator of this embodiment performs localization processing of a sound image on original sounds such as recorded sounds and synthesized sounds. A first sound signal generating means 10 for generating a virtual source, a second sound signal generating means 20 for generating a binaural source recorded by a dummy head microphone and having a sound image localized from the time of recording, and a first and a second signal. Adder 30 for adding the respective sound signals generated by the two sound signal generating means, and left and right channel speakers 40 for reproducing the sound signals output from the adder 30
L, 40R.

【００２１】（１−２）第１の音響信号生成手段１０バーチャルソースを生成する第１の音響信号生成手段１
０は、マイクロフォンによって収録した楽音や、コンピ
ュータによって合成した楽音などの原音生成手段１１に
接続され、この原音生成手段１１から出力された音響信
号を受聴者の周囲の任意の方向に定位させる音像定位演
算手段１２と、クロストークキャンセル処理を施すクロ
ストークキャンセル演算手段１４の２段構成からなる。(1-2) First acoustic signal generating means 10 First acoustic signal generating means 1 for generating a virtual source
Reference numeral 0 denotes a sound image localization which is connected to original sound generating means 11 such as a musical sound recorded by a microphone or a musical sound synthesized by a computer, and localizes an acoustic signal output from the original sound generating means 11 in an arbitrary direction around a listener. It has a two-stage configuration of a calculating means 12 and a crosstalk canceling calculating means 14 for performing a crosstalk canceling process.

【００２２】（１−３）音像定位演算手段１２音源からの音波は、受聴者のいる部屋や空間等の場の伝
達系と受聴者の頭部、耳介、肩等の反射、回折、共振に
よる伝達系の作用を受けて、受聴者の両耳（鼓膜）に至
る。ここでは、これらの伝達系の伝達関数をまとめて頭
部伝達関数（ Head Related Transfer Functions）と称
する。音像定位演算手段１２は、受聴者と音像定位させ
んとする音源の方向、受聴者と各々２つのスピーカ４０
Ｌ，４０Ｒの方向によって決まる頭部伝達関数を使っ
て、入力された信号を処理し、受聴者の周囲の任意の位
置に音像を定位させるものである。(1-3) Sound image localization calculation means 12 The sound wave from the sound source is transmitted to a transmission system of a field such as a room or space where the listener is located, and reflected, diffracted, and resonated by the head, auricles, shoulders, etc. of the listener. To the ears (eardrum) of the listener. Here, the transfer functions of these transfer systems are collectively referred to as Head Related Transfer Functions. The sound image localization calculation means 12 includes: a listener and a direction of a sound source to be sound image localized;
The input signal is processed by using a head-related transfer function determined by the directions of L and 40R, and a sound image is localized at an arbitrary position around the listener.

【００２３】以下、この音像定位演算手段１２の原理を
図４により説明する。図４は、受聴者が両耳で音源０の
音響信号０を聞いている状況と、ヘッドホンＰＬ，ＰＲ
の音響信号ＰＬ、ＰＲを聞いている状況を表わす。この
状況は、前述したクロストークを考慮する必要がない状
況を示している。図４において、ＨＬは音源０から受聴
者の左側の耳への頭部伝達関数、ＨＲは同じく右側の耳
への頭部伝達関数である。ＨＨはヘッドホンＰＬ，ＰＲ
から耳への音響伝達関数（ヘッドホンのスピーカを含
む）、ＰＬ，ＰＲはヘッドホンからの音響信号（音圧レ
ベル）、ＥＬ，ＥＲは受聴者の外耳道入口での音響信号
である。Hereinafter, the principle of the sound image localization calculating means 12 will be described with reference to FIG. FIG. 4 shows a situation where the listener is listening to the sound signal 0 of the sound source 0 with both ears and the headphones PL and PR.
Represents a situation in which the user is listening to the acoustic signals PL and PR of. This situation indicates a situation where it is not necessary to consider the above-mentioned crosstalk. In FIG. 4, HL is the head-related transfer function from the sound source 0 to the left ear of the listener, and HR is the head-related transfer function to the right ear. HH is headphone PL, PR
Transfer function (including a headphone speaker) from the head to the ear, PL and PR are sound signals (sound pressure levels) from the headphones, and EL and ER are sound signals at the entrance of the ear canal of the listener.

【００２４】ヘッドホンＰＬ，ＰＲからの音響信号が、
あたかも受聴者左側後方の音源０の位置で音響レベル０
の音が鳴っているように聞こえるようにするには、受聴
者の両耳（外耳道入口）の位置において、音源０からの
音響信号０とヘッドホンＰＬ，ＰＲからの信号ＰＬ，Ｐ
Ｒとが等しくなるようにすればよい。また、２つのスピ
ーカ４０Ｌ，４０ＲＳＬ，ＳＲからの音声信号が、あた
かも受聴者左側後方の音源０の位置で音圧レベル０の音
で鳴っているように聞こえるようにするには、受聴者の
両耳の位置（外耳道入口）において、音源ＶＳからの音
響信号Ｖ０と２つのスピーカ４０Ｌ，４０Ｒからの音響
信号ＳＬ，ＳＲとが等しくなるようにすればよい。The sound signals from the headphones PL and PR are
Sound level 0 at the position of sound source 0 behind the left side of the listener
In order to make it sound as if it is sounding, the sound signal 0 from the sound source 0 and the signals PL, P from the headphones PL, PR are placed at the positions of both ears (entrances of the ear canal) of the listener.
R should be equal. In order for the sound signals from the two speakers 40L, 40RSL, and SR to sound as if they are sounding at a sound pressure level 0 at the position of the sound source 0 on the rear left side of the listener, it is necessary to use both of the listeners. The sound signal V0 from the sound source VS and the sound signals SL and SR from the two speakers 40L and 40R may be equal at the position of the ear (the entrance of the ear canal).

【００２５】図４から判るように、音源ＶＳから音が出
ている場合、左右各々の外耳道入口付近での信号は、As can be seen from FIG. 4, when a sound is emitted from the sound source VS, the signals near the entrances of the ear canals on the left and right sides are:

【数１】ＥＬ＝ＨＬ×０ＥＲ＝ＨＲ×０となる。## EQU1 ## EL = HL × 0 ER = HR × 0

【００２６】また、ヘッドホンＰＬ，ＰＲから放音する
場合の左右各々の外耳道入口付近での音響信号ＥＬ，Ｅ
Ｒは、When sound is emitted from the headphones PL and PR, the acoustic signals EL and E near the ear canal entrances on the left and right sides, respectively.
R is

【数２】ＥＬ＝ＨＨ×ＰＬＥＲ＝ＨＨ×ＰＲとなる。数式１、数式２より音響信号ＥＬ，ＥＲを消去
し、ヘッドホンＰＬ，ＰＲからの音響信号ＰＬ，ＰＲを
求めると、## EQU2 ## EL = HH × PLER = HH × PR When the sound signals EL and ER are eliminated from Expressions 1 and 2, and the sound signals PL and PR from the headphones PL and PR are obtained,

【数３】ＰＬ＝（ＨＬ／ＨＨ）×Ｖ０ＰＲ＝（ＨＲ／ＨＨ）×Ｖ０となる。## EQU3 ## PL = (HL / HH) × V0 PR = (HR / HH) × V0

【００２７】このように、音像定位演算手段１２は、音
源からの音響信号Ｖ０に対して（ＨＬ／ＨＨ）あるいは
（ＨＲ／ＨＨ）の処理を施すフィルタによって構成さ
れ、この音像定位演算手段の出力信号ＰＬ，ＰＲが後段
のクロストークキャンセル演算手段１４に出力される。As described above, the sound image localization calculating means 12 is constituted by the filter for performing (HL / HH) or (HR / HH) processing on the acoustic signal V0 from the sound source. The signals PL and PR are output to the subsequent stage crosstalk canceling calculation means 14.

【００２８】さらに、前記のような音像定位演算手段１
２には、音像の定位置を変更するための調整器１３が設
けられている。すなわち、ある音源からの音が受聴者に
対して常に一定の位置から聞こえている場合には、その
音についての音像定位演算手段１２における定位処理は
一定でよいが、再生音場内で音源が移動するような定位
を行う場合には、前記調整器１３を使用して音像定位演
算手段１２におけるフィルタのパラメータを変更するこ
とで定位位置を可変とする。Further, the sound image localization calculating means 1 as described above
2 is provided with an adjuster 13 for changing the fixed position of the sound image. That is, when a sound from a certain sound source is always heard from a fixed position to the listener, the localization processing of the sound by the sound image localization calculating means 12 may be constant, but the sound source moves within the reproduction sound field. When performing such localization, the localization position is made variable by changing the parameters of the filter in the sound image localization calculation means 12 using the adjuster 13.

【００２９】（１−４）クロストークキャンセル演算手
段１４前記の音像定位演算手段１２によって得られた音響信号
は、ヘッドホンによって聴取することを前提として音像
の定位を行うものである。このヘッドホンは左右チャン
ネルの音響信号をそれぞれ直接聴取者の耳に送り込むも
のであるが、本発明のように、左右チャンネルの２つの
スピーカ４０Ｌ，４０Ｒを用いて再生する時には左のス
ピーカから右耳へ、また右のスピーカから左耳へクロス
トークするする信号があり、このクロストーク信号が現
実の音源からの音が持っている定位感に対して違和感を
与える。そこで、本実施形態では、このクロストーク信
号を打ち消す信号を演算して生成し、音像定位演算手段
１２の出力信号ＰＬ，ＰＲに付加するクロストークキャ
ンセル演算手段１４が設けられている。このクロストー
クキャンセル演算手段１４は、一例として、音像定位演
算手段１２からの左右チャンネルの出力信号ＰＬ，ＰＲ
に対して伝達関数の演算を行う第１フィルタ１４ＡＬ，
１４ＡＲ及び第２のフィルタ１４ＢＬ，１４ＢＲからな
る。(1-4) Crosstalk Canceling Calculation Means 14 The sound signal obtained by the sound image localization calculation means 12 is for localizing a sound image on the assumption that the sound signal is heard through headphones. These headphones send the left and right channel sound signals directly to the listener's ears. However, as in the present invention, when reproducing using the two left and right channel speakers 40L and 40R, the left speaker goes to the right ear. There is also a signal that causes crosstalk from the right speaker to the left ear, and this crosstalk signal gives a sense of incompatibility with the sense of localization of the sound from the real sound source. Therefore, in the present embodiment, there is provided a crosstalk canceling calculating means 14 for calculating and generating a signal for canceling the crosstalk signal and adding the signal to the output signals PL and PR of the sound image localization calculating means 12. As an example, the crosstalk cancellation calculating means 14 outputs the left and right channel output signals PL, PR from the sound image localization calculating means 12.
A first filter 14AL that calculates a transfer function for
14AR and the second filters 14BL and 14BR.

【００３０】このクロストークキャンセル演算手段１４
の原理を図５により説明する。この図５は、受聴者が両
耳で左後方にある音源ＶＳ（音圧レベルをＶ０とする）
の音を聞いている状況と、受聴者の前方正面に配置され
た２つのスピーカ４０Ｌ，４０Ｒ（音圧レベルをそれぞ
れＳＬ，ＳＲとする）の音を聞いている状況を表す。こ
の場合、受聴者の耳には、耳に近い方のスピーカと遠い
方のスピーカの両方からの音響信号が聞こえてくるクロ
ストークが発生している。ＨＥはスピーカ４０Ｌまたは
４０Ｒから受聴者の当該スピーカに近い側の耳への頭部
伝達関数（スピーカも含む）であり、ＨＸは同じく受聴
者の当該スピーカから遠い側の耳への頭部伝達関数（ス
ピーカも含む）である。This crosstalk cancel operation means 14
The principle will be described with reference to FIG. FIG. 5 shows the sound source VS (the sound pressure level is set to V0) in which the listener is behind and behind both ears.
And the situation where the user is listening to the sounds of two speakers 40L and 40R (the sound pressure levels are SL and SR, respectively) arranged in front of the listener. In this case, crosstalk occurs in the listener's ear in which sound signals from both the speaker closer to the ear and the speaker farther from the ear can be heard. HE is a head-related transfer function from the speaker 40L or 40R to the ear of the listener closer to the speaker (including the speaker), and HX is a head-related transfer function of the listener to the ear farther from the speaker. (Including speakers).

【００３１】受聴者まで等距離となるように受聴者の前
側に配置させた左スピーカ４０Ｌと右スピーカ４０Ｒか
ら放音する場合、伝達関数ＨＥとＨＸは互いに対称とな
って、左右各々の外耳道入り口付近での信号ＥＬ，ＥＲ
は、When sound is emitted from the left speaker 40L and the right speaker 40R disposed in front of the listener so as to be at the same distance to the listener, the transfer functions HE and HX are symmetrical with each other, and the entrances of the left and right ear canals are respectively. Signals EL and ER in the vicinity
Is

【数４】ＥＬ＝ＨＥ＊ＳＬ＋ＨＸ＊ＳＲＥＲ＝ＨＥ＊ＳＲ＋ＨＸ＊ＳＬとなる。数式１、数式４より信号ＥＬ，ＥＲを消去し、
スピーカ４０Ｌ，４０Ｒの信号ＳＬ，ＳＲを求めると、## EQU4 ## EL = HE * SL + HX * SR ER = HE * SR + HX * SL Eliminating the signals EL and ER from Equations 1 and 4,
When the signals SL and SR of the speakers 40L and 40R are obtained,

【数５】ＳＬ＝（ＨＬ＊ＨＥ＊Ｖ０−ＨＲ＊ＨＸ＊Ｖ
０）／（ＨＥ²−ＨＸ²）ＳＲ＝（ＨＲ＊ＨＥ＊Ｖ０−ＨＬ＊ＨＸ＊Ｖ０）／（Ｈ
Ｅ²−ＨＸ²）となる。## EQU5 ## SL = (HL * HE * V0-HR * HX * V
0) / (HE ² −HX ² ) SR = (HR * HE * V0−HL * HX * V0) / (H
E ² −HX ² ).

【００３２】前記数式３を数式５に代入して音源ＶＳに
係わる音響信号Ｖ０、伝達関数ＨＬ，ＨＲを消去する
と、Substituting Equation 3 into Equation 5 to eliminate the acoustic signal V0 and the transfer functions HL and HR related to the sound source VS,

【数６】ＳＬ＝（ＨＨ／ＨＥ）／（１−（ＨＸ／ＨＥ）
²）＊（ＰＬ−ＰＲ＊（ＨＸ／ＨＥ））ＳＲ＝（ＨＨ／ＨＥ）／（１−（ＨＸ／ＨＥ）²）＊
（ＰＲ−ＰＬ＊（ＨＸ／ＨＥ））となる。## EQU6 ## SL = (HH / HE) / (1- (HX / HE)
² ) * (PL-PR * (HX / HE)) SR = (HH / HE) / (1- (HX / HE) ² ) *
(PR-PL * (HX / HE)).

【００３３】本実施形態のクロストークキャンセル演算
手段１４は、前記数式６の演算を行う第１及び第２のフ
ィルタによって構成されている。なお、このクロストー
クキャンセル演算手段は、図示のようなフィルタ構成に
限定されるものではなく、左右チャンネルのスピーカに
よる音響信号の再生時に生じるクロストークを打ち消す
ものであれば、他の構成のものを適宜使用できる。The crosstalk canceling calculation means 14 of the present embodiment is constituted by first and second filters for performing the calculation of the equation (6). Note that the crosstalk canceling calculation means is not limited to the filter configuration as shown in the drawing, and any other configuration may be used as long as it cancels the crosstalk generated when the audio signals are reproduced by the left and right channel speakers. Can be used as appropriate.

【００３４】（１−５）第２の音響信号生成手段２０バイノーラルソースを生成する第２の音響信号生成手段
２０は、収録用の音場内に設置された左右チャンネルの
ダミーヘッドマイクロフォン２１によって収録された音
声を記録した収録装置２２の出力端子に接続されてい
る。この第２の音響信号生成手段２０は、収録装置２２
から入力された左右チャンネルの音響信号に対してクロ
ストークキャンセル処理を施すクロストークキャンセル
演算手段２３を備えている。このクロストークキャンセ
ル演算手段２３の構成は、前記第１の音響信号生成手段
に採用されたクロストークキャンセル演算手段と同様な
構成を有する。(1-5) Second Acoustic Signal Generating Unit 20 The second acoustic signal generating unit 20 for generating the binaural source is recorded by the right and left channel dummy head microphones 21 installed in the recording sound field. Connected to the output terminal of the recording device 22 that records the recorded audio. The second acoustic signal generating means 20 includes a recording device 22
And a crosstalk canceling operation unit 23 that performs a crosstalk canceling process on the left and right channel audio signals input from the. The configuration of the crosstalk cancellation calculation means 23 has the same configuration as that of the crosstalk cancellation calculation means employed in the first acoustic signal generation means.

【００３５】なお、前記の音像定位演算手段１２および
クロストークキャンセル演算手段１４，２３は、通常、
ＤＳＰ（Digital Signal Processor）あるいはマイクロ
コンピュータなどを用いて構成され、上述の各種伝達関
数の演算処理を行っている。そして、音像定位演算手段
１２を構成する上記各フィルタの伝達関数はＦＩＲフィ
ルタ（Finite Impulse Response Filter）を用いて実現
されている。これはＦＩＲフィルタが複雑な伝達関数を
含む種々の伝達関数を実現しやすいためである。また、
実現できる伝達関数に現実的な制限があるが、ＩＩＲフ
ィルタ（Infinite Impulse Response Filter）を使用す
ることも可能である。このＩＩＲフィルタはＦＩＲフィ
ルタに比べてハードウェア規模もソフトウェア規模も小
さくてよい利点がある。また、前記フィルタの構成を決
定する頭部伝達関数についても、受聴者のリスニングポ
イントに配置された剛球を使用した近似結果から算出し
たものを使用することができる。The sound image localization calculation means 12 and the crosstalk cancellation calculation means 14 and 23 are usually
It is configured using a DSP (Digital Signal Processor) or a microcomputer, etc., and performs arithmetic processing of the above-described various transfer functions. The transfer function of each of the filters constituting the sound image localization calculation means 12 is realized using an FIR filter (Finite Impulse Response Filter). This is because the FIR filter can easily realize various transfer functions including a complicated transfer function. Also,
Although there are practical limitations on the transfer functions that can be realized, it is also possible to use an IIR filter (Infinite Impulse Response Filter). This IIR filter has an advantage that the hardware scale and the software scale may be smaller than the FIR filter. Further, as the head-related transfer function for determining the configuration of the filter, a function calculated from an approximation result using a hard sphere arranged at the listener's listening point can be used.

【００３６】（１−６）左右チャンネルのスピーカ４０
Ｌ，４０Ｒ本実施形態では、左右チャンネルのスピーカ４０Ｌ，４
０Ｒをその中心間の距離が４５ｃｍかそれ以下となるよ
うに配置する。また、このスピーカ４０Ｌ，４０Ｒの見
開き角度θは受聴者に対して６度から２０度、好ましく
は約１０度に設定される。すなわち、通常のステレオシ
ステムにおいては、左右チャンネルのスピーカ４０Ｌ，
４０Ｒの見開き角度は６０度程度に設定されていたが、
本実施形態では、前記のクロストークキャンセル演算手
段とこのような見開き角度の小さく近接した左右チャン
ネルのスピーカ４０Ｌ，４０Ｒを使用することにより、
ＳＤ方式と呼ばれる音場再生システムを構成する。(1-6) Left and right channel speakers 40
L, 40R In the present embodiment, left and right channel speakers 40L, 4R
The OR is positioned so that the distance between its centers is 45 cm or less. The spread angle θ of the speakers 40L and 40R is set to 6 to 20 degrees, preferably about 10 degrees with respect to the listener. That is, in a normal stereo system, the left and right channel speakers 40L,
The spread angle of 40R was set to about 60 degrees,
In the present embodiment, by using the above-described crosstalk cancel calculation means and the speakers 40L and 40R of the adjacent left and right channels having such a small spread angle,
A sound field reproduction system called the SD system is configured.

【００３７】（２）実施形態の作用このような構成を有する本実施形態の装置の作用を図２
に従って説明する。(2) Operation of the Embodiment The operation of the apparatus of this embodiment having such a configuration is shown in FIG.
It will be described according to.

【００３８】まず、バーチャルソースを生成するための
第１の音響信号生成手段１０に与える音響信号を得るた
めに、音像の定位を行うことのできる音源からの音をス
テレオあるいはモノラルのマイクロフォンによって収録
したり、コンピュータを利用して希望する合成音を生成
する。例えば、図２のＡ−１に示すように、人物の台詞
や足音などの楽音を収録する。次に、収録された原音に
対して、音像定位演算手段１２を利用して音の定位を与
える。例えば、台詞や足音に対してある定位を与え、次
いで調整器１３を利用してその定位を経時的に変化させ
ることにより、図２のＢ−１に示すように、あたかも人
が台詞を言いながら歩いているような再生音が得られる
ようにする。なお、この例では、人の台詞と足音とは同
じ定位を与えればよいので、台詞と足音とをどうしに１
本のマイクで収録するか、それぞれ別に収録あるいは合
成した後両者の音響信号を加算してから音像の定位処理
を施す。このようにして、収録音や合成音に対して音像
の定位を与えた仮想音源からの音響信号については、ク
ロストークキャンセル演算手段１４によって左右チャン
ネルのスピーカ４０Ｌ，４０Ｒから再生した場合のクロ
ストークキャンセル処理を施し、加算器３０に出力す
る。First, in order to obtain an acoustic signal to be provided to the first acoustic signal generating means 10 for generating a virtual source, sound from a sound source capable of localizing a sound image is recorded by a stereo or monaural microphone. Or use a computer to generate the desired synthesized sound. For example, as shown at A-1 in FIG. Next, sound localization is given to the recorded original sound using the sound image localization calculating means 12. For example, by giving a certain localization to a dialogue or a footstep, and then using the adjuster 13 to change the localization over time, as shown in B-1 of FIG. Try to get the sound of walking. In this example, since it is sufficient to give the same localization to the speech and the footstep of the person, the speech and the footstep are set to 1
After recording with a book microphone or separately recording or synthesizing each, the sound signals of both are added, and then the sound image is localized. As described above, the sound signal from the virtual sound source in which the sound image is localized with respect to the recorded sound and the synthesized sound is canceled by the crosstalk canceling calculation unit 14 when the sound signals are reproduced from the left and right channel speakers 40L and 40R. Processing is performed, and the result is output to the adder 30.

【００３９】一方、バイノーラルソースを得るための第
２の音響信号生成手段２０には、ダミーヘッドマイクロ
フォンによって収録されたバイノーラルソースを記録装
置を解して供給する。このバイノーラルソースは、図４
のＡ−２に示すように、自然環境の中に配置されたダミ
ーヘッドマイクロフォン２１を使用して、周囲の小枝の
音、小鳥のさえずり、川の流れる音など、受聴者の全周
囲に存在する音を収録装置２２に収録する。次に、収録
された音をクロストークキャンセル演算手段２３に入力
して、収録時の音の定位はそのままで、左右チャンネル
のスピーカ４０Ｌ，４０Ｒによって再生する場合のクロ
ストークがキャンセルされるようにクロストークキャン
セル処理を施す。その結果、図２のＢ−２に示すよう
に、左右チャンネルのスピーカ４０Ｌ，４０Ｒによって
再生した場合にも、ダミーヘッドマイクロフォン２１に
よる収録時と同様の立体的な再生音を出力するための音
響信号が得られる。On the other hand, the binaural source recorded by the dummy head microphone is supplied to the second acoustic signal generating means 20 for obtaining the binaural source through the recording device. This binaural source is shown in FIG.
As shown in A-2, using the dummy head microphone 21 arranged in the natural environment, the sound exists around the listener, such as the sound of the surrounding twigs, the sound of birds, and the sound of the river flowing. The sound is recorded on the recording device 22. Next, the recorded sound is input to the crosstalk canceling calculation means 23, and the sound is localized at the time of recording, and the crosstalk is reproduced so that the crosstalk when the sound is reproduced by the left and right channel speakers 40L and 40R is canceled. Perform talk cancellation processing. As a result, as shown in B-2 of FIG. 2, even when the sound is reproduced by the left and right channel speakers 40L and 40R, an acoustic signal for outputting the same three-dimensional reproduced sound as that at the time of recording by the dummy head microphone 21 is obtained. Is obtained.

【００４０】このようにして得られた第１の音響信号生
成手段１０から出力されたバーチャルソースによる音響
信号と、第２の音響信号生成手段２０から出力されたバ
イノーラルソースによる音響信号とを加算器３０のおい
て加算して、加算された音響信号を左右チャンネルのス
ピーカ４０Ｌ，４０Ｒから出力する。すると、図２のＣ
に示すように、受聴者の耳には、川の流れ音や小枝のな
る音を背景とした自然の環境の中で人が台詞を言いなが
ら歩いているような音が聞こえる。The thus obtained sound signal from the virtual source output from the first sound signal generating means 10 and the sound signal from the binaural source output from the second sound signal generating means 20 are added to each other. At 30, the added audio signals are output from the left and right channel speakers 40 </ b> L and 40 </ b> R. Then, C in FIG.
As shown in Fig. 5, a listener can hear the sound of a person walking while speaking in a natural environment against the backdrop of river sounds and twigs.

【００４１】（３）実施形態の効果以上の通り、本実施形態によれば、多数の音源があって
そのすべてに音像の定位を行うことが不可能な場合であ
っても、ダミーヘッドマイクロフォンによって収録した
バイノーラルソースと、任意に音像の定位を行うことの
できるバーチャルソースとを組み合わせることにより、
臨場感に優れた立体音を得ることができる。(3) Effects of the Embodiment As described above, according to this embodiment, even if there are a large number of sound sources and it is impossible to localize the sound image to all of them, the dummy head microphone can be used. By combining the recorded binaural source with a virtual source that can arbitrarily localize the sound image,
It is possible to obtain a three-dimensional sound excellent in a sense of reality.

【００４２】また、本実施形態のように、ＳＤ方式を利
用すれば、左右にごく近接して配置した２つのスピーカ
４０Ｌ，４０Ｒだけで臨場感のある立体音を再生できる
ので、従来離した位置に２つ必要であったスピーカ４０
Ｌ，４０Ｒを実質的に１台分のスペースにまとめること
が可能となる。また、ＳＤ方式は、従来の３Ｄ音場再生
システムに比べ、３Ｄ効果が体感できる範囲（サービス
エリア）が広く、音像の定位（音が聞こえる位置）も安
定しているのが特徴である。さらに、左右のスピーカ４
０Ｌ，４０Ｒに距離を持たせる必要がないため、通常離
れた位置に配置されている２つのステレオスピーカ４０
Ｌ，４０Ｒを間隔を置かず中心に配置しても左右に拡が
りのある３Ｄ音場を再生することができ、例えば通常の
横に長いＣＤラジカセをＳＤ方式によりスピーカ４０
Ｌ，４０Ｒを中心に集約しても、通常のラジカセよりは
るかに立体的な音場を再生することができる。このよう
にＳＤ方式を使用した本実施形態では、コンパクトなシ
ステムで立体感と拡がりのある豊かなステレオ音場を実
現でき、これにより、スピーカ４０Ｌ，４０Ｒの設置ス
ペースの特性を生かした、全く新しい設計・デザインの
製品を創造することも可能である。Further, if the SD system is used as in the present embodiment, a three-dimensional sound with a sense of realism can be reproduced only by the two speakers 40L and 40R arranged very close to the left and right. Speaker 40 which was necessary for two
L and 40R can be substantially combined into one space. Further, the SD system is characterized in that the range in which the 3D effect can be experienced (service area) is wide and the localization of the sound image (the position where sound can be heard) is stable, compared to the conventional 3D sound field reproduction system. Furthermore, left and right speakers 4
Since it is not necessary to provide a distance between the stereo speakers 40L and 40L, two stereo speakers 40 which are normally arranged at distant positions.
Even if the L and 40R are arranged at the center without an interval, a 3D sound field having a wide left and right can be reproduced.
Even if L and 40R are concentrated, a sound field much more three-dimensional than a normal boombox can be reproduced. As described above, in the present embodiment using the SD system, a rich stereo sound field having a three-dimensional effect and a spaciousness can be realized with a compact system. It is also possible to create design products.

【００４３】（４）他の実施の形態本発明は前記の実施形態に限定されるものではなく、次
のような他の実施形態も包含する。(4) Other Embodiments The present invention is not limited to the above-described embodiments, but includes the following other embodiments.

【００４４】(a) 収録音や合成音など音像の定位を行う
音源が複数ある場合には、図２に示すように、第１の音
響信号生成手段１０−１から１０−ｎを複数個並列に設
け、これら複数個の第１の音響信号生成手段１０−１か
ら１０−ｎの出力信号を加算器３０によって、第２の音
響信号生成手段２０からの出力信号と加算して、左右チ
ャンネルのスピーカ４０Ｌ，４０Ｒから出力することが
できる。この場合、音源ごとに異なって定位を与えるこ
とができるので、バイノーラルソースを背景として異な
った距離や方向から種々の音が聞こえてくるような再生
音を得ることができる。その結果、電子会議システムの
ように共通の音場内の異なった場所に複数の発言者がい
る状況や、ゲーム機やコンピュータなどで生成された仮
想空間内において、共通の音場を形成し、この音場内で
それぞれ異なった方向に移動する複数の音源（例えば、
ゲームのキャラクターや乗り物など）を表現することが
できる。(A) When there are a plurality of sound sources for localizing a sound image such as a recorded sound and a synthesized sound, as shown in FIG. 2, a plurality of first sound signal generating means 10-1 to 10-n are arranged in parallel. The output signals of the plurality of first sound signal generation means 10-1 to 10-n are added to the output signals of the second sound signal generation means 20 by an adder 30 to obtain the left and right channels. Output can be made from the speakers 40L and 40R. In this case, since different localizations can be given to each sound source, it is possible to obtain a reproduced sound in which various sounds can be heard from different distances and directions with the binaural source as a background. As a result, a common sound field is formed in a situation where there are multiple speakers in different places in a common sound field such as an electronic conference system, or in a virtual space generated by a game machine or a computer. Multiple sound sources (for example, moving in different directions in the sound field)
Game characters and vehicles).

【００４５】(b) ＳＤ方式を使用することなく、従来の
ステレオシステムと同様に見開き角度が大きくなるよう
に左右チャンネルのスピーカ４０Ｌ，４０Ｒの間隔を広
げることもできる。(B) Without using the SD system, the distance between the left and right channel speakers 40L and 40R can be increased so that the spread angle becomes large as in the conventional stereo system.

【００４６】(c) ＳＤ方式や通常のステレオシステムの
場合に、左右チャンネルのスピーカ４０Ｌ，４０Ｒに加
えて、他のスピーカを設け、この他のスピーカに対して
も音像定位演算手段およびクロストークキャンセル演算
手段による処理を施した音響信号を供給することもでき
る。この場合、左右チャンネルのスピーカ４０Ｌ，４０
Ｒおよび他のスピーカに対しても、各スピーカ間のクロ
ストークを打ち消す処理を施す必要がある。(C) In the case of the SD system or a normal stereo system, other speakers are provided in addition to the left and right channel speakers 40L and 40R, and the sound image localization calculation means and the crosstalk cancellation are provided for the other speakers. It is also possible to supply an acoustic signal that has been processed by the arithmetic means. In this case, the left and right channel speakers 40L, 40L
It is necessary to perform processing for canceling crosstalk between the speakers for R and other speakers.

【００４７】(d) 前記実施形態は装置として本発明を表
現したものであるが、本発明はこの実施形態の装置のみ
によって実現されるものではなく、装置の構成自体は異
なっても前記実施形態に表現されたものと同様な手順に
よって立体音を生成するすべての方法を包含する。(D) Although the above-described embodiment expresses the present invention as an apparatus, the present invention is not realized only by the apparatus of this embodiment. And all methods for generating a three-dimensional sound by a procedure similar to that described in the above.

【００４８】(e) 本発明は、前記実施形態のような装置
及び実施形態に表現されるような方法に限定されるもの
ではなく、第１及び第２の音響信号を左右のスピーカか
ら出力できるように記録したＣＤ−ＲＯＭ、ビデオテー
プ、ＤＶＤなどの記録媒体も本発明の一実施形態であ
る。(E) The present invention is not limited to the device as in the above embodiment and the method as described in the embodiment, and the first and second acoustic signals can be output from the left and right speakers. A recording medium such as a CD-ROM, a video tape, or a DVD recorded as described above is also an embodiment of the present invention.

【００４９】[0049]

【発明の効果】以上の通り、本発明によれば、収録後に
おいて音像の定位が困難な音場についてダミーヘッドマ
イクロフォンによって収録したバイノーラルソースを利
用し、このバイノーラルソースと操作者が自由に音像の
定位を設定できるバーチャルソースとを加算して再生す
るようにしたものであるから、複雑な音響信号を有する
背景音の中で所望の音を自由に移動させることができ、
臨場感があり表現力に優れた再生音を容易に得ることが
可能となる。As described above, according to the present invention, a binaural source recorded by a dummy head microphone is used for a sound field in which it is difficult to localize a sound image after recording, and the binaural source and an operator can freely generate a sound image. Since it is made to add and reproduce a virtual source that can set the localization, it is possible to freely move a desired sound in a background sound having a complex sound signal,
It is possible to easily obtain a reproduced sound that is realistic and has excellent expressive power.

[Brief description of the drawings]

【図１】本発明による立体音生成装置の第１実施の形態
を示すブロック図。FIG. 1 is a block diagram showing a first embodiment of a three-dimensional sound generator according to the present invention.

【図２】図１の実施形態の作用を説明する図。FIG. 2 is a view for explaining the operation of the embodiment of FIG. 1;

【図３】本発明による立体音生成装置の第２実施の形態
を示すブロック。FIG. 3 is a block diagram showing a second embodiment of the three-dimensional sound generator according to the present invention.

【図４】本発明による立体音生成装置に使用する音像定
位演算手段の原理を示す図。FIG. 4 is a diagram showing the principle of a sound image localization calculation unit used in the three-dimensional sound generation device according to the present invention.

【図５】本発明による立体音生成装置に使用するクロス
トークキャンセル演算手段の原理を示す図。FIG. 5 is a diagram showing the principle of a crosstalk cancel calculation means used in the three-dimensional sound generation device according to the present invention.

[Explanation of symbols]

１０…第１の音響信号生成手段１１…原音生成手段１２…音像定位演算手段１３…調整器１４，２３…クロストークキャンセル演算手段２０…第２の音響信号生成手段２１…ダミーヘッドマイクロフォン２２…収録装置３０…加算器４０Ｌ，４０Ｒ…スピーカ DESCRIPTION OF SYMBOLS 10 ... 1st sound signal generation means 11 ... Original sound generation means 12 ... Sound image localization calculation means 13 ... Adjusters 14, 23 ... Crosstalk cancellation calculation means 20 ... 2nd sound signal generation means 21 ... Dummy head microphone 22 ... Recording Apparatus 30 ... Adder 40L, 40R ... Speaker

───────────────────────────────────────────────────── フロントページの続き (72)発明者岡元勇三東京都大田区大森西３−１−25 トミンハイム204号 (72)発明者内田成俊千葉県船橋市北本町２−40−１ブライトシティ621号 (72)発明者浜田晴夫東京都武蔵野市関前２−２−16 Ｆターム(参考） 5D011 AB12 5D062 AA64 ──────────────────────────────────────────────────続き Continued on the front page (72) Inventor Yuzo Okamoto 3-1-25 Ominishi, Omori-nishi, Ota-ku, Tokyo 204 (72) Inventor Shigetoshi Uchida 2-40-1 Kitahonmachi, Funabashi-shi, Chiba Prefecture Bright City 621 No. (72) Inventor Haruo Hamada 2-2-16 Sekimae, Musashino-shi, Tokyo F-term (reference) 5D011 AB12 5D062 AA64

Claims

[Claims]

1. A sound image localization calculating means for adding sound localization to a recorded or synthesized first acoustic signal, and a first sound signal output from the sound image localization calculating means, A first crosstalk cancellation calculating means for performing processing for canceling crosstalk of each channel generated when an audio signal is output from the left and right speakers, and a second audio signal recorded by a dummy head microphone. A second crosstalk canceling calculating means for performing processing for canceling crosstalk of each channel generated when the second audio signal is output from the left and right speakers; and a first and a second crosstalk canceling calculating means. Means for adding sound signals from the speakers and outputting the signals from left and right channel speakers. Sound generator.

2. A sound image localization calculating means for adding sound localization to a recorded or synthesized first acoustic signal, and a first sound signal output from the sound image localization calculating means, A first crosstalk canceling operation unit for performing a process for canceling crosstalk of each channel generated when an audio signal is output from the left and right speakers, and a first audio signal generation unit; The second crosstalk canceling operation means for performing processing for canceling crosstalk of each channel generated when the second audio signal is output from the left and right speakers to the recorded second audio signal by the second crosstalk cancellation calculating means And a plurality of the first acoustic signal generating means are provided in parallel in accordance with the number of sound sources for localizing sound. Stereo sound generation apparatus, characterized in that it comprises a means for outputting the first audio signal generating means and the second left and right channel speakers by adding the sound signal from the sound signal generator means number.

3. The speaker according to claim 1, wherein the left and right channel speakers are arranged at a spread angle of 6 to 20 degrees when viewed from the listener's listening point. Three-dimensional sound generator.

4. A method in which sound localization is added to a recorded or synthesized first acoustic signal, and the first acoustic signal is output from the left and right speakers with respect to the first acoustic signal. Performs processing for canceling channel crosstalk, and cancels the crosstalk of each channel generated when the second audio signal is output from the left and right speakers with respect to the second audio signal recorded by the dummy head microphone. A stereophonic sound generating method, wherein the first and second acoustic signals subjected to these processes are added and output from left and right channel speakers.

5. A method in which sound localization is added to a recorded or synthesized first acoustic signal, and the first acoustic signal is output from the left and right speakers with respect to the first acoustic signal. Performs processing for canceling channel crosstalk, and cancels the crosstalk of each channel generated when the second audio signal is output from the left and right speakers with respect to the second audio signal recorded by the dummy head microphone. A medium for recording a three-dimensional sound, wherein the first and second acoustic signals subjected to these processes are recorded so as to be output from speakers on the left and right channels.