JP3363921B2

JP3363921B2 - Sound image localization device

Info

Publication number: JP3363921B2
Application number: JP23363292A
Authority: JP
Inventors: 聡一西山; 和之渡辺
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1992-09-01
Filing date: 1992-09-01
Publication date: 2003-01-08
Anticipated expiration: 2018-01-08
Also published as: JPH0686400A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】近年、マンマシン・インタフェー
ス、ヒューマン・インタフェースというような人間と計
算機との対話方法に対する要求があり、人工現実感（Ａ
Ｒ）や仮想現実（ＶＲ）といった技術により人間の５感
を利用した直感的な対話方法が開発されるようになって
きた。しかし、これまでに開発された対話方法は視覚に
よるものが大多数をしめ、聴覚によるものは少ない。[Industrial application] In recent years, there has been a demand for human-computer interaction methods such as a man-machine interface and a human interface.
Technologies such as R) and virtual reality (VR) have led to the development of intuitive dialogue methods that utilize the human senses. However, most of the dialog methods that have been developed so far are visual and less auditory.

【０００２】人間の現実感の認識を考えると視覚からの
情報が最も重要であると思われるが、視覚のみで効果的
な現実感は得られない。なぜなら、人間は現実感の認識
を５感全ての情報によって認識しているからである。こ
れからの対話装置を考えると、音像の定位が行える音響
装置が必要である。この音響定位装置を用いることによ
って音の方向や周りの環境をリアルに人間に与えること
ができ、現実感が向上するであろう。例えば、視覚装置
にヘッド・マウント・ディスプレイ（ＨＭＤ）と呼ばれ
る頭部搭載型立体視装置を用い、音響再生装置に音像の
定位が行える音響装置を用いることで、実時間で変化す
る仮想世界（景観シミュレーション、ＣＡＤ／ＣＡＭな
ど）をコンピュータ・グラフィクス（ＣＧ）やコンピュ
ータ・サウンド（ＣＳ）で体験し直感的な操作をするこ
とができる。本発明は、上記した音響定位装置に関し、
特に本発明は聴者に任意の音像の定位を知覚させること
ができる音響定位装置に関するものである。From the viewpoint of human perception of reality, it is considered that information from the visual sense is the most important, but effective sense of reality cannot be obtained only by the visual sense. This is because human beings recognize the perception of reality by all five senses of information. Considering an interactive device in the future, an audio device capable of localizing a sound image is required. By using this acoustic localization device, the direction of sound and the surrounding environment can be realistically given to humans, and the sense of reality will be improved. For example, by using a head-mounted stereoscopic device called a head mounted display (HMD) as a visual device and an acoustic device capable of localizing a sound image as a sound reproducing device, a virtual world that changes in real time (landscape) You can experience simulation, CAD / CAM, etc.) with computer graphics (CG) and computer sound (CS), and perform intuitive operations. The present invention relates to the acoustic localization device described above,
In particular, the present invention relates to an acoustic localization device that allows a listener to perceive an arbitrary localization of a sound image.

【０００３】[0003]

【従来の技術】仮想世界に立体音を取り入れ現実感を表
現する人工現実感の聴覚装置として、現在、開発されて
いるものには、例えば、ヘッドホンを用いて立体音を
生成するシステム、あるいは、スピーカを用いて立体
音を生成するシステムが知られている。（１）ヘッドホンを用いて立体音を生成するシステム。図９（ａ）は上記システムの構成を示す図である。同図
において、９１は音源となるＣＤ等のドライ・ソース、
９２はディスク、９３はサンプラ、９４はＲＳ２３２Ｃ
シリアル・ポート、９５はホスト・コンピュータ、９６
は立体音生成用の専用計算機、９７はヘッドホン、９８
は位置検出装置、９９は聴者である。同図において、ま
ず、出力したいソース（ドライ・ソース）をサンプラ９
３に録音する。ホスト・コンピュータ９５はＲＳ−２３
２Ｃのシリアル・ポートを介してＭＩＤＩ（ミュージカ
ル・インストルメント・デジタル・インタフェース、デ
ジタル音楽信号の伝送規約であり、以下ＭＩＤＩとい
う）に変換し、ドライ・ソースの選択、４本の出力ライ
ンの選択および出力タイミングの制御を行う。2. Description of the Related Art Artificial-reality hearing devices that incorporate three-dimensional sound into a virtual world to express reality include, for example, a system for generating three-dimensional sound using headphones, or A system that generates a stereoscopic sound using a speaker is known. (1) A system that generates stereoscopic sound using headphones. FIG. 9A is a diagram showing the configuration of the above system. In the figure, 91 is a dry source such as a CD as a sound source,
92 is a disk, 93 is a sampler, and 94 is RS232C.
Serial port, 95 is host computer, 96
Is a dedicated computer for three-dimensional sound generation, 97 is headphones, 98
Is a position detecting device, and 99 is a listener. In the figure, the source (dry source) to be output is sampler 9 first.
Record to 3. The host computer 95 is RS-23
Converted to MIDI (Musical Instrument Digital Interface, a digital music signal transmission protocol, hereinafter referred to as MIDI) via the 2C serial port, and selects dry source, selects four output lines, and Controls output timing.

【０００４】聴者９９の位置、方向（位置、方向あわせ
て６自由度）は位置検出装置９８により検出され、ホス
ト・コンピュータ９５より立体音生成用の専用計算機９
６に伝えられる。立体音生成用の専用計算機９６は聴者
９９の位置、方向情報からドライ・ソースを立体音に変
換して、左右の音として聴者９９のヘッドホン９８に出
力する。本システムの特徴は次の通りである。ヘッ
ドホンを使用する個人用の立体音生成装置である。
ヘッドホンを使用するため聴者の骨格や髪形によって聴
覚に個人差が生じ、その補正が困難である。サンプ
ラによりドライ・ソースを表現する。聴者には６自
由度（位置、方向）が与えられる。聴者前面の音の
定位が困難である。映像装置はヘッド・マウント・
ディスプレイ（ＨＭＤ）を用いたシステムである。The position and direction of the listener 99 (six degrees of freedom in position and direction) are detected by a position detection device 98, and a dedicated computer 9 for generating a stereophonic sound from a host computer 95.
6. The three-dimensional sound generation dedicated computer 96 converts the dry source into three-dimensional sound based on the position and direction information of the listener 99, and outputs it to the headphones 98 of the listener 99 as left and right sounds. The features of this system are as follows. This is a personal stereophonic sound generation device that uses headphones.
Since headphones are used, there are individual differences in hearing due to the skeleton and hairstyle of the listener, and it is difficult to correct them. Express the dry sauce with a sampler. The listener is given 6 degrees of freedom (position, direction). It is difficult to localize the sound in front of the listener. The video equipment is head mount
It is a system using a display (HMD).

【０００５】（２）スピーカを用いて立体音を生成する
システム図９（ｂ）は上記システムの構成を示す図であり、同図
において、１０１は音を生成することができるコンピュ
ータ、１０２はデジタル・サラウンド・デコーダ、１０
３ａないし１０３ｅはスピーカ、１０４は聴者である。
同図において、まず録音したいドライ・ソースをコンピ
ュータ１０１のメモリに記憶させる。コンピュータ１０
１は聴者１０５の位置と音源（最大４種類のドライ・ソ
ースを同時出力可能）位置からデジタル・サラウンド・
デコーダ１０２の特性を考慮して左右の音を生成する。
デジタル・サラウンド・デコーダ１０２は生成された左
右の音に方向性強調処理を行い４方向のスピーカ１０３
ａないし１０３ｅから立体音を出力する。本システムの
特徴は次の通りである。複数人が聴くことができる
スピーカ・タイプの立体音生成装置である。スピー
カの配置範囲内に音場を提供する。市販のデジタル
・サラウンド・デコーダによって立体音を表現するため
方向性強調処理依存となる可能性がある。聴者には
位置のみの３自由度が与えられる。映像装置は単一
平面画面である。(2) System for Generating Stereoscopic Sound Using Speaker FIG. 9 (b) is a diagram showing the configuration of the above system. In FIG. 9, 101 is a computer capable of generating sound, and 102 is a digital system.・ Surround decoder, 10
3a to 103e are speakers, and 104 is a listener.
In the figure, first, the dry source to be recorded is stored in the memory of the computer 101. Computer 10
1 is digital surround sound from the position of the listener 105 and the sound source (up to 4 types of dry sources can be output simultaneously).
The left and right sounds are generated in consideration of the characteristics of the decoder 102.
The digital surround decoder 102 performs directionality enhancement processing on the generated left and right sounds and a four-direction speaker 103.
Stereo sound is output from a to 103e. The features of this system are as follows. It is a speaker-type three-dimensional sound generation device that can be heard by a plurality of people. Providing a sound field within the speaker placement range. Since stereoscopic sound is expressed by a commercially available digital surround decoder, there is a possibility that it will depend on the directional enhancement processing. The listener is given three degrees of freedom only in position. The video device is a single plane screen.

【０００６】[0006]

【発明が解決しようとする課題】ところで、上記した
（１）のシステムは体験者の位置と方向により立体音を
生成するが、一般的にヘッドホンを用いるため、空間へ
の音の定位が弱く、音像の移動も左右の移動を表現でき
る程度である。特に、ヘッドホンの特性上、前方の音は
頭の中に、後方の音は上位に定位する傾向があるため、
あらゆる方向の定位が困難である。また、ヘッドホンを
用いたシステムにおいては、頭部伝達関数（骨格、耳の
形、頭髪など）の個人差によって音の認識に差が生じ、
現実感の表現をしにくい。By the way, the system of (1) described above generates stereoscopic sound depending on the position and direction of the experiencer, but since headphones are generally used, localization of sound into space is weak, The movement of the sound image is also such that left and right movement can be expressed. In particular, due to the characteristics of headphones, the sound in the front tends to be localized in the head and the sound in the rear tends to be localized in the higher level,
Localization in all directions is difficult. In addition, in a system using headphones, there are differences in sound recognition due to individual differences in head related transfer functions (skeleton, ear shape, hair, etc.),
It is difficult to express reality.

【０００７】（２）のシステムは前面を向いた聴者に立
体音を提供するシステムであり、音像の移動については
空間内の移動がある程度表現できるが、空間への音の定
位が弱い。また、人工現実感の認識という点からみて、
聴者の自由度や方向性強調処理への依存特性などに問題
がある。本発明は上記した従来技術の問題点に鑑みなさ
れたものであって、聴者（体験者）の頭部の動きに追従
させて、任意の音の音像を定位して聞かせることによ
り、聴者に音像を現実感をもって認識させることができ
る音像定位装置を提供することを目的とする。The system (2) is a system for providing a stereophonic sound to a listener facing the front. The movement of the sound image can be expressed to some extent in the space, but the localization of the sound to the space is weak. Also, in terms of recognition of artificial reality,
There are problems with the degree of freedom of the listener and the dependence on the directionality enhancement processing. The present invention has been made in view of the above-mentioned problems of the prior art, and makes the listener follow the movement of the head of the listener (experiencer) and localize and hear a sound image of an arbitrary sound. It is an object of the present invention to provide a sound image localization device capable of recognizing a sound image with a sense of reality.

【０００８】[0008]

【課題を解決するための手段】図１は本発明の基本構成
図である。上記課題を解決するため、本発明の請求項１
の発明は、任意の音を発生する音発生装置１と、聴者５
の位置、角度等の聴点を検出する位置検出装置３と、音
像を生成する音制御装置２と、音像を再生する音再生装
置４ａ，４ｂ，４ｃ，４ｄと、仮想的な音の世界を提供
する管理計算機６と、音再生装置が設けられた場所とは
異なる場所で発生する任意の音を集音し、音の発生位置
とともに音制御装置に送る集音装置１’を備え、管理計
算機６が位置検出装置３の出力に基づき任意の音の動き
あるいは変化を時間的に管理し、音制御装置２が管理計
算機６の出力に基づき位置検出装置３からの聴者５の位
置に関する情報、音再生装置４ａ，４ｂ，４ｃ，４ｄの
配置、および、任意に設定する発生音もしくは収集音の
定位位置との関係により発生音もしくは収集音に効果を
与えて再生するように構成したものである。FIG. 1 is a basic configuration diagram of the present invention. In order to solve the above problems, claim 1 of the present invention
Of the invention, a sound generator 1 for generating an arbitrary sound, and a listener 5
A position detecting device 3 for detecting a listening point such as a position and an angle, a sound control device 2 for generating a sound image, sound reproducing devices 4a, 4b, 4c, 4d for reproducing a sound image, and a virtual sound world. The management computer 6 provided and the place where the sound reproduction device is installed
Collecting arbitrary sounds that occur in different places , the sound generation position
A sound collection device 1 ′ for sending to the sound control device is provided together, the management computer 6 temporally manages the movement or change of any sound based on the output of the position detection device 3, and the sound control device 2 outputs the output of the management computer 6. Based on the information on the position of the listener 5 from the position detection device 3, the arrangement of the sound reproduction devices 4a, 4b, 4c, 4d, and the relation between the sound reproduction device 4a, 4b, 4c, and 4d and the localization position of the generated sound or the collected sound, the generated sound or the collected sound is collected. It is configured to give an effect to the sound and reproduce it.

【０００９】本発明の請求項２の発明は、請求項１の発
明において、音再生装置４ａ，４ｂ，４ｃ，４ｄを移動
させる可動筐体を備え、可動筐体が、音像定位の認識低
下を防ぐように聴者５の移動にあわせて音再生装置４
ａ，４ｂ，４ｃ，４ｄを移動させるように構成したもの
である。本発明の請求項３の発明は、請求項１または請
求項２の発明において、音に同期した映像を生成する映
像生成装置７を備え、映像生成装置７が音再生装置４
ａ，４ｂ，４ｃ，４ｄが出力する音像と聴者５の動きに
同期した映像８を聴者５に提供するように構成したもの
である。According to a second aspect of the present invention, in the first aspect of the invention, a movable casing for moving the sound reproducing devices 4a, 4b, 4c, 4d is provided, and the movable casing reduces the recognition of the sound image localization. The sound reproduction device 4 is adapted to the movement of the listener 5 so as to prevent it.
It is configured to move a, 4b, 4c and 4d. The invention of claim 3 of the present invention is the invention of claim 1 or contract.
In the invention of claim 2 , the image generation device 7 for generating an image synchronized with a sound is provided, and the image generation device 7 is the sound reproduction device 4.
The sound image output by a, 4b, 4c, and 4d and the image 8 synchronized with the movement of the listener 5 are provided to the listener 5.

【００１０】[0010]

【作用】本発明の請求項１の発明において、聴者５の頭
部の位置は位置検出装置３により検出され、音制御装置
２に与えられる。管理計算機６が位置検出装置３の出力
に基づき任意の音の動きあるいは変化を時間的に管理す
る。集音装置１’は音再生装置が設けられた場所とは異
なる場所で発生する任意の音を集音し、音の発生位置と
ともに音制御装置に送り、音制御装置２は位置検出装置
３により検出された聴者５の頭部の位置、方向から聴者
の聴点と音再生装置４ａないし４ｄの位置関係を求め、
これと、任意の音の発生位置を設定して、これら３者の
位置関係に基づき、音発生装置１が発生する音情報に、
その音圧比の制御、音情報の遅延、周波数変換等の効果
を与え、音再生装置４ａ，４ｂ，４ｃ，４ｄに出力す
る。聴者の頭部の位置、方向に追従させて音像を定位さ
せているので、聴者に現実感をもって音像定位を認識さ
せることができる。また、集音装置１’が遠隔地で発生
する音を取り込み、音制御装置２が収集された音に効果
を与えて音再生装置４ａ，４ｂ，４ｃ，４ｄに出力して
いるので、遠隔地の音を現実感をもって聴者に体験させ
ることができる。In the first aspect of the present invention, the position of the head of the listener 5 is detected by the position detecting device 3 and given to the sound control device 2. The management computer 6 temporally manages the movement or change of an arbitrary sound based on the output of the position detection device 3 .
It The sound collector 1'is different from the place where the sound reproduction device is installed.
Collect any sound that is generated in
Both are sent to the sound control device , and the sound control device 2 obtains the positional relationship between the listener's listening point and the sound reproduction devices 4a to 4d from the position and direction of the head of the listener 5 detected by the position detection device 3,
By setting this and an arbitrary sound generation position, the sound information generated by the sound generation device 1 is added to the sound information based on the positional relationship between the three parties.
The effects such as control of the sound pressure ratio, delay of sound information, frequency conversion, etc. are given and output to the sound reproduction devices 4a, 4b, 4c, 4d. Since the sound image is localized by following the position and direction of the listener's head, the listener can recognize the sound image localization with a sense of reality. Further, since the sound collecting device 1 ′ captures the sound generated at the remote place and the sound control device 2 gives an effect to the collected sound and outputs the sound to the sound reproducing devices 4a, 4b, 4c, 4d, the remote place. The sound of can be experienced by the listener.

【００１１】本発明の請求項２の発明においては、音像
定位の認識低下を防ぐように聴者５の移動にあわせて音
再生装置４ａ，４ｂ，４ｃ，４ｄを移動させるように構
成したので、音再生装置の再生性能の不足や聴者の移動
による音像定位の認識の低下を防ぐことができる。本発
明の請求項３の発明においては、音に同期した映像を生
成する映像生成装置７を備え、映像生成装置７が音再生
装置４ａ，４ｂ，４ｃ，４ｄが出力する音像と聴者５の
動きに同期した映像８を聴者５に提供するように構成し
たので、任意の音に映像を付加することができ、一層現
実感を高めることができる。According to the second aspect of the present invention, the sound reproducing devices 4a, 4b, 4c and 4d are moved in accordance with the movement of the listener 5 so as to prevent the sound image localization from being deteriorated. It is possible to prevent a reduction in the reproduction performance of the reproduction device and a reduction in the recognition of the sound image localization due to the movement of the listener. According to the third aspect of the present invention, there is provided the image generation device 7 for generating the image synchronized with the sound, and the image generation device 7 outputs the sound images output by the sound reproduction devices 4a, 4b, 4c, 4d and the movement of the listener 5. Since the image 8 synchronized with the above is provided to the listener 5, the image can be added to an arbitrary sound and the sense of reality can be further enhanced.

【００１２】[0012]

【実施例】図２は本発明の第１の実施例を示す図であ
り、同図において、２１は音情報を（ドライ・ソース）
を発生する音発生装置、２２は音発生装置２１が発生す
る音情報に効果を与える音制御装置、２３は聴者の頭部
の位置（位置、方向の６自由度）を検出する位置検出装
置、２４ａないし２４ｄは例えばスピーカ等からなる音
再生装置、２５は聴者である。図２において、音再生装
置２４ａないし２４ｄは聴者２５を取り囲むように配置
されており、聴者２５の頭部の位置は位置検出装置２３
により検出され、音制御装置２２に与えられる。音制御
装置２２は位置検出装置２３により検出された聴者２５
の頭部の位置、方向から聴者の聴点と音再生装置２４ａ
ないし２４ｄの位置関係を求め、これと、任意の音の発
生位置を設定して、これら３者の位置関係に基づき、音
発生装置２１が発生する音情報に効果を与える。FIG. 2 is a diagram showing a first embodiment of the present invention, in which 21 is sound information (dry source).
, 22 is a sound control device that exerts an effect on the sound information generated by the sound generation device 21, 23 is a position detection device that detects the position of the listener's head (6 degrees of freedom in position and direction), Reference numerals 24a to 24d are sound reproducing devices including, for example, speakers, and 25 is a listener. In FIG. 2, the sound reproduction devices 24a to 24d are arranged so as to surround the listener 25, and the position of the head of the listener 25 is determined by the position detection device 23.
Is detected by the sound control device 22 and given to the sound control device 22. The sound control device 22 controls the listener 25 detected by the position detection device 23.
From the position and direction of the head of the listener, the listening point of the listener and the sound reproducing device 24a
To 24d, the position of generation of an arbitrary sound is set, and based on the positional relationship of these three, the sound information generated by the sound generator 21 is effective.

【００１３】音情報に与える効果としては、例えば、音
発生装置２１から発生する音情報を音再生装置２４ａな
いし２４ｄに割り振るとき、音圧比を変えたり、音情報
に遅延を与えたり、あるいは、周波数を変換したりす
る。また、反射音や残響音などの計算を行って、音像の
定位を行うとともに、部屋の大きさなどを聴者に認識さ
せる。本実施例においては、上記のように、聴者の頭部
の位置、方向に追従させて音像を定位させているので、
聴者に現実感をもって音像を認識させることができる。As an effect to be given to the sound information, for example, when the sound information generated from the sound generating device 21 is allocated to the sound reproducing devices 24a to 24d, the sound pressure ratio is changed, the sound information is delayed, or the frequency is changed. Or convert. In addition, the reflected sound and reverberant sound are calculated to localize the sound image, and the listener recognizes the size of the room. In the present embodiment, as described above, since the sound image is localized by following the position and direction of the listener's head,
The listener can recognize the sound image with a sense of reality.

【００１４】図３は本発明の第２の実施例を示す図であ
り、図２に示したものと同一のものには同一の符号が付
されており、本実施例においては、図２に示した音発生
装置２１に換え、遠隔地に設けた音収集装置２１’を設
けたものであり、その他の構成は第１の実施例と同一で
ある。図３において、音収集装置２１’は聴者２５とは
離れた空間の音を収集し、音制御装置２２に送る。その
際、必要に応じて、音収集装置２１’は収集する音の発
生位置に関する情報を送る。音制御装置２２は音収集装
置２１’より送られた音情報に、第１の実施例で説明し
たのと同様に効果を与え、音再生装置２４ａないし２４
ｄに与え、音像を定位させる。本実施例においては、上
記のように、遠隔地の音を現実感をもって聴者に体験さ
せることができる。FIG. 3 is a diagram showing a second embodiment of the present invention, in which the same components as those shown in FIG. 2 are designated by the same reference numerals, and in this embodiment, FIG. Instead of the sound generating device 21 shown, a sound collecting device 21 'provided at a remote place is provided, and the other configurations are the same as those of the first embodiment. In FIG. 3, the sound collecting device 21 ′ collects the sound in the space apart from the listener 25 and sends it to the sound control device 22. At that time, if necessary, the sound collecting device 21 'sends information on the generation position of the sound to be collected. The sound control device 22 applies the same effect to the sound information sent from the sound collecting device 21 ′ as described in the first embodiment, and the sound reproducing devices 24 a through 24 a.
The sound image is localized at d. In the present embodiment, as described above, the listener can experience the sound of a remote place with a sense of reality.

【００１５】図４は本発明の第３の実施例を示す図であ
り、図２に示したものと同一のものには同一の符号が付
されており、本実施例は、音再生装置２４ａないし２４
ｄを可動筐体２９ａないし２９ｄに取り付け、絶えず聴
者２５の方向に向けることができるようにしたものであ
る。同図において、音制御装置２２は、第１の実施例と
同様、音発生装置２１が発生する音に効果を与え各音再
生装置２４ａないし２４ｄに与えるとともに、位置検出
装置２３により検出された聴者２５の位置情報に基づき
音再生装置２４ａないし２４ｄを絶えず聴者２５の方向
に向かせるための可動筐体移動情報を与える。本実施例
においては、上記のように、可動筐体２９ａないし２９
ｄにより音再生装置２４ａないし２４ｄを聴者２５の方
向を向けるので、音再生装置の再生性能の不足や聴者の
移動による音像定位の認識の低下を防ぐことができる。FIG. 4 is a diagram showing a third embodiment of the present invention, in which the same components as those shown in FIG. 2 are designated by the same reference numerals, and in this embodiment, a sound reproducing device 24a is used. Through 24
d is attached to the movable housings 29a to 29d so that it can be constantly directed toward the listener 25. In the figure, as in the first embodiment, the sound control device 22 gives an effect to the sound generated by the sound generation device 21 and gives it to each of the sound reproduction devices 24a to 24d, and the listener detected by the position detection device 23. Based on the position information of 25, the movable housing movement information for constantly orienting the sound reproducing devices 24a to 24d toward the listener 25 is given. In the present embodiment, as described above, the movable casings 29a to 29a are used.
Since the sound reproduction devices 24a to 24d are directed to the listener 25 by d, it is possible to prevent the reproduction performance of the sound reproduction device from being insufficient and the reduction in the recognition of the sound image localization due to the movement of the listener.

【００１６】図５は本発明の第４の実施例を示す図であ
り、図２に示したものと同一のものには同一の符号が付
されており、本実施例は、図２のものに任意の音を管理
するための管理計算機２６を設けたものであり、その他
の構成は第１の実施例と同一である。図５において、管
理計算機２６は任意の音の移動や音の変化を管理する
（時間成分を含む場合もある）計算機である。例えば、
蜜蜂が聴者２５の周りを飛び回る音を聴者２５に認識さ
せる場合には、蜂が飛んでいる間は管理計算機２６が音
発生装置２１に蜂の飛んでいる音を出させて音制御装置
２２には蜂の位置情報を伝達する。音制御装置２２は、
第１の実施例と同様、管理制御装置２６から与えられる
位置情報に基づき音発生装置２１が発生する音を各再生
装置２４ａないし２４ｄに割り振り、聴者２５に蜂が飛
んでいる音を認識させる。本実施例においては、上記の
ように、管理計算機２６を設けて任意の音の移動や音の
変化を管理するので、一層現実感を高めて聴者２５に音
を認識させることができる。FIG. 5 is a diagram showing a fourth embodiment of the present invention. The same parts as those shown in FIG. 2 are designated by the same reference numerals, and this embodiment is the same as that of FIG. A management computer 26 for managing arbitrary sounds is provided in the second embodiment, and other configurations are the same as those in the first embodiment. In FIG. 5, the management computer 26 is a computer that manages the movement of any sound and the change of sound (may include a time component). For example,
When making the listener 25 recognize the sound of the bees flying around the listener 25, the management computer 26 causes the sound generation device 21 to emit the sound of the bees flying and causes the sound control device 22 to operate while the bees are flying. Conveys bee location information. The sound control device 22 is
Similar to the first embodiment, the sound generated by the sound generation device 21 is assigned to each of the reproduction devices 24a to 24d based on the position information provided from the management control device 26, and the listener 25 is made to recognize the sound of a bee flying. In the present embodiment, as described above, since the management computer 26 is provided to manage the movement and change of any sound, it is possible to make the listener 25 recognize the sound more realistically.

【００１７】図６は本発明の第５の実施例を示す図であ
り、図２に示したものと同一のものには同一の符号が付
されており、本実施例は、図２のものに映像再生装置２
７および映像を表示するための画面２８ａないし２８ｄ
を設けたものであり、その他の構成は第１の実施例と同
一である。図６において、映像再生装置２７は音を発生
する物体の位置情報などから体験者（聴者）が現実感を
得られるような映像を生成して体験者（聴者）に体験さ
せる。以上のように、本実施例においては、映像再生装
置２７および映像を表示するための画面２８ａないし２
８ｄを設けたので、任意の音に映像を付加することがで
き、現実感を高めることができる。FIG. 6 is a diagram showing a fifth embodiment of the present invention. The same parts as those shown in FIG. 2 are designated by the same reference numerals, and this embodiment is the same as that of FIG. Video playback device 2
7 and screens 28a to 28d for displaying images
Is provided, and the other structure is the same as that of the first embodiment. In FIG. 6, the image reproducing device 27 generates an image that gives the experience person (listener) a sense of reality based on the position information of the object generating the sound and causes the experience person (listener) to experience. As described above, in the present embodiment, the video playback device 27 and the screens 28a to 2 for displaying the video are used.
Since 8d is provided, it is possible to add an image to an arbitrary sound and enhance the sense of reality.

【００１８】図８は本発明の具体的実施例を示す図であ
り、本実施例は図５に示した第４の実施例と図６に示し
た実施例のものに、ヘッド・マウント・ディスプレイ
（ＨＭＤ）による立体画面表示装置と実時間画像生成装
置を設けた実施例を示したものであ。なお、ヘッド・マ
ウント・ディスプレイ（ＨＭＤ）は眼前の２枚の液晶表
示装置を光学系により左右の目に個々に見せることによ
り、立体視を可能とする装置である。図７は図８に示す
実施例におけるスピーカの配置を示す図であり、５０
Ｌ，５０Ｒはそれぞれ体験者（聴者）２５の前面に設け
られた左と右のスピーカ、５１Ｌ，５１Ｒは体験者（聴
者）２５の側面に設けられた左と右のスピーカ、５２
Ｌ，５２Ｒは体験者（聴者）２５の背面に設けられた左
と右のスピーカである。FIG. 8 is a diagram showing a specific embodiment of the present invention. This embodiment is the same as the fourth embodiment shown in FIG. 5 and the embodiment shown in FIG. It shows an embodiment in which a stereoscopic screen display device by (HMD) and a real-time image generation device are provided. A head-mounted display (HMD) is a device that enables stereoscopic viewing by allowing two liquid crystal display devices in front of the eye to be individually viewed by the left and right eyes by an optical system. FIG. 7 is a diagram showing the arrangement of speakers in the embodiment shown in FIG.
L and 50R are left and right speakers provided on the front surface of the experience person (listener) 25, 51L and 51R are left and right speakers provided on the side surface of the experience person (listener) 25, 52
L and 52R are left and right speakers provided on the back surface of the experience (listener) 25.

【００１９】図８において、図５、図６の実施例に示し
たものと同一のものには同一の符号が付されており、同
図において、２３は体験者（聴者）の頭部の位置（位
置、方向の６自由度）を検出する位置検出装置、２５は
体験者（聴者）、２７は立体画像を実時間で生成する映
像生成装置、２８’は映像生成装置２７により生成され
る立体画面、２９は音に効果を与えるための制御信号を
発生する音制御装置であり、以下に述べるデジタル・サ
ンプラ、プログラマブル・ライン・セレクタ等を制御す
る信号を出力する。また、３１はプログラムされている
音をＭＩＤＩのタイミングで発生させるデジタル・サン
プラ、３２はデジタル・サンプラ３１が発生する音をデ
ジタル・サウンド・プロセッサに割り振るプログラマブ
ル・ライン・セレクタ、３３ａないし３３ｃはプログラ
マブル・ライン・セレクタ３２からの入力音源に対して
音圧、遅延、周波数の変換を行うデジタル・サウンド・
プロセッサである。In FIG. 8, the same components as those shown in the embodiment of FIGS. 5 and 6 are designated by the same reference numerals, and in FIG. 8, 23 is the position of the head of the experience person (listener). A position detection device that detects (6 degrees of freedom in position and direction), 25 is an experience person (listener), 27 is a video generation device that generates a stereoscopic image in real time, and 28 'is a stereoscopic image generated by the video generation device 27. A screen, 29 is a sound control device for generating a control signal for giving an effect to a sound, and outputs a signal for controlling a digital sampler, a programmable line selector, etc. described below. Further, 31 is a digital sampler for generating a programmed sound at a MIDI timing, 32 is a programmable line selector for allocating the sound generated by the digital sampler 31 to a digital sound processor, and 33a to 33c are programmable line selectors. Digital sound that performs sound pressure, delay, and frequency conversion on the input sound source from the line selector 32.
It is a processor.

【００２０】３４ａないし３４ｆはデジタル・サウンド
・プロセッサ３３ａないし３３ｃの出力を増幅する増幅
器、３５ａないし３５ｃは増幅器３４ａないし３４ｆの
出力とＤＡＴ３８の出力の音圧制御を行うデジタル・ミ
キシング・プロセッサ、３６ａないし３６ｆはデジタル
・ミキシング・プロセッサ３５ａないし３５ｃの出力を
増幅して図７の体験者（聴者）２５の前面、側面、背面
の左右の各スピーカ５０Ｌ，５０Ｒ，５１Ｌ，５１Ｒ，
５２Ｌ，５２Ｒに出力する増幅器、３８は音が録音され
たＤＡＴ（デジタル・オーディオ・テープ）である。４
０，４１，４２はＲＳ−２３２Ｃ／ＭＩＤＩシリアル・
ポートであり、音制御装置とデジタル・サンプラ３１、
プログラマブル・ライン・セレクタ３２、デジタル・サ
ウンド・プロセッサ３３ａ〜３３ｃおよびデジタル・ミ
キシング・プロセッサ３５ａ〜３５ｃとの間の制御信号
の伝送を行う。また、４３，４４，４５はＲＳ２３２Ｃ
のシリアル信号をＭＩＤＩに変換するＭＩＤＩエキスパ
ンダである。34a to 34f are amplifiers for amplifying the outputs of the digital sound processors 33a to 33c, 35a to 35c are digital mixing processors for controlling the sound pressure of the outputs of the amplifiers 34a to 34f and the output of the DAT 38, and 36a to 36c. 36f amplifies the outputs of the digital mixing processors 35a to 35c to amplify the outputs of the experience (listener) 25 shown in FIG. 7 to the left, right, left and right speakers 50L, 50R, 51L, 51R, respectively.
An amplifier for outputting to 52L and 52R, and 38 is a DAT (digital audio tape) on which sound is recorded. Four
0, 41, 42 are RS-232C / MIDI serial
Port, sound controller and digital sampler 31,
Control signals are transmitted between the programmable line selector 32, the digital sound processors 33a to 33c, and the digital mixing processors 35a to 35c. In addition, 43, 44 and 45 are RS232C
Is a MIDI expander for converting a serial signal of the above into MIDI.

【００２１】図８において、体験者（聴者）２５の頭部
の位置、方向が位置検出装置２３で検出され、管理計算
機２６に与えられる。管理計算機２６は体験者（聴者）
２５の頭部の位置、方向信号に基づき、図５の第４の実
施例と同様、発生する音の種類、音の位置情報を音制御
装置２９に与えるとともに、映像生成装置２７に表示す
る映像と体験者（聴者）２５の位置情報を与える。映像
生成装置２７は管理計算機２６の出力に基づき、体験者
（聴者）２５の動きにあわせて仮想世界の物体の位置を
リアル・タイムでヘッド・マウント・ディスプレイ（Ｈ
ＭＤ）の左右に表示し、体験者（聴者）２５にコンピュ
ータ・グラフィクスの立体映像を提供する。音制御装置
２９は管理計算機２６が出力する音の位置情報に基づき
ＲＳ−２３２Ｃ／ＭＩＤＩシリアル・ポートを介してデ
ジタル・サンプラ３１、プログラマブル・ライン・セレ
クタ３２、デジタル・サウンド・プロセッサ３３ａ〜３
３ｃおよびデジタル・ミキシング・プロセッサ３５ａ〜
３５ｃを制御して立体音を生成させる。In FIG. 8, the position and direction of the head of the experiencer (listener) 25 are detected by the position detection device 23 and given to the management computer 26. Management computer 26 is an experienced person (listener)
Based on the position and direction signals of the head of No. 25, the kind of sound to be generated and sound position information are given to the sound control device 29 and displayed on the video generation device 27 as in the fourth embodiment of FIG. And the position information of the experience person (listener) 25 is given. Based on the output of the management computer 26, the video generation device 27 displays the position of the object in the virtual world in real time according to the motion of the experiencer (listener) 25 in a head mounted display (H).
It is displayed on the left and right of the MD) and provides the experience (listener) 25 with a stereoscopic image of computer graphics. The sound control device 29 uses the position information of the sound output by the management computer 26 to transmit a digital sampler 31, a programmable line selector 32, and digital sound processors 33a-3a through an RS-232C / MIDI serial port.
3c and digital mixing processor 35a-
35c is controlled to generate a three-dimensional sound.

【００２２】デジタル・サンプラ３１はプログラムされ
ている音を音制御装置２９が出力するＭＩＤＩのタイミ
ングに合わせて出力し、プログラマブル・ライン・セレ
クタ３２に与える。プログラマブル・ライン・セレクタ
３２はデジタル・サンプラ３１が発生する音を音制御装
置２９の制御信号に基づきデジタル・サウンド・プロセ
ッサ３３ａ〜３３ｃに割り振る。デジタル・サウンド・
プロセッサ３３ａ〜３３ｃは割り振られた入力音源に対
して体験者（聴者）２５の前面、側面、背面の左右の音
の音圧、遅延、周波数の変換を行う。デジタル・サウン
ド・プロセッサ３３ａ〜３３ｃの出力は増幅器３４ａ〜
３４ｆにより増幅され、デジタル・ミキシング・プロセ
ッサ３５ａ〜３５ｃに与えられる。デジタル・ミキシン
グ・プロセッサ３５ａ〜３５ｃは増幅器３４ａ〜３４ｆ
の出力とＤＡＴ３７の出力を音制御装置２９の制御信号
に基づきミキシングして増幅器３６ａ〜３６ｆを介して
図７に示す各スピーカ５０Ｌ，５０Ｒ，５１Ｌ，５１
Ｒ，５２Ｌ，５２Ｒに出力し、体験者（聴者）２５に立
体音を提供する。The digital sampler 31 outputs the programmed sound at the timing of the MIDI output from the sound control device 29 and gives it to the programmable line selector 32. The programmable line selector 32 allocates the sound generated by the digital sampler 31 to the digital sound processors 33a to 33c based on the control signal of the sound control device 29. Digital sound
The processors 33a to 33c perform sound pressure, delay, and frequency conversion of sounds on the front, side, and back of the experience (listener) 25 with respect to the allocated input sound source. The outputs of the digital sound processors 33a to 33c are amplifiers 34a to
It is amplified by 34f and given to the digital mixing processors 35a to 35c. The digital mixing processors 35a to 35c are amplifiers 34a to 34f.
Of the speaker 50L, 50R, 51L, 51 shown in FIG. 7 via the amplifiers 36a to 36f by mixing the output of the DAT and the output of the DAT 37 based on the control signal of the sound control device 29.
It outputs to R, 52L, 52R, and provides a stereoscopic sound to the experience person (listener) 25.

【００２３】以上のように、本実施例においては、体験
者（聴者）の頭部の動きを検出して仮想世界の中の体験
者（聴者）と物体の位置関係により、体験者（聴者）の
動きに合わせた仮想世界の立体画像と立体音を提供する
ので、体験者（聴者）は仮想世界を現実としてとらえる
ことができ、仮想世界の操作性を向上させることができ
る。As described above, in this embodiment, the experience (listener) is detected based on the positional relationship between the experience (listener) and the object in the virtual world by detecting the head movement of the experience (listener). Since the stereoscopic image and the stereoscopic sound of the virtual world that match the movement of the virtual world are provided, the experience person (listener) can perceive the virtual world as reality and improve the operability of the virtual world.

【００２４】[0024]

【発明の効果】以上説明したことから明らかなように、
本発明においては、体験者（聴者）の頭部の位置、方向
等を検出して、聴者の位置に関する情報、音再生装置の
配置、および、任意に設定する発生音の定位位置との関
係により発生音に効果を与えて再生しているので、人間
と計算機のインタラクションで音像の定位を行うことが
でき、現実感を高めて体験者（聴者）に音を認識させる
ことができる。特に、映像生成装置と組み合わせること
により、ＣＡＤ／ＣＡＭ、テレ・プレゼンス、テレ・イ
グジスタンス、教育、建築エンターテーメントなどの多
様な分野に応用することができ、その効果は極めて大き
い。As is apparent from the above description,
In the present invention, the position, direction, etc. of the head of the experiencer (listener) is detected, and the relationship between the position of the listener, the position of the sound reproduction device, and the localization position of the generated sound is arbitrarily set. Since the generated sound is reproduced by giving an effect, it is possible to localize the sound image by the interaction between the human and the computer, and it is possible to enhance the sense of reality and make the experience person (listener) recognize the sound. In particular, by combining with an image generation device, it can be applied to various fields such as CAD / CAM, tele presence, tele presence, education, and architectural entertainment, and the effect is extremely large.

[Brief description of drawings]

【図１】本発明の基本構成図である。FIG. 1 is a basic configuration diagram of the present invention.

【図２】本発明の第１の実施例を示す図である。FIG. 2 is a diagram showing a first embodiment of the present invention.

【図３】本発明の第２の実施例を示す図である。FIG. 3 is a diagram showing a second embodiment of the present invention.

【図４】本発明の第３の実施例を示す図である。FIG. 4 is a diagram showing a third embodiment of the present invention.

【図５】本発明の第４の実施例を示す図である。FIG. 5 is a diagram showing a fourth embodiment of the present invention.

【図６】本発明の第５の実施例を示す図である。FIG. 6 is a diagram showing a fifth embodiment of the present invention.

【図７】本発明の具体的実施例の再生装置の配置を示す
図である。FIG. 7 is a diagram showing an arrangement of a reproducing apparatus according to a specific embodiment of the present invention.

【図８】本発明の具体的実施例を示す図である。FIG. 8 is a diagram showing a specific example of the present invention.

【図９】従来例を示す図である。FIG. 9 is a diagram showing a conventional example.

[Explanation of symbols]

１，２１音発生装置１’，２１’ 集音装置２，２２音制御装置３，２３位置検出装置４ａ，４ｂ，４ｃ，４ｄ，２４ａ，２４ｂ，２４ｃ，２
４ｄ，５０Ｌ，５０Ｒ，５１Ｌ，５１Ｒ，５２Ｌ，５２
Ｒ音再生装置５，２５聴者６，２６管理計算機７，２７映像生成装置２９ａ，２９ｂ，２９ｃ，２９ｄ可動筺体３１デジタル・サンプラ３２プログラマブル・ライン・セレクタ３３ａ，３３ｂ，３３ｃデジタル・サウンド・プロセ
ッサ３４ａ，３４ｂ，３４ｃ，３４ｄ，３４ｅ，３４ｆ，３
６ａ，３６ｂ，３６ｄ，３６ｅ，３６ｆ増幅器３５ａ，３５ｂ，３５ｃデジタル・ミキシング・プロ
セッサ３８デジタル・オーディオ・テー
プ（ＤＡＴ）1, 21 Sound generating device 1 ', 21' Sound collecting device 2, 22 Sound control device 3, 23 Position detecting device 4a, 4b, 4c, 4d, 24a, 24b, 24c, 2
4d, 50L, 50R, 51L, 51R, 52L, 52
R sound reproduction device 5,25 listener 6,26 management computer 7,27 video generation device 29a, 29b, 29c, 29d movable housing 31 digital sampler 32 programmable line selector 33a, 33b, 33c digital sound processor 34a, 34b, 34c, 34d, 34e, 34f, 3
6a, 36b, 36d, 36e, 36f Amplifiers 35a, 35b, 35c Digital mixing processor 38 Digital audio tape (DAT)

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平１−279700（ＪＰ，Ａ) 特開平１−239674（ＪＰ，Ａ) 特開昭52−30402（ＪＰ，Ａ) 特開平４−192066（ＪＰ，Ａ) 実開平３−105099（ＪＰ，Ｕ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) H04S 7/00 H04S 1/00 ─────────────────────────────────────────────────── ─── Continuation of the front page (56) Reference JP-A-1-279700 (JP, A) JP-A-1-239674 (JP, A) JP-A-52-30402 (JP, A) JP-A-4- 192066 (JP, A) Actual Kaihei 3-105099 (JP, U) (58) Fields investigated (Int.Cl. ⁷ , DB name) H04S 7/00 H04S 1/00

Claims

(57) [Claims]

1. A sound generation device for generating an arbitrary sound, a position detection device for detecting a listening point such as a position and an angle of a listener, a sound control device for generating a sound image, and a sound reproduction device for reproducing the sound image. , Occurs in a place different from the place where the management computer that provides the virtual sound world and the sound playback device are installed
Collect any sound, and send it to the sound control device together with the position of the sound.
A sound collecting device is provided, the management computer temporally manages the movement or change of any sound based on the output of the position detection device, and the sound control device relates to the position of the listener from the position detection device based on the output of the management computer. A sound image localization device characterized in that an effect is exerted on a generated sound or a collected sound depending on the relationship between the information, the arrangement of the sound reproduction device, and the localization position of the generated sound or the collected sound which is arbitrarily set.

2. The sound reproducing device is provided with a movable casing, and the movable casing moves the sound reproducing device in accordance with the movement of the listener so as to prevent deterioration of recognition of sound image localization. 1. Sound image localization device.

3. An image generating apparatus for generating an image synchronized with sound, wherein the image generating apparatus provides the listener with an image synchronized with the sound image output by the sound reproducing apparatus and the movement of the listener. The sound image localization apparatus according to claim 1 or 2 .