JP7463796B2

JP7463796B2 - DEVICE SYSTEM, SOUND QUALITY CONTROL METHOD AND SOUND QUALITY CONTROL PROGRAM

Info

Publication number: JP7463796B2
Application number: JP2020054233A
Authority: JP
Inventors: 幸生多田; 和也粂原; 光希有田
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2020-03-25
Filing date: 2020-03-25
Publication date: 2024-04-09
Anticipated expiration: 2040-03-25
Also published as: US20210301581A1; US11693621B2; JP2021158426A

Description

この発明の一実施形態は、遮蔽物および遮蔽物によって隠されている仮想音源の移動に応じて、仮想音源の音質を変化させる音響デバイスおよび音質制御方法に関する。 One embodiment of the present invention relates to an acoustic device and a sound quality control method that changes the sound quality of a virtual sound source in response to the movement of an obstruction and the movement of the virtual sound source that is hidden by the obstruction.

ユーザに、ヘッドホンまたはイヤホンなどの音響デバイスを装用させて、拡張現実（ＡＲ：Augmented Reality）を体験させるＡＲシステムが提案されている。ＡＲシステムは、ユーザが滞在している場所に応じた音声を音響デバイスから放音する。ＡＲシステムは、仮想音源を所定の定位位置に定位させるために、ユーザの現在位置およびユーザの頭部の向きを検出する。ＡＲシステムは、検出された位置や頭部の向きに応じた頭部伝達関数を用いて音声に特定の信号処理を加えることで仮想音源を所定位置に定位させている。 An AR system has been proposed that allows a user to experience augmented reality (AR) by wearing an acoustic device such as headphones or earphones. The AR system emits sound from the acoustic device according to the location where the user is staying. The AR system detects the user's current position and the direction of the user's head in order to localize a virtual sound source at a predetermined localization position. The AR system localizes the virtual sound source at a predetermined position by applying specific signal processing to the sound using a head-related transfer function according to the detected position and head direction.

頭部伝達関数とは、音源位置からユーザの両耳の外耳道までの音声の伝達関数である。音源位置で発生した音声がユーザの耳に到達するまでの間に、頭部形状、耳介形状などによりその音源方向に応じた特性で周波数特性が変化する。頭部伝達関数は、音声がユーザの耳に到達するまでの間に変化を受けた周波数特性を表した関数であり、音源方向毎に用意されている。ユーザは、各音源方向特有の周波数特性を聞き分けて、音声の到来方向を判断している。したがって、ＡＲシステムが、音声を所定方向の頭部伝達関数を用いて加工して再生することにより、ユーザにさも所定方向から音声が聞こえてきたかのような感覚をもたせることができる。 The head-related transfer function is the transfer function of sound from the sound source position to the ear canals of both user's ears. When sound generated at the sound source position reaches the user's ears, the frequency characteristics change with characteristics according to the direction of the sound source due to the head shape, auricle shape, etc. The head-related transfer function is a function that represents the frequency characteristics that change before the sound reaches the user's ears, and is prepared for each sound source direction. The user determines the direction from which the sound is coming by listening to the frequency characteristics specific to each sound source direction. Therefore, by processing the sound using the head-related transfer function for a specified direction and playing it back, the AR system can give the user the sensation that the sound is coming from the specified direction.

ユーザの位置と仮想音源との間に、現実のまたは仮想的な遮蔽物がある場合、遮蔽物の影響が音質に反映されることが好ましい。例えば、特許文献１には、ユーザが滞在している場所に応じた道順を提示することにより、ユーザをナビゲーションする音声ナビゲーションシステムが開示されている。この文献では、目的物（目的地）についてのガイド音声が再生される際、目的物とユーザの位置との間に遮蔽物がある場合、音質を変化（減衰）させることが提案されている。なお、特許文献１のシステムは、目的物とユーザの位置との間に遮蔽物がある場合にナビゲーション音声を減衰させるものであり、目的物の位置にナビゲーション音声を定位するものではない。 When there is a real or virtual obstruction between the user's position and the virtual sound source, it is preferable that the effect of the obstruction is reflected in the sound quality. For example, Patent Document 1 discloses a voice navigation system that navigates the user by presenting a route according to the location where the user is staying. This document proposes that when a guide voice for an object (destination) is played back, if there is an obstruction between the object and the user's position, the sound quality is changed (attenuated). Note that the system of Patent Document 1 attenuates the navigation voice when there is an obstruction between the object and the user's position, but does not localize the navigation voice to the object's position.

国際公開第２０１７／０１８２９８International Publication No. 2017/018298

従来のＡＲシステムは、仮想音源の定位位置が現実のまたは仮想的な扉や窓などの遮蔽物によって遮蔽された場所であるか否かを考慮して仮想音源の音質を制御していなかった。このため、拡張現実のリアリティが低下してしまう問題があった。 Conventional AR systems do not control the sound quality of virtual sound sources by taking into account whether the location of the virtual sound source is blocked by a real or virtual door, window, or other obstruction. This causes a problem of reducing the realism of augmented reality.

そこで、本発明の一実施形態に係る目的の一つは、現実世界の扉や窓などの遮蔽物による仮想音源の遮蔽状態を考慮して、仮想音源の聞こえ方をよりリアルに加工できるようにすることにある。 Therefore, one of the objectives of one embodiment of the present invention is to make it possible to process the way a virtual sound source sounds more realistically by taking into account the state of obstruction of the virtual sound source by obstacles such as doors and windows in the real world.

本発明の一実施形態に係るデバイスシステムは、デバイス、センサおよび音声生成手段を備える。デバイスは、ユーザが装用する。センサは、移動可能な遮蔽物の移動を検出する。音声処理部は、遮蔽物の反対側に定位された仮想音源から遮蔽物で遮蔽された場合の第１の音質の音声を生成してデバイスから放音し、遮蔽物が仮想音源を遮蔽する位置から移動したことをセンサが検出したとき、音声の音質を、遮蔽物で遮蔽された音質から遮蔽されない場合の第２の音質に変化させる。 A device system according to one embodiment of the present invention includes a device, a sensor, and a sound generating means. The device is worn by a user. The sensor detects the movement of a movable obstruction. The sound processing unit generates sound of a first sound quality when a virtual sound source located on the opposite side of the obstruction is obstructed by the obstruction and emits the sound from the device, and when the sensor detects that the obstruction has moved from a position obstructing the virtual sound source, changes the sound quality of the sound from the sound quality when obstructed by the obstruction to a second sound quality when not obstructed.

本発明の一実施形態に係る音質制御方法は、ユーザが装用する音響デバイスおよび移動可能な遮蔽物の移動を検出するセンサを含むデバイスシステムが、遮蔽物の反対側に定位された仮想音源から遮蔽物で遮蔽された場合の第１の音質の音声を生成して音響デバイスから放音し、遮蔽物が移動したことをセンサが検出したとき、音声の音質を第１の音質から遮蔽物に遮蔽されない場合の第２の音質に変化させることを特徴とする。 A sound quality control method according to one embodiment of the present invention is characterized in that a device system including an acoustic device worn by a user and a sensor for detecting the movement of a movable obstruction generates sound of a first sound quality when obstructed by an obstruction from a virtual sound source positioned on the opposite side of the obstruction, emits the sound from the acoustic device, and when the sensor detects that the obstruction has moved, changes the sound quality of the sound from the first sound quality to a second sound quality when not obstructed by an obstruction.

本発明の一実施形態に係る音質制御プログラムは、ユーザが装用する音響デバイスおよび移動可能な遮蔽物の移動を検出するセンサと通信する携帯端末装置の制御部を、遮蔽物の反対側に定位された仮想音源から遮蔽物で遮蔽された場合の第１の音質の音声を生成して音響デバイスから放音し、遮蔽物が移動したことをセンサが検出したとき、音声の音質を第１の音質から遮蔽物に遮蔽されない場合の第２の音質に変化させる音声制御手段として機能させることを特徴とする。 A sound quality control program according to one embodiment of the present invention is characterized in that it causes a control unit of a mobile terminal device that communicates with an acoustic device worn by a user and a sensor that detects the movement of a movable obstruction to function as a sound control means that generates sound of a first sound quality when obstructed by an obstruction from a virtual sound source positioned on the opposite side of the obstruction, emits the sound from the acoustic device, and when the sensor detects that the obstruction has moved, changes the sound quality of the sound from the first sound quality to a second sound quality when not obstructed by an obstruction.

この発明の一実施形態によれば、遮蔽物による仮想音源の遮蔽状態を考慮して、仮想音源の聞こえ方をよりリアルに加工することが可能となる。 According to one embodiment of the present invention, it is possible to process the way a virtual sound source sounds more realistically by taking into account the state of obstruction of the virtual sound source by an obstruction object.

図１は、この発明の実施形態である音声再生システムの構成を示す図である。FIG. 1 is a diagram showing the configuration of an audio reproduction system according to an embodiment of the present invention. 図２は、音声再生システムの携帯端末装置のブロック図である。FIG. 2 is a block diagram of a portable terminal device of the audio reproduction system. 図３は、音声再生システムのヘッドホンのブロック図である。FIG. 3 is a block diagram of a headphone of the audio reproduction system. 図４は、音声再生システムのドアセンサのブロック図である。FIG. 4 is a block diagram of a door sensor of the audio playback system. 図５は、音声再生システムが使用される建物の平面図である。FIG. 5 is a plan view of a building in which the audio reproduction system is used. 図６は、音声再生システムにおける間接音を説明する図である。FIG. 6 is a diagram for explaining indirect sound in a sound reproduction system. 図７は、音声再生システムにおける直接音を説明する図である。FIG. 7 is a diagram for explaining a direct sound in an audio reproduction system. 図８は、音声再生システムにおける透過音を説明する図である。FIG. 8 is a diagram for explaining transmitted sound in an audio reproduction system. 図９は、携帯端末装置の信号処理部の構成を示す図である。FIG. 9 is a diagram showing the configuration of a signal processing unit of the portable terminal device. 図１０は、携帯端末装置の処理動作を示すフローチャートである。FIG. 10 is a flowchart showing the processing operation of the mobile terminal device. 図１１は、携帯端末装置の処理動作を示すフローチャートである。FIG. 11 is a flowchart showing the processing operation of the mobile terminal device. 図１２は、携帯端末装置の処理動作を示すフローチャートである。FIG. 12 is a flowchart showing the processing operation of the mobile terminal device. 図１３は、音声再生システムにおいて音源が移動する場合の建物の平面図である。FIG. 13 is a plan view of a building in an audio reproduction system in which a sound source moves.

図１は、本発明が適用される音声再生システム１の構成を示す図である。図２は、音声再生システム１の携帯端末装置１０のブロック図である。音声再生システム１は、携帯端末装置１０、音響デバイスであるヘッドホン２０、および、ドアセンサ３０を含む。図１は、ユーザＬが、携帯端末装置１０を手に持ち、ヘッドホン２０を装用した例を示している。ユーザＬは、この装備で図５に示す部屋２０１に入る。ユーザＬが部屋２０１に入ると、携帯端末装置１０は、シナリオファイル７２（単に、シナリオ７２とも言う）に基づいて、部屋２０２に定位する音声を再生する。ヘッドホン２０が本発明の音響デバイスに対応する。 Fig. 1 is a diagram showing the configuration of an audio reproduction system 1 to which the present invention is applied. Fig. 2 is a block diagram of a mobile terminal device 10 of the audio reproduction system 1. The audio reproduction system 1 includes the mobile terminal device 10, headphones 20 which are an acoustic device, and a door sensor 30. Fig. 1 shows an example in which a user L holds the mobile terminal device 10 in his/her hand and wears the headphones 20. With this equipment, the user L enters a room 201 shown in Fig. 5. When the user L enters the room 201, the mobile terminal device 10 reproduces audio localized in the room 202 based on a scenario file 72 (also simply referred to as the scenario 72). The headphones 20 correspond to the acoustic device of the present invention.

携帯端末装置１０は、例えば、スマートホン（多機能携帯電話）が用いられる。携帯端末装置１０とヘッドホン２０とは、Ｂｌｕｅｔｏｏｔｈ（登録商標）で接続されており、相互に通信可能である。携帯端末装置１０とヘッドホン２０との接続は、Ｂｌｕｅｔｏｏｔｈに限定されず、他の無線通信規格または有線でもよい。ヘッドホン２０は、２個のスピーカ２１Ｒ，２１Ｌとヘッドバンド２２とを組み合わせた、いわゆる耳掛け型である。ヘッドホン２０は、ヘッドバンド２２に姿勢センサ２３を有し、ユーザＬの頭部の向きをトラッキング可能である。姿勢センサ２３は、３軸ジャイロ（角速度）センサ、６軸センサ（３軸ジャイロセンサ＋３軸モーション（加速度）センサ）、および、９軸センサ（３軸ジャイロセンサ＋３軸モーションセンサ＋３軸コンパス（方位）センサ）のいずれが用いられてもよい。音響デバイスとして、ヘッドホン２０に代えてイヤホンが用いられてもよい。 The mobile terminal device 10 is, for example, a smartphone (multi-function mobile phone). The mobile terminal device 10 and the headphones 20 are connected by Bluetooth (registered trademark) and can communicate with each other. The connection between the mobile terminal device 10 and the headphones 20 is not limited to Bluetooth, and may be other wireless communication standards or wired. The headphones 20 are a so-called ear-hook type that combines two speakers 21R and 21L with a headband 22. The headphones 20 have a posture sensor 23 on the headband 22 and can track the orientation of the user L's head. The posture sensor 23 may be any of a three-axis gyro (angular velocity) sensor, a six-axis sensor (three-axis gyro sensor + three-axis motion (acceleration) sensor), and a nine-axis sensor (three-axis gyro sensor + three-axis motion sensor + three-axis compass (direction) sensor). An earphone may be used as the acoustic device instead of the headphones 20.

ドアセンサ３０は、固定壁５０１（図５参照）に設けられたドア５０２の開閉状態を検出する。ドアセンサ３０は、携帯端末装置１０にドア５０２の開閉状態の情報を送信する。ドアセンサ３０と携帯端末装置１０とは、ＢｌｕｅｔｏｏｔｈＬｏｗＥｎｅｒｇｙ（ＢＬＥ）で接続されている。ＢＬＥおよび通常のＢｌｕｅｔｏｏｔｈは、一つのデバイスで併用可能である。この実施形態でＢｌｕｅｔｏｏｔｈは、上述のように、携帯端末装置１０とヘッドホン２０との接続に用いられている。ドアセンサ３０と携帯端末装置１０との接続形態は、ＢＬＥに限定されない。例えばドアセンサ３０と携帯端末装置１０は、Ｗｉ－Ｆｉ（登録商標）または携帯通信ネットワーク等、インターネットを介して接続してもよい。 The door sensor 30 detects the open/closed state of a door 502 provided on a fixed wall 501 (see FIG. 5). The door sensor 30 transmits information on the open/closed state of the door 502 to the mobile terminal device 10. The door sensor 30 and the mobile terminal device 10 are connected by Bluetooth Low Energy (BLE). BLE and normal Bluetooth can be used together in one device. In this embodiment, Bluetooth is used to connect the mobile terminal device 10 and the headphones 20, as described above. The connection between the door sensor 30 and the mobile terminal device 10 is not limited to BLE. For example, the door sensor 30 and the mobile terminal device 10 may be connected via the Internet, such as Wi-Fi (registered trademark) or a mobile communication network.

図２は、携帯端末装置１０のブロック図である。携帯端末装置１０は、ハードウェア的には、制御部１００、記憶部１０３、音声生成部１０５、信号処理部１０６および通信処理部１０７などを備えたスマートホンである。制御部１００は、ＣＰＵを備えている。記憶部１０３は、ＲＯＭ、ＲＡＭおよびフラッシュメモリを備えている。 Figure 2 is a block diagram of the mobile terminal device 10. In terms of hardware, the mobile terminal device 10 is a smartphone equipped with a control unit 100, a memory unit 103, a voice generation unit 105, a signal processing unit 106, and a communication processing unit 107. The control unit 100 includes a CPU. The memory unit 103 includes a ROM, a RAM, and a flash memory.

携帯端末装置１０、ヘッドホン２０およびドアセンサ３０は、携帯端末装置１０が、記憶部１０３に記憶されているプログラム７０を起動することにより音声再生システム１として機能する。 The mobile terminal device 10, the headphones 20, and the door sensor 30 function as the audio playback system 1 when the mobile terminal device 10 activates the program 70 stored in the memory unit 103.

記憶部１０３は、上述のプログラム７０を記憶しているとともに、シナリオファイル７２、および、音声データ７３を記憶する。プログラム７０は、携帯端末装置１０、ヘッドホン２０およびドアセンサ３０を音声再生システムとして機能させるためのアプリケーションプログラムである。シナリオファイル７２は、ユーザＬに対して所定の音声を順次再生するための手順が記載されたファイルである。シナリオファイル７２は、音声を再生する場所、例えば、図５に示す建物２００の形状や壁５００の配置などが記載されたレイアウトテーブル７２Ａを含んでいる。音声データ７３は、シナリオファイル７２にしたがって再生される音声のデータである。音声データ７３は、たとえばＰＣＭやＭＰ４のような音声信号であってもよく、音声生成部１０５をシンセサイザとして利用する音声合成データであってもよい。なお、音声データ７３が音声信号の場合、制御部１００が、音声生成部１０５として、音声データ７３を読み出してもよい。 The storage unit 103 stores the above-mentioned program 70, as well as a scenario file 72 and audio data 73. The program 70 is an application program for causing the mobile terminal device 10, the headphones 20, and the door sensor 30 to function as an audio playback system. The scenario file 72 is a file in which a procedure for sequentially playing back predetermined audio for the user L is described. The scenario file 72 includes a layout table 72A in which the location where the audio is to be played back, for example, the shape of the building 200 and the arrangement of the walls 500 shown in FIG. 5, are described. The audio data 73 is data of audio played back according to the scenario file 72. The audio data 73 may be, for example, an audio signal such as PCM or MP4, or may be audio synthesis data using the audio generation unit 105 as a synthesizer. When the audio data 73 is an audio signal, the control unit 100 may read out the audio data 73 as the audio generation unit 105.

フィルタ係数７１は、頭部伝達関数および部屋２０１の所定位置におけるインパルス応答を含んでいる。これらのフィルタ係数は、図５等に示す仮想音源ＳＰ１をユーザＬに対して所定の位置に定位させるために使用される。フィルタ係数７１は、信号処理部１０６において使用される。なお、以下の記載において、ＳＰ１を仮想音源位置とも言う。 The filter coefficients 71 include a head-related transfer function and an impulse response at a predetermined position in the room 201. These filter coefficients are used to localize the virtual sound source SP1 shown in FIG. 5 etc. at a predetermined position relative to the user L. The filter coefficients 71 are used in the signal processing unit 106. In the following description, SP1 is also referred to as the virtual sound source position.

制御部１００は、携帯端末装置１０の動作を制御する。制御部１００は、プログラム７０が起動されることにより、位置決定部１０１、頭部方向決定部１０２としても機能する。位置決定部１０１は、携帯端末装置１０の現在位置を決定する。音声再生システム１では、携帯端末装置１０の位置がユーザＬの位置として使用される。位置決定部１０１は、屋外であれば、ＧＰＳ、みちびき等の衛星測位システムが用いられればよい。位置決定部１０１は、屋内であれば、屋内に設置されたビーコン等の位置測定システムが用いられればよい。屋内の場合でも、位置決定部１０１は、まず屋外で衛星測位システムを用いて正確な位置を決定しておき（キャリブレーション）、その後は姿勢センサ２３を用いてユーザＬの移動をトレースして屋内での位置を決定してもよい。ユーザＬの移動をトレースする場合、姿勢センサ２３は、ユーザＬのモーションを検出できる６軸センサまたは９軸センサが好適である。 The control unit 100 controls the operation of the mobile terminal device 10. When the program 70 is started, the control unit 100 also functions as a position determination unit 101 and a head direction determination unit 102. The position determination unit 101 determines the current position of the mobile terminal device 10. In the audio reproduction system 1, the position of the mobile terminal device 10 is used as the position of the user L. If the position determination unit 101 is outdoors, a satellite positioning system such as GPS or Michibiki may be used. If the position determination unit 101 is indoors, a position measurement system such as a beacon installed indoors may be used. Even if indoors, the position determination unit 101 may first determine an accurate position using a satellite positioning system outdoors (calibration), and then use the attitude sensor 23 to trace the movement of the user L to determine the indoor position. When tracing the movement of the user L, the attitude sensor 23 is preferably a 6-axis sensor or 9-axis sensor that can detect the motion of the user L.

頭部方向決定部１０２は、ヘッドホン２０から取得した姿勢センサ２３の検出値に基づいてユーザＬの頭部の向きを決定する。姿勢センサ２３が３軸のジャイロセンサの場合、制御部１００は、最初にユーザＬに所定の方向を向かせて頭部方向を決定する（キャリブレーション）。その後、制御部１００は、姿勢センサ２３から角速度情報を取得し、この角速度情報を頭部方向に積算することで現在のユーザＬの頭部の向きを決定する。姿勢センサ２３がジャイロセンサおよびコンパスセンサを含む場合、応答の速いジャイロセンサでユーザＬの頭部方向の変化に追従しつつ、応答の遅いコンパスセンサでジャイロセンサの積分誤差をキャンセルするようにすればよい。 The head direction determination unit 102 determines the orientation of the user L's head based on the detection value of the orientation sensor 23 acquired from the headphones 20. If the orientation sensor 23 is a three-axis gyro sensor, the control unit 100 first determines the head direction by having the user L face a specified direction (calibration). The control unit 100 then acquires angular velocity information from the orientation sensor 23 and determines the current orientation of the user L's head by integrating this angular velocity information with the head direction. If the orientation sensor 23 includes a gyro sensor and a compass sensor, it is sufficient to use the fast-response gyro sensor to follow changes in the head direction of the user L, while canceling the integral error of the gyro sensor with the slow-response compass sensor.

音声生成部１０５は、音声データ７３を再生する。音声データ７３は、たとえばＰＣＭやＭＰ４のような音声信号であってもよく、音声生成部１０５をシンセサイザとして利用する音声合成データであってもよい。信号処理部１０６は、仮想音源ＳＰ１（図５参照）およびユーザＬの位置に応じて、音声の音質を制御する。また、信号処理部１０６は、位置決定部１０１、頭部方向決定部１０２から取得したユーザＬの位置および向きに応じて音声の定位を決定し、定位方向の頭部伝達関数を読み出して、音声をフィルタリングする。音声生成部１０５および信号処理部１０６が、本発明の音声処理部に対応する。 The sound generation unit 105 plays back the sound data 73. The sound data 73 may be a sound signal such as PCM or MP4, or may be sound synthesis data using the sound generation unit 105 as a synthesizer. The signal processing unit 106 controls the quality of the sound according to the position of the virtual sound source SP1 (see FIG. 5) and the user L. The signal processing unit 106 also determines the localization of the sound according to the position and orientation of the user L acquired from the position determination unit 101 and the head direction determination unit 102, reads out the head related transfer function of the localization direction, and filters the sound. The sound generation unit 105 and the signal processing unit 106 correspond to the sound processing unit of the present invention.

音声再生システム１は、シナリオファイル７２に基づき、部屋２０１にいるユーザＬに対して、壁５００で隔てられた隣の部屋２０２に音声を定位させる（図５－８等参照）。壁５００にはドア５０２が設けられている。ドア５０２が閉じているとき、音声再生システムは、壁５００で遮られた向こうの部屋から聞こえてくるような音質で音声を再生する。ドア５０２が開いているとき、音声再生システムは、部屋２０２で響いている音声がドア枠５０３を通って聞こえてくるような音質で音声を再生する。携帯端末装置１０（通信処理部１０７）は、処理された音声をヘッドホン２０に送信する。ヘッドホン２０は、受信した音声をスピーカ２１Ｒ，２１Ｌから出力する。これにより、ユーザＬは、シナリオファイル７２にしたがって、予め決められた定位位置から聞こえてくる聴感で音声を聞くことができる。 Based on the scenario file 72, the audio reproduction system 1 localizes audio to the adjacent room 202 separated by a wall 500 for the user L in the room 201 (see FIG. 5-8, etc.). The wall 500 is provided with a door 502. When the door 502 is closed, the audio reproduction system reproduces audio with a sound quality that sounds like the audio is coming from the other room blocked by the wall 500. When the door 502 is open, the audio reproduction system reproduces audio with a sound quality that sounds like the audio reverberating in the room 202 is coming through the door frame 503. The mobile terminal device 10 (communication processing unit 107) transmits the processed audio to the headphones 20. The headphones 20 output the received audio from the speakers 21R and 21L. This allows the user L to hear the audio with an auditory sensation of coming from a predetermined localization position according to the scenario file 72.

通信処理部１０７は、Ｂｌｕｅｔｏｏｔｈ対応機器であるヘッドホン２０、および、ドアセンサ３０と通信する。通信処理部１０７は、ヘッドホン２０の通信処理部２４とＢｌｕｅｔｏｏｔｈで通信する。通信処理部１０７は、ドアセンサ３０の通信処理部３２とＢｌｕｅｔｏｏｔｈＢＬＥで通信する。通信処理部１０７は、ヘッドホン２０に対してオーディオ信号の送信を行うとともに、ヘッドホン２０から姿勢センサ２３の検出値を受信する。通信処理部１０７は、ドアセンサ３０からドア５０２の開閉状態の情報を受信する。 The communication processing unit 107 communicates with the headphones 20 and the door sensor 30, which are Bluetooth-compatible devices. The communication processing unit 107 communicates with the communication processing unit 24 of the headphones 20 via Bluetooth. The communication processing unit 107 communicates with the communication processing unit 32 of the door sensor 30 via Bluetooth BLE. The communication processing unit 107 transmits an audio signal to the headphones 20 and receives the detection value of the attitude sensor 23 from the headphones 20. The communication processing unit 107 receives information on the open/closed state of the door 502 from the door sensor 30.

図３は、ヘッドホン２０の構成を示すブロック図である。ヘッドホン２０は、スピーカ２１Ｌ，２１Ｒ、姿勢センサ２３、通信処理部２４、ＡＩＦ２５、ＤＡＣ２６Ｌ，２６Ｒ、アンプ２７Ｌ，２７Ｒを備えている。 Figure 3 is a block diagram showing the configuration of the headphones 20. The headphones 20 include speakers 21L and 21R, a position sensor 23, a communication processing unit 24, an AIF 25, DACs 26L and 26R, and amplifiers 27L and 27R.

通信処理部２４は、ＢｌｕｅｔｏｏｔｈまたはＢＬＥで携帯端末装置１０（通信処理部１０７）と通信する。ＡＩＦ（ＡｕｄｉｏＩｎｔｅｒｆａｃｅ）２５は、携帯端末装置１０から受信した音声信号を左右チャンネルの信号に分離してＤＡＣ２６Ｌ，２６Ｒに送信する。ＤＡＣ（ＤｉｇｔａｌｔｏＡｎａｌｏｇＣｏｎｖｅｒｔｅｒ）２６Ｌ，２６Ｒは、ＡＩＦ２５から入力されたデジタル信号をアナログ信号に変換する。アンプ２７Ｌ，２７Ｒは、ＤＡＣ２６Ｌ，２６Ｒから入力されたアナログ信号を増幅してスピーカ２１Ｌ，２１Ｌに供給する。これにより、携帯端末装置１０から受信した音声信号は、音響としてスピーカ２１Ｌ，２１Ｒから放音される。ヘッドホン２０は、ユーザＬの頭部に装用されているため、スピーカ２１，２１Ｒから放音された音声はユーザＬの左右の耳で聴取される。 The communication processing unit 24 communicates with the mobile terminal device 10 (communication processing unit 107) via Bluetooth or BLE. The AIF (Audio Interface) 25 separates the audio signal received from the mobile terminal device 10 into left and right channel signals and transmits them to the DACs 26L and 26R. The DACs (Digital to Analog Converters) 26L and 26R convert the digital signal input from the AIF 25 into an analog signal. The amplifiers 27L and 27R amplify the analog signal input from the DACs 26L and 26R and supply it to the speakers 21L and 21L. As a result, the audio signal received from the mobile terminal device 10 is emitted as sound from the speakers 21L and 21R. Since the headphones 20 are worn on the head of the user L, the sound emitted from the speakers 21 and 21R is heard by the left and right ears of the user L.

図４は、ドアセンサ３０ブロック図である。ドアセンサ３０は、図５に示されるように、ドア５０２のヒンジ近傍に取り付けられ、ドア５０２の固定壁５０１に対する開閉状態の情報を検出および出力する。開閉状態の情報は、固定壁５０１に対する角度で表される。ドアセンサ３０は、センサモジュール３１および通信処理部３２を備えている。センサモジュール３１は、ドア５０２の開閉状態を検出する。センサモジュール３１は、例えば、ロータリエンコーダ、半導体センサ、または、光電センサなどで構成される。ロータリエンコーダは、ドア５０２のヒンジと同軸で回転してドア５０２の回転角度または絶対角度を検出する。半導体センサは、ドア５０２の開閉による角速度を検出し、この角速度を積分してドア５０２の角度を算出する。光学センサは、ドア５０２および固定壁５０１にそれぞれ発光部および受光部を有し、発光部から受光部に至る光の位置の変化によってドア５０２の角度を検出する。センサモジュール３１は、上に述べたものに限定されない。たとえば、センサモジュール３１が、ポテンショメータであってもよい。また、センサモジュール３１が、ドア５０２が完全に閉じているか、少しでも開いているかのみを検出すればよい場合、センサモジュール３１はリミットスイッチでもよい。通信処理部３２は、センサモジュール３１が検出したドア５０２の開閉状態の情報を携帯端末装置１０に送信する。 Figure 4 is a block diagram of the door sensor 30. As shown in Figure 5, the door sensor 30 is attached near the hinge of the door 502, and detects and outputs information on the open/closed state of the door 502 relative to the fixed wall 501. The information on the open/closed state is expressed as an angle relative to the fixed wall 501. The door sensor 30 includes a sensor module 31 and a communication processing unit 32. The sensor module 31 detects the open/closed state of the door 502. The sensor module 31 is composed of, for example, a rotary encoder, a semiconductor sensor, or a photoelectric sensor. The rotary encoder rotates coaxially with the hinge of the door 502 to detect the rotation angle or absolute angle of the door 502. The semiconductor sensor detects the angular velocity due to the opening and closing of the door 502, and calculates the angle of the door 502 by integrating this angular velocity. The optical sensor has a light emitting unit and a light receiving unit on the door 502 and the fixed wall 501, respectively, and detects the angle of the door 502 by the change in the position of the light from the light emitting unit to the light receiving unit. The sensor module 31 is not limited to the above. For example, the sensor module 31 may be a potentiometer. Also, if the sensor module 31 only needs to detect whether the door 502 is completely closed or slightly open, the sensor module 31 may be a limit switch. The communication processing unit 32 transmits information on the open/closed state of the door 502 detected by the sensor module 31 to the mobile terminal device 10.

図５は、本発明の音声再生システムでユーザＬを導き入れてシナリオが実行される建物２００の平面図である。建物２００には、部屋２０１および部屋２０２が設けられている。部屋２０１および部屋２０２は、壁５００で仕切られている。壁５００は、固定壁５０１と固定壁５０１の一部に設けられたドア５０２とを有している。ドア５０２は、固定壁５０１に形成されたドア枠５０３に取り付けられている。ドア５０２は、部屋２０１側に揺動自在である。壁５００が本発明の遮蔽物に対応する。 Figure 5 is a plan view of a building 200 into which a user L is guided and a scenario is executed by the audio playback system of the present invention. Building 200 is provided with rooms 201 and 202. Rooms 201 and 202 are separated by a wall 500. Wall 500 has a fixed wall 501 and a door 502 provided in a part of fixed wall 501. Door 502 is attached to a door frame 503 formed on fixed wall 501. Door 502 can freely swing towards room 201. Wall 500 corresponds to a shield of the present invention.

建物２００およびその内部の位置はＸＹ座標で特定される。ＸＹ座標は、図５左下に示されるように、図中左右方向に設定されたＸ軸、上下方向に設定されたＹ軸に基づいて決定される。部屋２０１、２０２の形状、壁５００の位置、ドア５０２の位置は全てＸＹ座標で表され、レイアウトテーブル７２Ａに記憶されている。本実施形態の音声再生システム１は、音声の定位処理を二次元、すなわち、音源の位置、ユーザＬの耳の位置などは全て同じ高さにあるとみなして音像定位処理を行っている。音像定位を高さを含む三次元で行う場合は、図面の表裏方向に高さ方向のＺ軸を設定すればよい。 The building 200 and positions within it are specified by XY coordinates. As shown in the lower left of FIG. 5, the XY coordinates are determined based on the X axis set in the left-right direction in the figure and the Y axis set in the up-down direction. The shapes of the rooms 201 and 202, the positions of the walls 500, and the doors 502 are all expressed in XY coordinates and stored in the layout table 72A. The audio reproduction system 1 of this embodiment performs audio localization processing in two dimensions, that is, performs sound image localization processing by assuming that the positions of the sound source and the ears of the user L are all at the same height. If sound image localization is performed in three dimensions including height, the Z axis in the height direction can be set in the front-to-back direction of the drawing.

ドア５０２には、ドア５０２の開閉状態を検出するためのドアセンサ３０が設けられている。図５では、ドア５０２は部屋２０１側に開かれている。ドア５０２はユーザＬによって手動で開かれてもよく、図示しないアクチュエータなどによって自動的に開かれてもよい。 The door 502 is provided with a door sensor 30 for detecting whether the door 502 is open or closed. In FIG. 5, the door 502 is open toward the room 201. The door 502 may be opened manually by the user L, or may be opened automatically by an actuator (not shown).

ユーザＬは、部屋２０１を移動しつつ、音声再生システム１によって再生される音声を聴く。音声再生システム１は、ユーザＬの位置、時刻等でシナリオファイル７２を参照し、シナリオファイル７２によって指示された音声を再生する。図５に示した場面では、音声再生システム１は、部屋２０２の仮想音源位置ＳＰ１に定位するピアノ演奏音を再生する。図５では、仮想音源位置ＳＰ１の場所に、実際のピアノ３００が設置されているが、実際のピアノ３００は必須ではない。 User L listens to the sound reproduced by the sound reproduction system 1 while moving around the room 201. The sound reproduction system 1 refers to the scenario file 72 based on the position, time, etc. of the user L, and reproduces the sound specified by the scenario file 72. In the scene shown in FIG. 5, the sound reproduction system 1 reproduces the sound of a piano performance localized at a virtual sound source position SP1 in the room 202. In FIG. 5, an actual piano 300 is placed at the location of the virtual sound source position SP1, but an actual piano 300 is not required.

図５において、ユーザＬは、ヘッドホン２０からピアノ演奏音が聞こえてくると、ドア５０２の近くの位置ＬＰ１へ移動してドア５０２を開ける（図５はドア５０２が開いた状態を示している）。これにより、ユーザＬは、部屋２０２でピアノ３００が鳴っていることを認識するが、ユーザＬは、固定壁５０１で遮られて、位置ＬＰ１からピアノ３００（仮想音源位置ＳＰ１）を直接見ることができない。ユーザＬは、ドア５０２を開けたのち、音声が鳴っている場所を探すため、位置ＬＰ２へ移動する。位置ＬＰ２へ移動したユーザＬは、ドア枠５０３を介して部屋２０２の中を見ることによって、ピアノ３００（仮想音源位置ＳＰ１）を発見する。 In FIG. 5, when user L hears the sound of a piano being played through headphones 20, he moves to position LP1 near door 502 and opens door 502 (FIG. 5 shows door 502 open). This allows user L to recognize that piano 300 is playing in room 202, but user L cannot directly see piano 300 (virtual sound source position SP1) from position LP1 because fixed wall 501 blocks the view. After opening door 502, user L moves to position LP2 to find the location where the sound is being played. Having moved to position LP2, user L finds piano 300 (virtual sound source position SP1) by looking inside room 202 through door frame 503.

ユーザＬが上のような行動をした場合の、音声再生システム１による音声（ピアノ演奏音）の音質の制御態様は、以下のとおりである。ドア５０２が閉じているとき、音声再生システム１は、ユーザＬに対して、壁５００の向こう側でピアノ３００が演奏され、壁５００（ドア５０２）から漏れてくるような音質でピアノ演奏音を再生する。ドア５０２が開かれたとき、音声再生システム１は、部屋２０２で響いているピアノ演奏音がドア枠５０３を介して聞こえてくるような音質で、ピアノ演奏音を再生する。ただし、位置ＬＰ１では、ユーザＬは、仮想音源位置ＳＰ１に定位されているピアノ演奏音の直接音を聴くことができない。その後、ユーザＬは、ピアノ３００が見える位置である位置ＬＰ２に移動する。ユーザＬが位置ＬＰ２に移動したとき、音声再生システム１は、ピアノ演奏音の直接音をユーザＬに対して再生する。 When the user L behaves as described above, the sound quality of the sound (piano performance sound) controlled by the sound reproduction system 1 is as follows. When the door 502 is closed, the sound reproduction system 1 reproduces the piano performance sound with a sound quality that makes it seem as if the piano 300 is being played on the other side of the wall 500 and is leaking through the wall 500 (door 502) for the user L. When the door 502 is open, the sound reproduction system 1 reproduces the piano performance sound with a sound quality that makes it seem as if the piano performance sound reverberating in the room 202 is being heard through the door frame 503. However, at position LP1, the user L cannot hear the direct sound of the piano performance sound localized at the virtual sound source position SP1. After that, the user L moves to position LP2 where the piano 300 is visible. When the user L moves to position LP2, the sound reproduction system 1 reproduces the direct sound of the piano performance sound for the user L.

図６－８を参照して、ドア５０２が閉じているとき、ドア５０２が開かれユーザＬが位置ＬＰ１にいるとき、および、ユーザＬが位置ＬＰ２に移動したときの仮想音源位置ＳＰ１で発生した音声の伝達形態を説明する。以下の説明では、仮想音源位置ＳＰ１で発生した音声を音声Ｓ（ＳＰ１）と呼ぶ。なお、図５-９、および、図１３の記載において、音声Ｓ（ＳＰ１）等の括弧“（）”は、“－”で代用される。 With reference to Figures 6-8, the transmission mode of sound generated at virtual sound source position SP1 when door 502 is closed, when door 502 is open and user L is at position LP1, and when user L moves to position LP2 will be described. In the following description, the sound generated at virtual sound source position SP1 will be referred to as sound S (SP1). Note that in the descriptions in Figures 5-9 and 13, parentheses "( )" around sound S (SP1), etc. will be replaced with "-".

図６は、ドア５０２が開かれた状態のドア枠５０３を介してユーザＬに聞こえてくる音声（間接音）を説明する図である。音声Ｓ（ＳＰ１）は、部屋２０２全体に伝わり、壁で反射するなどして部屋２０２内に響きを形成する。ドア枠５０３の位置ＳＰ２においても、位置ＳＰ２における響きの音声が鳴動する。以下、位置ＳＰ２で聴取される音声を音声Ｓ（ＳＰ２）と呼ぶ。仮想音源位置ＳＰ１から位置ＳＰ２への音声の伝搬は、仮想音源位置ＳＰ１に発音源を設置し、位置ＳＰ２にマイクを設置して測定されたインパルス応答で表される。このインパルス応答を、以下、インパルス応答ＩＲ（１－２）と呼ぶ。インパルス応答ＩＲ（１－２）は、上述したように、部屋２０２に響いているピアノ演奏音Ｓ（ＳＰ１）を位置ＳＰ２で聴取した場合の応答波形を表している。音声Ｓ（ＳＰ２）は、音声Ｓ（ＳＰ１）にインパルス応答ＩＲ（１－２）を畳み込むＦＩＲフィルタでフィルタリングすることによって得られる。 Figure 6 is a diagram explaining the sound (indirect sound) heard by the user L through the door frame 503 when the door 502 is open. The sound S (SP1) is transmitted throughout the room 202 and forms a reverberation in the room 202 by, for example, reflecting off the walls. The sound reverberating at position SP2 also sounds at position SP2 of the door frame 503. Hereinafter, the sound heard at position SP2 is referred to as sound S (SP2). The propagation of sound from the virtual sound source position SP1 to position SP2 is represented by an impulse response measured by placing a sound source at the virtual sound source position SP1 and a microphone at position SP2. Hereinafter, this impulse response is referred to as impulse response IR (1-2). As described above, impulse response IR (1-2) represents the response waveform when the piano performance sound S (SP1) reverberating in the room 202 is heard at position SP2. Audio S(SP2) is obtained by filtering audio S(SP1) with an FIR filter that convolves the impulse response IR(1-2).

ドア５０２が開かれている場合、ドア枠５０３に到達した音声Ｓ（ＳＰ２）が、部屋２０１に伝わり、ユーザＬまで到達する。ドア枠５０３の位置ＳＰ２から、ユーザＬの両耳までの音声の伝搬は、ユーザＬの位置およびユーザＬの頭部の向きに応じた頭部伝達関数によって表される。このときの頭部伝達関数を、以下、頭部伝達関数ＨＲＴＦ（２－Ｌ）と呼ぶ。開いたドア５０２（ドア枠５０３）からユーザＬに聞こえてくる音声である間接音Ｓ（Ｌ open）は、音声Ｓ（ＳＰ２）を頭部伝達関数ＨＲＴＦ（２－Ｌ）で処理することによって得られる。具体的には、頭部伝達関数ＨＲＴＦ（２－Ｌ）を時間領域の係数列に変換した頭部インパルス応答を音声Ｓ（ＳＰ２）に畳み込むＦＩＲフィルタでフィルタリングすることによって、間接音Ｓ（Ｌ open）が得られる。なお、処理を容易にするため、部屋２０１における残響（インパルス応答）は、考慮しないものとする。 When the door 502 is open, the sound S (SP2) that has reached the door frame 503 is transmitted to the room 201 and reaches the user L. The propagation of the sound from the position SP2 of the door frame 503 to both ears of the user L is represented by a head-related transfer function according to the position of the user L and the direction of the user L's head. The head-related transfer function at this time is hereinafter referred to as the head-related transfer function HRTF (2-L). The indirect sound S (L open), which is the sound that the user L hears from the open door 502 (door frame 503), is obtained by processing the sound S (SP2) with the head-related transfer function HRTF (2-L). Specifically, the indirect sound S (L open) is obtained by filtering the head impulse response, which is a time-domain coefficient sequence of the head-related transfer function HRTF (2-L), with an FIR filter that convolves the sound S (SP2) with the head impulse response. Note that, in order to simplify the processing, the reverberation (impulse response) in the room 201 is not taken into consideration.

図７は、音源から直接ユーザＬに聞こえてくる音声（直接音）を説明する図である。ユーザＬが位置ＬＰ２にいる場合、仮想音源位置ＳＰ１を直接目視可能であるため、音声Ｓ（ＳＰ１）の直接音が聞こえる。直接音の伝搬は、仮想音源位置ＳＰ１からユーザＬの位置およびユーザＬの頭部の向きに応じた頭部伝達関数によって表される。このときの頭部伝達関数を、以下、頭部伝達関数ＨＲＴＦ（１－Ｌ）と呼ぶ。直接音Ｓ（Ｌ direct）は、音声Ｓ（ＳＰ１）を頭部伝達関数ＨＲＴＦ（１－Ｌ）で処理することによって得られる。具体的には、頭部伝達関数ＨＲＴＦ（１－Ｌ）を時間領域の係数列に変換した頭部インパルス応答を音声Ｓ（ＳＰ１）に畳み込むＦＩＲフィルタでフィルタリングすることによって、直接音Ｓ（Ｌ direct）が得られる。 Figure 7 is a diagram explaining the sound (direct sound) that the user L hears directly from the sound source. When the user L is at position LP2, the virtual sound source position SP1 is directly visible, and the direct sound of the sound S (SP1) is heard. The propagation of the direct sound is represented by a head-related transfer function according to the position of the user L from the virtual sound source position SP1 and the direction of the user L's head. Hereinafter, the head-related transfer function at this time is referred to as the head-related transfer function HRTF (1-L). The direct sound S (L direct) is obtained by processing the sound S (SP1) with the head-related transfer function HRTF (1-L). Specifically, the direct sound S (L direct) is obtained by filtering the sound S (SP1) with an FIR filter that convolves the head impulse response, which is a time-domain coefficient sequence of the head-related transfer function HRTF (1-L), with the sound S (SP1).

図８は、閉じられたドア５０２を透過してユーザＬに聞こえてくる音声（透過音）を説明する図である。この実施形態では、固定壁５０１は、全く音声を透過しないとする。ドア５０２が閉じている場合、部屋２０１に居るユーザＬは、部屋２０２からドア５０２を抜けて伝わってくる音声を聴く。ドア５０２に到達する音声は、上記のように音声Ｓ（ＳＰ２）である。透過音Ｓ（Ｌ door）は、音声Ｓ（ＳＰ２）がドア５０２に到達し、閉じているドア５０２を通過してドア表面（部屋２０１側）ＳＰ２０に伝わり、ドア表面ＳＰ２０からユーザＬに伝わったものである。したがって、音声Ｓ（Ｌ door）の伝搬は、以下の３つのインパルス応答で表される。仮想音源位置ＳＰ１からドア５０２（ＳＰ２）までのインパルス応答、ドア５０２（ＳＰ２）からドア表面ＳＰ２０までのインパルス応答、および、ドア表面ＳＰ２０からユーザＬまでの頭部伝達関数ＨＲＴＦ（２０－Ｌ）。なお、処理を容易にするため、部屋２０１における残響（インパルス応答）は、考慮しないものとする。 Figure 8 is a diagram explaining the sound (transmitted sound) that is heard by the user L through a closed door 502. In this embodiment, it is assumed that the fixed wall 501 does not transmit sound at all. When the door 502 is closed, the user L in the room 201 hears the sound that travels from the room 202 through the door 502. The sound that reaches the door 502 is the sound S (SP2) as described above. The transmitted sound S (L door) is the sound S (SP2) that reaches the door 502, passes through the closed door 502, travels to the door surface (room 201 side) SP20, and is transmitted from the door surface SP20 to the user L. Therefore, the propagation of the sound S (L door) is expressed by the following three impulse responses. The impulse response from the virtual sound source position SP1 to the door 502 (SP2), the impulse response from the door 502 (SP2) to the door surface SP20, and the head-related transfer function HRTF (20-L) from the door surface SP20 to the user L. Note that, to simplify the processing, the reverberation (impulse response) in the room 201 is not taken into consideration.

頭部伝達関数ＨＲＴＦ（２０－Ｌ）は、図６の頭部伝達関数ＨＲＴＦ（２－Ｌ）とほぼ同じと考えられる。したがって、頭部伝達関数ＨＲＴＦ（２－Ｌ）が頭部伝達関数ＨＲＴＦ（２０－Ｌ）として用いられてもよい。 The head-related transfer function HRTF(20-L) is considered to be approximately the same as the head-related transfer function HRTF(2-L) in FIG. 6. Therefore, the head-related transfer function HRTF(2-L) may be used as the head-related transfer function HRTF(20-L).

なお、ドア５０２（ＳＰ２）からドア表面ＳＰ２０までのインパルス応答は、ドア５０２の遮音特性である。ドア５０２の遮音特性のインパルス応答を、以下、インパルス応答ＩＲ（door）と呼ぶ。 The impulse response from door 502 (SP2) to door surface SP20 is the sound insulation characteristic of door 502. Hereinafter, the impulse response of the sound insulation characteristic of door 502 is referred to as impulse response IR (door).

閉じたドア５０２を透過してユーザＬに聞こえてくる音声である間接音Ｓ（Ｌ door）は、音声Ｓ（ＳＰ２０）を頭部伝達関数ＨＲＴＦ（２０－Ｌ）で処理することによって得られる。具体的には、頭部伝達関数ＨＲＴＦ（２０－Ｌ）を時間領域の係数列に変換した頭部インパルス応答を音声Ｓ（ＳＰ２０）に畳み込むＦＩＲフィルタでフィルタリングすることによって、透過音Ｓ（Ｌ door）が得られる。 Indirect sound S (L door), which is sound that passes through the closed door 502 and is heard by the user L, is obtained by processing the sound S (SP20) with a head-related transfer function HRTF (20-L). Specifically, the transmitted sound S (L door) is obtained by filtering with an FIR filter that convolves the head impulse response, which is a time-domain coefficient sequence obtained by converting the head-related transfer function HRTF (20-L), with the sound S (SP20).

図８に示したように、ドア５０２が、閉じているとき、音声再生システム１は、ユーザＬに対して、透過音Ｓ（Ｌ door）のみを再生する。 As shown in FIG. 8, when the door 502 is closed, the audio reproduction system 1 reproduces only the transmitted sound S (L door) for the user L.

図６に示したように、ドア５０２が開いているが、ユーザＬが、ピアノ３００が見えない場所（たとえば場所ＬＰ１）にいるとき、音声再生システム１は、ユーザＬに対して、ドア枠５０３から聞こえてくる間接音Ｓ（Ｌ open）を再生する。 As shown in FIG. 6, when the door 502 is open but the user L is in a location (e.g., location LP1) where the piano 300 is not visible, the audio playback system 1 plays for the user L an indirect sound S (L open) heard from the door frame 503.

図７に示したように、ドア５０２が開いていて、ユーザＬが、ピアノ３００の見える位置（たとえば位置ＬＰ２）にいるとき、音声再生システム１は、ユーザＬに対して、仮想音源位置ＳＰ１（ピアノ３００）からの直接音Ｓ（Ｌ direct）、および、間接音Ｓ（Ｌ open）を再生する。これは、ユーザＬがピアノ３００を目視できる位置にいる場合であっても、間接音Ｓ（Ｌ open）は、ユーザＬに聞こえてくるためである。 As shown in FIG. 7, when the door 502 is open and the user L is in a position where the piano 300 can be seen (e.g., position LP2), the sound reproduction system 1 reproduces for the user L a direct sound S (L direct) from the virtual sound source position SP1 (piano 300) and an indirect sound S (L open). This is because the indirect sound S (L open) can be heard by the user L even if the user L is in a position where the piano 300 can be seen.

ユーザＬが、ピアノ３００（仮想音源位置ＳＰ１）が完全に見える位置にいるか、一部のみ見える位置にいるかで直接音Ｓ（Ｌ direct）のゲインを変えてもよい。また、この場合に、音質が、周波数領域で高音域の少し減衰させるなどの調整がされてもよい。 The gain of the direct sound S (L direct) may be changed depending on whether the user L is in a position where the piano 300 (virtual sound source position SP1) is completely visible or only partially visible. In this case, the sound quality may be adjusted in the frequency domain, such as by slightly attenuating the high-pitched sounds.

図９は、信号処理部１０６の機能ブロック図である。信号処理部１０６は、たとえばＤＳＰ（degitalsignal processor）で構成されており、プログラムにより、音声生成部１０５で生成された音声の信号処理を行うための種々の機能部が構成される。音声生成部１０５は、上述したように、ピアノ演奏音などの音声を生成する。信号処理部１０６は、音声を処理して、透過音Ｓ（Ｌ door）、間接音Ｓ（Ｌ open）、および、直接音Ｓ（Ｌ direct）を生成する。図示のフィルタ６４－６９は、全てＦＩＲフィルタである。図９では信号の流れを一重線で表しているが、処理される音声信号は左右２チャンネルの信号である。 Figure 9 is a functional block diagram of the signal processing unit 106. The signal processing unit 106 is composed of, for example, a DSP (digital signal processor), and various functional units are configured by a program to perform signal processing of the sound generated by the sound generation unit 105. As described above, the sound generation unit 105 generates sounds such as piano performance sounds. The signal processing unit 106 processes the sounds to generate transmitted sound S (L door), indirect sound S (L open), and direct sound S (L direct). The filters 64-69 shown in the figure are all FIR filters. In Figure 9, the signal flow is represented by a single line, but the sound signals to be processed are two-channel signals, left and right.

直列に接続されたフィルタ６４－６６は、ドア５０２が閉じられている場合の透過音Ｓ（Ｌ door）を生成する。フィルタ６４には、仮想音源位置ＳＰ１からドア５０２の位置ＳＰ２までのインパルス応答ＩＲ（１－２）がセットされる。フィルタ６４は、音声Ｓ（ＳＰ１）をフィルタリングして音声Ｓ（ＳＰ２）を生成する。フィルタ６５には、ドア５０２の遮音特性であるインパルス応答ＩＲ（door）がセットされる。フィルタ６５は、音声Ｓ（ＳＰ２）をフィルタリングして音声Ｓ（ＳＰ２０）を生成する。フィルタ６６には、ドア５０２の部屋２０１側の位置ＳＰ２０からユーザＬの位置および頭部の向きに応じた頭部伝達関数（頭部インパルス応答）ＨＲＴＦ（２０－Ｌ）がセットされる。フィルタ６６は、音声Ｓ（ＳＰ２０）をフィルタリングして透過音Ｓ（Ｌ door）を生成する。この実施形態では、インパルス応答ＩＲ（１－２）、インパルス応答（遮音特性）ＩＲ（door）、および、頭部伝達関数ＨＲＴＦ（２０－Ｌ）の演算ために、３つのフィルタ６４－６６が設けられている。しかし、これらのフィルタ係数を合成した１つのフィルタで透過音Ｓ（Ｌ door）が生成されてもよい。 The serially connected filters 64-66 generate a transmitted sound S (L door) when the door 502 is closed. The filter 64 is set with an impulse response IR (1-2) from the virtual sound source position SP1 to the position SP2 of the door 502. The filter 64 filters the sound S (SP1) to generate a sound S (SP2). The filter 65 is set with an impulse response IR (door) which is the sound insulation characteristic of the door 502. The filter 65 filters the sound S (SP2) to generate a sound S (SP20). The filter 66 is set with a head-related transfer function (head impulse response) HRTF (20-L) according to the position and head direction of the user L from the position SP20 on the room 201 side of the door 502. The filter 66 filters the sound S (SP20) to generate a transmitted sound S (L door). In this embodiment, three filters 64-66 are provided to calculate the impulse response IR(1-2), the impulse response (sound insulation characteristics) IR(door), and the head-related transfer function HRTF(20-L). However, the transmitted sound S(L door) may be generated by a single filter that combines these filter coefficients.

直列に接続されたフィルタ６７，６８は、ドア５０２が開かれている場合の間接音Ｓ（Ｌ open）を生成する。フィルタ６７には、仮想音源位置ＳＰ１からドア５０２の位置ＳＰ２までのインパルス応答ＩＲ（１－２）がセットされる。フィルタ６７は、音声Ｓ（ＳＰ１）をフィルタリングして音声Ｓ（ＳＰ２）を生成する。フィルタ６８には、ドア５０２の位置ＳＰ２からユーザＬの位置および頭部の向きに応じた頭部伝達関数（頭部インパルス応答）ＨＲＴＦ（２－Ｌ）がセットされる。フィルタ６８は、音声Ｓ（ＳＰ２）をフィルタリングして間接音Ｓ（Ｌ open）を生成する。この実施形態では、インパルス応答ＩＲ（１－２）、および、頭部伝達関数ＨＲＴＦ（２－Ｌ）のために、２つのフィルタ６６、６７が設けられている。しかし、これらのフィルタ係数を合成した１つのフィルタで間接音Ｓ（Ｌ open）が生成されてもよい。 The serially connected filters 67 and 68 generate an indirect sound S (L open) when the door 502 is open. An impulse response IR (1-2) from the virtual sound source position SP1 to the position SP2 of the door 502 is set in the filter 67. The filter 67 filters the sound S (SP1) to generate the sound S (SP2). A head-related transfer function (head impulse response) HRTF (2-L) according to the position of the user L and the direction of the head from the position SP2 of the door 502 is set in the filter 68. The filter 68 filters the sound S (SP2) to generate the indirect sound S (L open). In this embodiment, two filters 66 and 67 are provided for the impulse response IR (1-2) and the head-related transfer function HRTF (2-L). However, the indirect sound S (L open) may be generated by a single filter that combines these filter coefficients.

フィルタ６９は、直接音Ｓ（Ｌ direct）を生成する。フィルタ６８には、仮想音源位置ＳＰ１からユーザＬの位置およびユーザＬの頭部の向きに応じた頭部伝達関数（頭部インパルス応答）ＨＲＴＦ（１－Ｌ）がセットされる。フィルタ６９は、音声Ｓ（ＳＰ１）をフィルタリングして直接音Ｓ（Ｌ direct）を生成する。 The filter 69 generates a direct sound S (L direct). A head-related transfer function (head impulse response) HRTF (1-L) corresponding to the position of the user L from the virtual sound source position SP1 and the direction of the head of the user L is set in the filter 68. The filter 69 filters the sound S (SP1) to generate the direct sound S (L direct).

ゲイン調整部６１－６３は、生成された透過音Ｓ（Ｌ door）、間接音Ｓ（Ｌ open）、および、直接音Ｓ（Ｌ direct）のゲインをそれぞれオン／オフおよびゲイン調整する。インパルス応答や頭部伝達関数には、音量制御の要素が含まれているため、信号処理部１０６は、通常は透過音Ｓ（Ｌ door）などの生成後にゲインを調整しなくてもよい。ゲイン調整部６１－６３は、ドア５０２の開角に応じて間接音Ｓ（Ｌ open）のゲインを調整する場合、透過音Ｓ（Ｌ door）および間接音Ｓ（Ｌ open）をクロスフェードする場合などに使用される。 The gain adjustment units 61-63 turn on/off and adjust the gain of the generated transmitted sound S (L door), indirect sound S (L open), and direct sound S (L direct). Since the impulse response and head-related transfer function include an element of volume control, the signal processing unit 106 does not usually need to adjust the gain after generating the transmitted sound S (L door), etc. The gain adjustment units 61-63 are used when adjusting the gain of the indirect sound S (L open) according to the opening angle of the door 502, when cross-fading the transmitted sound S (L door) and the indirect sound S (L open), etc.

加算部８０は、ゲイン調整部６１－６３でゲインを調整された透過音Ｓ（Ｌ door）、間接音Ｓ（Ｌ open）、直接音Ｓ（Ｌ direct）を加算して、ヘッドホン２０に出力される音声Ｓ（Ｌ）を生成する。信号処理部１０６は、音声Ｓ（Ｌ）を通信処理部１０７に入力する。通信処理部１０７は、音声Ｓ（Ｌ）をヘッドホン２０に送信する。信号処理部１０６は、全てのフィルタ６４－６９および全てのゲイン調整部６１－６３のフィルタ係数およびゲイン値を合成して一つのフィルタ係数を算出することも可能である。信号処理部１０６は、このフィルタ係数を用いて一つのＦＩＲフィルタで音声Ｓ（Ｌ）を生成することも可能である。 The adder 80 adds the transmitted sound S (L door), the indirect sound S (L open), and the direct sound S (L direct) whose gains have been adjusted by the gain adjustment units 61-63, to generate the sound S (L) to be output to the headphones 20. The signal processing unit 106 inputs the sound S (L) to the communication processing unit 107. The communication processing unit 107 transmits the sound S (L) to the headphones 20. The signal processing unit 106 can also calculate one filter coefficient by combining the filter coefficients and gain values of all the filters 64-69 and all the gain adjustment units 61-63. The signal processing unit 106 can also generate the sound S (L) with one FIR filter using this filter coefficient.

図１０は、携帯端末装置１０の音声信号処理動作を示すフローチャートである。この処理はシナリオファイル７２に基づいて音声が生成されているときに実行される。この処理は、定期的に、例えば２０ミリ秒ごとに、携帯端末装置１０の制御部１００および信号処理部１０６などによって実行される。 Figure 10 is a flowchart showing the audio signal processing operation of the mobile terminal device 10. This process is executed when audio is being generated based on the scenario file 72. This process is executed periodically, for example every 20 milliseconds, by the control unit 100 and signal processing unit 106 of the mobile terminal device 10.

制御部１００は、ユーザＬの現在位置および頭部方向を取得する（ステップＳ１１，Ｓ１２）。以下、このフローチャートにおいて、ステップＳｎ（ｎは任意の数値）を単にＳｎと言う。制御部１００は、ドアセンサ３０から受信した信号を判断しドア５０２が開いているか否かを判断する（Ｓ１３）。ドア５０２が閉じている場合は（Ｓ１３でＮＯ）、ユーザＬには透過音のみが聞こえるため、図９に示す信号処理部１０６に透過音Ｓ（Ｌ door）の生成を指示する（Ｓ１４）。信号処理部１０６から出力された透過音Ｓ（Ｌ door）は、通信処理部１０７に出力される（Ｓ１５）。 The control unit 100 acquires the current position and head direction of the user L (steps S11, S12). Hereinafter, in this flowchart, step Sn (n is an arbitrary numerical value) will be simply referred to as Sn. The control unit 100 judges the signal received from the door sensor 30 and judges whether the door 502 is open or not (S13). If the door 502 is closed (NO in S13), the user L will only hear the transmitted sound, and therefore instructs the signal processing unit 106 shown in FIG. 9 to generate the transmitted sound S (L door) (S14). The transmitted sound S (L door) output from the signal processing unit 106 is output to the communication processing unit 107 (S15).

ドア５０２が開いている場合は（Ｓ１３でＹＥＳ）、制御部１００は、ユーザＬが仮想音源位置ＳＰ１（ピアノ３００）を直視できる位置にいるかを判断する（Ｓ２１）。直視できる位置にいる場合（Ｓ２１でＹＥＳ）、制御部１００は、処理をＳ２５に進める。直視できない位置にいる場合（Ｓ２１でＮＯ）、制御部１００は、処理をＳ２２に進める。 If the door 502 is open (YES in S13), the control unit 100 determines whether the user L is in a position where the virtual sound source position SP1 (piano 300) can be viewed directly (S21). If the user L is in a position where the virtual sound source position SP1 can be viewed directly (YES in S21), the control unit 100 advances the process to S25. If the user L is in a position where the virtual sound source position SP1 cannot be viewed directly (NO in S21), the control unit 100 advances the process to S22.

ユーザＬが仮想音源位置ＳＰ１を直視できない位置にいる場合（Ｓ２１でＮＯ）、制御部１００は、信号処理部１０６に間接音Ｓ（Ｌ open）の生成を指示する（Ｓ２２）。このとき、制御部１００は、ユーザＬの位置および頭部の向きに基づいて１つの頭部伝達関数を選択し、フィルタ６８にセットする。信号処理部１０６から出力された間接音Ｓ（Ｌ open）は、通信処理部１０７に出力される（Ｓ１５）。 If the user L is in a position where he/she cannot look directly at the virtual sound source position SP1 (NO in S21), the control unit 100 instructs the signal processing unit 106 to generate an indirect sound S(L open) (S22). At this time, the control unit 100 selects one head-related transfer function based on the position and head direction of the user L, and sets it in the filter 68. The indirect sound S(L open) output from the signal processing unit 106 is output to the communication processing unit 107 (S15).

ドア５０２が開いており、ユーザＬが仮想音源位置ＳＰ１を直視できる位置にいる場合（Ｓ２１でＹＥＳ）、制御部１００は、信号処理部１０６に直接音Ｓ（Ｌ direct）および間接音Ｓ（Ｌ open）の生成を指示する（Ｓ２５、Ｓ２６）。このとき、制御部１００は、ユーザＬの位置および頭部の向きに基づき、フィルタ６８，６９にそれぞれ１つの頭部伝達関数をセットする。信号処理部１０６は、生成された直接音Ｓ（Ｌ direct）および間接音Ｓ（Ｌ open）を加算して音声Ｓ（Ｌ）を生成して（Ｓ２７）、通信処理部１０７に出力する（Ｓ２７）。 When the door 502 is open and the user L is in a position where he or she can look directly at the virtual sound source position SP1 (YES in S21), the control unit 100 instructs the signal processing unit 106 to generate a direct sound S (L direct) and an indirect sound S (L open) (S25, S26). At this time, the control unit 100 sets one head-related transfer function to each of the filters 68 and 69 based on the position and head direction of the user L. The signal processing unit 106 adds the generated direct sound S (L direct) and indirect sound S (L open) to generate a sound S (L) (S27) and outputs it to the communication processing unit 107 (S27).

制御部１００は、図１０のステップＳ１３で、ドア５０２が開いているか閉じているかを判断している。制御部１００は、ドア５０２が開いている場合、どの程度の角度で開いているかを判断し、その角度に応じて間接音Ｓ（Ｌ open）のゲインを調節してもよい。さらに、制御部１００は、ドア５０２の開角に応じて間接音Ｓ（Ｌ open）の音質を調節してもよい。 In step S13 of FIG. 10, the control unit 100 determines whether the door 502 is open or closed. If the door 502 is open, the control unit 100 may determine the angle at which the door 502 is open and adjust the gain of the indirect sound S (L open) according to the angle. Furthermore, the control unit 100 may adjust the sound quality of the indirect sound S (L open) according to the opening angle of the door 502.

図１１は、ドア５０２の開角で間接音Ｓ（Ｌ open）のゲインを調節する処理を示すフローチャートである。制御部１００は、ドアセンサ３０からドア５０２の開角を取得する（Ｓ３１）。制御部１００は、取得された開角に基づき、ゲイン調整部６２にこの開角に応じたゲインを設定する（Ｓ３２）。このとき、制御部１００は、ドア５０２の開角が大きくなるに従って、間接音Ｓ（Ｌ open）のゲインを大きくするとともに、透過音Ｓ（Ｌ door）のゲインを小さくする処理（クロスフェード）をしてもよい。 Figure 11 is a flowchart showing the process of adjusting the gain of the indirect sound S (L open) depending on the opening angle of the door 502. The control unit 100 acquires the opening angle of the door 502 from the door sensor 30 (S31). Based on the acquired opening angle, the control unit 100 sets a gain corresponding to this opening angle in the gain adjustment unit 62 (S32). At this time, the control unit 100 may perform a process (cross-fade) of increasing the gain of the indirect sound S (L open) and decreasing the gain of the transmitted sound S (L door) as the opening angle of the door 502 increases.

制御部１００は、図１０のステップＳ２１で、ユーザＬが仮想音源ＳＰ１を直視できる位置にいるか否かを判断している。制御部１００は、仮想音源ＳＰ１にピアノ３００のような所定の大きさを持たせ、ユーザＬが仮想音源ＳＰ１をどの程度直視できているかに応じて直接音Ｓ（Ｌ direct）のゲインを調節してもよい。さらに、制御部１００は、ユーザＬが仮想音源ＳＰ１をどの程度直視できているかに応じて直接音Ｓ（Ｌ direct）の音質を調節してもよい。 In step S21 of FIG. 10, the control unit 100 determines whether the user L is in a position where he or she can look directly at the virtual sound source SP1. The control unit 100 may give the virtual sound source SP1 a predetermined size, such as a piano 300, and adjust the gain of the direct sound S (L direct) depending on how directly the user L can look at the virtual sound source SP1. Furthermore, the control unit 100 may adjust the sound quality of the direct sound S (L direct) depending on how directly the user L can look at the virtual sound source SP1.

図１２は、ユーザＬが仮想音源ＳＰ１をどの程度直視できているかに応じて直接音Ｓ（Ｌ direct）のゲインを調節する処理を示すフローチャートである。制御部１００は、ユーザＬの位置から仮想音源ＳＰ１をどの程度直視できるかを算出する（Ｓ３３）。直視範囲の算出は、ユーザＬ、仮想音源ＳＰ１、および、ドア枠５０３の座標に基づいて行われる。制御部１００は、算出した直視範囲に応じてゲイン調整部６２にゲインを設定する（Ｓ３４）。すなわち、制御部１００は、ユーザＬが仮想音源ＳＰ１全体を直視できる場合は、１００％のゲインを設定し、ユーザＬが直視できる仮想音源ＳＰ１の範囲が狭くなるに従って設定されるゲインが小さくなるようにすればよい。 Figure 12 is a flowchart showing the process of adjusting the gain of the direct sound S (L direct) depending on the extent to which the user L can view the virtual sound source SP1 directly. The control unit 100 calculates the extent to which the virtual sound source SP1 can be viewed directly from the position of the user L (S33). The direct viewing range is calculated based on the coordinates of the user L, the virtual sound source SP1, and the door frame 503. The control unit 100 sets a gain in the gain adjustment unit 62 depending on the calculated direct viewing range (S34). That is, when the user L can view the entire virtual sound source SP1 directly, the control unit 100 sets a gain of 100%, and sets a smaller gain as the range of the virtual sound source SP1 that the user L can view directly becomes narrower.

上の実施形態は、仮想音源ＳＰ１（ピアノ３００）が移動しない場合について説明した。以下の実施形態は、仮想音源ＳＰ１が移動する場合について説明する。この実施形態において上の実施形態と同様の構成の部分は同一番号を付して説明を省略する。 The above embodiment describes a case where the virtual sound source SP1 (piano 300) does not move. The following embodiment describes a case where the virtual sound source SP1 moves. In this embodiment, parts with configurations similar to those in the above embodiment are given the same numbers and will not be described.

図１３は建物２００内の仮想音源ＳＰ１０およびユーザＬの配置を示す図である。部屋のレイアウトは、図５に示したものと同じである。図１３において、鳥の外観で記載されている仮想音源ＳＰ１０は、位置ＳＰ１０（１）から位置ＳＰ１０（２）へ移動する。ドア５０２は開いている。図６、７の実施形態では、ユーザＬ自身が移動することにより、仮想音源ＳＰ１を直視できるようになる。この実施形態では、仮想音源ＳＰ１０が移動することにより、ユーザＬが仮想音源ＳＰ１０を直視できるようになる。 Figure 13 is a diagram showing the arrangement of a virtual sound source SP10 and a user L in a building 200. The layout of the room is the same as that shown in Figure 5. In Figure 13, the virtual sound source SP10, depicted with the appearance of a bird, moves from position SP10(1) to position SP10(2). The door 502 is open. In the embodiment of Figures 6 and 7, the user L moves so that he or she can look directly at the virtual sound source SP1. In this embodiment, the virtual sound source SP10 moves so that the user L can look directly at the virtual sound source SP10.

ユーザＬは、位置ＬＰ１０に留まっている。仮想音源ＳＰ１０が、位置ＳＰ１０（１）にあるとき、ユーザＬは、仮想音源ＳＰ１０を直視することができず、ユーザＬには間接音Ｓ（Ｌ open）が聴こえる。間接音Ｓ（Ｌ open）は、図６で説明したものと同じであり、ＳＰ２における仮想音源ＳＰ１０（１）のインパルス応答、および、ＳＰ２からユーザＬまでの頭部伝達関数で算出される。仮想音源ＳＰ１０が、位置ＳＰ１０（２）へ移動したとき、ユーザＬは、ドア枠５０３を介して仮想音源ＳＰ１０を直視することができる。ユーザＬは、直接音Ｓ（Ｌ direct）が聴こえる。直接音Ｓ（Ｌ direct）は、図７で説明したものと同じであり、ＳＰ１０（２）からユーザＬまでの頭部伝達関数で算出される。また、直接音が聴こえているときも、間接音が併行して聴こえている。この間接音は、ＳＰ２における仮想音源ＳＰ１０（２）のインパルス応答、および、ＳＰ２からユーザＬまでの頭部伝達関数で算出される。 The user L remains at position LP10. When the virtual sound source SP10 is at position SP10(1), the user L cannot look directly at the virtual sound source SP10, and hears indirect sound S (L open). The indirect sound S (L open) is the same as that described in FIG. 6, and is calculated using the impulse response of the virtual sound source SP10(1) at SP2 and the head-related transfer function from SP2 to the user L. When the virtual sound source SP10 moves to position SP10(2), the user L can look directly at the virtual sound source SP10 through the door frame 503. The user L hears direct sound S (L direct). The direct sound S (L direct) is the same as that described in FIG. 7, and is calculated using the head-related transfer function from SP10(2) to the user L. In addition, when the direct sound is heard, the indirect sound is heard in parallel. This indirect sound is calculated using the impulse response of the virtual sound source SP10(2) in SP2 and the head-related transfer function from SP2 to the user L.

仮想音源ＳＰ１０が移動する場合、制御部１００は、図１０のフローチャートにおいて、Ｓ１０、１１と併行して仮想音源ＳＰ１０の位置を取得して、ユーザＬが仮想音源ＳＰ１０を直視できるか否かを計算すればよい。 When the virtual sound source SP10 moves, the control unit 100 acquires the position of the virtual sound source SP10 in parallel with S10 and S11 in the flowchart of FIG. 10, and calculates whether the user L can look directly at the virtual sound source SP10.

図１３の例は、ユーザＬが停止している例を示した。さらに、ユーザＬが図５－８と同様に移動し、さらに仮想音源ＳＰ１０が移動する実施形態も実現可能である。 The example in FIG. 13 shows an example in which the user L is stationary. Furthermore, an embodiment in which the user L moves in the same manner as in FIG. 5-8 and the virtual sound source SP10 also moves can also be realized.

実施形態のドア５０２は、蝶番を用いた揺動式である。ドア５０２は、揺動式だけでなく、引き戸など他の機構で開閉するものも含む。ドア５０２が常に開放された実施形態、および、ドア５０２が無い実施形態、すなわちドア枠５０３（開口部）のみの実施形態も実現可能である。 The door 502 in this embodiment is of a swinging type that uses hinges. The door 502 is not limited to a swinging type, but also includes doors that open and close using other mechanisms, such as a sliding door. It is also possible to realize an embodiment in which the door 502 is always open, and an embodiment without the door 502, i.e., an embodiment with only a door frame 503 (opening).

図５の例は、ユーザＬがドア５０２を開く場合についてのものであった。ユーザＬが開いているドア５０２を閉じる実施形態も実現可能である。 The example in FIG. 5 was for a case where user L opens door 502. An embodiment in which user L closes an open door 502 is also possible.

図８の例は、ドア５０２を閉じたとき、ドア５０２のみから透過音が聴こえてくる実施形態である。ドア５０２を閉じたとき、壁５００全体から透過音が聴こえてくる実施形態も実現可能である。 The example in Figure 8 is an embodiment in which transmitted sound can be heard only through the door 502 when the door 502 is closed. It is also possible to realize an embodiment in which transmitted sound can be heard through the entire wall 500 when the door 502 is closed.

以上の実施形態では、音声生成部１０５および信号処理部１０６が、携帯端末装置１０に設けられている。これら音声生成部１０５および信号処理部１０６が、ヘッドホン２０に設けられてもよい。 In the above embodiment, the audio generation unit 105 and the signal processing unit 106 are provided in the mobile terminal device 10. The audio generation unit 105 and the signal processing unit 106 may also be provided in the headphones 20.

以上の実施形態では、建物２００、部屋２０１、２０２、壁５００、ドア５０２等が実際に存在する例が説明された。本発明をＶＲ（ヴァーチャル・リアリティ）に適用する場合、建物２００、部屋２０１、２０２、壁５００、ドア５０２等は、バーチャルであってもよい。 In the above embodiment, an example has been described in which the building 200, the rooms 201 and 202, the walls 500, the doors 502, etc. actually exist. When the present invention is applied to VR (virtual reality), the building 200, the rooms 201 and 202, the walls 500, the doors 502, etc. may be virtual.

以上の実施形態では、遮蔽物に遮蔽されない場合の第２の音質に変化させる音声処理部として、頭部伝達関数を示した。しかし、例えば音量を変える処理も、当該音声処理の一例である。 In the above embodiment, a head-related transfer function is used as an audio processing unit that changes the sound quality to the second sound quality when the sound is not blocked by an obstruction. However, for example, a process of changing the volume is also an example of the audio processing.

以上詳述した実施形態から、以下のような態様が把握される。 From the embodiment described above in detail, the following aspects can be understood:

《態様１》
本開示の態様１に係るデバイスシステムは、デバイス、センサおよび音声生成手段を備える。デバイスは、ユーザが装用する。センサは、移動可能な遮蔽物の移動を検出する。音声処理部は、遮蔽物の反対側に定位された仮想音源から遮蔽物で遮蔽された場合の第１の音質の音声を生成してデバイスから放音し、遮蔽物が仮想音源を遮蔽する位置から移動したことをセンサが検出したとき、音声の音質を、遮蔽物で遮蔽された音質から遮蔽されない場合の第２の音質に変化させる。 <<Aspect 1>>
A device system according to a first aspect of the present disclosure includes a device, a sensor, and a sound generating means. The device is worn by a user. The sensor detects the movement of a movable obstruction. The sound processing unit generates a sound of a first sound quality when a virtual sound source located on the opposite side of the obstruction is obstructed by the obstruction and emits the sound from the device, and when the sensor detects that the obstruction has moved from a position obstructing the virtual sound source, changes the sound quality of the sound from the sound quality when obstructed by the obstruction to a second sound quality when not obstructed.

《態様２》
本開示の態様２に係るデバイスシステムは、遮蔽物、ユーザおよび仮想音源のうち少なくとも一つが移動したことにより、ユーザが仮想音源の定位位置を直視できるようになった場合、第２の音質に代えて仮想音源の定位位置を直視できる場合の第３の音質の音声を生成する。これにより直接音をユーザに聴かせることができる。 Aspect 2
In the device system according to the second aspect of the present disclosure, when at least one of the obstruction, the user, and the virtual sound source moves and the user is able to directly view the localization position of the virtual sound source, the device system generates sound of a third sound quality for the case where the localization position of the virtual sound source can be directly viewed, instead of the second sound quality. This allows the user to hear the direct sound.

《態様３》
本開示の態様３に係るデバイスシステムは、遮蔽物として、ユーザがいる第１の空間、および、仮想音源が定位される第２の空間を仕切る壁、および、壁に設けられたドアを用いる。センサとして、ドアの開閉を検出するセンサを用いる。これにより、部屋を隔てて音声を聴く場合の音質が実現される。 Aspect 3
The device system according to the third aspect of the present disclosure uses, as the shielding object, a wall separating a first space in which a user is present from a second space in which a virtual sound source is located, and a door provided on the wall. As the sensor, a sensor that detects the opening and closing of the door is used. This realizes sound quality equivalent to that when listening to sound across a room.

《態様４》
本開示の態様４に係るデバイスシステムは、センサとして、ドアの開閉の程度を検出するものを用いる。音声処理部は、センサが検出したドアの開閉の程度に応じて、第１の音質および第２の音質をクロスフェードした音質の音声を生成する。これにより、ドア５０２が少し開かれた場合や徐々に開かれた場合の音質の変化が実現される。 Aspect 4
The device system according to the fourth aspect of the present disclosure uses a sensor that detects the degree of opening and closing of the door. The audio processing unit generates audio with a sound quality obtained by cross-fading the first sound quality and the second sound quality according to the degree of opening and closing of the door detected by the sensor. This realizes a change in sound quality when the door 502 is opened slightly or gradually.

《態様５》
本開示の態様５に係るデバイスシステムは、開かれたドアの位置ＳＰ２における仮想音源の第２の空間内のインパルス応答と、位置ＳＰ２からユーザまでの頭部伝達関数とでフィルタリングして第２の音質を実現する。 Aspect 5
The device system according to aspect 5 of the present disclosure achieves a second sound quality by filtering the impulse response in a second space of a virtual sound source at position SP2 of the open door and a head-related transfer function from position SP2 to the user.

１音声再生システム
１０携帯端末装置（スマートホン）
２０ヘッドホン
２１スピーカ
２３姿勢センサ
２４通信処理部
３０ドアセンサ
３１センサモジュール
３０通信処理部
７０アプリケーションプログラム
１００制御部
１０６信号処理部
５００壁
５０１固定壁
５０２ドア
５０３ドア枠
ＳＰ１，ＳＰ１０仮想音源 1 Audio playback system 10 Mobile terminal device (smartphone)
20 Headphones 21 Speaker 23 Attitude sensor 24 Communication processing unit 30 Door sensor 31 Sensor module 30 Communication processing unit 70 Application program 100 Control unit 106 Signal processing unit 500 Wall 501 Fixed wall 502 Door 503 Door frames SP1, SP10 Virtual sound source

Claims

An acoustic device worn by a user;
A sensor for detecting the movement of an openable/ closable obstruction;
a sound processing unit that generates a sound of a first sound quality from a virtual sound source located on the opposite side of the obstruction when the virtual sound source is obstructed by the obstruction, emits the sound from the acoustic device, and changes the sound quality of the sound from the first sound quality to a second sound quality when the virtual sound source is not obstructed by the obstruction when the sensor detects that the obstruction has moved from a position obstructing the virtual sound source;
A device system comprising:
When at least one of the obstruction, the user, and the virtual sound source moves and the user is able to directly view the localization position of the virtual sound source, a sound of a third sound quality for when the user can directly view the localization position of the virtual sound source is generated instead of the second sound quality.
Device system.

the obstruction is a wall that separates a first space in which the user is present from a second space in which the virtual sound source is located, and a door provided on the wall;
The device system according to claim 1 , wherein the sensor detects whether the door is open or closed.

The sensor detects the degree of opening and closing of the door,
The device system according to claim 2 , wherein the audio processing unit generates audio having a sound quality obtained by cross-fading the first sound quality and the second sound quality in accordance with the degree of opening and closing of the door detected by the sensor.

The second sound quality is
an impulse response in a second space of the virtual sound source at the position of the open door;
A head-related transfer function from the position of the open door to the user;
The device system according to claim 2 or 3 , wherein the sound quality is filtered by

The first sound quality is a sound quality filtered by an impulse response of the virtual sound source in the second space at the position of the open door, an impulse response of the sound insulation characteristic of the obstruction, and a head-related transfer function from the position of the open door to the user;
The third sound quality is a sound quality filtered by a head-related transfer function from the localization position of the virtual sound source to the user.
The device system according to claim 4 .

A device system including an acoustic device worn by a user and a sensor for detecting movement of a movable obstruction,
A sound quality control method comprising: generating a sound of a first sound quality when a virtual sound source is blocked by the blocking object from a virtual sound source located on an opposite side of the blocking object, emitting the sound from the acoustic device; and changing a sound quality of the sound from the first sound quality to a second sound quality when the virtual sound source is not blocked by the blocking object when the sensor detects that the blocking object has moved from a position blocking the virtual sound source.
When the user is in a position where the user can directly view the localization position of the virtual sound source, a sound of a third sound quality for when the user can directly view the localization position of the virtual sound source is generated instead of the second sound quality.
Sound quality control method .

As the shield, a wall separating a first space in which the user is present from a second space in which the virtual sound source is located, and a door provided on the wall are used,
The sound quality control method according to claim 6 , wherein the sensor is a sensor that detects whether the door is open or closed.

The sensor detects the degree of opening and closing of the door,
The sound quality control method according to claim 7 , further comprising generating a sound quality obtained by cross-fading the first sound quality and the second sound quality in accordance with the degree of opening and closing of the door detected by the sensor.

The sound quality control method according to claim 7 or 8, wherein the second sound quality is a sound quality filtered by an impulse response in the second space of the virtual sound source at the position of the open door and a head- related transfer function from the position of the open door to the user.

The first sound quality is a sound quality filtered by an impulse response of the virtual sound source in the second space at the position of the open door, an impulse response of the sound insulation characteristic of the obstruction, and a head related transfer function from the position of the open door to the user;
The third sound quality is a sound quality filtered by a head-related transfer function from the localization position of the virtual sound source to the user.
The sound quality control method according to claim 9.

A control unit of a mobile terminal device that communicates with an acoustic device worn by a user and a sensor that detects the movement of a movable obstruction,
generating a sound having a first sound quality when the sound source is blocked by the obstruction from a virtual sound source located on the opposite side of the obstruction, and emitting the sound from the acoustic device;
When the sensor detects that the obstruction has moved from a position obstructing the virtual sound source, the sound quality of the sound is changed from the first sound quality to a second sound quality when the virtual sound source is not obstructed by the obstruction ,
The sound quality control program causes the control unit to generate, when the user is in a position where the user can directly view the localization position of the virtual sound source, a sound of a third sound quality for when the user can directly view the localization position of the virtual sound source, instead of the second sound quality.
Sound quality control program .