JP2021052315A

JP2021052315A - Out-of-head localization filter determination system, out-of-head localization processing device, out-of-head localization filter determination device, out-of-head localization filter determination method, and program

Info

Publication number: JP2021052315A
Application number: JP2019174232A
Authority: JP
Inventors: 邦明高地; Kuniaki Kochi
Original assignee: JVCKenwood Corp
Current assignee: JVCKenwood Corp
Priority date: 2019-09-25
Filing date: 2019-09-25
Publication date: 2021-04-01
Also published as: WO2021059984A1

Abstract

To provide an out-of-head localization filter determination system that can determine a filter appropriately, an out-of-head localization processing device, an out-of-head localization filter determination device, an out-of-head localization filter determination method, and a program.SOLUTION: An out-of-head localization filter determination system according to an embodiment includes a frequency conversion portion 112 that converts frequency of a sound collection signal to obtain frequency characteristics, a smoothing portion 113 that smoothes the frequency characteristics, a feature amount extraction portion 114 that obtains a peak and a notch of the smoothed frequency characteristic and extracts the feature amount of the frequency characteristic as a user feature amount on the basis of the peak and the notch, a data extraction portion 302 that extracts second preset data from a data storage portion 303, a comparison unit 304 that compares the user feature amount with the extracted second preset data, and a selection portion 305 that selects the first preset data from the plurality of pieces of the first preset data on the basis of the comparison result.SELECTED DRAWING: Figure 4

Description

本発明は、頭外定位フィルタ決定システム、頭外定位処理装置、頭外定位フィルタ決定装置、頭外定位フィルタ決定方法、及びプログラムに関する。 The present invention relates to an out-of-head localization filter determination system, an out-of-head localization processing device, an out-of-head localization filter determination device, an out-of-head localization filter determination method, and a program.

音像定位技術として、ヘッドホンを用いて受聴者の頭部の外側に音像を定位させる頭外定位技術がある。頭外定位技術では、ヘッドホンから耳までの特性をキャンセルし、ステレオスピーカから耳までの４本の特性を与えることにより、音像を頭外に定位させている。 As a sound image localization technique, there is an out-of-head localization technique in which a sound image is localized on the outside of the listener's head using headphones. In the out-of-head localization technology, the sound image is localized out of the head by canceling the characteristics from the headphones to the ears and giving four characteristics from the stereo speakers to the ears.

頭外定位再生においては、２チャンネル（以下、ｃｈと記載）のスピーカから発した測定信号（インパルス音等）を聴取者本人（ユーザ）の耳に設置したマイクロフォン（以下、マイクとする）で録音する。そして、インパルス応答で得られた収音信号に基づいて、処理装置がフィルタを作成する。作成したフィルタを２ｃｈのオーディオ信号に畳み込むことにより、頭外定位再生を実現することができる。 In out-of-head localization playback, measurement signals (impulse sounds, etc.) emitted from two-channel (hereinafter referred to as ch) speakers are recorded with a microphone (hereinafter referred to as a microphone) installed in the listener's (user's) ear. To do. Then, the processing device creates a filter based on the sound pick-up signal obtained by the impulse response. By convolving the created filter into a 2ch audio signal, out-of-head localization reproduction can be realized.

さらに、ヘッドホンから耳までの特性をキャンセルするためのフィルタを生成するために、ヘッドホンから耳元乃至鼓膜までの特性（外耳道伝達関数ＥＣＴＦ、外耳道伝達特性とも言う）を聴取者本人の耳に設置したマイクで測定する。 Furthermore, in order to generate a filter for canceling the characteristics from the headphones to the ear, a microphone in which the characteristics from the headphones to the ear to the eardrum (also called the external auditory canal transfer function ECTF or external auditory canal transfer characteristic) is installed in the listener's own ear Measure with.

特許文献１には、頭外音像定位フィルタを用いた両耳聴装置が開示されている。この装置では、多数の人間のあらかじめ測定された空間伝達関数を人間の聴覚特性に対応する特徴パラメータベクトルに変換している。そして、装置は、クラスタリングを行って少数に集約したデータを用いている。さらに、装置は、予め測定された空間伝達関数と、実耳ヘッドホン逆伝達関数を人間の身体的寸法によりクラスタリングを行っている。そして、各クラスタの重心に最も近い人間のデータを用いている。 Patent Document 1 discloses a binaural hearing device using an extracranial sound image localization filter. In this device, a large number of human pre-measured spatial transfer functions are converted into feature parameter vectors corresponding to human auditory characteristics. Then, the apparatus uses the data aggregated in a small number by performing clustering. Further, the device clusters the spatial transfer function measured in advance and the reverse transfer function of the actual ear headphones according to the physical dimensions of a human being. Then, the human data closest to the center of gravity of each cluster is used.

特許文献２には、ヘッドホンとマイクユニットとを備えた頭外定位フィルタ決定装置が開示されている。特許文献１では、サーバ装置が、音源から被測定者の耳までの空間音響伝達特性に関する第１のプリセットデータと、被測定者の耳の外耳道伝達特性に関する第２のプリセットデータとを対応付けて記憶している。ユーザ端末が、ユーザの外耳道伝達特性に関する測定データを測定している。ユーザ端末が測定データに基づくユーザデータをサーバ装置に送信している。サーバ装置は、ユーザデータを複数の第２のプリセットデータと比較している。具体的には、ユーザと被測定者の外耳道伝達特性に基づく特徴量同士を比較している。サーバ装置は、比較結果に基づいて、第１のプリセットデータを抽出している。 Patent Document 2 discloses an out-of-head localization filter determining device including a headphone and a microphone unit. In Patent Document 1, the server device associates the first preset data regarding the spatial acoustic transmission characteristic from the sound source to the ear of the person to be measured with the second preset data regarding the external auditory canal transmission characteristic of the ear of the person to be measured. I remember. The user terminal is measuring measurement data regarding the user's ear canal transmission characteristics. The user terminal transmits user data based on the measurement data to the server device. The server device compares the user data with a plurality of second preset data. Specifically, the features based on the external auditory canal transmission characteristics of the user and the subject are compared. The server device extracts the first preset data based on the comparison result.

特開平８−１１１８９９号公報Japanese Unexamined Patent Publication No. 8-11189 特開２０１８―１９１２０８号公報Japanese Unexamined Patent Publication No. 2018-191208

しかしながら、特許文献１の装置では、身体的寸法によりクラスタリングを行っているため、ユーザ個人の身体的寸法を計測する必要がある。また、クラスタリングを適切に行うことができないおそれがある。この場合、ユーザに適した頭外音像定位フィルタを用いることができないという課題がある。 However, in the device of Patent Document 1, since clustering is performed based on the physical dimensions, it is necessary to measure the physical dimensions of the individual user. In addition, clustering may not be performed properly. In this case, there is a problem that an out-of-head sound image localization filter suitable for the user cannot be used.

また、特許文献２では、ユーザと被測定者の外耳道伝達特性に基づく特徴量同士を比較している。具体的には、特徴量の類似度を算出して、相関の高い被測定者の第１のプリセットデータを抽出している。特徴量は、２ｋＨｚ〜２０ｋＨｚの周波数振幅特性となっている。類似度が高い被測定者の外耳道伝達特性に対応する空間音響伝達特性を抽出して、ユーザの空間音響フィルタとしている。 Further, in Patent Document 2, feature quantities based on the external auditory canal transmission characteristics of the user and the subject are compared. Specifically, the similarity of the feature amount is calculated, and the first preset data of the subject to be measured with high correlation is extracted. The feature amount has a frequency amplitude characteristic of 2 kHz to 20 kHz. The spatial acoustic transmission characteristics corresponding to the external auditory canal transmission characteristics of the subject with high similarity are extracted and used as the user's spatial acoustic filter.

特許文献２のように、ユーザの外耳道伝達特性から空間音響伝達フィルタを用いる処理を行う場合、適切かつ迅速に処理を行うことが望まれる。しかしながら、特許文献２では、２ｋＨｚ〜２０ｋＨｚの周波数振幅特性を特徴量としているため、データ量の削減が困難である。多くの被測定者に関するプリセットデータがある場合、処理時間が長くなるという課題がある。 When performing a process using a spatial acoustic transmission filter based on the external auditory canal transmission characteristics of the user as in Patent Document 2, it is desired to perform the process appropriately and quickly. However, in Patent Document 2, since the feature amount is a frequency amplitude characteristic of 2 kHz to 20 kHz, it is difficult to reduce the amount of data. If there is preset data for many subjects, there is a problem that the processing time becomes long.

本実施形態は上記の点に鑑みなされたものであり、適切かつ迅速に処理を行うことができる頭外定位フィルタ決定システム、頭外定位処理装置、頭外定位フィルタ決定装置、頭外定位フィルタ決定方法、及びプログラムを提供することを目的とする。 This embodiment has been made in view of the above points, and can perform an appropriate and quick process of an out-of-head localization filter determination system, an out-of-head localization processing device, an out-of-head localization filter determination device, and an out-of-head localization filter determination. The purpose is to provide methods and programs.

本実施の形態にかかる頭外定位フィルタ決定システムは、ユーザに装着され、前記ユーザの耳に向けて音を出力する出力ユニットと、前記ユーザの耳に装着され、前記出力ユニットから出力された音を収音するマイクユニットと、前記出力ユニットに対して測定信号を出力するとともに、前記マイクユニットから出力された収音信号を測定する測定処理装置と、前記測定処理装置と通信可能なサーバ装置と、を備えた頭外定位フィルタ決定システムであって、前記サーバ装置は、音源から被測定者の耳までの空間音響伝達特性に関する第１のプリセットデータと、前記被測定者の耳の外耳道伝達特性の特徴量に関する第２のプリセットデータと、を対応付けて記憶するデータ格納部であって、複数の被測定者に対して取得された複数の前記第１及び第２のプリセットデータを記憶するデータ格納部を備え、前記頭外定位フィルタ決定システムは、前記マイクユニットで収音された収音信号を周波数変換して、周波数特性を求める周波数変換部と、前記周波数特性を平滑化する平滑化部と、平滑化された前記周波数特性のピーク及びノッチを求め、前記ピーク及びノッチに基づいて前記周波数特性の特徴量をユーザ特徴量として抽出する特徴量抽出部と、前記ユーザ特徴量に基づいて、前記データ格納部に格納されている複数の第２のプリセットデータのうちの一部の第２のプリセットデータを抽出するデータ抽出部と、前記ユーザ特徴量と、抽出された前記第２のプリセットデータとを比較する比較部と、比較結果に基づいて、複数の前記第１のプリセットデータの中から第１のプリセットデータを選択する選択部と、を備えている。 The out-of-head localization filter determination system according to the present embodiment is attached to an output unit that is attached to the user and outputs sound toward the user's ear, and a sound that is attached to the user's ear and is output from the output unit. A microphone unit that collects sound, a measurement processing device that outputs a measurement signal to the output unit and measures the sound collection signal output from the microphone unit, and a server device that can communicate with the measurement processing device. The server device is an out-of-head localization filter determination system comprising, the first preset data regarding the spatial acoustic transmission characteristics from the sound source to the subject's ear, and the external auditory canal transmission characteristics of the subject's ear. A data storage unit that stores the second preset data related to the feature amount in association with each other, and stores a plurality of the first and second preset data acquired for a plurality of subjects. The out-of-head localization filter determination system including a storage unit includes a frequency conversion unit that frequency-converts the sound collection signal picked up by the microphone unit to obtain frequency characteristics, and a smoothing unit that smoothes the frequency characteristics. A feature amount extraction unit that obtains a smoothed peak and notch of the frequency characteristic and extracts the feature amount of the frequency characteristic as a user feature amount based on the peak and notch, and a feature amount extraction unit based on the user feature amount. A data extraction unit that extracts a part of the second preset data among the plurality of second preset data stored in the data storage unit, the user feature amount, and the extracted second preset data. It is provided with a comparison unit for comparing with and a selection unit for selecting the first preset data from the plurality of first preset data based on the comparison result.

本実施の形態にかかる頭外定位処理装置は、出力ユニットから出力された測定信号をマイクユニットで収音することで取得される収音信号を周波数変換して、周波数特性を求める周波数変換部と、前記周波数特性を平滑化する平滑化部と、平滑化された前記周波数特性のピーク及びノッチを求め、前記ピーク及びノッチに基づいて前記周波数特性の特徴量をユーザ特徴量として抽出する特徴量抽出部と、前記ユーザ特徴量に類似する特徴量に対応付けられた空間音響伝達特性に基づいて、空間音響フィルタを設定する空間音響フィルタ設定部と、前記収音信号に基づいて、前記出力ユニットの特性をキャンセルする逆フィルタを算出する逆フィルタ算出部と、を備えている。 The out-of-head localization processing device according to the present embodiment includes a frequency conversion unit that obtains frequency characteristics by frequency-converting a sound-collecting signal acquired by collecting the measurement signal output from the output unit with the microphone unit. , A smoothing unit for smoothing the frequency characteristics, and the peaks and notches of the smoothed frequency characteristics are obtained, and the feature amount of the frequency characteristics is extracted as a user feature amount based on the peaks and notches. A space acoustic filter setting unit that sets a spatial acoustic filter based on a unit, a spatial acoustic transmission characteristic associated with a feature amount similar to the user feature amount, and an output unit of the output unit based on the sound pick-up signal. It is provided with an inverse filter calculation unit that calculates an inverse filter that cancels the characteristics.

本実施の形態にかかる頭外定位フィルタ決定装置は、音源から被測定者の耳までの空間音響伝達特性に関する第１のプリセットデータと、前記被測定者の耳の外耳道伝達特性の特徴量に関する第２のプリセットデータと、を対応付けて記憶するデータ格納部であって、複数の被測定者に対して取得された複数の前記第１及び第２のプリセットデータを記憶するデータ格納部と、ユーザ特徴量に基づいて、前記データ格納部に格納されている複数の第２のプリセットデータのうちの一部の第２のプリセットデータを抽出するデータ抽出部と、前記ユーザ特徴量と、前記第２のプリセットデータの特徴量とを比較する比較部と、比較結果に基づいて、複数の前記第１のプリセットデータの中から第１のプリセットデータを選択する選択部と、を備え、前記外耳道伝達特性の周波数特性が平滑化され、平滑化された前記周波数特性のピーク及びノッチに基づいて前記特徴量が抽出されている。 The extrahead localization filter determining device according to the present embodiment has the first preset data regarding the spatial acoustic transmission characteristics from the sound source to the subject's ear, and the first preset data regarding the characteristics of the external auditory canal transmission characteristics of the subject's ear. A data storage unit that stores two preset data in association with each other, and a data storage unit that stores a plurality of the first and second preset data acquired for a plurality of subjects, and a user. A data extraction unit that extracts a part of the second preset data among a plurality of second preset data stored in the data storage unit based on the feature amount, the user feature amount, and the second preset data. The external auditory canal transmission characteristic is provided with a comparison unit for comparing the feature amount of the preset data of the above and a selection unit for selecting the first preset data from the plurality of first preset data based on the comparison result. The frequency characteristics of the above are smoothed, and the feature amount is extracted based on the peaks and notches of the smoothed frequency characteristics.

本実施の形態にかかる頭外定位フィルタ決定方法は、ユーザに装着され、前記ユーザの耳に向けて音を出力する出力ユニットと、前記ユーザの耳に装着され、前記出力ユニットから出力された音を収音するマイクを有するマイクユニットと、を用いて、前記ユーザに対する頭外定位フィルタを決定する頭外定位フィルタ決定方法であって、前記マイクユニットで収音された収音信号を周波数変換して、周波数特性を求めるステップと、前記周波数特性を平滑化するステップと、平滑化された前記周波数特性のピーク及びノッチを求め、前記ピーク及びノッチに基づいて前記周波数特性の特徴量をユーザ特徴量として抽出するステップと、前記ユーザ特徴量に基づいて、データ格納部に格納されている複数の第２のプリセットデータのうちの一部の第２のプリセットデータを抽出するステップと、前記ユーザ特徴量と、抽出された前記第２のプリセットデータとを比較するステップと、比較結果に基づいて、複数の第１のプリセットデータの中から第１のプリセットデータを選択するステップと、を含む。 The method for determining an out-of-head localization filter according to the present embodiment includes an output unit that is attached to the user and outputs a sound toward the user's ear, and a sound that is attached to the user's ear and is output from the output unit. This is an out-of-head localization filter determination method for determining an out-of-head localization filter for the user by using a microphone unit having a microphone for collecting sound, and frequency-converting the sound collection signal collected by the microphone unit. The step of obtaining the frequency characteristic, the step of smoothing the frequency characteristic, the peak and the notch of the smoothed frequency characteristic are obtained, and the feature amount of the frequency characteristic is calculated as the user feature amount based on the peak and the notch. A step of extracting a part of the second preset data among a plurality of second preset data stored in the data storage unit based on the user feature amount, and the user feature amount. A step of comparing the extracted second preset data with the extracted second preset data, and a step of selecting the first preset data from the plurality of first preset data based on the comparison result.

本実施の形態にかかるプログラムは、ユーザに装着され、前記ユーザの耳に向けて音を出力する出力ユニットと、前記ユーザの耳に装着され、前記出力ユニットから出力された音を収音するマイクを有するマイクユニットと、を用いて、前記ユーザに対する頭外定位フィルタを決定する頭外定位フィルタ決定方法をコンピュータに実行させるためのプログラムであって、前記頭外定位フィルタ決定方法は、前記マイクユニットで収音された収音信号を周波数変換して、周波数特性を求めるステップと、前記周波数特性を平滑化するステップと、平滑化された前記周波数特性のピーク及びノッチを求め、前記ピーク及びノッチに基づいて前記周波数特性の特徴量をユーザ特徴量として抽出するステップと、前記ユーザ特徴量に基づいて、データ格納部に格納されている複数の第２のプリセットデータのうちの一部の第２のプリセットデータを抽出するステップと、前記ユーザ特徴量と、抽出された前記第２のプリセットデータとを比較するステップと、比較結果に基づいて、複数の第１のプリセットデータの中から第１のプリセットデータを選択するステップと、を含む。 The program according to the present embodiment includes an output unit that is attached to the user and outputs sound toward the user's ear, and a microphone that is attached to the user's ear and collects the sound output from the output unit. This is a program for causing a computer to execute an out-of-head localization filter determination method for determining an out-of-head localization filter for the user by using a microphone unit having the above, and the out-of-head localization filter determination method is the microphone unit. The step of frequency-converting the sound pick-up signal picked up in (1) to obtain the frequency characteristic, the step of smoothing the frequency characteristic, and the peak and notch of the smoothed frequency characteristic are obtained, and the peak and the notch are used. Based on the step of extracting the feature amount of the frequency characteristic as the user feature amount, and the second of a part of the plurality of second preset data stored in the data storage unit based on the user feature amount. A step of extracting preset data, a step of comparing the user feature amount with the extracted second preset data, and a first preset from a plurality of first preset data based on the comparison result. Includes steps to select data.

本実施形態によれば、適切かつ迅速に処理を行うことができる頭外定位フィルタ決定システム、頭外定位処理装置、頭外定位フィルタ決定装置、頭外定位フィルタ決定方法、及びプログラムを提供することができる。 According to the present embodiment, an out-of-head localization filter determination system, an out-of-head localization processing device, an out-of-head localization filter determination device, an out-of-head localization filter determination method, and a program capable of performing processing appropriately and quickly are provided. Can be done.

本実施の形態に係る頭外定位処理装置を示すブロック図である。It is a block diagram which shows the out-of-head localization processing apparatus which concerns on this embodiment. 測定装置の構成を模式的に示す図である。It is a figure which shows typically the structure of the measuring apparatus. 測定装置の構成を模式的に示す図である。It is a figure which shows typically the structure of the measuring apparatus. 頭外定位フィルタ決定システムの構成を示すブロック図である。It is a block diagram which shows the structure of the out-of-head localization filter determination system. 特徴量を抽出する処理を示すフローチャートである。It is a flowchart which shows the process of extracting a feature amount. 平滑化特性を模式的に示すグラフである。It is a graph which shows the smoothing property schematically. データ格納部に格納されているプリセットデータを示すテーブルである。This is a table showing preset data stored in the data storage unit. 特徴量のデータを示すテーブルである。It is a table showing the data of the feature quantity. サーバ装置における処理を示すフローチャートである。It is a flowchart which shows the process in a server device. 特徴ベクトルの距離を算出する処理を示すフローチャートである。It is a flowchart which shows the process of calculating the distance of a feature vector. ユーザデータと第２のプリセットデータとの相関を算出する処理を示すフローチャートである。It is a flowchart which shows the process of calculating the correlation between the user data and the second preset data. 補間特性を模式的に示すグラフである。It is a graph which shows the interpolation characteristic schematically. プリセットデータのクラスタを説明するための図である。It is a figure for demonstrating the cluster of preset data.

（概要）
まず、音像定位処理の概要について説明する。ここでは、音像定位処理装置の一例である頭外定位処理について説明する。本実施形態にかかる頭外定位処理は、空間音響伝達特性と外耳道伝達特性を用いて頭外定位処理を行うものである。空間音響伝達特性は、スピーカなどの音源から外耳道までの伝達特性である。外耳道伝達特性は、外耳道入口から鼓膜までの伝達特性である。本実施形態では、ヘッドホンを装着した状態での外耳道伝達特性を測定し、その測定データを用いて頭外定位処理を実現している。 (Overview)
First, the outline of the sound image localization process will be described. Here, the out-of-head localization processing, which is an example of the sound image localization processing apparatus, will be described. The extra-head localization process according to the present embodiment is to perform the extra-head localization process using the spatial acoustic transmission characteristic and the external auditory canal transmission characteristic. The spatial acoustic transmission characteristic is a transmission characteristic from a sound source such as a speaker to the ear canal. The ear canal transmission characteristic is the transmission characteristic from the ear canal entrance to the eardrum. In the present embodiment, the external auditory canal transmission characteristic is measured while the headphones are worn, and the extra-head localization process is realized by using the measurement data.

本実施の形態にかかる頭外定位処理は、パーソナルコンピュータ（ＰＣ）、スマートホン、タブレット端末などのユーザ端末で実行される。ユーザ端末は、プロセッサ等の処理手段、メモリやハードディスクなどの記憶手段、液晶モニタ等の表示手段、タッチパネル、ボタン、キーボード、マウスなどの入力手段を有する情報処理装置である。ユーザ端末は、データを送受信する通信機能を有している。さらに、ユーザ端末には、ヘッドホン又はイヤホンを有する出力手段（出力ユニット）が接続される。 The out-of-head localization process according to this embodiment is executed on a user terminal such as a personal computer (PC), a smart phone, or a tablet terminal. A user terminal is an information processing device having a processing means such as a processor, a storage means such as a memory or a hard disk, a display means such as a liquid crystal monitor, and an input means such as a touch panel, a button, a keyboard, and a mouse. The user terminal has a communication function for transmitting and receiving data. Further, an output means (output unit) having headphones or earphones is connected to the user terminal.

高い定位効果を得るには、ユーザ本人の特性を測定して頭外定位フィルタを生成する必要がある。ユーザ個人の空間音響伝達特性は、スピーカ等の音響機材や室内の音響特性が整えられたリスニングルームで行われることが一般的である。すなわち、ユーザがリスニングルームに行くか、ユーザの自宅などにリスニングルームを準備する必要がある。このため、ユーザ個人の空間音響伝達特性を適切に測定することができない場合がある。 In order to obtain a high localization effect, it is necessary to measure the characteristics of the user himself / herself and generate an out-of-head localization filter. The spatial acoustic transmission characteristics of an individual user are generally performed in a listening room in which acoustic equipment such as a speaker or indoor acoustic characteristics are arranged. That is, it is necessary for the user to go to the listening room or prepare a listening room at the user's home or the like. Therefore, it may not be possible to appropriately measure the spatial acoustic transmission characteristics of the individual user.

また、ユーザの自宅などにスピーカを設置してリスニングルームを準備した場合でも、左右非対称にスピーカが設置されている場合や、部屋の音響環境が音楽聴取に最適でない場合がある。このような場合、自宅で適切な空間音響伝達特性を測定することは大変困難である。 Further, even when a speaker is installed at the user's home or the like to prepare a listening room, the speaker may be installed asymmetrically or the acoustic environment of the room may not be optimal for listening to music. In such cases, it is very difficult to measure appropriate spatial acoustic transmission characteristics at home.

一方、ユーザ個人の外耳道伝達特性の測定は、マイクユニット、及びヘッドホンを装着した状態で行われる。すなわち、ユーザがマイクユニット、及びヘッドホンを装着していれば、外耳道伝達特性を測定することができる。ユーザがリスニングルームに行く必要や、ユーザの家に大がかりなリスニングルームを準備する必要がない。また、外耳道伝達特性を測定するための測定信号の発生や、収音信号の記録などはスマートホンやＰＣなどのユーザ端末を用いて、行うことができる。 On the other hand, the measurement of the external auditory canal transmission characteristic of an individual user is performed with the microphone unit and headphones attached. That is, if the user wears a microphone unit and headphones, the external auditory canal transmission characteristic can be measured. There is no need for the user to go to the listening room or set up a large listening room in the user's home. Further, the generation of the measurement signal for measuring the external auditory canal transmission characteristic, the recording of the sound collection signal, and the like can be performed by using a user terminal such as a smart phone or a PC.

このように、ユーザ個人に対して、空間音響伝達特性の測定を実施することが困難である場合がある。そこで、本実施の形態にかかる頭外定位処理システムは、外耳道伝達特性の測定結果に基づいて、空間音響伝達特性に応じたフィルタを決定している。すなわち、ユーザ個人の外耳道伝達特性の測定結果に基づいて、ユーザに適した頭外定位処理フィルタを決定している。 As described above, it may be difficult to measure the spatial acoustic transmission characteristics for an individual user. Therefore, in the extrahead localization processing system according to the present embodiment, the filter according to the spatial acoustic transmission characteristic is determined based on the measurement result of the external auditory canal transmission characteristic. That is, an extracranial localization processing filter suitable for the user is determined based on the measurement result of the external auditory canal transmission characteristic of the individual user.

具体的には、頭外定位処理システムは、ユーザ端末と、サーバ装置とを備えている。ユーザ以外の複数の被測定者に対して事前に測定された空間音響伝達特性及び外耳道伝達特性をサーバ装置が格納しておく。すなわち、ユーザ端末とは異なる測定装置を用いて、音源としてスピーカを用いた空間音響伝達特性の測定（以下、第１の事前測定とも称する）と、ヘッドホンを用いた外耳道伝達特性の測定（第２の事前測定とも称する）を、行う。第１の事前測定及び第２の事前測定は、ユーザ以外の被測定者に対して実施される。 Specifically, the out-of-head localization processing system includes a user terminal and a server device. The server device stores the spatial acoustic transmission characteristics and the external auditory canal transmission characteristics measured in advance for a plurality of subjects other than the user. That is, measurement of spatial acoustic transmission characteristics using a speaker as a sound source (hereinafter, also referred to as first pre-measurement) using a measuring device different from the user terminal, and measurement of external auditory canal transmission characteristics using headphones (second). (Also referred to as pre-measurement of). The first pre-measurement and the second pre-measurement are performed on a person to be measured other than the user.

サーバ装置は、第１の事前測定の結果に応じた第１のプリセットデータと、第２の事前測定の結果に応じた第２のプリセットデータとを格納している。複数の被測定者に対して第１及び第２の事前測定を行うことで、複数の第１のプリセットデータと、複数の第２のプリセットデータとが取得される。空間音響伝達特性に関する第１のプリセットデータと、外耳道伝達特性に関する第２のプリセットデータとを、サーバ装置が、被測定者毎に対応付けて記憶する。サーバ装置は、データベースに、複数の第１のプリセットデータと、複数の第２のプリセットデータとを格納している。 The server device stores the first preset data according to the result of the first pre-measurement and the second preset data according to the result of the second pre-measurement. By performing the first and second pre-measurements on the plurality of subjects, the plurality of first preset data and the plurality of second preset data are acquired. The server device stores the first preset data regarding the spatial acoustic transmission characteristic and the second preset data regarding the external auditory canal transmission characteristic in association with each other for each person to be measured. The server device stores a plurality of first preset data and a plurality of second preset data in the database.

さらに、頭外定位処理を実行するユーザ個人に対しては、ユーザ端末を用いて、外耳道伝達特性のみを測定する（以下、ユーザ測定とする）。ユーザ測定は、第２の事前測定と同様に、音源としてヘッドホンを用いた測定である。ユーザ端末は、外耳道伝達特性に関する測定データを取得する。そして、ユーザ端末は、測定データに基づくユーザデータをサーバ装置に送信する。サーバ装置は、ユーザデータを複数の第２のプリセットデータとそれぞれ比較する。サーバ装置は、比較結果に基づいて、複数の第２のプリセットデータの中からユーザデータとの相関が高い第２のプリセットデータを決定する。 Further, for the individual user who executes the out-of-head localization process, only the external auditory canal transmission characteristic is measured by using the user terminal (hereinafter referred to as user measurement). The user measurement is a measurement using headphones as a sound source, as in the second pre-measurement. The user terminal acquires measurement data regarding the external auditory canal transmission characteristic. Then, the user terminal transmits the user data based on the measurement data to the server device. The server device compares the user data with the plurality of second preset data, respectively. The server device determines the second preset data having a high correlation with the user data from the plurality of second preset data based on the comparison result.

そして、サーバ装置は、相関の高い第２のプリセットデータに対応付けられた第１のプリセットデータを読み出す。すなわち、サーバ装置は、比較結果に基づいて、複数の第１のプリセットデータの中から、ユーザ個人に適した第１のプリセットデータを抽出する。サーバ装置は、抽出した第１のプリセットデータをユーザ端末に送信する。そして、ユーザ端末は、第１のプリセットデータに基づくフィルタと、ユーザ測定に基づく逆フィルタとを用いて、頭外定位処理を行う。 Then, the server device reads out the first preset data associated with the second preset data having a high correlation. That is, the server device extracts the first preset data suitable for the individual user from the plurality of first preset data based on the comparison result. The server device transmits the extracted first preset data to the user terminal. Then, the user terminal performs the out-of-head localization process by using the filter based on the first preset data and the inverse filter based on the user measurement.

実施の形態１．
（頭外定位処理装置）
まず、本実施の形態にかかる音場再生装置の一例である頭外定位処理装置１００を図１に示す。図１は、頭外定位処理装置１００のブロック図である。頭外定位処理装置１００は、ヘッドホン４３を装着するユーザＵに対して音場を再生する。そのため、頭外定位処理装置１００は、ＬｃｈとＲｃｈのステレオ入力信号ＸＬ、ＸＲについて、音像定位処理を行う。ＬｃｈとＲｃｈのステレオ入力信号ＸＬ、ＸＲは、ＣＤ（Compact Disc）プレイヤーなどから出力されるアナログのオーディオ再生信号、又は、mp3(MPEG Audio Layer-3)等のデジタルオーディオデータである。なお、頭外定位処理装置１００は、物理的に単一な装置に限られるものではなく、一部の処理が異なる装置で行われてもよい。例えば、一部の処理がＰＣなどにより行われ、残りの処理がヘッドホン４３に内蔵されたＤＳＰ(Digital Signal Processor)などにより行われてもよい。 Embodiment 1.
(Out-of-head localization processing device)
First, FIG. 1 shows an out-of-head localization processing device 100 which is an example of the sound field reproducing device according to the present embodiment. FIG. 1 is a block diagram of the out-of-head localization processing device 100. The out-of-head localization processing device 100 reproduces the sound field for the user U who wears the headphones 43. Therefore, the out-of-head localization processing device 100 performs sound image localization processing on the stereo input signals XL and XR of Lch and Rch. The Lch and Rch stereo input signals XL and XR are analog audio reproduction signals output from a CD (Compact Disc) player or the like, or digital audio data such as mp3 (MPEG Audio Layer-3). The out-of-head localization processing device 100 is not limited to a physically single device, and some of the processing may be performed by different devices. For example, a part of the processing may be performed by a PC or the like, and the remaining processing may be performed by a DSP (Digital Signal Processor) or the like built in the headphones 43.

頭外定位処理装置１００は、頭外定位処理部１０、フィルタ部４１、フィルタ部４２、及びヘッドホン４３を備えている。頭外定位処理部１０、フィルタ部４１、及びフィルタ部４２はプロセッサにより実現可能である。 The out-of-head localization processing device 100 includes an out-of-head localization processing unit 10, a filter unit 41, a filter unit 42, and headphones 43. The out-of-head localization processing unit 10, the filter unit 41, and the filter unit 42 can be realized by a processor.

頭外定位処理部１０は、畳み込み演算部１１〜１２、２１〜２２、及び加算器２４、２５を備えている。畳み込み演算部１１〜１２、２１〜２２は、空間音響伝達特性を用いた畳み込み処理を行う。頭外定位処理部１０には、ＣＤプレイヤーなどからのステレオ入力信号ＸＬ、ＸＲが入力される。頭外定位処理部１０には、空間音響伝達特性が設定されている。頭外定位処理部１０は、各ｃｈのステレオ入力信号ＸＬ、ＸＲに対し、空間音響伝達特性のフィルタ（以下、空間音響フィルタとも称する）を畳み込む。空間音響伝達特性は被測定者の頭部や耳介で測定した頭部伝達関数ＨＲＴＦでもよいし、ダミーヘッドまたは第三者の頭部伝達関数であってもよい。 The out-of-head localization processing unit 10 includes convolution calculation units 11 to 12, 21 to 22, and adders 24 and 25. The convolution calculation units 11 to 12 and 21 to 22 perform a convolution process using the spatial acoustic transmission characteristic. Stereo input signals XL and XR from a CD player or the like are input to the out-of-head localization processing unit 10. Spatial acoustic transmission characteristics are set in the out-of-head localization processing unit 10. The out-of-head localization processing unit 10 convolves a filter having spatial acoustic transmission characteristics (hereinafter, also referred to as a spatial acoustic filter) with the stereo input signals XL and XR of each channel. The spatial acoustic transmission characteristic may be a head-related transfer function HRTF measured on the head or auricle of the subject, or may be a dummy head or a third-party head-related transfer function.

４つの空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓを１セットとしたものを空間音響伝達関数とする。畳み込み演算部１１、１２、２１、２２で畳み込みに用いられるデータが空間音響フィルタとなる。空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓのそれぞれは、後述する測定装置を用いて測定されている。 The spatial acoustic transfer function is a set of four spatial acoustic transfer characteristics Hls, Hlo, Hro, and Hrs. The data used for convolution by the convolution calculation units 11, 12, 21, and 22 serves as a spatial acoustic filter. Each of the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs is measured using a measuring device described later.

そして、畳み込み演算部１１は、Ｌｃｈのステレオ入力信号ＸＬに対して空間音響伝達特性Ｈｌｓに応じた空間音響フィルタを畳み込む。畳み込み演算部１１は、畳み込み演算データを加算器２４に出力する。畳み込み演算部２１は、Ｒｃｈのステレオ入力信号ＸＲに対して空間音響伝達特性Ｈｒｏに応じた空間音響フィルタを畳み込む。畳み込み演算部２１は、畳み込み演算データを加算器２４に出力する。加算器２４は２つの畳み込み演算データを加算して、フィルタ部４１に出力する。 Then, the convolution calculation unit 11 convolves the spatial acoustic filter corresponding to the spatial acoustic transmission characteristic Hls with respect to the stereo input signal XL of the Lch. The convolution calculation unit 11 outputs the convolution calculation data to the adder 24. The convolution calculation unit 21 convolves a spatial acoustic filter corresponding to the spatial acoustic transmission characteristic Hro with respect to the stereo input signal XR of Rch. The convolution calculation unit 21 outputs the convolution calculation data to the adder 24. The adder 24 adds two convolution operation data and outputs the data to the filter unit 41.

畳み込み演算部１２は、Ｌｃｈのステレオ入力信号ＸＬに対して空間音響伝達特性Ｈｌｏに応じた空間音響フィルタを畳み込む。畳み込み演算部１２は、畳み込み演算データを、加算器２５に出力する。畳み込み演算部２２は、Ｒｃｈのステレオ入力信号ＸＲに対して空間音響伝達特性Ｈｒｓに応じた空間音響フィルタを畳み込む。畳み込み演算部２２は、畳み込み演算データを、加算器２５に出力する。加算器２５は２つの畳み込み演算データを加算して、フィルタ部４２に出力する。 The convolution calculation unit 12 convolves a spatial acoustic filter corresponding to the spatial acoustic transmission characteristic Hlo with respect to the Lch stereo input signal XL. The convolution calculation unit 12 outputs the convolution calculation data to the adder 25. The convolution calculation unit 22 convolves a spatial acoustic filter corresponding to the spatial acoustic transmission characteristic Hrs with respect to the stereo input signal XR of Rch. The convolution calculation unit 22 outputs the convolution calculation data to the adder 25. The adder 25 adds two convolution operation data and outputs the data to the filter unit 42.

フィルタ部４１、４２にはヘッドホン特性（ヘッドホンの再生ユニットとマイク間の特性）をキャンセルする逆フィルタが設定されている。そして、頭外定位処理部１０での処理が施された再生信号（畳み込み演算信号）に逆フィルタを畳み込む。フィルタ部４１で加算器２４からのＬｃｈ信号に対して、逆フィルタを畳み込む。同様に、フィルタ部４２は加算器２５からのＲｃｈ信号に対して逆フィルタを畳み込む。逆フィルタは、ヘッドホン４３を装着した場合に、ヘッドホンユニットからマイクまでの特性をキャンセルする。マイクは、外耳道入口から鼓膜までの間ならばどこに配置してもよい。逆フィルタは、ユーザＵ本人の特性の測定結果から算出されている。 Inverse filters that cancel the headphone characteristics (characteristics between the headphone playback unit and the microphone) are set in the filter units 41 and 42. Then, the inverse filter is convoluted into the reproduced signal (convolution calculation signal) processed by the out-of-head localization processing unit 10. The filter unit 41 convolves the inverse filter with respect to the Lch signal from the adder 24. Similarly, the filter unit 42 convolves the inverse filter with respect to the Rch signal from the adder 25. The reverse filter cancels the characteristics from the headphone unit to the microphone when the headphone 43 is attached. The microphone may be placed anywhere between the ear canal entrance and the eardrum. The inverse filter is calculated from the measurement result of the characteristics of the user U himself / herself.

フィルタ部４１は、補正されたＬｃｈ信号をヘッドホン４３の左ユニット４３Ｌに出力する。フィルタ部４２は、補正されたＲｃｈ信号をヘッドホン４３の右ユニット４３Ｒに出力する。ユーザＵは、ヘッドホン４３を装着している。ヘッドホン４３は、Ｌｃｈ信号とＲｃｈ信号をユーザＵに向けて出力する。これにより、ユーザＵの頭外に定位された音像を再生することができる。 The filter unit 41 outputs the corrected Lch signal to the left unit 43L of the headphones 43. The filter unit 42 outputs the corrected Rch signal to the right unit 43R of the headphones 43. The user U is wearing the headphones 43. The headphone 43 outputs the Lch signal and the Rch signal toward the user U. As a result, the sound image localized outside the head of the user U can be reproduced.

このように、頭外定位処理装置１００は、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに応じた空間音響フィルタと、ヘッドホン特性の逆フィルタを用いて、頭外定位処理を行っている。以下の説明において、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに応じた空間音響フィルタと、ヘッドホン特性の逆フィルタとをまとめて頭外定位処理フィルタとする。２ｃｈのステレオ再生信号の場合、頭外定位フィルタは、４つの空間音響フィルタと、２つの逆フィルタとから構成されている。そして、頭外定位処理装置１００は、ステレオ再生信号に対して合計６個の頭外定位フィルタを用いて畳み込み演算処理を行うことで、頭外定位処理を実行する。 As described above, the out-of-head localization processing device 100 performs the out-of-head localization processing by using the spatial acoustic filter corresponding to the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs and the inverse filter of the headphone characteristics. In the following description, the spatial acoustic filter corresponding to the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs and the inverse filter of the headphone characteristics are collectively referred to as an out-of-head localization processing filter. In the case of a 2ch stereo reproduction signal, the out-of-head localization filter is composed of four spatial acoustic filters and two inverse filters. Then, the out-of-head localization processing device 100 executes the out-of-head localization processing by performing a convolution calculation process on the stereo reproduction signal using a total of six out-of-head localization filters.

（空間音響伝達特性の測定装置）
図２を用いて、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓを測定する測定装置２００について説明する。図２は、被測定者１に対して第１の事前測定を行うための測定構成を模式的に示す図である。 (Measuring device for spatial acoustic transmission characteristics)
The measuring device 200 for measuring the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs will be described with reference to FIG. FIG. 2 is a diagram schematically showing a measurement configuration for performing the first pre-measurement on the person to be measured 1.

図２に示すように、測定装置２００は、ステレオスピーカ５とマイクユニット２を有している。ステレオスピーカ５が測定環境に設置されている。測定環境は、ユーザＵの自宅の部屋やオーディオシステムの販売店舗やショールーム等でもよい。測定環境は、スピーカや音響の整ったリスニングルームであることが好ましい。 As shown in FIG. 2, the measuring device 200 includes a stereo speaker 5 and a microphone unit 2. The stereo speaker 5 is installed in the measurement environment. The measurement environment may be the user U's home room, an audio system sales store, a showroom, or the like. The measurement environment is preferably a listening room with speakers and sound.

本実施の形態では、測定装置２００の測定処理装置２０１が、空間音響フィルタを適切に生成するための演算処理を行っている。測定処理装置２０１は、例えば、ＣＤプレイヤー等の音楽プレイヤーなどを有している。測定処理装置２０１は、パーソナルコンピュータ（ＰＣ）、タブレット端末、スマートホン等であってもよい。また、測定処理装置２０１は、サーバ装置自体であってもよい。 In the present embodiment, the measurement processing device 201 of the measuring device 200 performs arithmetic processing for appropriately generating the spatial acoustic filter. The measurement processing device 201 includes, for example, a music player such as a CD player. The measurement processing device 201 may be a personal computer (PC), a tablet terminal, a smart phone, or the like. Further, the measurement processing device 201 may be the server device itself.

測定処理装置２０１は、メモリ、及びプロセッサを備えている。メモリは、処理プログラムや各種パラメータや測定データなどを記憶している。プロセッサは、メモリに格納された処理プログラムを実行する。プロセッサが処理プログラムを実行することで、各処理が実行される。プロセッサは、例えば、ＣＰＵ（Central Processing Unit）、ＦＰＧＡ（Field-Programmable Gate Array）、ＤＳＰ（Digital Signal Processor），ＡＳＩＣ（Application Specific Integrated Circuit）、又は、GPU(Graphics Processing Unit)等であってもよい。 The measurement processing device 201 includes a memory and a processor. The memory stores processing programs, various parameters, measurement data, and the like. The processor executes a processing program stored in memory. Each process is executed when the processor executes the process program. The processor may be, for example, a CPU (Central Processing Unit), an FPGA (Field-Programmable Gate Array), a DSP (Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), a GPU (Graphics Processing Unit), or the like. ..

ステレオスピーカ５は、左スピーカ５Ｌと右スピーカ５Ｒを備えている。例えば、被測定者１の前方に左スピーカ５Ｌと右スピーカ５Ｒが設置されている。左スピーカ５Ｌと右スピーカ５Ｒは、インパルス応答測定を行うためのインパルス音等を出力する。以下、本実施の形態では、音源となるスピーカの数を２（ステレオスピーカ）として説明するが、測定に用いる音源の数は２に限らず、１以上であればよい。すなわち、1chのモノラル、または、5.1ch、7.1ch等の、いわゆるマルチチャンネル環境においても同様に、本実施の形態を適用することができる。 The stereo speaker 5 includes a left speaker 5L and a right speaker 5R. For example, a left speaker 5L and a right speaker 5R are installed in front of the person to be measured 1. The left speaker 5L and the right speaker 5R output an impulse sound or the like for measuring an impulse response. Hereinafter, in the present embodiment, the number of speakers serving as sound sources will be described as 2 (stereo speakers), but the number of sound sources used for measurement is not limited to 2, and may be 1 or more. That is, the present embodiment can be similarly applied to a so-called multi-channel environment such as 1ch monaural or 5.1ch, 7.1ch, etc.

マイクユニット２は、左のマイク２Ｌと右のマイク２Ｒを有するステレオマイクである。左のマイク２Ｌは、被測定者１の左耳９Ｌに設置され、右のマイク２Ｒは、被測定者１の右耳９Ｒに設置されている。具体的には、左耳９Ｌ、右耳９Ｒの外耳道入口から鼓膜までの位置にマイク２Ｌ、２Ｒを設置することが好ましい。マイク２Ｌ、２Ｒは、ステレオスピーカ５から出力された測定信号を収音して、収音信号を取得する。マイク２Ｌ、２Ｒは収音信号を測定処理装置２０１に出力する。被測定者１は、人でもよく、ダミーヘッドでもよい。すなわち、本実施形態において、被測定者１は人だけでなく、ダミーヘッドを含む概念である。 The microphone unit 2 is a stereo microphone having a left microphone 2L and a right microphone 2R. The left microphone 2L is installed in the left ear 9L of the person to be measured 1, and the right microphone 2R is installed in the right ear 9R of the person to be measured 1. Specifically, it is preferable to install microphones 2L and 2R at positions from the entrance of the ear canal to the eardrum of the left ear 9L and the right ear 9R. The microphones 2L and 2R pick up the measurement signal output from the stereo speaker 5 and acquire the sound pick-up signal. The microphones 2L and 2R output the sound pick-up signal to the measurement processing device 201. The person to be measured 1 may be a person or a dummy head. That is, in the present embodiment, the person to be measured 1 is a concept including not only a person but also a dummy head.

上記のように、左スピーカ５Ｌ、右スピーカ５Ｒで出力されたインパルス音をマイク２Ｌ、２Ｒで測定することでインパルス応答が測定される。測定処理装置２０１は、インパルス応答測定により取得した収音信号をメモリなどに記憶する。これにより、左スピーカ５Ｌと左マイク２Ｌとの間の空間音響伝達特性Ｈｌｓ、左スピーカ５Ｌと右マイク２Ｒとの間の空間音響伝達特性Ｈｌｏ、右スピーカ５Ｒと左マイク２Ｌとの間の空間音響伝達特性Ｈｒｏ、右スピーカ５Ｒと右マイク２Ｒとの間の空間音響伝達特性Ｈｒｓが測定される。すなわち、左スピーカ５Ｌから出力された測定信号を左マイク２Ｌが収音することで、空間音響伝達特性Ｈｌｓが取得される。左スピーカ５Ｌから出力された測定信号を右マイク２Ｒが収音することで、空間音響伝達特性Ｈｌｏが取得される。右スピーカ５Ｒから出力された測定信号を左マイク２Ｌが収音することで、空間音響伝達特性Ｈｒｏが取得される。右スピーカ５Ｒから出力された測定信号を右マイク２Ｒが収音することで、空間音響伝達特性Ｈｒｓが取得される。 As described above, the impulse response is measured by measuring the impulse sound output by the left speaker 5L and the right speaker 5R with the microphones 2L and 2R. The measurement processing device 201 stores the sound pick-up signal acquired by the impulse response measurement in a memory or the like. As a result, the spatial acoustic transmission characteristic Hls between the left speaker 5L and the left microphone 2L, the spatial acoustic transmission characteristic Hlo between the left speaker 5L and the right microphone 2R, and the spatial acoustic between the right speaker 5R and the left microphone 2L. The transmission characteristic Hro and the spatial acoustic transmission characteristic Hrs between the right speaker 5R and the right microphone 2R are measured. That is, the spatial acoustic transmission characteristic Hls is acquired by the left microphone 2L collecting the measurement signal output from the left speaker 5L. The spatial acoustic transmission characteristic Hlo is acquired by the right microphone 2R collecting the measurement signal output from the left speaker 5L. The spatial acoustic transmission characteristic Hiro is acquired by the left microphone 2L collecting the measurement signal output from the right speaker 5R. The spatial acoustic transmission characteristic Hrs is acquired by the right microphone 2R picking up the measurement signal output from the right speaker 5R.

また、測定装置２００は、収音信号に基づいて、左右のスピーカ５Ｌ、５Ｒから左右のマイク２Ｌ、２Ｒまでの空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに応じた空間音響フィルタを生成してもよい。例えば、測定処理装置２０１は、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓを所定のフィルタ長で切り出す。測定処理装置２０１は、測定した空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓを補正してもよい。 Further, the measuring device 200 generates a spatial acoustic filter corresponding to the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs from the left and right speakers 5L and 5R to the left and right microphones 2L and 2R based on the sound pick-up signal. May be good. For example, the measurement processing device 201 cuts out the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs with a predetermined filter length. The measurement processing device 201 may correct the measured spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs.

このようにすることで、測定処理装置２０１は、頭外定位処理装置１００の畳み込み演算に用いられる空間音響フィルタを生成する。図１で示したように、頭外定位処理装置１００が、左右のスピーカ５Ｌ、５Ｒと左右のマイク２Ｌ、２Ｒとの間の空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに応じた空間音響フィルタを用いて頭外定位処理を行う。すなわち、空間音響フィルタをオーディオ再生信号に畳み込むことにより、頭外定位処理を行う。 By doing so, the measurement processing device 201 generates a spatial acoustic filter used for the convolution calculation of the out-of-head localization processing device 100. As shown in FIG. 1, the out-of-head localization processing device 100 uses a spatial acoustic filter according to the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs between the left and right speakers 5L and 5R and the left and right microphones 2L and 2R. Perform out-of-head localization processing using. That is, the out-of-head localization process is performed by convolving the spatial acoustic filter into the audio reproduction signal.

測定処理装置２０１は、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓのそれぞれに対応する収音信号に対して同様の処理を実施している。すなわち、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに対応する４つの収音信号に対して、それぞれ同様の処理が実施される。これにより、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに対応する空間音響フィルタをそれぞれ生成することができる。 The measurement processing device 201 performs the same processing on the sound pick-up signals corresponding to each of the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs. That is, the same processing is performed on each of the four sound pick-up signals corresponding to the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs. As a result, it is possible to generate spatial acoustic filters corresponding to the spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs, respectively.

（外耳道伝達特性の測定）
次に、外耳道伝達特性を測定するための測定装置２００について、図３を用いて説明する。図３は、被測定者１に対して第２の事前測定を行うための構成を示している。 (Measurement of ear canal transmission characteristics)
Next, the measuring device 200 for measuring the external auditory canal transmission characteristic will be described with reference to FIG. FIG. 3 shows a configuration for performing a second pre-measurement on the person to be measured 1.

測定処理装置２０１には、マイクユニット２と、ヘッドホン４３と、が接続されている。マイクユニット２は、左マイク２Ｌと、右マイク２Ｒとを備えている。左マイク２Ｌは、被測定者１の左耳９Ｌに装着される。右マイク２Ｒは、被測定者１の右耳９Ｒに装着される。測定処理装置２０１、及びマイクユニット２は、図２の測定処理装置２０１、及びマイクユニット２と同じものでもよく、異なるものでもよい。 The microphone unit 2 and the headphones 43 are connected to the measurement processing device 201. The microphone unit 2 includes a left microphone 2L and a right microphone 2R. The left microphone 2L is attached to the left ear 9L of the person to be measured 1. The right microphone 2R is attached to the right ear 9R of the person to be measured 1. The measurement processing device 201 and the microphone unit 2 may be the same as or different from the measurement processing device 201 and the microphone unit 2 of FIG.

ヘッドホン４３は、ヘッドホンバンド４３Ｂと、左ユニット４３Ｌと、右ユニット４３Ｒとを、有している。ヘッドホンバンド４３Ｂは、左ユニット４３Ｌと右ユニット４３Ｒとを連結する。左ユニット４３Ｌは被測定者１の左耳９Ｌに向かって音を出力する。右ユニット４３Ｒは被測定者１の右耳９Ｒに向かって音を出力する。ヘッドホン４３は密閉型、開放型、半開放型、または半密閉型等、ヘッドホンの種類を問わない。ヘッドホン４３は、ヘッドホン４３が装着された状態で、マイクユニット２が被測定者１に装着される。すなわち、左マイク２Ｌ、右マイク２Ｒが装着された左耳９Ｌ、右耳９Ｒにヘッドホン４３の左ユニット４３Ｌ、右ユニット４３Ｒがそれぞれ装着される。ヘッドホンバンド４３Ｂは、左ユニット４３Ｌと右ユニット４３Ｒとをそれぞれ左耳９Ｌ、右耳９Ｒに押し付ける付勢力を発生する。 The headphone 43 has a headphone band 43B, a left unit 43L, and a right unit 43R. The headphone band 43B connects the left unit 43L and the right unit 43R. The left unit 43L outputs sound toward the left ear 9L of the person to be measured 1. The right unit 43R outputs sound toward the right ear 9R of the person to be measured 1. The headphone 43 may be of any type, such as a closed type, an open type, a semi-open type, or a semi-closed type. In the headphone 43, the microphone unit 2 is attached to the person to be measured 1 with the headphone 43 attached. That is, the left unit 43L and the right unit 43R of the headphones 43 are attached to the left ear 9L and the right ear 9R to which the left microphone 2L and the right microphone 2R are attached, respectively. The headphone band 43B generates an urging force that presses the left unit 43L and the right unit 43R against the left ear 9L and the right ear 9R, respectively.

左マイク２Ｌは、ヘッドホン４３の左ユニット４３Ｌから出力された音を収音する。右マイク２Ｒは、ヘッドホン４３の右ユニット４３Ｒから出力された音を収音する。左マイク２Ｌ、及び右マイク２Ｒのマイク部は、外耳孔近傍の収音位置に配置される。左マイク２Ｌ、及び右マイク２Ｒは、ヘッドホン４３に干渉しないように構成されている。すなわち、左マイク２Ｌ、及び右マイク２Ｒは左耳９Ｌ、右耳９Ｒの適切な位置に配置された状態で、被測定者１がヘッドホン４３を装着することができる。なお、左マイク２Ｌ、及び右マイク２Ｒは、それぞれヘッドホン４３の左ユニット４３Ｌ、及び右ユニット４３Ｒに内蔵されていてもよく、ヘッドホン４３と別個に設けられていても良い。 The left microphone 2L collects the sound output from the left unit 43L of the headphones 43. The right microphone 2R collects the sound output from the right unit 43R of the headphones 43. The microphone portions of the left microphone 2L and the right microphone 2R are arranged at sound collecting positions near the outer ear canal. The left microphone 2L and the right microphone 2R are configured so as not to interfere with the headphone 43. That is, the subject 1 can wear the headphones 43 in a state where the left microphone 2L and the right microphone 2R are arranged at appropriate positions of the left ear 9L and the right ear 9R. The left microphone 2L and the right microphone 2R may be built in the left unit 43L and the right unit 43R of the headphone 43, respectively, or may be provided separately from the headphone 43.

測定処理装置２０１は、左マイク２Ｌ、及び右マイク２Ｒに対して測定信号を出力する。これにより、左マイク２Ｌ、及び右マイク２Ｒはインパルス音などを発生する。具体的には、左ユニット４３Ｌから出力されたインパルス音を左マイク２Ｌで測定する。右ユニット４３Ｒから出力されたインパルス音を右マイク２Ｒで測定する。このようにすることで、インパルス応答測定が実施される。 The measurement processing device 201 outputs a measurement signal to the left microphone 2L and the right microphone 2R. As a result, the left microphone 2L and the right microphone 2R generate an impulse sound or the like. Specifically, the impulse sound output from the left unit 43L is measured by the left microphone 2L. The impulse sound output from the right unit 43R is measured by the right microphone 2R. By doing so, the impulse response measurement is performed.

測定処理装置２０１は、インパルス応答測定に基づく収音信号をメモリなどに記憶する。これにより、左ユニット４３Ｌと左マイク２Ｌとの間の伝達特性（すなわち、左耳の外耳道伝達特性）と、右ユニット４３Ｒと右マイク２Ｒとの間の伝達特性（すなわち、右耳の外耳道伝達特性）が取得される。ここで、左マイク２Ｌが取得した左耳の外耳道伝達特性の測定データを測定データＥＣＴＦＬとし、右マイク２Ｒが取得した右耳の外耳道伝達特性の測定データを測定データＥＣＴＦＲとする。 The measurement processing device 201 stores a sound pick-up signal based on the impulse response measurement in a memory or the like. As a result, the transmission characteristic between the left unit 43L and the left microphone 2L (that is, the ear canal transmission characteristic of the left ear) and the transmission characteristic between the right unit 43R and the right microphone 2R (that is, the ear canal transmission characteristic of the right ear). ) Is acquired. Here, the measurement data of the external auditory canal transmission characteristic of the left ear acquired by the left microphone 2L is referred to as measurement data ECTFL, and the measurement data of the external auditory canal transmission characteristic of the right ear acquired by the right microphone 2R is referred to as measurement data ECTFR.

測定処理装置２０１は、測定データＥＣＴＦＬ、ＥＣＴＦＲをそれぞれ記憶するメモリなどを有している。なお、測定処理装置２０１は、外耳道伝達特性又は空間音響伝達特性を測定するための測定信号として、インパルス信号やＴＳＰ（ＴｉｍｅＳｔｒｅｔｃｈｅｄＰｕｌｓｅ）信号等を発生する。測定信号はインパルス音等の測定音を含んでいる。 The measurement processing device 201 has a memory for storing measurement data ECTFL and ECTFR, respectively. The measurement processing device 201 generates an impulse signal, a TSP (Time Stretched Pulse) signal, or the like as a measurement signal for measuring the external auditory canal transmission characteristic or the spatial acoustic transmission characteristic. The measurement signal includes a measurement sound such as an impulse sound.

図２、図３で示した測定装置２００によって、複数の被測定者１の外耳道伝達特性、及び空間音響伝達特性を測定する。本実施の形態では、図２の測定構成による第１の事前測定を複数の被測定者１に対して実施する。同様に、図３の測定構成による第２の事前測定を複数の被測定者１に対して実施する。これにより、被測定者１毎に、外耳道伝達特性、及び空間音響伝達特性が測定される。 The external auditory canal transmission characteristics and the spatial acoustic transmission characteristics of the plurality of subjects 1 are measured by the measuring device 200 shown in FIGS. 2 and 3. In the present embodiment, the first pre-measurement according to the measurement configuration of FIG. 2 is performed on a plurality of subjects 1. Similarly, the second pre-measurement according to the measurement configuration of FIG. 3 is performed on a plurality of subjects 1. As a result, the external auditory canal transmission characteristic and the spatial acoustic transmission characteristic are measured for each person to be measured 1.

（頭外定位フィルタ決定システム）
次に、本実施の形態にかかる頭外定位フィルタ決定システム５００について、図４を用いて説明する。図４は、頭外定位フィルタ決定システム５００の全体構成を示す図である。頭外定位フィルタ決定システム５００は、頭外定位処理装置１００と、サーバ装置３００と、を備えている。 (Out-of-head localization filter determination system)
Next, the out-of-head localization filter determination system 500 according to the present embodiment will be described with reference to FIG. FIG. 4 is a diagram showing the overall configuration of the out-of-head localization filter determination system 500. The out-of-head localization filter determination system 500 includes an out-of-head localization processing device 100 and a server device 300.

頭外定位処理装置１００とサーバ装置３００とは、ネットワーク４００を介して接続されている。ネットワーク４００は、例えば、インターネットや携帯電話通信網などの公衆ネットワークなどである。頭外定位処理装置１００とサーバ装置３００とは無線又は有線により通信可能になっている。なお、頭外定位処理装置１００とサーバ装置３００とは一体の装置であってもよい。 The out-of-head localization processing device 100 and the server device 300 are connected via a network 400. The network 400 is, for example, a public network such as the Internet or a mobile phone communication network. The out-of-head localization processing device 100 and the server device 300 can communicate with each other wirelessly or by wire. The out-of-head localization processing device 100 and the server device 300 may be integrated devices.

頭外定位処理装置１００は、図１で示したように、頭外定位処理された再生信号をユーザＵに出力するユーザ端末となる。さらに、頭外定位処理装置１００は、ユーザＵの外耳道伝達特性の測定を行う。そのため、頭外定位処理装置１００には、マイクユニット２とヘッドホン４３とが接続されている。頭外定位処理装置１００は、図３の測定装置２００と同様に、マイクユニット２と、ヘッドホン４３とを用いたインパルス応答測定を行う。なお、マイクユニット２、及びヘッドホン４３とＢｌｕｅＴｏｏｔｈ（登録商標）などにより無線接続されていてもよい。 As shown in FIG. 1, the out-of-head localization processing device 100 is a user terminal that outputs a reproduction signal that has undergone out-of-head localization processing to the user U. Further, the out-of-head localization processing device 100 measures the external auditory canal transmission characteristic of the user U. Therefore, the microphone unit 2 and the headphones 43 are connected to the out-of-head localization processing device 100. The out-of-head localization processing device 100 performs impulse response measurement using the microphone unit 2 and the headphones 43, similarly to the measuring device 200 of FIG. The microphone unit 2 and the headphones 43 may be wirelessly connected by Bluetooth (registered trademark) or the like.

頭外定位処理装置１００は、特性取得部１１１、周波数変換部１１２，平滑化部１１３、特徴量抽出部１１４と、逆フィルタ算出部１１５と、フィルタ設定部１１６と、送信部１２１と、受信部１２２を備えている。なお、頭外定位処理装置１００とサーバ装置３００とが一体の装置である場合、該装置は受信部１２２に代えてユーザデータを取得する取得部を備えていてもよい。 The out-of-head localization processing device 100 includes a characteristic acquisition unit 111, a frequency conversion unit 112, a smoothing unit 113, a feature amount extraction unit 114, an inverse filter calculation unit 115, a filter setting unit 116, a transmission unit 121, and a reception unit. It has 122. When the out-of-head localization processing device 100 and the server device 300 are integrated devices, the device may include an acquisition unit for acquiring user data instead of the reception unit 122.

特性取得部１１１は、ユーザ測定を行うため、インパルス音となる測定信号をヘッドホン４３に出力する。ヘッドホン４３が出力したインパルス音をマイクユニット２が収音する。マイクユニット２は収音信号を特性取得部１１１に出力する。なお、インパルス応答測定については、図３の説明と同様であるため、適宜説明を省略する。すなわち、頭外定位処理装置１００が、図３の測定処理装置２０１と同様の機能を有している。頭外定位処理装置１００と、マイクユニット２と、ヘッドホン４３とがユーザ測定を行う。特性取得部１１１は、収音信号に対して、Ａ／Ｄ変換や同期加算処理などを行ってもよい。 The characteristic acquisition unit 111 outputs a measurement signal that becomes an impulse sound to the headphones 43 in order to perform user measurement. The microphone unit 2 collects the impulse sound output by the headphones 43. The microphone unit 2 outputs a sound pick-up signal to the characteristic acquisition unit 111. Since the impulse response measurement is the same as that described in FIG. 3, the description thereof will be omitted as appropriate. That is, the out-of-head localization processing device 100 has the same function as the measurement processing device 201 of FIG. The out-of-head localization processing device 100, the microphone unit 2, and the headphones 43 perform user measurement. The characteristic acquisition unit 111 may perform A / D conversion, synchronous addition processing, or the like on the sound pick-up signal.

特性取得部１１１は、外耳道伝達特性に関する測定データを取得する。特性取得部１１１は、インパルス応答測定等を行うために、図３の測定処理装置２０１と同様の機能を有している。測定データは、ユーザＵの左耳９Ｌの外耳道伝達特性に関する測定データと、右耳９Ｒの外耳道伝達特性に関する測定データとを含んでいる。ここで、時間領域の外耳道伝達特性をｆ（ｔ）とする。 The characteristic acquisition unit 111 acquires measurement data regarding the external auditory canal transmission characteristic. The characteristic acquisition unit 111 has the same function as the measurement processing device 201 of FIG. 3 for performing impulse response measurement and the like. The measurement data includes the measurement data regarding the external auditory canal transmission characteristic of the left ear 9L of the user U and the measurement data regarding the external auditory canal transmission characteristic of the right ear 9R. Here, let f (t) be the external auditory canal transmission characteristic in the time domain.

以下、図４と共に、図５を参照して外耳道伝達特性の特徴量を求める処理について説明する。図５は、特徴量を抽出する処理を示すフローチャートである。 Hereinafter, the process of obtaining the feature amount of the external auditory canal transmission characteristic will be described with reference to FIG. FIG. 5 is a flowchart showing a process of extracting a feature amount.

周波数変換部１１２が、外耳道伝達特性ｆ（ｔ）を周波数変換する（Ｓ１１）。例えば、周波数変換部１１２が、時間領域の外耳道伝達特性に対して離散フーリエ変換を行うことで、周波数振幅特性及び周波数位相特性を算出する。また、周波数変換部１１２は、離散フーリエ変換に限らず、離散コサイン変換などにより、周波数振幅特性及び周波数位相特性を算出してもよい。周波数振幅特性の代わりに、周波数パワー特性が用いられていてもよい。周波数変換で得られた周波数振幅特性をＦ（ｆ）とする。 The frequency conversion unit 112 frequency-converts the external auditory canal transmission characteristic f (t) (S11). For example, the frequency conversion unit 112 calculates the frequency amplitude characteristic and the frequency phase characteristic by performing a discrete Fourier transform on the external auditory canal transmission characteristic in the time domain. Further, the frequency conversion unit 112 may calculate the frequency amplitude characteristic and the frequency phase characteristic not only by the discrete Fourier transform but also by the discrete cosine transform or the like. The frequency power characteristic may be used instead of the frequency amplitude characteristic. Let F (f) be the frequency amplitude characteristic obtained by frequency conversion.

平滑化部１１３が、周波数振幅特性Ｆ（ｆ）を平滑化する（Ｓ１２）。平滑化手法としては、ケプストラム分析、単純移動平均、Savitzky-Golayフィルタ、平滑化スプライン、などを用いることができる。平滑化された周波数振幅特性を平滑化特性ＳＦ（ｆ）とする。 The smoothing unit 113 smoothes the frequency amplitude characteristic F (f) (S12). As the smoothing method, cepstrum analysis, simple moving average, Savitzky-Golay filter, smoothing spline, and the like can be used. The smoothed frequency amplitude characteristic is defined as the smoothing characteristic SF (f).

特徴量抽出部１１４が、平滑化特性ＳＦ（ｆ）のピーク及びノッチを検出し、特徴量を抽出する（Ｓ１３）。具体的には、平滑化特性ＳＦ（ｆ）の傾きに基づいて、ピーク及びノッチを検出する。平滑化特性ＳＦ（ｆ）の極大値がピークとなり、極小値がノッチとなる。図６は、平滑化特性を模式的に示す図である。ここでは、平滑化特性に３つのピークと２つのノッチが含まれている例を示す。 The feature amount extraction unit 114 detects the peak and notch of the smoothing characteristic SF (f) and extracts the feature amount (S13). Specifically, peaks and notches are detected based on the slope of the smoothing characteristic SF (f). The maximum value of the smoothing characteristic SF (f) is the peak, and the minimum value is the notch. FIG. 6 is a diagram schematically showing smoothing characteristics. Here, an example is shown in which the smoothing characteristic includes three peaks and two notches.

最も低い周波数のピークを１番目のピークＰ［１］とし、最も低い周波数のノッチを１番目のノッチＮ［１］とする。同様に周波数が低い側から順番に、２番目のピークＰ［２］、２番目のノッチＮ［２］、３番目のピークＰ［３］とする。ピーク、及びノッチをＰ［ｌ］、Ｎ［ｌ］として一般化する。ｌは１以上の整数であり、ピーク及びノッチの番号を示す。 The peak with the lowest frequency is referred to as the first peak P [1], and the notch with the lowest frequency is referred to as the first notch N [1]. Similarly, the second peak P [2], the second notch N [2], and the third peak P [3] are set in order from the lower frequency side. Peaks and notches are generalized as P [l], N [l]. l is an integer of 1 or more and indicates a peak and notch number.

特徴量抽出部１１４は、ピーク周波数、ピーク周波数での振幅（利得）、ノッチ周波数、ノッチ周波数での振幅（利得）を特徴量として抽出する。なお、ピーク周波数での振幅、及びノッチ周波数での振幅の値を、それぞれピーク値、及びノッチ値とする。Ｐ［１］〜Ｐ［３］のピーク周波数をそれぞれｆｐ１〜ｆｐ３とし、ピーク値をｇｐ１〜ｇｐ３とする。Ｎ［１］〜Ｎ［２］のノッチ周波数をそれぞれｎｐ１〜ｎｐ２とし、ノッチ値をｎｐ１〜ｎｐ２とする。ピーク値、及びノッチ値は、平滑前の周波数振幅特性Ｆ（ｆ）の振幅値とするが、平滑化特性ＳＦ（ｆ）の振幅値であってもよい。もちろん、周波数パワー特性の場合は、ピーク値、及びノッチ値はパワー値となる。 The feature amount extraction unit 114 extracts the peak frequency, the amplitude (gain) at the peak frequency, the notch frequency, and the amplitude (gain) at the notch frequency as the feature amount. The amplitude at the peak frequency and the amplitude at the notch frequency are defined as the peak value and the notch value, respectively. The peak frequencies of P [1] to P [3] are fp1 to fp3, respectively, and the peak values are gp1 to gp3. The notch frequencies of N [1] to N [2] are set to np1 to np2, respectively, and the notch values are set to np1 to np2. The peak value and the notch value are the amplitude values of the frequency amplitude characteristic F (f) before smoothing, but may be the amplitude values of the smoothing characteristic SF (f). Of course, in the case of frequency power characteristics, the peak value and the notch value are power values.

ピークＰ［１］は（ｇｐ１，ｆｐ１）の２次元ベクトルで示される。同様に、ピークＰ［２］、Ｐ［３］はそれぞれ（ｇｐ２，ｆｐ２）、（ｇｐ３，ｆｐ３）の２次元ベクトルで示される。ノッチＮ［１］、Ｎ［２］はそれぞれ（ｇｎ１，ｆｎ１）、（ｇｎ２，ｆｎ２）の２次元ベクトルで示される。 The peak P [1] is represented by a two-dimensional vector of (gp1, fp1). Similarly, peaks P [2] and P [3] are represented by two-dimensional vectors of (gp2, fp2) and (gp3, fp3), respectively. Notches N [1] and N [2] are represented by two-dimensional vectors of (gn1, fn1) and (gn2, fn2), respectively.

全てのピーク及びノッチの周波数、及び振幅値を特徴量として抽出する。特徴量抽出部１１４が抽出した特徴量は、特徴ベクトルとして示される。特徴ベクトルは、ピーク数とノッチ数との和に応じた次元となっている。具体的には、ピーク数をｌ＿ｍａｘ、ノッチ数をｍ＿ｍａｘとすると、特徴ベクトルの次元数は、［２×（ｌ＿ｍａｘ＋ｍ＿ｍａｘ）］となる。例えば、ピークの数が３つ、ノッチの数が２つの場合、特徴量のベクトルは、（ｆｐ１、ｇｐ１、ｆｎ１、ｇｎ１、ｆｐ２、ｇｐ２、ｆｎ２、ｇｎ２、ｆｐ３、ｇｐ３）の１０次元となる。このように、特徴量抽出部１１４は、外耳道伝達特性の特徴量を抽出する。特徴量抽出部１１４は、左右の外耳道伝達特性の特徴量をそれぞれ抽出する。 All peak and notch frequencies and amplitude values are extracted as features. The feature amount extracted by the feature amount extraction unit 114 is shown as a feature vector. The feature vector has a dimension corresponding to the sum of the number of peaks and the number of notches. Specifically, assuming that the number of peaks is l_max and the number of notches is m_max, the number of dimensions of the feature vector is [2 × (l_max + m_max)]. For example, when the number of peaks is 3 and the number of notches is 2, the feature amount vector has 10 dimensions (fp1, gp1, fn1, gn1, fp2, gp2, fn2, gn2, fp3, gp3). In this way, the feature amount extraction unit 114 extracts the feature amount of the external auditory canal transmission characteristic. The feature amount extraction unit 114 extracts the feature amount of the left and right ear canal transmission characteristics, respectively.

なお、特徴量抽出部１１４がピーク及びノッチを求める帯域は全周波数帯域であってもよく、一部の周波数帯域であってもよい。例えば、平滑化特性において、４ｋＨｚ以上の周波数帯域におけるピークとノッチを求めてもよい。 The band for which the feature amount extraction unit 114 obtains the peak and the notch may be the entire frequency band or a part of the frequency band. For example, in the smoothing characteristics, peaks and notches in a frequency band of 4 kHz or higher may be obtained.

送信部１２１は、特徴量抽出部１１４が抽出した特徴量をユーザデータとして、サーバ装置３００に送信する。ユーザデータは外耳道伝達特性に関するデータである。具体的には、ユーザデータは、ユーザの外耳道伝達特性の特徴量を含んでいる。ユーザの外耳道伝達特性の特徴量をユーザ特徴量ｈｐＬ＿Ｕ、ｈｐＲ＿Ｕとする。ユーザ特徴量ｈｐＬ＿Ｕはユーザの左耳の外耳道伝達特性の特徴量であり、ユーザ特徴量ｈｐＲ＿Ｕはユーザの左耳の外耳道伝達特性の特徴量である。 The transmission unit 121 transmits the feature amount extracted by the feature amount extraction unit 114 to the server device 300 as user data. User data is data related to external auditory canal transmission characteristics. Specifically, the user data includes features of the user's ear canal transmission characteristics. Let the feature amounts of the user's ear canal transmission characteristics be the user feature amounts hpL_U and hpR_U. The user feature amount hpL_U is a feature amount of the external auditory canal transmission characteristic of the user's left ear, and the user feature amount hpR_U is a feature amount of the external auditory canal transmission characteristic of the user's left ear.

逆フィルタ算出部１１５は外耳道伝達特性ｆ（ｔ）に基づいて、逆フィルタを算出する。例えば、逆フィルタ算出部１１５は、外耳道伝達特性ｆ（ｔ）の周波数振幅特性Ｆ（ｆ）や周波数位相特性を補正する。逆フィルタ算出部１１５は、逆離散フーリエ変換により、周波数特性と位相特性とを用いて時間信号を算出する。逆フィルタ算出部１１５は、時間信号を所定のフィルタ長で切り出すことで、逆フィルタを算出する。頭外定位処理装置１００は、逆フィルタをメモリなどに格納する。 The inverse filter calculation unit 115 calculates the inverse filter based on the external auditory canal transmission characteristic f (t). For example, the inverse filter calculation unit 115 corrects the frequency amplitude characteristic F (f) and the frequency phase characteristic of the external auditory canal transmission characteristic f (t). The inverse filter calculation unit 115 calculates a time signal using the frequency characteristic and the phase characteristic by the inverse discrete Fourier transform. The inverse filter calculation unit 115 calculates an inverse filter by cutting out a time signal with a predetermined filter length. The out-of-head localization processing device 100 stores the inverse filter in a memory or the like.

上記のように、逆フィルタはヘッドホン特性（ヘッドホンの再生ユニットとマイク間の特性）をキャンセルするフィルタである。頭外定位処理装置１００は、逆フィルタ算出部１１５が算出した左右の逆フィルタを記憶する。なお、逆フィルタの算出方法については、公知の手法を用いることができるため、詳細な説明を省略する。逆フィルタ算出部１１５は、左耳の外耳道伝達特性に基づいて、左の逆フィルタＬｉｎｖを生成する。逆フィルタ算出部１１５は、右耳の外耳道伝達特性に基づいて、右の逆フィルタＲｉｎｖを生成する。 As described above, the inverse filter is a filter that cancels the headphone characteristics (characteristics between the headphone playback unit and the microphone). The out-of-head localization processing device 100 stores the left and right inverse filters calculated by the inverse filter calculation unit 115. As for the calculation method of the inverse filter, a known method can be used, and therefore detailed description thereof will be omitted. The inverse filter calculation unit 115 generates the left inverse filter Linv based on the external auditory canal transmission characteristic of the left ear. The inverse filter calculation unit 115 generates the right inverse filter Rinv based on the external auditory canal transmission characteristic of the right ear.

なお、頭外定位処理装置１００が上記の処理を行っているが、一部又は全ての処理は、サーバ装置３００で行われていてもよい。例えば、頭外定位処理装置１００が測定した外耳道伝達特性ｆ（ｔ）をサーバ装置３００に送信して、サーバ装置３００が、周波数変換、平滑化、特徴量抽出の処理を行ってもよい。周波数変換、平滑化、特徴量抽出の処理は、サーバ装置３００及び頭外定位処理装置１００のいずれで行われてもよい。さらには、頭外定位処理装置１００又はサーバ装置３００以外の装置が一部の処理を行ってもよい。 Although the out-of-head localization processing device 100 performs the above processing, some or all of the processing may be performed by the server device 300. For example, the external auditory canal transmission characteristic f (t) measured by the out-of-head localization processing device 100 may be transmitted to the server device 300, and the server device 300 may perform frequency conversion, smoothing, and feature extraction processing. The frequency conversion, smoothing, and feature amount extraction processing may be performed by either the server device 300 or the out-of-head localization processing device 100. Furthermore, a device other than the out-of-head localization processing device 100 or the server device 300 may perform some processing.

サーバ装置３００について説明する。サーバ装置３００は、受信部３０１と、データ抽出部３０２と、データ格納部３０３と、比較部３０４と、選択部３０５と、送信部３０６とを備えている。 The server device 300 will be described. The server device 300 includes a receiving unit 301, a data extraction unit 302, a data storage unit 303, a comparison unit 304, a selection unit 305, and a transmission unit 306.

受信部３０１は、頭外定位処理装置１００から送信されたユーザデータを受信する。ユーザデータはユーザ特徴量ｈｐＬ＿Ｕ、ｈｐＲ＿Ｕを含んでいる。データ抽出部３０２は、ユーザ特徴量に基づいて、データ格納部３０３に格納されているプリセットデータの一部を抽出する。 The receiving unit 301 receives the user data transmitted from the out-of-head localization processing device 100. The user data includes user feature amounts hpL_U and hpR_U. The data extraction unit 302 extracts a part of the preset data stored in the data storage unit 303 based on the user feature amount.

データ格納部３０３は、事前測定で測定された複数の被測定者に関するデータをプリセットデータとして格納するデータベースである。データベースは、複数の記憶装置に分散されていてもよい。図７を参照して、データ格納部３０３に格納されているデータについて、説明する。図７は、データ格納部３０３に格納されているデータを示す表である。 The data storage unit 303 is a database that stores data related to a plurality of subjects measured in advance measurement as preset data. The database may be distributed across multiple storage devices. The data stored in the data storage unit 303 will be described with reference to FIG. 7. FIG. 7 is a table showing the data stored in the data storage unit 303.

データ格納部３０３は、被測定者の左右の耳毎にプリセットデータを格納している。具体的には、データ格納部３０３は、被測定者ＩＤ、耳の左右、特徴量、空間音響伝達特性１、及び空間音響伝達特性２が１行に並んだテーブル形式となっている。なお、図７に示すデータ形式は一例であり、テーブル形式ではなく、各パラメータのオブジェクトをタグ等で関連付けて保持するデータ形式等を採用してもよい。 The data storage unit 303 stores preset data for each of the left and right ears of the person to be measured. Specifically, the data storage unit 303 has a table format in which the subject ID, the left and right ears, the feature amount, the spatial acoustic transmission characteristic 1, and the spatial acoustic transmission characteristic 2 are arranged in one row. The data format shown in FIG. 7 is an example, and a data format or the like in which objects of each parameter are associated with each other by a tag or the like may be adopted instead of the table format.

データ格納部３０３には、１人の被測定者Ａに対して、２つのデータセットが格納されている。すなわち、データ格納部３０３は、被測定者Ａの左耳に関するデータセットと、被測定者Ａの右耳に関するデータセットが格納されている。 Two data sets are stored in the data storage unit 303 for one person A to be measured. That is, the data storage unit 303 stores a data set relating to the left ear of the subject A and a data set relating to the right ear of the subject A.

１つのデータセットには、被測定者ＩＤ、耳の左右、特徴量、空間音響伝達特性１、及び空間音響伝達特性２が含まれている。特徴量は、図３に示す測定装置２００による第２の事前測定に基づくデータである。特徴量は、外耳孔よりも前にある第１の位置からマイク２Ｌ、２Ｒまでの外耳道伝達特性から抽出されている。具体的には、被測定者１の外耳道伝達特性に対して、図５の処理を施すことで、特徴量が抽出される。ユーザの特徴量と、被測定者の特徴量は、それぞれの外耳道伝達特性に対して同様の処理を施すことで、抽出される。 One data set includes the subject ID, the left and right ears, the feature amount, the spatial acoustic transmission characteristic 1, and the spatial acoustic transmission characteristic 2. The feature amount is data based on the second pre-measurement by the measuring device 200 shown in FIG. The feature amount is extracted from the ear canal transmission characteristics from the first position in front of the external auditory canal to the microphones 2L and 2R. Specifically, the feature amount is extracted by subjecting the external auditory canal transmission characteristic of the person to be measured 1 to the process of FIG. The feature amount of the user and the feature amount of the person to be measured are extracted by performing the same processing on the respective external auditory canal transmission characteristics.

被測定者Ａの左耳の特徴量は、特徴量ｈｐＬ＿Ａと示し、被測定者Ａの右耳の特徴量は、特徴量ｈｐＲ＿Ａと示している。被測定者Ｂの左耳の特徴量は、特徴量ｈｐＬ＿Ｂと示し、被測定者Ｂの右耳の特徴量は、特徴量ｈｐＲ＿Ｂと示している。 The feature amount of the left ear of the subject A is shown as the feature amount hpL_A, and the feature amount of the right ear of the subject A is shown as the feature amount hpR_A. The feature amount of the left ear of the subject B is shown as the feature amount hpL_B, and the feature amount of the right ear of the subject B is shown as the feature amount hpR_B.

空間音響伝達特性１、及び空間音響伝達特性２は、図２に示す測定装置２００による第１の事前測定に基づくデータである。被測定者Ａの左耳の場合、空間音響伝達特性１はＨｌｓ＿Ａとなり、空間音響伝達特性２は、Ｈｒｏ＿Ａとなる。被測定者Ａの右耳の場合、空間音響伝達特性１はＨｒｓ＿Ａとなり、空間音響伝達特性２は、Ｈｌｏ＿Ａとなる。このように、１つの耳に関する２つの空間音響伝達特性がペアとなっている。被測定者Ｂの左耳については、Ｈｌｓ＿ＢとＨｒｏ＿Ｂがペアとなり、被測定者Ｂの右耳については、Ｈｒｓ＿ＢとＨｌｏ＿Ｂがペアとなっている。空間音響伝達特性１、及び空間音響伝達特性２は、フィルタ長で切り出された後のデータでもよく、フィルタ長で切り出される前のデータでもよい。 The spatial acoustic transmission characteristic 1 and the spatial acoustic transmission characteristic 2 are data based on the first pre-measurement by the measuring device 200 shown in FIG. In the case of the left ear of the subject A, the spatial acoustic transmission characteristic 1 is Hls_A, and the spatial acoustic transmission characteristic 2 is Hro_A. In the case of the right ear of the subject A, the spatial acoustic transmission characteristic 1 is Hrs_A, and the spatial acoustic transmission characteristic 2 is Hlo_A. In this way, two spatial acoustic transmission characteristics for one ear are paired. For the left ear of the subject B, Hls_B and Hro_B are paired, and for the right ear of the subject B, Hrs_B and Hlo_B are paired. The spatial acoustic transmission characteristic 1 and the spatial acoustic transmission characteristic 2 may be data after being cut out by the filter length, or may be data before being cut out by the filter length.

被測定者Ａの左耳については、特徴量ｈｐＬ＿Ａと、空間音響伝達特性Ｈｌｓ＿Ａと、空間音響伝達特性Ｈｒｏ＿Ａとが対応付けられて、１つのデータセットとなっている。同様に、被測定者Ａの右耳については、特徴量ｈｐＲ＿Ａと、空間音響伝達特性Ｈｒｓ＿Ａと、空間音響伝達特性Ｈｌｏ＿Ａとが対応付けられて、１つのデータセットとなっている。同様に、被測定者Ｂの左耳については、特徴量ｈｐＬ＿Ｂと、空間音響伝達特性Ｈｌｓ＿Ｂと、空間音響伝達特性Ｈｒｏ＿Ｂとが対応付けられて、１つのデータセットとなっている。同様に、被測定者Ｂの右耳については、特徴量ｈｐＲ＿Ｂと、空間音響伝達特性Ｈｒｓ＿Ｂと、空間音響伝達特性Ｈｌｏ＿Ｂとが対応付けて、１つのデータセットとなっている。 For the left ear of the subject A, the feature amount hpL_A, the spatial acoustic transmission characteristic Hls_A, and the spatial acoustic transmission characteristic Hro_A are associated with each other to form one data set. Similarly, for the right ear of the subject A, the feature amount hpR_A, the spatial acoustic transmission characteristic Hrs_A, and the spatial acoustic transmission characteristic Hlo_A are associated with each other to form one data set. Similarly, for the left ear of the subject B, the feature amount hpL_B, the spatial acoustic transmission characteristic Hls_B, and the spatial acoustic transmission characteristic Hro_B are associated with each other to form one data set. Similarly, for the right ear of the subject B, the feature amount hpR_B, the spatial acoustic transmission characteristic Hrs_B, and the spatial acoustic transmission characteristic Hlo_B are associated with each other to form one data set.

なお、空間音響伝達特性１、２のペアを第１のプリセットデータとする。すなわち、１つのデータセットを構成する空間音響伝達特性１、及び空間音響伝達特性２を第１のプリセットデータとする。１つのデータセットを構成する特徴量を第２のプリセットデータとする。１つのデータセットは、第１のプリセットデータ、及び第２のプリセットデータを含んでいる。そして、データ格納部３０３は、第１のプリセットデータと第２のプリセットデータとを被測定者の左右の耳毎に対応付けて記憶している。 The pair of spatial acoustic transmission characteristics 1 and 2 is used as the first preset data. That is, the spatial acoustic transmission characteristic 1 and the spatial acoustic transmission characteristic 2 constituting one data set are set as the first preset data. The features that make up one data set are used as the second preset data. One data set contains a first preset data and a second preset data. Then, the data storage unit 303 stores the first preset data and the second preset data in association with each of the left and right ears of the person to be measured.

ここで、ｎ（ｎは２以上の整数）人の被測定者１に対して、第１及び第２の事前測定が予め行われているとする。この場合、データ格納部３０３には、両耳分である２ｎ個のデータセットが格納されている。 Here, it is assumed that the first and second pre-measurements are performed in advance for n (n is an integer of 2 or more) person 1 to be measured. In this case, the data storage unit 303 stores 2n data sets for both ears.

図８を用いて、第２のプリセットデータを詳細に説明する。図８は、第２のプリセットデータを示す表である。被測定者１の左右の耳毎に、特徴量が含まれている。特徴量は、上記の通り、ピーク値、ピーク周波数、ノッチ値、及びノッチ周波数を含んでいる。ＧＰ１、ＦＰ１は、１番目のピークのピーク値、及びピーク周波数である。ＧＮ１、ＦＧ１は、１番目のノッチのノッチ値、及びノッチ周波数である。なお、図８では、２番目以降のピーク、及びノッチに関するデータを省略している。さらに、第２のプリセットデータは、平滑化特性のピーク数とノッチ数を含んでいてもよい。 The second preset data will be described in detail with reference to FIG. FIG. 8 is a table showing the second preset data. A feature amount is included for each of the left and right ears of the subject 1. As described above, the feature amount includes the peak value, the peak frequency, the notch value, and the notch frequency. GP1 and FP1 are the peak value and peak frequency of the first peak. GN1 and FG1 are notch values and notch frequencies of the first notch. In FIG. 8, the data relating to the second and subsequent peaks and the notch are omitted. Further, the second preset data may include the number of peaks and the number of notches of the smoothing characteristic.

また、被測定者１毎にピーク数またはノッチ数が異なる。また、同じ被測定者１であっても、左右の耳で、ピーク数またはノッチ数が異なることがある。よって、データセット毎に、特徴量として含まれるデータ数が異なる。なお、ピーク数とノッチ数は、全周波数帯域におけるピークとノッチの数でもよく、一部の周波数帯域における数でもよい。例えば、所定の周波数以上の帯域におけるピークとノッチを特徴量としてもよい。所定の周波数は、例えば、２ｋＨｚ〜４ｋＨｚの範囲にある周波数である。 In addition, the number of peaks or the number of notches is different for each person to be measured. Further, even in the same subject 1, the number of peaks or the number of notches may differ between the left and right ears. Therefore, the number of data included as a feature amount differs for each data set. The number of peaks and the number of notches may be the number of peaks and notches in all frequency bands, or may be the number in some frequency bands. For example, peaks and notches in a band above a predetermined frequency may be used as feature quantities. The predetermined frequency is, for example, a frequency in the range of 2 kHz to 4 kHz.

データ抽出部３０２は、ピーク数及びノッチ数が一致する第２のプリセットデータを探索する。そして、データ抽出部３０２は、ピーク数及びノッチ数が一致する第２のプリセットデータを抽出する。データ抽出部３０２は、データ格納部３０３に格納されているデータセットの中から、ピーク数及びノッチ数が一致する第２のプリセットデータを抽出する。 The data extraction unit 302 searches for the second preset data in which the number of peaks and the number of notches match. Then, the data extraction unit 302 extracts the second preset data in which the number of peaks and the number of notches match. The data extraction unit 302 extracts the second preset data in which the number of peaks and the number of notches match from the data set stored in the data storage unit 303.

例えば、図６では、ピーク数が３で、ノッチ数が２である。よって、図８に示すテーブルにおいて、ＩＤ＿Ａの被測定者の左耳と、ＩＤ＿Ｂの被測定者の右耳で、ピーク数とノッチ数が一致する。データ抽出部３０２は、ＩＤ＿Ａの被測定者１の左耳の特徴量ｈｐＬ＿Ａ、ＩＤ＿Ｂの被測定者１の右耳の特徴量ｈｐＲ＿Ｂを抽出する。データ抽出部３０２が抽出したデータを抽出データとする。 For example, in FIG. 6, the number of peaks is 3 and the number of notches is 2. Therefore, in the table shown in FIG. 8, the number of peaks and the number of notches match between the left ear of the person to be measured with ID_A and the right ear of the person to be measured with ID_B. The data extraction unit 302 extracts the feature amount hpL_A of the left ear of the person to be measured with ID_A and the feature amount hpR_B of the right ear of the person to be measured with ID_B. The data extracted by the data extraction unit 302 is used as the extracted data.

比較部３０４は、抽出データ（例えば、特徴量ｈｐＬ＿Ａ、特徴量ｈｐＬ＿Ａ）と、ユーザデータ（例えば、ユーザ特徴量ｈｐＬ＿Ｕ）とを比較して、類似度を求める。比較部３０４は、抽出データの中で、ユーザデータに最も類似する第２のプリセットデータを求める。比較部３０４は、比較結果を選択部３０５に出力する。選択部３０５は、ユーザデータに最も類似する第２のプリセットデータに対応する第１のプリセットデータを選択する。第１のプリセットデータは、上記の通り、空間音響伝達特性１、及び空間音響伝達特性２を含んでいる。 The comparison unit 304 compares the extracted data (for example, feature amount hpL_A, feature amount hpL_A) with the user data (for example, user feature amount hpL_U) to obtain the degree of similarity. The comparison unit 304 obtains the second preset data that is most similar to the user data among the extracted data. The comparison unit 304 outputs the comparison result to the selection unit 305. The selection unit 305 selects the first preset data corresponding to the second preset data most similar to the user data. As described above, the first preset data includes the spatial acoustic transmission characteristic 1 and the spatial acoustic transmission characteristic 2.

例えば、ユーザ特徴量ｈｐＬ＿Ｕに最も類似する特徴量がＩＤ＿Ａの被測定者１の左耳の特徴量ｈｐＬ＿Ａであるとする。選択部３０５は、特徴量ｈｐＬ＿Ａに対応する空間音響伝達特性Ｈｌｓ＿Ａ、Ｈｒｏ＿Ａを選択する。選択部３０５で選択された特性を選択特性とし、選択特性のデータを選択データとする。選択データは１つのデータセットに含まれる第１のプリセットデータである。選択データは空間音響伝達特性のペアを含んでいる。 For example, it is assumed that the feature amount most similar to the user feature amount hpL_U is the feature amount hpL_A of the left ear of the subject 1 with ID_A. The selection unit 305 selects the spatial acoustic transmission characteristics Hls_A and Hro_A corresponding to the feature amount hpL_A. The characteristic selected by the selection unit 305 is used as the selection characteristic, and the data of the selection characteristic is used as the selection data. The selected data is the first preset data included in one data set. The selection data contains a pair of spatial acoustic transfer characteristics.

データ抽出部３０２、比較部３０４、及び選択部３０５は、左右のユーザ特徴量に対して、それぞれ同様の処理を行う。ユーザの左耳のユーザ特徴量ｈｐＬ＿Ｕについて、空間音響伝達特性のペア（例えば、Ｈｌｓ＿Ａ，Ｈｒｏ＿Ａ）が選択データとなる。同様に、ユーザの右耳のユーザ特徴量ｈｐＲ＿Ｕについて、空間音響伝達特性のペアが選択データとなる。選択部３０５は、ユーザのそれぞれの耳に対して、選択データを選択する。 The data extraction unit 302, the comparison unit 304, and the selection unit 305 perform the same processing on the left and right user feature quantities, respectively. For the user feature amount hpL_U of the user's left ear, a pair of spatial acoustic transmission characteristics (for example, Hls_A, Hro_A) is selected data. Similarly, for the user feature amount hpR_U of the user's right ear, the pair of spatial acoustic transmission characteristics becomes the selection data. The selection unit 305 selects selection data for each user's ear.

そして、送信部３０６は、選択データ（第１のプリセットデータ）を頭外定位処理装置１００に送信する。送信部３０６は、選択された第１のプリセットデータに対して、通信規格に応じた処理（例えば、変調処理）を行って、送信する。送信部３０６は、ユーザの左右の耳に関して、それぞれ空間音響伝達特性のペアを選択データとして送信する。 Then, the transmission unit 306 transmits the selection data (first preset data) to the out-of-head localization processing device 100. The transmission unit 306 performs processing (for example, modulation processing) according to the communication standard on the selected first preset data and transmits the selected first preset data. The transmission unit 306 transmits a pair of spatial acoustic transmission characteristics as selection data for each of the user's left and right ears.

このように、データ抽出部３０２は、ピーク及びノッチに基づいて、データ格納部３０３に格納されたデータセットの中から、一部のデータセットを抽出する。比較部３０４は、抽出されたデータセットの第２のプリセットデータをユーザデータと比較する。そして、選択部３０５は、第２のプリセットデータとユーザデータとの比較結果に基づいて、ユーザに適した第１のプリセットデータを選択する。 In this way, the data extraction unit 302 extracts a part of the data set from the data set stored in the data storage unit 303 based on the peak and the notch. The comparison unit 304 compares the second preset data of the extracted data set with the user data. Then, the selection unit 305 selects the first preset data suitable for the user based on the comparison result between the second preset data and the user data.

受信部１２２は、送信部３０６から送信された選択データ（第１のプリセットデータ）を受信する。受信部１２２は、受信した第１のプリセットデータに対して、通信規格に応じた処理（例えば、復調処理）を行う。受信部１２２は、左耳に関する第１のプリセットデータとして、空間音響伝達特性のペアを受信する。受信部１２２は、右耳に関する第１のプリセットデータとして、空間音響伝達特性のペアを受信する。 The receiving unit 122 receives the selection data (first preset data) transmitted from the transmitting unit 306. The receiving unit 122 performs processing (for example, demodulation processing) according to the communication standard on the received first preset data. The receiving unit 122 receives a pair of spatial acoustic transmission characteristics as the first preset data regarding the left ear. The receiving unit 122 receives a pair of spatial acoustic transmission characteristics as the first preset data regarding the right ear.

そして、フィルタ設定部１１６は、第１のプリセットデータに基づいて、空間音響フィルタを設定する。例えば、空間音響伝達特性のペアに含まれる空間音響伝達特性Ｈｌｓ＿ＡがユーザＵの空間音響伝達特性Ｈｌｓとなり、空間音響伝達特性Ｈｒｏ＿ＡがユーザＵの空間音響伝達特性Ｈｒｏとなる。同様に、選択データである第１のプリセットデータに含まれる空間音響伝達特性のペアが、ユーザＵの空間音響伝達特性Ｈｌｏ，Ｈｒｓとなる。ユーザＵの空間音響伝達特性Ｈｌｏ、及びＨｒｓは、図７の空間音響伝達特性１のデータから選ばれる。ユーザＵの空間音響伝達特性Ｈｌｏ及びＨｒｏは、図７の空間音響伝達特性２のデータから選ばれる。 Then, the filter setting unit 116 sets the spatial acoustic filter based on the first preset data. For example, the spatial acoustic transmission characteristic Hls_A included in the pair of spatial acoustic transmission characteristics becomes the spatial acoustic transmission characteristic Hls of the user U, and the spatial acoustic transmission characteristic Hro_A becomes the spatial acoustic transmission characteristic Hro of the user U. Similarly, the pair of spatial acoustic transmission characteristics included in the first preset data, which is the selected data, becomes the spatial acoustic transmission characteristics Hlo, Hrs of the user U. The spatial acoustic transmission characteristic Hlo and Hrs of the user U are selected from the data of the spatial acoustic transmission characteristic 1 of FIG. The spatial acoustic transmission characteristics Hlo and Hro of the user U are selected from the data of the spatial acoustic transmission characteristic 2 of FIG.

なお、第１のプリセットデータがフィルタ長で切り出した後のデータである場合、フィルタ設定部１１６が第１のプリセットデータをそのまま、空間音響フィルタとして設定する。例えば、空間音響伝達特性Ｈｌｓ＿ＡがユーザＵの空間音響伝達特性Ｈｌｓとなる。第１のプリセットデータがフィルタ長で切り出される前のデータである場合、フィルタ設定部１１６が空間音響伝達特性をフィルタ長に切り出す処理を行う。 When the first preset data is the data after being cut out by the filter length, the filter setting unit 116 sets the first preset data as it is as a spatial acoustic filter. For example, the spatial acoustic transmission characteristic Hls_A becomes the spatial acoustic transmission characteristic Hls of the user U. When the first preset data is the data before being cut out by the filter length, the filter setting unit 116 performs a process of cutting out the spatial acoustic transmission characteristic to the filter length.

頭外定位処理装置１００は、頭外定位フィルタをメモリなどに格納する。フィルタ設定部１１６は、空間音響フィルタを図１の畳み込み演算部１１、１２、２１、２２に設定する。また、フィルタ設定部１１６は、逆フィルタ算出部１１５が算出した逆フィルタをフィルタ部４１，フィルタ部４２に設定する。頭外定位処理装置１００は、図１に示したように、４つの空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに応じた空間音響フィルタと、逆フィルタとを用いて、演算処理を行う。頭外定位処理装置１００は、４つの空間音響フィルタと、２つの逆フィルタとを用いて、ステレオ入力信号に上記の畳み込み演算処理等を行う。 The out-of-head localization processing device 100 stores the out-of-head localization filter in a memory or the like. The filter setting unit 116 sets the spatial acoustic filter in the convolution calculation units 11, 12, 21, and 22 of FIG. Further, the filter setting unit 116 sets the inverse filter calculated by the inverse filter calculation unit 115 in the filter unit 41 and the filter unit 42. As shown in FIG. 1, the out-of-head localization processing device 100 performs arithmetic processing using a spatial acoustic filter corresponding to four spatial acoustic transmission characteristics Hls, Hlo, Hro, and Hrs, and an inverse filter. The out-of-head localization processing device 100 uses four spatial acoustic filters and two inverse filters to perform the above-mentioned convolution calculation processing on the stereo input signal.

このように、データ格納部３０３が、被測定者１毎に第１のプリセットデータと、第２のプリセットデータを対応付けて格納している。第１のプリセットデータは被測定者１の空間音響伝達特性に関するデータである。第２のプリセットデータは、被測定者１の外耳道伝達特性に関するデータである。具体的には、第２のプリセットデータは、被測定者１の外耳道伝達特性の特徴量を含んでいる。 In this way, the data storage unit 303 stores the first preset data and the second preset data in association with each other for each person to be measured 1. The first preset data is data relating to the spatial acoustic transmission characteristics of the subject 1. The second preset data is data relating to the external auditory canal transmission characteristic of the subject 1. Specifically, the second preset data includes the feature amount of the external auditory canal transmission characteristic of the subject 1.

比較部３０４はユーザデータを、第２のプリセットデータと比較する。ユーザデータは、ユーザ測定で得られた外耳道伝達特性に関するユーザ特徴量を含んでいる。比較部３０４は、特徴量の類似度を求めている。比較部３０４は、特徴量の類似度を求めている。そして、比較部３０４は、ユーザの外耳道伝達特性と類似する被測定者１と、耳の左右とを決定する。 The comparison unit 304 compares the user data with the second preset data. The user data includes a user feature amount related to the external auditory canal transmission characteristic obtained by the user measurement. The comparison unit 304 obtains the similarity of the features. The comparison unit 304 obtains the similarity of the features. Then, the comparison unit 304 determines the person to be measured 1 and the left and right ears that are similar to the user's ear canal transmission characteristics.

選択部３０５は、決定された被測定者と耳の左右とに対応する第１のプリセットデータを読み出す。そして、送信部３０６は、選択された第１のプリセットデータを頭外定位処理装置１００に送信している。ユーザ端末である頭外定位処理装置１００は、第１のプリセットデータに基づく空間音響フィルタと、測定データに基づく逆フィルタとを用いて、頭外定位処理を行う。 The selection unit 305 reads out the first preset data corresponding to the determined subject and the left and right ears. Then, the transmission unit 306 transmits the selected first preset data to the out-of-head localization processing device 100. The out-of-head localization processing device 100, which is a user terminal, performs out-of-head localization processing using a spatial acoustic filter based on the first preset data and an inverse filter based on the measurement data.

このようにすることで、ユーザＵが空間音響伝達特性を測定しなくても、適切なフィルタを決定することができる。よって、ユーザがリスニングルームなどに行く必要や、ユーザの家にスピーカなどを設置する必要がなくなる。ユーザ測定はヘッドホン装着状態で実施される。すなわち、ユーザＵがヘッドホンとマイクとを装着していれば、ユーザ個人の外耳道伝達特性を測定することができる。よって、簡便な方法で、定位効果の高い頭外定位を実現できる。なお、ユーザ測定と、頭外定位受聴に用いられるヘッドホン４３は同じタイプのものであることが好ましい。 By doing so, it is possible to determine an appropriate filter without the user U measuring the spatial acoustic transmission characteristics. Therefore, it is not necessary for the user to go to the listening room or the like, or to install a speaker or the like in the user's house. User measurement is performed with headphones on. That is, if the user U wears the headphones and the microphone, the external auditory canal transmission characteristics of the individual user can be measured. Therefore, an out-of-head localization with a high localization effect can be realized by a simple method. It is preferable that the headphones 43 used for the user measurement and the out-of-head stereotactic listening are of the same type.

さらに、ユーザデータと第２のプリセットデータが周波数振幅特性のピーク及びノッチに基づく特徴量となっている。これにより、処理するデータ量を削減することができる。特に、ピーク周波数、ピーク値、ノッチ周波数、及びノッチ値が特徴量として抽出されている。これにより、データ格納部３０３が、多数の被測定者のプリセットデータを格納している場合でも、少ないデータ量で適切にマッチングを行うことができる。 Further, the user data and the second preset data are features based on the peaks and notches of the frequency amplitude characteristic. As a result, the amount of data to be processed can be reduced. In particular, the peak frequency, the peak value, the notch frequency, and the notch value are extracted as feature quantities. As a result, even when the data storage unit 303 stores preset data of a large number of subjects, matching can be appropriately performed with a small amount of data.

また、本実施形態にかかる方法では、多数のプリセット特性を聴く聴感テストを行う必要がない。よって、ユーザ負担を軽減することができ、利便性を向上することができる。そして、被測定者とユーザの特徴量を比較することで、特性が似ている被測定者を選ぶことができる。そして選ばれた被測定者の耳の第１のプリセットデータから音響伝達フィルタが生成されるため、高い頭外定位効果が期待できる。 Further, in the method according to the present embodiment, it is not necessary to perform an auditory test for listening to a large number of preset characteristics. Therefore, the burden on the user can be reduced and the convenience can be improved. Then, by comparing the feature quantities of the person to be measured and the user, it is possible to select a person to be measured having similar characteristics. Then, since the acoustic transmission filter is generated from the first preset data of the selected subject's ear, a high out-of-head localization effect can be expected.

次に、図９を用いて、類似度を求める処理について説明する。図９は類似度を求める処理の一例を示すフローチャートである。本実施の形態では、比較部３０４が、ベクトル間距離または相関係数を用いて、類似度を算出している。なお、比較部３０４は、ベクトル間距離と相関係数の双方を用いて類似度を算出してもよい。 Next, the process of obtaining the similarity will be described with reference to FIG. FIG. 9 is a flowchart showing an example of the process of obtaining the similarity. In the present embodiment, the comparison unit 304 calculates the similarity using the inter-vector distance or the correlation coefficient. The comparison unit 304 may calculate the similarity using both the inter-vector distance and the correlation coefficient.

データ抽出部３０２が、評価範囲を算出する（Ｓ２１）。評価範囲は、ピーク及びノッチを抽出する周波数帯域である。ここでは、４ｋＨｚ以上の周波数帯域を評価範囲としている。４ｋＨｚ以上の周波数帯域に、個人の特性が現れやすいからである。なお、評価範囲の下限値は４ｋＨｚに限られるものではなく、例えば２ｋＨｚであってもよい。評価範囲の上限値は、周波数変換により得られた周波数特性の最大周波数とすることができる。なお、予め評価範囲内のピーク及びノッチのみが抽出されている場合、Ｓ２１は省略することができる。 The data extraction unit 302 calculates the evaluation range (S21). The evaluation range is a frequency band for extracting peaks and notches. Here, the evaluation range is a frequency band of 4 kHz or higher. This is because individual characteristics are likely to appear in the frequency band of 4 kHz or higher. The lower limit of the evaluation range is not limited to 4 kHz, and may be, for example, 2 kHz. The upper limit of the evaluation range can be the maximum frequency of the frequency characteristic obtained by frequency conversion. If only the peaks and notches within the evaluation range have been extracted in advance, S21 can be omitted.

データ抽出部３０２が、データ格納部３０３に格納されたプリセットデータの中からデータを抽出する（Ｓ２２）。すなわち、データ抽出部３０２が、評価範囲においてピーク数とノッチ数が一致する第２のプリセットデータを抽出する。 The data extraction unit 302 extracts data from the preset data stored in the data storage unit 303 (S22). That is, the data extraction unit 302 extracts the second preset data in which the number of peaks and the number of notches match in the evaluation range.

次に、比較部３０４が、ユーザデータ、及び第２のプリセットデータを正規化及び尺度変換する。例えば、ユークリッド距離を求める場合、周波数と振幅値で評価範囲の値（単位）が異なる。このため、評価対象となる周波数帯域の最小周波数が０、最大周波数が１となるように、比較部３０４がピーク周波数とノッチ周波数を正規化する。振幅値では、評価対象の振幅値（ピーク値）の最大値が１となり、振幅値（ノッチ値）の最小値が０となるように、比較部３０４がピーク値、及びノッチ値を正規化する。例えば、図６に示す例において、横軸（周波数軸）上において、ｆｐ１〜ｆｐ３の範囲が０〜１となる。同様に、縦軸（振幅軸）上において、ｇｎ１〜ｇｐ３の範囲が０〜１となる。 Next, the comparison unit 304 normalizes and scales the user data and the second preset data. For example, when calculating the Euclidean distance, the value (unit) of the evaluation range differs depending on the frequency and amplitude value. Therefore, the comparison unit 304 normalizes the peak frequency and the notch frequency so that the minimum frequency of the frequency band to be evaluated is 0 and the maximum frequency is 1. In the amplitude value, the comparison unit 304 normalizes the peak value and the notch value so that the maximum value of the amplitude value (peak value) to be evaluated becomes 1 and the minimum value of the amplitude value (notch value) becomes 0. .. For example, in the example shown in FIG. 6, the range of fp1 to fp3 is 0 to 1 on the horizontal axis (frequency axis). Similarly, on the vertical axis (amplitude axis), the range of gn1 to gp3 is 0 to 1.

尺度変換は、対数軸において、離散的なスペクトルデータ（周波数振幅特性）が等間隔になるように包絡線データの尺度を変化する。周波数変換部で求められた周波数振幅特性は、周波数的に等間隔となっている。つまり、周波数振幅特性は、周波数線形軸において等間隔となっているため、周波数対数軸では非等間隔になっている。このため、周波数対数軸において周波数振幅特性のデータが等間隔になるように、補間処理を行う。 Scale conversion changes the scale of envelope data so that discrete spectral data (frequency amplitude characteristics) are evenly spaced on the logarithmic axis. The frequency amplitude characteristics obtained by the frequency converter are evenly spaced in terms of frequency. That is, since the frequency amplitude characteristics are evenly spaced on the frequency linear axis, they are not evenly spaced on the frequency logarithmic axis. Therefore, interpolation processing is performed so that the frequency amplitude characteristic data are evenly spaced on the frequency logarithmic axis.

周波数振幅特性において、対数軸上では、低周波数域になればなるほど隣接するデータ間隔は粗く、高周波数域になればなるほど隣接するデータ間隔は密になっている。そのため、比較部３０４は、データ間隔が粗い低周波数帯域のデータを補間する。具体的には、比較部３０４は、３次元スプライン補間等の補間処理を行うことで、対数軸において等間隔に配置された離散的な包絡線データを求める。尺度変換が行われた包絡線データを、尺度変換データとする。尺度変換データは、周波数と振幅値またはパワー値とが対応付けられているスペクトルとなる。 In the frequency amplitude characteristic, on the logarithmic axis, the lower the frequency range, the coarser the adjacent data spacing, and the higher the frequency range, the denser the adjacent data spacing. Therefore, the comparison unit 304 interpolates the data in the low frequency band in which the data interval is coarse. Specifically, the comparison unit 304 obtains discrete envelope data arranged at equal intervals on the logarithmic axis by performing interpolation processing such as three-dimensional spline interpolation. Envelope data that has undergone scale conversion is defined as scale conversion data. The scale conversion data is a spectrum in which the frequency is associated with the amplitude value or the power value.

対数尺度に変換する理由について説明する。一般的に人間の感覚量は対数に変換されていると言われている。そのため、聴こえる音の周波数も対数軸で考えることが重要になる。尺度変換することで、上記の感覚量においてデータが等間隔となるため、全ての周波数帯域でデータを等価に扱えるようになる。この結果、数学的な演算、周波数帯域の分割や重み付けが容易になり、安定した結果を得ることが可能になる。なお、比較部３０４は、対数尺度に限らず、人間の聴覚に近い尺度（聴覚尺度と称する）へ包絡線データを変換すればよい。聴覚尺度としては、対数尺度（Ｌｏｇスケール）、メル（ｍｅｌ）尺度、バーク（Ｂａｒｋ）尺度、ＥＲＢ（Equivalent Rectangular Bandwidth）尺度等で尺度変換をしてもよい。比較部３０４は、データ補間により、包絡線データを聴覚尺度で尺度変換する。例えば、比較部３０４は、聴覚尺度においてデータ間隔が粗い低周波数帯域のデータを補間することで、低周波数帯域のデータを密にする。聴覚尺度で等間隔なデータは、線形尺度（リニアスケール）では低周波数帯域が密、高周波数帯域が粗なデータとなる。このようにすることで、比較部３０４は、聴覚尺度で等間隔な尺度変換データを生成することができる。もちろん、尺度変換データは、聴覚尺度において、完全に等間隔なデータでなくてもよい。 The reason for converting to a logarithmic scale will be explained. It is generally said that human senses are converted to logarithms. Therefore, it is important to consider the frequency of the audible sound on the logarithmic axis. By performing the scale conversion, the data are evenly spaced in the above-mentioned sensory quantity, so that the data can be treated equivalently in all frequency bands. As a result, mathematical calculations, frequency band division and weighting become easy, and stable results can be obtained. The comparison unit 304 may convert the envelope data into a scale close to human hearing (referred to as an auditory scale), not limited to the logarithmic scale. As the auditory scale, a logarithmic scale (Log scale), a mel scale, a Bark scale, an ERB (Equivalent Rectangular Bandwidth) scale, or the like may be used for scale conversion. The comparison unit 304 scales the envelope data on an auditory scale by data interpolation. For example, the comparison unit 304 interpolates the data in the low frequency band in which the data interval is coarse in the auditory scale to make the data in the low frequency band dense. Data that are evenly spaced on the auditory scale are dense in the low frequency band and coarse in the high frequency band on the linear scale. By doing so, the comparison unit 304 can generate scale conversion data at equal intervals on the auditory scale. Of course, the scale conversion data does not have to be completely evenly spaced data in the auditory scale.

Ｓ２３、及びＳ２５において、正規化、及び尺度変換は同じ処理でなくてもよい。例えば、距離を求める場合の尺度と、相関係数を求めるための尺度は異なっていてもよい。さらに、正規化、及び尺度変換の一方のみを行ってもよい。あるいは、正規化、及び尺度変換を省略してもよい。正規化と尺度変換の処理順序は、どちらを先に行ってもよく任意である。 In S23 and S25, normalization and scale conversion do not have to be the same process. For example, the scale for finding the distance and the scale for finding the correlation coefficient may be different. Furthermore, only one of normalization and scale conversion may be performed. Alternatively, normalization and scale conversion may be omitted. The processing order of normalization and scale conversion may be arbitrary, whichever comes first.

次に、比較部３０４は、特徴ベクトル間の距離を算出する（Ｓ２４）。特徴ベクトル間の距離を求めるための処理について図１０を用いて説明する。図１０は、距離を求める処理を示すフローチャートである。ここでは、比較部３０４が、ユーザデータと第２のプリセットデータとについて、特徴ベクトルのユークリッド距離を求めている。まず、初期化のため、ｌ＝０、ｍ＝０、ｑ＝０とする（Ｓ３１）。ｑはユークリッド距離を示す。 Next, the comparison unit 304 calculates the distance between the feature vectors (S24). The process for obtaining the distance between the feature vectors will be described with reference to FIG. FIG. 10 is a flowchart showing a process of obtaining a distance. Here, the comparison unit 304 obtains the Euclidean distance of the feature vector for the user data and the second preset data. First, for initialization, l = 0, m = 0, and q = 0 (S31). q indicates the Euclidean distance.

比較部３０４は、ｌ＝ｌ＿ｍａｘであるか否かを判定する（Ｓ３２）。ｌ＿ｍａｘは、ピーク数を示す整数である。ｌ＝ｌ＿ｍａｘではないと判定された場合（Ｓ３２のＮＯ）、ｑ＋＝｜｜ＡＰ［ｌ］−ＢＰ［ｌ］｜｜とする（Ｓ３３）。ＡＰ［ｌ］は、ユーザデータにおけるｌ番目のピークを示す二次元ベクトルである。ＢＰ［ｌ］は、第２のプリセットデータにおけるｌ番目のピークを示す二次元ベクトルである。比較部３０４は、ｌ番目のピーク同士のピーク間のユークリッド距離を求める。 The comparison unit 304 determines whether or not l = l_max (S32). l_max is an integer indicating the number of peaks. When it is determined that l = l_max is not (NO in S32), q + = || AP [l] -BP [l] || is set (S33). AP [l] is a two-dimensional vector indicating the l-th peak in the user data. BP [l] is a two-dimensional vector indicating the l-th peak in the second preset data. The comparison unit 304 obtains the Euclidean distance between the peaks of the l-th peaks.

比較部３０４は、ｌをインクリメントする（Ｓ３４）。そして、ｌ＝ｌ＿ｍａｘとなるまで、Ｓ３２〜Ｓ３４の処理を繰り返す。これにより、全てのピーク間の距離が求められる。つまり、ｑはピーク間の距離をｌ＿ｍａｘ回だけ加算した値となる。ｌ＿ｍａｘ＝３の場合、１番目のピーク同士のピーク間距離（第１のピーク間距離）と、２番目のピーク同士のピーク間距離（第２のピーク間距離）と、３番目のピーク同士のピーク間距離（第３のピーク間距離）が求められる。Ｓ３３において、ｑは、第１〜第３のピーク間距離の総和となる。 The comparison unit 304 increments l (S34). Then, the processes of S32 to S34 are repeated until l = l_max. This gives the distance between all the peaks. That is, q is a value obtained by adding the distance between peaks only l_max times. When l_max = 3, the peak-to-peak distance between the first peaks (first peak-to-peak distance), the peak-to-peak distance between the second peaks (second peak-to-peak distance), and the third peak-to-peak distance. The inter-peak distance (third inter-peak distance) is obtained. In S33, q is the sum of the distances between the first to third peaks.

ｌ＝ｌ＿ｍａｘと判定された場合（Ｓ３２のＹＥＳ）、比較部３０４は、ｍ＝ｍ＿ｍａｘであるか否かを判定する（Ｓ３５）。ｍ＿ｍａｘは、ノッチ数を示す整数である。ｍ＝ｍ＿ｍａｘではない場合（Ｓ３５のＮＯ）、ｑ＋＝｜｜ＡＮ［ｍ］−ＢＮ［ｍ］｜｜とする（Ｓ３６）。ＡＰ［ｍ］は、ユーザデータにおけるｍ番目のノッチを示す二次元ベクトルである。ＢＰ［ｍ］は、第２のプリセットデータにおけるｍ番目のノッチを示す二次元ベクトルである。比較部３０４は、ｍ番目のノッチ同士のノッチ間のユークリッド距離を求める。なお、上記の処理において、ＡＰ［ｌ］，ＡＮ［ｌ］、及びＢＰ［ｌ］、ＢＮ［ｌ］は、Ｓ２３で正規化及び尺度変換が行われたデータとなっている。 When it is determined that l = l_max (YES in S32), the comparison unit 304 determines whether or not m = m_max (S35). m_max is an integer indicating the number of notches. When m = m_max is not (NO in S35), q + = || AN [m] -BN [m] || is set (S36). AP [m] is a two-dimensional vector indicating the m-th notch in the user data. BP [m] is a two-dimensional vector indicating the m-th notch in the second preset data. The comparison unit 304 obtains the Euclidean distance between the m-th notches. In the above processing, AP [l], AN [l], BP [l], and BN [l] are data that have been normalized and scale-converted in S23.

比較部３０４は、ｍをインクリメントする（Ｓ３７）。そして、ｍ＝ｍ＿ｍａｘとなるまで、Ｓ３５〜Ｓ３７の処理を繰り返す。これにより、全てのノッチ間の距離が求められる。つまり、ｑはｌ個のピーク間の距離と、ｍ個のノッチ間の距離との総和となる。ｌ＿ｍａｘ＝３、ｍ＿ｍａｘ＝２の場合、ｑは、第１〜第３のピーク間距離と、第１〜第２のピーク間距離との総和となる。 The comparison unit 304 increments m (S37). Then, the processes of S35 to S37 are repeated until m = m_max. This gives the distance between all the notches. That is, q is the sum of the distance between l peaks and the distance between m notches. When l_max = 3 and m_max = 2, q is the sum of the first to third peak distances and the first to second peak distances.

そして、比較部３０４はｑ＝ｑ／（ｌ＿ｍａｘ＋ｍ＿ｍａｘ）とする。これにより、ｑが、（ｌ＿ｍａｘ＋ｍ＿ｍａｘ）個の距離の平均値となる。このようにして、特徴ベクトル間の距離を求めることができる。上記のように、ピーク数及びノッチ数が一致する第２のプリセットデータが抽出されているため、適切に特徴ベクトル間の距離を求めることができる。 Then, the comparison unit 304 sets q = q / (l_max + m_max). As a result, q becomes the average value of (l_max + m_max) distances. In this way, the distance between the feature vectors can be obtained. As described above, since the second preset data in which the number of peaks and the number of notches match are extracted, the distance between the feature vectors can be appropriately obtained.

図９の説明に戻る。比較部３０４は、相関係数を算出する（Ｓ２６）。図１１を用いて、相関係数を求めるための処理について説明する。なお、以下の処理においてもＡＰ［ｌ］、ＡＮ［ｍ］、ＢＰ［ｌ］、ＢＮ［ｍ］は、Ｓ２５において、正規化及び尺度変換されたデータとなっている。 Returning to the description of FIG. The comparison unit 304 calculates the correlation coefficient (S26). The process for obtaining the correlation coefficient will be described with reference to FIG. In the following processing, AP [l], AN [m], BP [l], and BN [m] are normalized and scale-converted data in S25.

比較部３０４は、特徴量のデータを補間する（Ｓ４１）。図１２に特徴量のデータを線形補間する例を示す。ここでは、ユーザデータが３つのピークＡＰ［１］〜ＡＰ［３］と２つのノッチＡＮ［１］〜ＡＮ［２］を有している。また、第２のプリセットデータが３つのピークＢＰ［１］〜ＢＰ［３］と２つのノッチＢＮ［１］〜ＢＮ［２］を有している。比較部３０４は、ピークとノッチとの間の振幅値を線形補間により算出する。もちろん、線形補間に限らず、スプライン補間などを用いてもよい。ユーザデータを補間したデータを補間データｌｉｎＡＦ（ｆ）とし、第２のプリセットデータを補間したデータを補間データｌｉｎＢＦ（ｆ）とする。 The comparison unit 304 interpolates the feature amount data (S41). FIG. 12 shows an example of linearly interpolating feature data. Here, the user data has three peaks AP [1] to AP [3] and two notches AN [1] to AN [2]. Further, the second preset data has three peaks BP [1] to BP [3] and two notches BN [1] to BN [2]. The comparison unit 304 calculates the amplitude value between the peak and the notch by linear interpolation. Of course, not limited to linear interpolation, spline interpolation or the like may be used. The data obtained by interpolating the user data is referred to as the interpolated data linAF (f), and the data obtained by interpolating the second preset data is referred to as the interpolated data linBF (f).

補間データを生成する理由について説明する。相関を見る２つのデータのピークとノッチの数が少ない場合がある。その場合、十分に２つの波形の類似性を評価することが困難となる。よって、ピークとノッチとの間のデータ（振幅値）を補間している。これにより、外耳道伝達特性の相関を精度良く求めることができる。 The reason for generating the interpolated data will be described. The number of peaks and notches in the two data looking for correlation may be small. In that case, it becomes difficult to sufficiently evaluate the similarity between the two waveforms. Therefore, the data (amplitude value) between the peak and the notch is interpolated. As a result, the correlation of the external auditory canal transmission characteristics can be obtained with high accuracy.

次に、比較部３０４が、補間データの評価範囲を算出する（Ｓ４２）。例えば、補間後、評価する２つのデータの最大周波数が異なる場合がある。この場合は２つの最大周波数の小さい周波数までを周波数軸の評価範囲の最大値とする。図１１に示す例においては、ＢＰ[３]のピーク周波数が、ＡＰ[３]のピーク周波数よりも大きい。よって、ＡＰ［３］のピーク周波数を最大周波数として、補間データｌｉｎＡＦ（ｆ）、ｌｉｎＢＦ（ｆ）を生成する。つまり、評価範囲の最大値は、ＡＰ［ｌ＿ｍａｘ］、ＡＮ［ｍ＿ｍａｘ］、ＢＰ［ｌ＿ｍａｘ］、ＢＮ［ｍ＿ｍａｘ］の周波数に基づいて設定される。 Next, the comparison unit 304 calculates the evaluation range of the interpolated data (S42). For example, after interpolation, the maximum frequencies of the two data to be evaluated may differ. In this case, the maximum value of the evaluation range of the frequency axis is set up to the frequency with which the two maximum frequencies are smaller. In the example shown in FIG. 11, the peak frequency of BP [3] is larger than the peak frequency of AP [3]. Therefore, the interpolated data linAF (f) and linBF (f) are generated with the peak frequency of AP [3] as the maximum frequency. That is, the maximum value of the evaluation range is set based on the frequencies of AP [l_max], AN [m_max], BP [l_max], and BN [m_max].

ＡＰ［ｌ＿ｍａｘ］のピーク周波数がＡＮ［ｍ＿ｍａｘ］のノッチ周波数より大きく、かつ、ＢＰ［ｌ＿ｍａｘ］のピーク周波数が、ＢＮ［ｍ＿ｍａｘ］のノッチ周波数よりも大きい場合、ＡＰ［ｌ＿ｍａｘ］のピーク周波数とＢＰ［ｌ＿ｍａｘ］のピーク周波数の内、小さい方のピーク周波数を補間データの最大周波数とする。ＡＰ［ｌ＿ｍａｘ］のピーク周波数がＡＮ［ｍ＿ｍａｘ］のノッチ周波数より小さく、かつ、ＢＰ［ｌ＿ｍａｘ］のピーク周波数が、ＢＮ［ｍ＿ｍａｘ］のノッチ周波数よりも小さい場合、ＡＮ［ｍ＿ｍａｘ］のノッチ周波数とＢＮ［ｍ＿ｍａｘ］のノッチ周波数の内、小さい方のノッチ周波数を補間データの最大周波数とする。 When the peak frequency of AP [l_max] is larger than the notch frequency of AN [m_max] and the peak frequency of BP [l_max] is larger than the notch frequency of BN [m_max], the peak frequency of AP [l_max] and BP The smaller peak frequency among the peak frequencies of [l_max] is set as the maximum frequency of the interpolated data. When the peak frequency of AP [l_max] is smaller than the notch frequency of AN [m_max] and the peak frequency of BP [l_max] is smaller than the notch frequency of BN [m_max], the notch frequency of AN [m_max] and BN Of the notch frequencies of [m_max], the smaller notch frequency is set as the maximum frequency of the interpolation data.

周波数軸の評価範囲の最小値は個人特性が現れやすい４ｋＨｚとしてもよい。耳介特性の測定方法によって、個人特性が現れやすい周波数領域が異なることがあるため、周波数軸の評価範囲の最小値は４ｋＨｚである必要はない。Ｓ２１において説明したように、最小値は２ｋＨｚ〜４ｋＨｚの範囲とすることができる。Ｓ４２で算出された評価範囲と、Ｓ２１で算出された評価範囲は異なっていてもよく、同じであってもよい。 The minimum value of the evaluation range of the frequency axis may be 4 kHz at which personal characteristics are likely to appear. Since the frequency range in which personal characteristics are likely to appear may differ depending on the method for measuring the auricle characteristics, the minimum value of the evaluation range of the frequency axis does not need to be 4 kHz. As described in S21, the minimum value can be in the range of 2 kHz to 4 kHz. The evaluation range calculated in S42 and the evaluation range calculated in S21 may be different or the same.

そして、比較部３０４が相関係数を算出する（Ｓ４３）。具体的には、比較部３０４は、評価範囲において、補間データｌｉｎＡＦ（ｆ）と補間データｌｉｎＢＦ（ｆ）の相関係数ｒを算出する。 Then, the comparison unit 304 calculates the correlation coefficient (S43). Specifically, the comparison unit 304 calculates the correlation coefficient r between the interpolated data linAF (f) and the interpolated data linBF (f) in the evaluation range.

図９の説明に戻る。比較部３０４は、ユークリッド距離ｑと相関係数ｒとに基づいて類似度を算出する（Ｓ２７）。ユークリッド距離ｑは値が小さいほど距離が近いこと、つまり特性が似ていることを示す。相関係数ｒは−１〜＋１の間で値を取り、＋１に近いほど似ていることを示す。よって、（１−ｒ）の値が小さいほど、特性が類似していることとなる。比較部３０４は、類似度をｑ＋（１−ｒ）とする。あるいは、比較部３０４は、類似度をｑ＊（１−ｒ）としてもよい。さらには、比較部３０４は、ｑと（１−ｒ）に重み付けを行ってもよい。例えば、比較部３０４は、重み付けのための係数をｑ及び（１−ｒ）の少なくとも一方に乗じてもよい。 Returning to the description of FIG. The comparison unit 304 calculates the similarity based on the Euclidean distance q and the correlation coefficient r (S27). The smaller the value of the Euclidean distance q, the closer the distance, that is, the similar characteristics. The correlation coefficient r takes a value between -1 and +1 and the closer it is to +1 the more similar it is. Therefore, the smaller the value of (1-r), the more similar the characteristics. The comparison unit 304 sets the similarity to q + (1-r). Alternatively, the comparison unit 304 may set the similarity to q * (1-r). Further, the comparison unit 304 may weight q and (1-r). For example, the comparison unit 304 may multiply at least one of q and (1-r) by a coefficient for weighting.

比較部３０４は、Ｓ２２で抽出された全てのデータに対して、ユーザ特性との類似度を算出する。つまり、比較部３０４は、ピーク数とノッチ数が等しい第２のプリセットデータのそれぞれに対して、ユーザデータとの類似度を算出する。そして、選択部３０５が、類似度が最も高い第２のプリセットデータに対応する第１のプリセットデータを選択する（Ｓ２８）。本実施の形態では、相関係数と距離とに基づいて類似度を求めているため、ユーザに特性が似ている被測定者１を選ぶことができる。そして選ばれた被測定者の耳の第１のプリセットデータを選択部３０５が選択するため、高い頭外定位効果が期待できる。 The comparison unit 304 calculates the degree of similarity with the user characteristics for all the data extracted in S22. That is, the comparison unit 304 calculates the similarity with the user data for each of the second preset data having the same number of peaks and the same number of notches. Then, the selection unit 305 selects the first preset data corresponding to the second preset data having the highest similarity (S28). In the present embodiment, since the similarity is obtained based on the correlation coefficient and the distance, it is possible to select the subject 1 whose characteristics are similar to those of the user. Then, since the selection unit 305 selects the first preset data of the selected ear of the subject to be measured, a high out-of-head localization effect can be expected.

なお、上記の説明では、相関係数と距離とに基づいて、類似度が求められているが、相関係数と距離のいずれか一方のみに基づいて、類似度が求められていてもよい。さらに、本実施の形態では、ピーク数とノッチ数とが一致する第２のプリセットデータのみを抽出しているため、特徴ベクトルの次元数を揃えることができる。これにより、ベクトル間距離を適切に算出することができ、より適切なマッチングを行うことができる。 In the above description, the similarity is determined based on the correlation coefficient and the distance, but the similarity may be determined based on only one of the correlation coefficient and the distance. Further, in the present embodiment, since only the second preset data in which the number of peaks and the number of notches match is extracted, the number of dimensions of the feature vector can be made uniform. As a result, the distance between the vectors can be calculated appropriately, and more appropriate matching can be performed.

また、上記の説明では、補間データｌｉｎＡＦ（ｆ）と補間データｌｉｎＢＦ（ｆ）の相関係数ｒを求めたが、比較部３０４は周波数振幅特性Ｆ（ｆ）と補間データｌｉｎＢＦ（ｆ）との相関係数を求めてもよい。あるいは、比較部３０４は平滑化特性ＳＦ（ｆ）と補間データｌｉｎＢＦ（ｆ）との相関係数を求めてもよい。この場合、頭外定位処理装置１００が外耳道伝達特性ｆ（ｔ）、周波数振幅特性Ｆ（ｆ）又は平滑化特性ＳＦ（ｆ）をサーバ装置３００に送信すればよい。また、周波数振幅特性Ｆ（ｆ）又は平滑化特性ＳＦ（ｆ）を求めるための処理の一部又は全部が、サーバ装置３００で行われてもよい。 Further, in the above description, the correlation coefficient r between the interpolated data linAF (f) and the interpolated data linBF (f) was obtained, but the comparison unit 304 has the frequency amplitude characteristic F (f) and the interpolated data linBF (f). The correlation coefficient may be obtained. Alternatively, the comparison unit 304 may obtain the correlation coefficient between the smoothing characteristic SF (f) and the interpolated data linBF (f). In this case, the out-of-head localization processing device 100 may transmit the external auditory canal transmission characteristic f (t), the frequency amplitude characteristic F (f), or the smoothing characteristic SF (f) to the server device 300. Further, a part or all of the processing for obtaining the frequency amplitude characteristic F (f) or the smoothing characteristic SF (f) may be performed by the server device 300.

変形例
変形例では、第２のプリセットデータをクラスタリングしている。図１３は、ピーク数とノッチ数とに基づいてクラスタリングを行った例を説明するためのテーブルである。実施の形態１では、特徴ベクトル間の距離を求めるため、ピーク数及びノッチ数がユーザ特徴量と一致する第２のプリセットデータが抽出されている。したがって、ピーク数及びノッチ数に応じて、予め第２のプリセットデータをクラスタリングしておくことで、データ抽出の処理を迅速に行うことができる。例えば、ピーク数が３、ノッチ数が２の場合、データ抽出部３０２が、クラスタ３３０に含まれる第２のプリセットデータを抽出する。 Modification example In the modification example, the second preset data is clustered. FIG. 13 is a table for explaining an example in which clustering is performed based on the number of peaks and the number of notches. In the first embodiment, in order to obtain the distance between the feature vectors, the second preset data in which the number of peaks and the number of notches match the user feature amount is extracted. Therefore, by clustering the second preset data in advance according to the number of peaks and the number of notches, the data extraction process can be performed quickly. For example, when the number of peaks is 3 and the number of notches is 2, the data extraction unit 302 extracts the second preset data included in the cluster 330.

もちろん、ピーク数、及びノッチ数以外を用いて、第２のプリセットデータをクラスタ（グループ）に分けてもよい。例えば、特徴ベクトルに応じて、クラスタリングを行ってもよい。具体的には、ｋ平均法等を用いて、クラスタリングを行うことができる。また、クラスタリングは、階層型クラスタリングであってもよく、非階層型クラスタリングであってもよい。また、各クラスタに十分な数の第２のプリセットデータ含まれるようにクラスタリングを行うことが好ましい。 Of course, the second preset data may be divided into clusters (groups) by using other than the number of peaks and the number of notches. For example, clustering may be performed according to the feature vector. Specifically, clustering can be performed using the k-means method or the like. Further, the clustering may be hierarchical clustering or non-hierarchical clustering. Further, it is preferable to perform clustering so that each cluster contains a sufficient number of second preset data.

データ抽出部３０２は、ユーザ特徴量が属するクラスタの第２のプリセットデータを抽出すればよい。データ格納部３０３では、複数の前記第２のプリセットデータが２以上のクラスタに分類されていればよい。データ抽出部３０２は、ユーザ特徴量が属するクラスタに含まれる第２のプリセットデータを抽出する。 The data extraction unit 302 may extract the second preset data of the cluster to which the user feature amount belongs. In the data storage unit 303, it is sufficient that the plurality of the second preset data are classified into two or more clusters. The data extraction unit 302 extracts the second preset data included in the cluster to which the user feature amount belongs.

頭外定位処理装置１００、及び測定処理装置２０１が、測定結果に応じて、フィルタを適切に生成するための演算処理を行っている。頭外定位処理装置１００、及び測定処理装置２０１は、パーソナルコンピュータ（ＰＣ）、タブレット端末、スマートホン等であり、メモリ、及びプロセッサを備えている。メモリは、処理プログラムや各種パラメータや測定データなどを記憶している。プロセッサは、メモリに格納された処理プログラムを実行する。プロセッサが処理プログラムを実行することで、各処理が実行される。プロセッサは、例えば、ＣＰＵ（Central Processing Unit）、ＦＰＧＡ（Field-Programmable Gate Array）、ＤＳＰ（Digital Signal Processor），ＡＳＩＣ（Application Specific Integrated Circuit）、又は、GPU(Graphics Processing Unit)等であってもよい。 The out-of-head localization processing device 100 and the measurement processing device 201 perform arithmetic processing for appropriately generating a filter according to the measurement result. The out-of-head localization processing device 100 and the measurement processing device 201 are a personal computer (PC), a tablet terminal, a smart phone, and the like, and include a memory and a processor. The memory stores processing programs, various parameters, measurement data, and the like. The processor executes a processing program stored in memory. Each process is executed when the processor executes the process program. The processor may be, for example, a CPU (Central Processing Unit), an FPGA (Field-Programmable Gate Array), a DSP (Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), a GPU (Graphics Processing Unit), or the like. ..

なお、図１，図２のブロック図のうち、一部のブロックを省略することが可能である。例えば、図２において、送信部１２１，送信部３０６、受信部１２２、及び受信部３０１は、他の装置に設けられていてもよい。さらに、複数の装置が分散して処理を行ってもよい。図５，図９のフローチャートのうち、一部，又は全部の処理については省略することが可能である。例えば、Ｓ２３〜Ｓ２７は、他の比較処理に置き換えることができる。つまり、Ｓ２３〜Ｓ２７以外の処理により、比較部３０４がユーザ特徴量と、第２のプリセットデータとを比較してもよい。 It is possible to omit some blocks from the block diagrams of FIGS. 1 and 2. For example, in FIG. 2, the transmitting unit 121, the transmitting unit 306, the receiving unit 122, and the receiving unit 301 may be provided in other devices. Further, a plurality of devices may be dispersed to perform the processing. Of the flowcharts of FIGS. 5 and 9, some or all of the processing can be omitted. For example, S23 to S27 can be replaced with other comparison processing. That is, the comparison unit 304 may compare the user feature amount with the second preset data by a process other than S23 to S27.

上記処理のうちの一部又は全部は、コンピュータプログラムによって実行されてもよい。上述したプログラムは、様々なタイプの非一時的なコンピュータ可読媒体（ｎｏｎ−ｔｒａｎｓｉｔｏｒｙｃｏｍｐｕｔｅｒｒｅａｄａｂｌｅｍｅｄｉｕｍ）を用いて格納され、コンピュータに供給することができる。非一時的なコンピュータ可読媒体は、様々なタイプの実体のある記録媒体（ｔａｎｇｉｂｌｅｓｔｏｒａｇｅｍｅｄｉｕｍ）を含む。非一時的なコンピュータ可読媒体の例は、磁気記録媒体（例えばフレキシブルディスク、磁気テープ、ハードディスクドライブ）、光磁気記録媒体（例えば光磁気ディスク）、ＣＤ−ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＣＤ−Ｒ、ＣＤ−Ｒ／Ｗ、半導体メモリ（例えば、マスクＲＯＭ、ＰＲＯＭ（ＰｒｏｇｒａｍｍａｂｌｅＲＯＭ)、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰＲＯＭ)、フラッシュＲＯＭ、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ））を含む。また、プログラムは、様々なタイプの一時的なコンピュータ可読媒体（ｔｒａｎｓｉｔｏｒｙｃｏｍｐｕｔｅｒｒｅａｄａｂｌｅｍｅｄｉｕｍ)によってコンピュータに供給されてもよい。一時的なコンピュータ可読媒体の例は、電気信号、光信号、及び電磁波を含む。一時的なコンピュータ可読媒体は、電線及び光ファイバ等の有線通信路、又は無線通信路を介して、プログラムをコンピュータに供給できる。 Part or all of the above processing may be executed by a computer program. The programs described above can be stored and supplied to a computer using various types of non-transitory computer readable media. Non-transient computer-readable media include various types of tangible storage media. Examples of non-temporary computer-readable media include magnetic recording media (eg, flexible disks, magnetic tapes, hard disk drives), magneto-optical recording media (eg, magneto-optical disks), CD-ROMs (Read Only Memory), CD-Rs, CD-R / W, semiconductor memory (for example, mask ROM, PROM (Programmable ROM), EPROM (Erasable PROM), flash ROM, RAM (Random Access Memory)) are included. The program may also be supplied to the computer by various types of temporary computer readable media. Examples of temporary computer-readable media include electrical, optical, and electromagnetic waves. The temporary computer-readable medium can supply the program to the computer via a wired communication path such as an electric wire and an optical fiber, or a wireless communication path.

以上、本発明者によってなされた発明を実施の形態に基づき具体的に説明したが、本発明は上記実施の形態に限られたものではなく、その要旨を逸脱しない範囲で種々変更可能であることは言うまでもない。 Although the invention made by the present inventor has been specifically described above based on the embodiment, the present invention is not limited to the above embodiment and can be variously modified without departing from the gist thereof. Needless to say.

Ｕユーザ
１被測定者
２Ｌ左マイク
２Ｒ右マイク
５Ｌ左スピーカ
５Ｒ右スピーカ
９Ｌ左耳
９Ｒ右耳
１０頭外定位処理部
１１畳み込み演算部
１２畳み込み演算部
２１畳み込み演算部
２２畳み込み演算部
２４加算器
２５加算器
４１フィルタ部
４２フィルタ部
４３ヘッドホン
１００頭外定位処理装置
１１１特性取得部
１１２周波数変換部
１１３平滑化部
１１４特徴量抽出部
１１５逆フィルタ算出部
１１６フィルタ設定部
１２１送信部
１２２受信部
２００測定装置
２０１測定処理装置
３００サーバ装置
３０１受信部
３０２データ抽出部
３０３データ格納部
３０４比較部
３０５選択部
３０６送信部 U User 1 Subject 2L Left microphone 2R Right microphone 5L Left speaker 5R Right speaker 9L Left ear 9R Right ear 10 Out-of-head localization processing unit 11 Convolution calculation unit 12 Convolution calculation unit 21 Convolution calculation unit 22 Convolution calculation unit 24 Adder 25 Adder 41 Filter unit 42 Filter unit 43 Headphones 100 Out-of-head localization processing device 111 Characteristic acquisition unit 112 Frequency conversion unit 113 Smoothing unit 114 Feature quantity extraction unit 115 Inverse filter calculation unit 116 Filter setting unit 121 Transmission unit 122 Receiver unit 200 Measurement Equipment 201 Measurement processing equipment 300 Server equipment 301 Reception unit 302 Data extraction unit 303 Data storage unit 304 Comparison unit 305 Selection unit 306 Transmission unit

Claims

An output unit that is attached to the user and outputs sound toward the user's ear,
A microphone unit that is attached to the user's ear and collects the sound output from the output unit.
A measurement processing device that outputs a measurement signal to the output unit and measures a sound collection signal output from the microphone unit.
An out-of-head localization filter determination system including a server device capable of communicating with the measurement processing device.
The server device
Data storage in which the first preset data relating to the spatial acoustic transmission characteristics from the sound source to the ear of the person to be measured and the second preset data relating to the characteristics of the external auditory canal transmission characteristics of the ear of the person to be measured are associated and stored. The unit includes a data storage unit that stores a plurality of the first and second preset data acquired for a plurality of subjects.
The out-of-head localization filter determination system is
A frequency conversion unit that obtains frequency characteristics by frequency-converting the sound-collecting signal picked up by the microphone unit.
A smoothing unit that smoothes the frequency characteristics,
A feature amount extraction unit that obtains a smoothed peak and notch of the frequency characteristic and extracts the feature amount of the frequency characteristic as a user feature amount based on the peak and notch.
A data extraction unit that extracts a part of the second preset data among the plurality of second preset data stored in the data storage unit based on the user feature amount, and a data extraction unit.
A comparison unit that compares the user feature amount with the extracted second preset data,
An out-of-head localization filter determination system including a selection unit for selecting a first preset data from a plurality of the first preset data based on a comparison result.

The out-of-head localization filter determination system according to claim 1, wherein the data extraction unit extracts the second preset data by using the number of the peaks and the notches.

The out-of-head localization filter determination system according to claim 2, wherein the data extraction unit extracts the second preset data in which the number of peaks and notches in a predetermined frequency band matches the user data.

The feature amount is
A claim that includes the peak frequency of the peak in a predetermined frequency band and the peak value at the peak frequency, and also includes the notch frequency of the notch in the predetermined frequency band and the notch value at the notch frequency. The out-of-head localization filter determination system according to any one of 1 to 3.

Any one of claims 1 to 4, wherein the comparison unit obtains the similarity based on the distance between the feature vector including the user feature amount and the feature vector including the feature amount of the second preset data. The out-of-head localization filter determination system described in.

The second preset data according to any one of claims 1 to 5, wherein the similarity is obtained based on the correlation between the interpolation characteristic obtained by interpolating between the peak and the notch and the user characteristic. Out-of-head localization filter determination system.

In the data storage unit, a plurality of the second preset data are classified into two or more clusters.
The data extraction unit
The out-of-head localization filter determination system according to any one of claims 1 to 6, wherein the second preset data included in the cluster to which the user feature amount belongs is extracted.

A frequency conversion unit that obtains frequency characteristics by frequency-converting the sound collection signal obtained by collecting the measurement signal output from the output unit with the microphone unit.
A smoothing unit that smoothes the frequency characteristics,
A feature amount extraction unit that obtains a smoothed peak and notch of the frequency characteristic and extracts the feature amount of the frequency characteristic as a user feature amount based on the peak and notch.
A spatial acoustic filter setting unit that sets a spatial acoustic filter based on a spatial acoustic transmission characteristic associated with a feature quantity similar to the user feature quantity.
An out-of-head localization processing device including an inverse filter calculation unit that calculates an inverse filter that cancels the characteristics of the output unit based on the sound pick-up signal.

Data storage in which the first preset data relating to the spatial acoustic transmission characteristics from the sound source to the ear of the person to be measured and the second preset data relating to the characteristics of the external auditory canal transmission characteristics of the ear of the person to be measured are associated and stored. A data storage unit that stores a plurality of the first and second preset data acquired for a plurality of subjects, and a data storage unit.
A data extraction unit that extracts a part of the second preset data among the plurality of second preset data stored in the data storage unit based on the user feature amount, and a data extraction unit.
A comparison unit that compares the user feature amount with the feature amount of the second preset data,
A selection unit for selecting the first preset data from the plurality of first preset data based on the comparison result is provided.
An out-of-head localization filter determining device in which the frequency characteristic of the external auditory canal transmission characteristic is smoothed and the feature amount is extracted based on the smoothed peak and notch of the frequency characteristic.

An output unit that is attached to the user and outputs sound toward the user's ear,
It is an out-of-head localization filter determination method for determining an out-of-head localization filter for the user by using a microphone unit which is attached to the user's ear and has a microphone for collecting the sound output from the output unit. ,
The step of frequency-converting the sound pick-up signal picked up by the microphone unit to obtain the frequency characteristics, and
The step of smoothing the frequency characteristics and
A step of obtaining a smoothed peak and notch of the frequency characteristic and extracting the feature amount of the frequency characteristic as a user feature amount based on the peak and notch.
A step of extracting a part of the second preset data from the plurality of second preset data stored in the data storage unit based on the user feature amount, and a step of extracting the second preset data.
A step of comparing the user feature amount with the extracted second preset data,
An out-of-head localization filter determination method including a step of selecting a first preset data from a plurality of first preset data based on a comparison result.

An output unit that is attached to the user and outputs sound toward the user's ear,
A computer is provided with an out-of-head localization filter determination method for determining an out-of-head localization filter for the user by using a microphone unit that is attached to the user's ear and has a microphone that collects the sound output from the output unit. It ’s a program to run
The method for determining the out-of-head localization filter is
The step of frequency-converting the sound pick-up signal picked up by the microphone unit to obtain the frequency characteristics, and
The step of smoothing the frequency characteristics and
A step of obtaining a smoothed peak and notch of the frequency characteristic and extracting the feature amount of the frequency characteristic as a user feature amount based on the peak and notch.
A step of extracting a part of the second preset data from the plurality of second preset data stored in the data storage unit based on the user feature amount, and a step of extracting the second preset data.
A step of comparing the user feature amount with the extracted second preset data,
A program including a step of selecting a first preset data from a plurality of first preset data based on a comparison result.