JP2010091980A

JP2010091980A - Transmitting and receiving device, transmitting and receiving method, and program

Info

Publication number: JP2010091980A
Application number: JP2008264452A
Authority: JP
Inventors: Osamu Fujii; 修藤井
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2008-10-10
Filing date: 2008-10-10
Publication date: 2010-04-22
Anticipated expiration: 2028-10-10
Also published as: JP5419413B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a transmitting and receiving device which more precisely monitors noise when information is transmitted, and also to provide a transmitting and receiving method and a program therefor. <P>SOLUTION: An extraction part 12 extracts left voice data and right voice data which are both related to received voice sound signals. An HPF (high pass filter) 110 extracts frequency components having a prescribed frequency or higher from these extracted left and right voice data. An addition part 141 computes values relative to the time sequential addition signals of the left voice data and right voice data which are related to the extracted frequency components, and computes addition data based on the accumulated addition values, by prescribed time, of the computed values. A subtraction part 142 computes values relative to the time sequential differential signals of the left voice data and right voice data which are related to the extracted frequency components, and computes subtraction data based on the accumulated addition values, by the prescribed time, of the computed values. The addition data and the subtraction data are both added, as the metadata, to voice data by an addition part 17. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、音声音響信号を送受信装置にて受信し、受信した音声音響信号を前記送受信装置から外部へ送信する送受信装置、送受信方法及びプログラムに関する。 The present invention relates to a transmission / reception device, a transmission / reception method, and a program for receiving a sound / acoustic signal by a transmission / reception device and transmitting the received sound / acoustic signal from the transmission / reception device to the outside.

デジタル放送においては変調された情報がキー局から、複数の中継局を経てユーザのチューナに伝送される。情報伝送の際、通信路にて各種ノイズが混入する場合があることから、この影響を低減するために誤り訂正技術等が採用されている（例えば、特許文献１、２参照）。また、ＭＰ３（MPeg-1 audio layer 3）、ＡＡＣ（Advanced Audio Codec）またはdolby-E（登録商標）等の高能率符号化方法として、チャンネル間の相関を利用したＭ/Ｓstereo技術等が採用されている。（例えば、非特許文献１、２参照）
特開平９-１８５０７公報特開平１-２２８３３３公報ＩＳＯ／ＩＥＣ１１１７２−３ＩＳＯ／ＩＥＣ１３８１８−７ In digital broadcasting, modulated information is transmitted from a key station to a user's tuner via a plurality of relay stations. Since various noises may be mixed in the communication path during information transmission, an error correction technique or the like is employed to reduce this influence (see, for example, Patent Documents 1 and 2). In addition, M / Stereo technology using correlation between channels is adopted as a high-efficiency encoding method such as MP3 (MPeg-1 audio layer 3), AAC (Advanced Audio Codec) or dolby-E (registered trademark). ing. (For example, see Non-Patent Documents 1 and 2)
Japanese Patent Laid-Open No. 9-18507 JP-A-1-228333 ISO / IEC 11172-3 ISO / IEC 13818-7

しかしながら、情報伝送の際にはランダムノイズまたはバーストノイズ等の各種ノイズの影響を受ける可能性がある。映像データのみならず音声データについてもノイズ等の影響を伝送路中で受けることがあり、これを簡易な処理でかつ効果的に検知する必要があった。また伝送の際にはdolby-E等の各種形式により符号化及び復号処理がなされるが、この符号化及び復号処理に起因する量子化ノイズまたは演算誤差等を、誤って伝送障害ノイズとして検出するという問題もあった。なお、特許文献１及び２並びに非特許文献１及び２には当該問題を解決するための手段が記載されていない。 However, there is a possibility that information transmission may be affected by various noises such as random noise or burst noise. Not only video data but also audio data may be affected by noise or the like in the transmission path, and this must be detected with simple processing and effectively. Also, during transmission, encoding and decoding processes are performed in various formats such as dolby-E. Quantization noise or calculation errors resulting from the encoding and decoding processes are erroneously detected as transmission fault noise. There was also a problem. Note that Patent Documents 1 and 2 and Non-Patent Documents 1 and 2 do not describe means for solving the problem.

本発明は斯かる事情に鑑みてなされたものであり、その目的は符号化及び復号処理に伴う成分を除去した上でメタデータを音声音響信号に付加することにより、より精度良く情報伝送の際のノイズを監視することが可能な送受信装置、送受信方法及びプログラムを提供することにある。 The present invention has been made in view of such circumstances, and an object of the present invention is to remove components accompanying encoding and decoding processes and add metadata to the audio-acoustic signal, thereby transmitting information with higher accuracy. It is an object to provide a transmission / reception apparatus, a transmission / reception method, and a program capable of monitoring noise.

本願に開示の送受信装置は、音声音響信号を受信し、受信した音声音響信号を送信する送受信装置において、受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出部と、該抽出部により抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出するハイパスフィルタと、該ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の和信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく加算データを算出する加算部と、前記ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の差信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく減算データを算出する減算部と、前記加算部及び減算部により算出した加算データ及び減算データをメタデータとして受信した音声音響信号に付加する付加部と、該付加部によりメタデータが付加された音声音響信号を外部へ送信する送信部とを備える。 The transmission / reception apparatus disclosed in the present application is a transmission / reception apparatus that receives a sound sound signal and transmits the received sound sound signal, and extracts an extraction unit that extracts a first sound sound signal and a second sound sound signal related to the received sound sound signal. A high-pass filter that extracts a frequency component equal to or higher than a predetermined frequency from the first audio-acoustic signal and the second audio-acoustic signal extracted by the extraction unit; a first audio-acoustic signal related to the frequency component extracted by the high-pass filter; A value related to a time-series sum signal of the second audio-acoustic signal is calculated, an addition unit that calculates addition data based on a cumulative addition value for a predetermined time of the calculated value, and a frequency component extracted by the high-pass filter A value related to a time-series difference signal between the first audio sound signal and the second audio sound signal is calculated, and subtraction data based on a cumulative addition value for a predetermined time of the calculated value. A subtracting unit that calculates the value, an adding unit that adds the addition data and subtracted data calculated by the adding unit and the subtracting unit to the audio-acoustic signal received as metadata, and a sound and audio signal to which metadata is added by the adding unit And a transmitting unit for transmitting to the outside.

本願に開示の送受信装置は、前記加算部により算出された加算データ及び前記減算部により算出された減算データを、所定の下限値及び上限値に基づき変換する変換部を備え、前記付加部は、前記変換部により変換された加算データ及び減算データをメタデータとして受信した音声音響信号に付加するよう構成してある。 The transmission / reception apparatus disclosed in the present application includes a conversion unit that converts the addition data calculated by the addition unit and the subtraction data calculated by the subtraction unit based on a predetermined lower limit value and an upper limit value, and the addition unit includes: The addition data and the subtraction data converted by the conversion unit are added to the audio-acoustic signal received as metadata.

本願に開示の送受信装置は、前記変換部は、前記加算部により算出された加算データ及び前記減算部により算出された減算データの絶対値の内、前記下限値よりも小さい加算データ及び減算データを零へ変換し、前記上限値を超える加算データ及び減算データの絶対値を前記上限値または上限値未満の値へ変換した後、絶対値算出前の符号を変換後の加算データ及び減算データに付加する上下限変換部と、加算データ及び減算データを整数へ変換する整数変換部とを備える。 In the transmitting / receiving apparatus disclosed in the present application, the conversion unit includes addition data and subtraction data smaller than the lower limit value among the absolute value of the addition data calculated by the addition unit and the subtraction data calculated by the subtraction unit. After converting the absolute value of the addition data and subtraction data exceeding the upper limit value to a value less than the upper limit value or lower limit value, the sign before calculating the absolute value is added to the converted addition data and subtraction data. An upper / lower limit conversion unit, and an integer conversion unit that converts the addition data and the subtraction data into integers.

本願に開示の送受信装置は、前記下限値は３、前記上限値は２５５である。 In the transmission / reception apparatus disclosed in the present application, the lower limit value is 3, and the upper limit value is 255.

本願に開示の送受信装置は、音声音響信号を受信し、受信した音声音響信号を送信する送受信装置において、受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出部と、該抽出部により抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出するハイパスフィルタと、該ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号の時系列の第１実効値を算出する第１実効値算出部と、前記ハイパスフィルタにより抽出された周波数成分に係る第２音声音響信号の時系列の第２実効値を算出する第２実効値算出部と、前記第１実効値算出部及び第２実効値算出部により算出した第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加する付加部と、該付加部によりメタデータが付加された音声音響信号を外部へ送信する送信部とを備える。 The transmission / reception apparatus disclosed in the present application is a transmission / reception apparatus that receives a sound sound signal and transmits the received sound sound signal, and extracts an extraction unit that extracts a first sound sound signal and a second sound sound signal related to the received sound sound signal. A high-pass filter that extracts a frequency component equal to or higher than a predetermined frequency from the first audio-acoustic signal and the second audio-acoustic signal extracted by the extraction unit, and a first audio-acoustic signal related to the frequency component extracted by the high-pass filter A first effective value calculation unit for calculating a first effective value in time series, and a second effective value calculation for calculating a second effective value in time series of the second audio-acoustic signal related to the frequency component extracted by the high-pass filter. And an adding unit that adds the first effective value and the second effective value calculated by the first effective value calculating unit and the second effective value calculating unit to the received audio-acoustic signal as metadata. And a transmission unit for transmitting the audio acoustic signals metadata is added by the adding unit to the outside.

本願に開示の送受信装置は、前記第１実効値算出部にて算出した第１実効値及び前記第２実効値算出部で算出した第２実効値の対数に基づき変換する変換部を備え、前記付加部は、前記変換部により変換された対数に係る第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加するよう構成してある。 The transmission / reception device disclosed in the present application includes a conversion unit that performs conversion based on a logarithm of the first effective value calculated by the first effective value calculation unit and the second effective value calculated by the second effective value calculation unit, The adding unit is configured to add the first effective value and the second effective value relating to the logarithm converted by the converting unit to the received audio-acoustic signal as metadata.

本願に開示の送受信装置は、前記所定周波数は２０Ｈｚである。 In the transmission / reception apparatus disclosed in the present application, the predetermined frequency is 20 Hz.

本願に開示の送受信装置は、前記抽出部は、受信した音声音響信号が第１音声音響信号及び第２音声音響信号を超える複数種類の音声音響信号を有する場合、該複数種類の音声音響信号を第１音声音響信号及び第２音声音響信号へ変換するよう構成してある。 In the transmission / reception device disclosed in the present application, when the received audio sound signal has a plurality of types of audio sound signals exceeding the first audio sound signal and the second audio sound signal, the extraction unit outputs the plurality of types of audio sound signals. The first voice sound signal and the second voice sound signal are converted.

本願に開示の送受信方法は、音声音響信号を送受信装置にて受信し、受信した音声音響信号を前記送受信装置から外部へ送信する送受信方法において、受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出ステップと、該抽出ステップにより抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出する成分抽出ステップと、該成分抽出ステップにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の和信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく加算データを算出する加算ステップと、前記成分抽出ステップにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の差信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく減算データを算出する減算ステップと、前記加算ステップ及び減算ステップにより算出した加算データ及び減算データをメタデータとして受信した音声音響信号に付加する付加ステップと、該付加ステップによりメタデータが付加された音声音響信号を外部へ送信する送信ステップとを含む。 The transmission / reception method disclosed in the present application is a transmission / reception method in which a sound / acoustic signal is received by a transmission / reception device, and the received sound / acoustic signal is transmitted from the transmission / reception device to the outside. An extraction step for extracting the second audio sound signal, a component extraction step for extracting a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction step, and an extraction by the component extraction step An addition step of calculating a value related to a time-series sum signal of the first audio acoustic signal and the second audio acoustic signal related to the frequency component and calculating addition data based on a cumulative addition value for a predetermined time of the calculated value; And calculating a value related to a time-series difference signal between the first audio sound signal and the second audio sound signal related to the frequency component extracted by the component extraction step. A subtraction step for calculating subtraction data based on a cumulative addition value for a predetermined time of the calculated value, and the addition data and subtraction data calculated by the addition step and the subtraction step are added to the received audio-acoustic signal as metadata. An adding step, and a transmitting step of transmitting the audio-acoustic signal to which the metadata is added in the adding step to the outside.

本願に開示の送受信方法は、音声音響信号を送受信装置にて受信し、受信した音声音響信号を前記送受信装置から外部へ送信する送受信方法において、受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出ステップと、該抽出ステップにより抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出する成分抽出ステップと、該成分抽出ステップにより抽出された周波数成分に係る第１音声音響信号の時系列の第１実効値を算出する第１実効値算出ステップと、前記成分抽出ステップにより抽出された周波数成分に係る第２音声音響信号の時系列の第２実効値を算出する第２実効値算出ステップと、前記第１実効値算出ステップ及び第２実効値算出ステップにより算出した第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加する付加ステップと、該付加ステップによりメタデータが付加された音声音響信号を外部へ送信する送信ステップとを含む。 The transmission / reception method disclosed in the present application is a transmission / reception method in which a sound / acoustic signal is received by a transmission / reception device, and the received sound / acoustic signal is transmitted from the transmission / reception device to the outside. An extraction step for extracting the second audio sound signal, a component extraction step for extracting a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction step, and an extraction by the component extraction step A first effective value calculating step of calculating a first effective value of the time series of the first audio-acoustic signal related to the frequency component thus obtained, and a time series of the second audio-acoustic signal related to the frequency component extracted by the component extracting step A second effective value calculating step for calculating a second effective value of the first effective value, and a first actual value calculated by the first effective value calculating step and the second effective value calculating step. Comprising an adding step of adding the voice sound signals received value and the second effective value as metadata, and a transmission step of transmitting the audio acoustic signals metadata is added by the addition step to the outside.

本願に開示のプログラムは、音声音響信号を受信して外部へ該音声音響信号を送信するコンピュータに用いられるプログラムにおいて、コンピュータに、受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出ステップと、該抽出ステップにより抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出する成分抽出ステップと、該成分抽出ステップにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の和信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく加算データを算出する加算ステップと、前記成分抽出ステップにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の差信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく減算データを算出する減算ステップと、前記加算ステップ及び減算ステップにより算出した加算データ及び減算データをメタデータとして受信した音声音響信号に付加する付加ステップと、該付加ステップによりメタデータが付加された音声音響信号を外部へ送信する送信ステップとを実行させる。 The program disclosed in the present application is a program used in a computer that receives a sound sound signal and transmits the sound sound signal to the outside, and the first sound sound signal and the second sound sound related to the received sound sound signal are transmitted to the computer. An extraction step for extracting a signal, a component extraction step for extracting a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction step, and a frequency component extracted by the component extraction step An addition step of calculating a value related to a time-series sum signal of the first audio acoustic signal and the second audio acoustic signal according to the above and calculating addition data based on a cumulative addition value for a predetermined time of the calculated value; A value related to a time-series difference signal between the first audio sound signal and the second audio sound signal related to the frequency component extracted in the step. A subtraction step for calculating subtraction data based on a cumulative addition value for a predetermined time of the calculated value, and the addition data and subtraction data calculated by the addition step and the subtraction step are added to the received audio-acoustic signal as metadata. An adding step and a transmitting step for transmitting the audio-acoustic signal to which metadata is added in the adding step to the outside are executed.

本願に開示のプログラムは、音声音響信号を受信して外部へ該音声音響信号を送信するコンピュータに用いられるプログラムにおいて、コンピュータに、受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出ステップと、該抽出ステップにより抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出する成分抽出ステップと、該成分抽出ステップにより抽出された周波数成分に係る第１音声音響信号の時系列の第１実効値を算出する第１実効値算出ステップと、前記成分抽出ステップにより抽出された周波数成分に係る第２音声音響信号の時系列の第２実効値を算出する第２実効値算出ステップと、前記第１実効値算出ステップ及び第２実効値算出ステップにより算出した第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加する付加ステップと、該付加ステップによりメタデータが付加された音声音響信号を外部へ送信する送信ステップとを実行させる。 The program disclosed in the present application is a program used in a computer that receives a sound sound signal and transmits the sound sound signal to the outside, and the first sound sound signal and the second sound sound related to the received sound sound signal are transmitted to the computer. An extraction step for extracting a signal, a component extraction step for extracting a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction step, and a frequency component extracted by the component extraction step A first effective value calculating step for calculating a first effective value of the time series of the first audio-acoustic signal related to the second effective value of the time series of the second audio-acoustic signal related to the frequency component extracted by the component extracting step. A first effective value calculating step for calculating a value, and a first effective value calculating step calculated by the first effective value calculating step and the second effective value calculating step. An adding step of adding the voice sound signals received the effective value and the second effective value as metadata, to execute a transmission step of transmitting voice sound signals metadata is added by the addition step to the outside.

本願に開示する装置によれば、抽出部は、受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する。ハイパスフィルタは、これら抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出する。加算部は、抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の和信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく加算データを算出する。同様に減算部は、抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の差信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく減算データを算出する。この加算データ及び減算データはメタデータとして付加部により音声音響信号に付加される。最後に、送信部はメタデータが付加された音声音響信号を外部へ送信する。 According to the device disclosed in the present application, the extraction unit extracts the first audio acoustic signal and the second audio acoustic signal related to the received audio acoustic signal. The high-pass filter extracts a frequency component of a predetermined frequency or higher from the extracted first audio sound signal and second audio sound signal. The adding unit calculates a value related to the time-series sum signal of the first audio acoustic signal and the second audio acoustic signal related to the extracted frequency component, and adds the addition data based on a cumulative addition value for a predetermined time of the calculated value. calculate. Similarly, the subtraction unit calculates a value related to the time-series difference signal between the first audio acoustic signal and the second audio acoustic signal related to the extracted frequency component, and subtracts the calculated value based on a cumulative addition value for a predetermined time. Calculate the data. The addition data and subtraction data are added as metadata to the audio-acoustic signal by the adding unit. Finally, the transmission unit transmits the audio / acoustic signal to which the metadata is added to the outside.

本願に開示する装置によれば、変換部は、算出した加算データ及び減算データを、所定の下限値及び上限値に基づき変換する。そして付加部は、変換部により変換された加算データ及び減算データをメタデータとして受信した音声音響信号に付加する。 According to the device disclosed in the present application, the conversion unit converts the calculated addition data and subtraction data based on the predetermined lower limit value and upper limit value. Then, the adding unit adds the addition data and subtraction data converted by the conversion unit to the received audio-acoustic signal as metadata.

本願に開示する装置によれば、抽出部は、受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する。ハイパスフィルタは、これら抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出する。第１実効値算出部は、ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号の時系列の第１実効値を算出する。同様に第２実効値算出部は、ハイパスフィルタにより抽出された周波数成分に係る第２音声音響信号の時系列の第２実効値を算出する。そして、付加部はこの算出した第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加する。最後に送信部はメタデータが付加された音声音響信号を外部へ送信する。 According to the device disclosed in the present application, the extraction unit extracts the first audio acoustic signal and the second audio acoustic signal related to the received audio acoustic signal. The high-pass filter extracts a frequency component of a predetermined frequency or higher from the extracted first audio sound signal and second audio sound signal. The first effective value calculation unit calculates a first effective value in a time series of the first audio-acoustic signal related to the frequency component extracted by the high-pass filter. Similarly, a 2nd effective value calculation part calculates the 2nd time effective 2nd effective value of the 2nd audio | voice sound signal which concerns on the frequency component extracted by the high pass filter. The adding unit adds the calculated first effective value and second effective value to the received audio-acoustic signal as metadata. Finally, the transmission unit transmits the audio / acoustic signal to which the metadata is added to the outside.

本願に開示する装置によれば、変換部は、第１実効値算出部にて算出した第１実効値及び第２実効値算出部で算出した第２実効値の対数に基づき変換する。そして、付加部は変換部により変換された対数に係る第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加する。 According to the device disclosed in the present application, the conversion unit performs conversion based on the logarithm of the first effective value calculated by the first effective value calculation unit and the second effective value calculated by the second effective value calculation unit. The adding unit adds the first effective value and the second effective value relating to the logarithm converted by the converting unit to the received audio-acoustic signal as metadata.

当該装置の一観点によれば、ハイパスフィルタにより第１音声音響信号及び第２音声音響信号の直流成分を排除することができる。従って伝送の際の符号化及び復号処理に伴い直流成分が多く存在する符号化復号形式または直流成分が排除された符号化復号形式の如何にかかわらず、加算データ及び減算データを算出する際、量子化ノイズ等を伝送障害ノイズと誤って判断する事態を回避することが可能となる等、本発明は優れた効果を奏する。 According to one aspect of the apparatus, the high-pass filter can eliminate the direct current components of the first audio sound signal and the second audio sound signal. Therefore, when calculating addition data and subtraction data, regardless of the encoding / decoding format in which a large amount of DC component is present or the encoding / decoding format in which the DC component is excluded, in the encoding and decoding processes during transmission, The present invention has an excellent effect, for example, by making it possible to avoid a situation in which misalignment noise or the like is erroneously determined as transmission failure noise.

実施の形態１
以下本発明の実施の形態を、図面を参照して説明する。図１は伝送システムの概要を示す模式図である。伝送システムはキー局に設けられる送受信装置１、中継局に設けられる送受信装置１、送受信装置１を含んで構成される。制作された映像データ及び音声音響信号からなる番組素材（以下、音声データという）は、複数の中継局を経てキー局に伝送される。その後、放送局で映像データ及び音声データが加工され、キー局から複数の中継局を経て図示しないユーザのチューナに伝送される。キー局及び中継局に設けられる送受信装置１は放送データ中の音声データを分析し、音声データの特徴量である加算データ及び減算データ（以下、場合によりまとめてメタデータという）を算出する。送受信装置１（以下、メタデータ算出器１）は受信した音声データからメタデータを算出する。メタデータ算出器１は算出したメタデータを音声データに付加し、後段の中継局のメタデータ算出器１へ送信する。以下では、音声データを送信するメタデータ算出器１を前段とし、当該メタデータ算出器１から音声データを受信するメタデータ算出器１を後段とする。 Embodiment 1
Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a schematic diagram showing an outline of a transmission system. The transmission system includes a transmission / reception device 1 provided in a key station, a transmission / reception device 1 provided in a relay station, and a transmission / reception device 1. Program material (hereinafter referred to as audio data) made up of the produced video data and audio sound signals is transmitted to the key station via a plurality of relay stations. Thereafter, the video data and audio data are processed at the broadcasting station, and transmitted from the key station to a user tuner (not shown) via a plurality of relay stations. The transmission / reception device 1 provided in the key station and the relay station analyzes the audio data in the broadcast data, and calculates addition data and subtraction data (hereinafter, collectively referred to as metadata in some cases), which are feature amounts of the audio data. The transmission / reception device 1 (hereinafter, metadata calculator 1) calculates metadata from the received audio data. The metadata calculator 1 adds the calculated metadata to the audio data, and transmits it to the metadata calculator 1 of the subsequent relay station. In the following, the metadata calculator 1 that transmits audio data is the front stage, and the metadata calculator 1 that receives the audio data from the metadata calculator 1 is the rear stage.

図２はメタデータ算出器１のハードウェア構成を示すブロック図である。メタデータ算出器１は、デマルチプレクサ１１、抽出部１２、メタデータ保持部１３、メタデータ算出部１４、メタデータ付加部１５、付加部１７、送信部１８、ハイパスフィルタ（以下、ＨＰＦという）１１０、及び、変換部１１１等を含んで構成される。メタデータ算出器１にはＭＰＥＧ（Moving Pictures Experts Group）、ＡＡＣまたはdolby-E等により圧縮されたＡＶストリームが入力される。なお、ＡＶストリームには映像データ及び音声データの双方が含まれるが、本実施の形態においては映像データの記載を省略する。音声データはＡＡＣまたはdolby-E形式等によりエンコードされており、図示しないデコーダによりデコードされた音声データ及び後述する特定データ（識別情報）がメタデータ算出器１へ入力される。また、制作した映像データ及び音声データをキー局（放送局）へ伝送する際には、非圧縮のまま音声データを伝送することもある。 FIG. 2 is a block diagram showing a hardware configuration of the metadata calculator 1. The metadata calculator 1 includes a demultiplexer 11, an extraction unit 12, a metadata holding unit 13, a metadata calculation unit 14, a metadata addition unit 15, an addition unit 17, a transmission unit 18, and a high-pass filter (hereinafter referred to as HPF) 110. And the conversion unit 111 and the like. An AV stream compressed by MPEG (Moving Pictures Experts Group), AAC, dolby-E or the like is input to the metadata calculator 1. The AV stream includes both video data and audio data, but the description of the video data is omitted in this embodiment. The audio data is encoded in the AAC or dolby-E format or the like, and the audio data decoded by a decoder (not shown) and specific data (identification information) described later are input to the metadata calculator 1. In addition, when the produced video data and audio data are transmitted to a key station (broadcast station), the audio data may be transmitted without being compressed.

メタデータ算出器１へ入力された音声データ及び特定データはデマルチプレクサ１１へ入力される。デマルチプレクサ１１は音声データに付加されたメタデータ及び特定データを抽出し、抽出したメタデータ及び特定データ、並びに、メタデータ及び特定データが取り除かれた音声データを分離して出力する。なお、この付加されたメタデータは前段のキー局または中継局のメタデータ算出器１にて算出されたメタデータである。このメタデータの算出処理及び特定データの内容については後述する。デマルチプレクサ１１にて分離された音声データは付加部１７及び抽出部１２にそれぞれ出力される。デマルチプレクサ１１にて分離されたメタデータ及び特定データはメタデータ保持部１３へ出力される。 The audio data and specific data input to the metadata calculator 1 are input to the demultiplexer 11. The demultiplexer 11 extracts the metadata and specific data added to the audio data, and separates and outputs the extracted metadata and specific data and the audio data from which the metadata and specific data have been removed. Note that the added metadata is metadata calculated by the metadata calculator 1 of the preceding key station or relay station. The metadata calculation process and the contents of the specific data will be described later. The audio data separated by the demultiplexer 11 is output to the adding unit 17 and the extracting unit 12, respectively. The metadata and specific data separated by the demultiplexer 11 are output to the metadata holding unit 13.

抽出部１２は入力された音声データに係る第１音声データ（以下、左音声データ）及び第２音声データ（以下、右音声データ）を抽出し、左音声データ及び右音声データを、ＨＰＦ１１０を介してメタデータ算出部１４へ出力する。すなわち抽出部１２は、音声データが左及び右の２ｃｈから構成される場合は左音声データ及び右音声データをそれぞれ抽出し、抽出した左音声データ及び右音声データを、ＨＰＦ１１０を介してメタデータ算出部１４へ出力する。 The extraction unit 12 extracts first sound data (hereinafter, left sound data) and second sound data (hereinafter, right sound data) related to the input sound data, and the left sound data and the right sound data are passed through the HPF 110. To the metadata calculation unit 14. That is, the extraction unit 12 extracts left audio data and right audio data, respectively, when the audio data is composed of left and right 2ch, and calculates metadata of the extracted left audio data and right audio data via the HPF 110. To the unit 14.

抽出部１２は音声データが２ｃｈの場合、上述した処理を行うが、音声データが２ｃｈを超える３ｃｈ以上の場合は、この３以上の複数チャンネルからなる音声データを、変換部１２１により左音声データ及び右音声データにより構成される２ｃｈの音声データへ変換（ダウンミックス）する。出力部１２２は変換後の２ｃｈに係る左音声データ及び右音声データをＨＰＦ１１０へ出力する。変換部１２１には３ｃｈ以上の音声データを２ｃｈの音声データへ変換するための数式が記憶されており、当該数式に従い変換を行う。本実施の形態においては音声データが例えば５．１ｃｈである例を説明する。 The extraction unit 12 performs the above-described processing when the audio data is 2ch. However, when the audio data is 3ch or more exceeding 2ch, the extraction unit 12 converts the audio data composed of three or more channels into left audio data and Conversion (downmix) into 2ch audio data composed of right audio data. The output unit 122 outputs the left audio data and the right audio data according to 2ch after conversion to the HPF 110. The conversion unit 121 stores mathematical formulas for converting audio data of 3ch or more into audio data of 2ch, and performs conversion according to the mathematical formulas. In the present embodiment, an example in which audio data is, for example, 5.1 ch will be described.

入力される音声データが、左音声データＬ、右音声データＲ、センター音声データＣ、左サラウンドデータＬｓ、及び、右サラウンドデータＲｓとした場合、変換後の左音声データＬ’、変換後の右音声データＲ’は、ＩＳＯ／ＩＥＣ１３８１８−７に従い、下記式（１）で表すことができる。 When the input audio data is left audio data L, right audio data R, center audio data C, left surround data Ls, and right surround data Rs, the converted left audio data L ′ and the converted right audio The audio data R ′ can be expressed by the following formula (1) according to ISO / IEC 13818-7.

また、式（１）は.１ｃｈの低域効果データＬＦＥを含んでいないが、ＬＦＥが存在する場合は、下記式（２）にて変換後の左音声データＬ’、変換後の右音声データＲ’を算出するようにすれば良い。 Further, equation (1) does not include .1ch low-frequency effect data LFE. However, when LFE is present, left audio data L ′ after conversion and right audio data after conversion according to equation (2) below. R ′ may be calculated.

図３は係数Ａの値を示すテーブルである。この値はＩＳＯ／ＩＥＣ１３８１８−７の８．３．７．５の記載に基づくものであり、matrix_mixdown_idxの値によりＡの値が決定される。なお図３に示すテーブルも変換部１２１に記憶されている。以上の如く、変換部１２１により左音声データ及び右音声データに変換された音声データは出力部１２２を介してＨＰＦ１１０へ出力される。なお、本実施の形態においては５．１ｃｈの例を説明したが７．１ｃｈ等の音声データを変換する形態であっても良い。また、ＩＳＯ／ＩＥＣ１３８１８−７の例を用いて、５．１ｃｈからなる音声データを、左音声データ及び右音声データにより構成される２ｃｈの音声データへダウンミックスする例を挙げたが、これに限るものではない。例えばＡＲＩＢ／ＳＴＤ−Ｂ２１等で規定された他のダウンミックスの数式を用いても良い。 FIG. 3 is a table showing the value of the coefficient A. This value is based on the description of 8.3.7.5 of ISO / IEC 13818-7, and the value of A is determined by the value of matrix_mixdown_idx. The table shown in FIG. 3 is also stored in the conversion unit 121. As described above, the audio data converted into the left audio data and the right audio data by the conversion unit 121 is output to the HPF 110 via the output unit 122. In the present embodiment, an example of 5.1 ch has been described, but audio data such as 7.1 ch may be converted. In addition, using the example of ISO / IEC 13818-7, an example was given in which 5.1ch audio data was downmixed to 2ch audio data composed of left audio data and right audio data. It is not limited. For example, other downmix formulas defined by ARIB / STD-B21 or the like may be used.

ＨＰＦ１１０は出力部１２２から出力された左音声データ及び右音声データのそれぞれの所定周波数以上の周波数成分を抽出する。ＨＰＦ１１０は所定周波数以上の周波数成分を抽出した後の左音声データ及び右音声データをメタデータ算出部１４へ出力する。ＨＰＦ１１０における所定周波数は内部のメモリ（図示せず）に遮断周波数として記憶されている。この遮断周波数は図示しない入力部により適宜値を変更することが可能である。遮断周波数は、例えば２０Ｈｚ以下の値とすれば良く、好ましくは６Ｈｚ程度とすればよい。 The HPF 110 extracts frequency components of the left audio data and the right audio data output from the output unit 122 that are equal to or higher than a predetermined frequency. The HPF 110 outputs the left audio data and the right audio data after extracting frequency components of a predetermined frequency or higher to the metadata calculation unit 14. The predetermined frequency in the HPF 110 is stored as a cutoff frequency in an internal memory (not shown). This cut-off frequency can be appropriately changed by an input unit (not shown). The cut-off frequency may be a value of 20 Hz or less, for example, preferably about 6 Hz.

図４はＨＰＦ１１０の構成を示すブロック図である。ＨＰＦ１１０は加算器１１０１、加算器１１０２、遅延回路１１０３及び遅延回路１１０４等を含む。ここで入力をx[n]、出力をy[n]、中間出力をv[n]とした場合、中間出力v[n]は以下の式（３）で表すことができる。
v[n] = x[n] + a₁v[n-1] + a₂v[n-2] ・・・（３） FIG. 4 is a block diagram showing the configuration of the HPF 110. The HPF 110 includes an adder 1101, an adder 1102, a delay circuit 1103, a delay circuit 1104, and the like. When the input is x [n], the output is y [n], and the intermediate output is v [n], the intermediate output v [n] can be expressed by the following equation (3).
v [n] = x [n] + a ₁ v [n-1] + a ₂ v [n-2] (3)

そして、出力y[n]は以下の式（４）で表すことができる。
y[n] = b₀v[n] + b₁v[n-1] + b₂v[n-2] ・・・（４） The output y [n] can be expressed by the following equation (4).
y [n] = b ₀ v [n] + b ₁ v [n-1] + b ₂ v [n-2] (4)

ここで係数b₀は0.999439161786443、係数a₁は-1.998878015320690、係数b₁は-1.998878323572890、係数a₂は0.998878631825079、係数b₂は0.999439161786443である。なお図４に示したＨＰＦ１１０の構成は一例であり他のフィルタを適用しても良い。例えば、図４に示すＨＰＦ１１０を複数段カスケード接続したものであっても良い。直流成分がＨＰＦ１１０により除去された右音声データ及び左音声データはメタデータ算出部１４へ出力される。 Here, the coefficient b ₀ is 0.999439161786443, the coefficient a ₁ is -1.998878015320690, the coefficient b ₁ is -1.998878323572890, the coefficient a ₂ is 0.998878631825079, and the coefficient b ₂ is 0.999439161786443. The configuration of the HPF 110 shown in FIG. 4 is an example, and other filters may be applied. For example, the HPF 110 shown in FIG. The right audio data and the left audio data from which the DC component has been removed by the HPF 110 are output to the metadata calculation unit 14.

メタデータ算出部１４は加算部１４１及び減算部１４２を含んで構成される。加算部１４１は左音声データ及び右音声データの時系列の和信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく加算データを算出する。また減算部１４２は左音声データ及び右音声データの時系列の差信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく減算データを算出する。算出された加算データ及び減算データはメタデータとして、変換部１１１を介してメタデータ付加部１５へ出力される。以下に詳細を説明する。 The metadata calculation unit 14 includes an addition unit 141 and a subtraction unit 142. The adder 141 calculates a value related to the time-series sum signal of the left audio data and the right audio data, and calculates addition data based on a cumulative addition value for a predetermined time of the calculated value. The subtracting unit 142 calculates a value related to a time-series difference signal between the left audio data and the right audio data, and calculates subtraction data based on a cumulative addition value for a predetermined time of the calculated value. The calculated addition data and subtraction data are output as metadata to the metadata adding unit 15 via the conversion unit 111. Details will be described below.

図５は左音声データ及び右音声データの時間的変化を模式的に示すグラフである。図５（ａ）は左音声データの振幅の時間的変化を模式的に示すグラフであり、図５（ｂ）は右音声データの振幅の時間的変化を模式的に示すグラフである。何れも横軸は時間、縦軸は振幅である。入力される音声データは所定時間毎（例えば、NTSC（National Television Standards Committee）映像の１フレームの時間である33.3msの整数倍、または、ＡＡＣの１フレームの符号化時間である42.6msの整数倍）に分割される。以下ではこの所定時間の一単位をフレームという。図５の例では音声データがフレーム１、フレーム２、・・・フレームｊの如く分割され、フレーム毎に加算データ及び減算データが算出される。 FIG. 5 is a graph schematically showing temporal changes of the left audio data and the right audio data. FIG. 5A is a graph schematically showing temporal changes in the amplitude of left audio data, and FIG. 5B is a graph schematically showing temporal changes in the amplitude of right audio data. In either case, the horizontal axis represents time, and the vertical axis represents amplitude. Input audio data is an integer multiple of 33.3ms, which is the time of one frame of NTSC (National Television Standards Committee) video, or an integral multiple of 42.6ms, which is the encoding time of one frame of AAC. ). Hereinafter, one unit of the predetermined time is referred to as a frame. In the example of FIG. 5, the audio data is divided into frame 1, frame 2,... Frame j, and addition data and subtraction data are calculated for each frame.

フレーム１において左音声データはサンプリング周波数に応じて時系列順にＬｉ、Ｌｉ＋１、・・・Ｌｎと表すことができる。同様にフレーム１における右音声データは、時系列順にＲｉ、Ｒｉ＋１、・・・Ｒｎと表すことができる。加算部１４１は左音声データの特定の時間におけるデータと、右音声データの特定の時間におけるデータとを加算し、和信号を算出する。例えばＲｉとＬｉとの和信号を算出する。次に、加算部１４１は加算値を２で除すことにより和信号に関する値を算出する。つまり当該特定時間における左音声データ及び右音声データの平均値を算出する。加算部１４１は、当該処理を１フレーム内に存在する全ての時間のデータに対して行う。つまり、ｉからｎまですべての時系列の組み合わせに対して演算処理を行う。そして加算部１４１は、フレーム内に存在する左音声データ及び右音声データ全ての組み合わせにおける平均値の総和を算出する。加算部１４１はその総和をフレーム内の左音声データ及び右音声データの組み合わせ数で除すことにより、総和の平均値を算出する。具体的には、フレーム１における加算データＡＩＩ（１）は式（５）により表すことができる。なお、以下では加算データを場合によりＡＩＩ（Audio in-phase information）と称する。 In the frame 1, the left audio data can be expressed as Li, Li + 1,... Ln in chronological order according to the sampling frequency. Similarly, the right audio data in frame 1 can be expressed as Ri, Ri + 1,... Rn in time series order. The adder 141 adds the data of the left audio data at a specific time and the data of the right audio data at a specific time to calculate a sum signal. For example, the sum signal of Ri and Li is calculated. Next, the adding unit 141 calculates a value related to the sum signal by dividing the added value by 2. That is, the average value of the left audio data and the right audio data at the specific time is calculated. The adder 141 performs the processing on all time data existing in one frame. That is, arithmetic processing is performed on all time series combinations from i to n. Then, the adding unit 141 calculates the sum of the average values in the combination of all the left audio data and right audio data existing in the frame. The adding unit 141 calculates the average value of the sum by dividing the sum by the number of combinations of the left audio data and the right audio data in the frame. Specifically, the addition data AII (1) in frame 1 can be expressed by equation (5). Hereinafter, the added data is sometimes referred to as AII (Audio in-phase information).

このように、特定時間における平均値、及び、フレーム内の総和の平均値を用いることにより加算データは音声データの最大振幅以下の値となることからデータ量の低減をも図ることが可能となる。なお、本実施の形態においては、加算データに関し右音声データと左音声データとの和の平均値を算出することとしたが、平均値を算出することなく加算値を利用しても良い。つまり式（５）の１／２を１に代えて演算し和信号に関する値としても良い。この場合、加算値の算出が、１フレーム内に存在する左音声データ及び右音声データ全ての組み合わせに対して行われる。この加算値の総和を算出し、さらにその総和の平均値を算出するようにしても良い。さらには、加算データに関し最後に総和の平均値を算出する例につき説明するが、平均値を算出することなく総和を加算データとして算出するようにしても良い。つまり式（５）の１／ｎを１とする演算を行う。加算部１４１は全てのフレーム１〜ｊについて同様の処理を行い加算データＡＩＩ（１）〜加算データＡＩＩ（ｊ）を算出する。 As described above, by using the average value in the specific time and the average value of the sum total in the frame, the added data becomes a value less than or equal to the maximum amplitude of the audio data, so that the data amount can be reduced. . In the present embodiment, the average value of the sum of the right audio data and the left audio data is calculated with respect to the addition data, but the addition value may be used without calculating the average value. That is, 1/2 of equation (5) may be calculated instead of 1 to obtain a value related to the sum signal. In this case, the addition value is calculated for all combinations of left audio data and right audio data existing in one frame. A total sum of the added values may be calculated, and an average value of the total sum may be calculated. Furthermore, although an example in which the average value of the sum is finally calculated regarding the addition data will be described, the sum may be calculated as addition data without calculating the average value. That is, an operation is performed in which 1 / n in the equation (5) is 1. The adder 141 performs the same processing for all frames 1 to j to calculate addition data AII (1) to addition data AII (j).

次いで減算部１４２について説明する。減算部１４２は左音声データの特定の時間におけるデータから、右音声データの特定の時間におけるデータを減算し、差信号を算出する。なお、減算部１４２は右音声データの特定の時間におけるデータから、左音声データの特定の時間におけるデータを減算しても良い。次に、減算部１４２は減算値を２で除すことにより差信号に関する値を算出する。つまり減算部１４２は当該特定時間における減算値の平均値を算出する。減算部１４２は、当該処理を１フレーム内に存在する左音声データ及び右音声データ全ての組み合わせに対して行う。そして減算部１４２はこの平均値の総和を算出し、さらにその総和の平均値を算出する。具体的には、フレーム１における減算データＡＯＩ（１）は式（６）により表すことができる。なお、以下では減算データを場合によりＡＯＩ（Audio out of phase information）と称する。 Next, the subtraction unit 142 will be described. The subtracting unit 142 subtracts the data at the specific time of the right audio data from the data at the specific time of the left audio data to calculate a difference signal. Note that the subtracting unit 142 may subtract data at a specific time of the left audio data from data at a specific time of the right audio data. Next, the subtraction unit 142 calculates a value related to the difference signal by dividing the subtraction value by 2. That is, the subtraction unit 142 calculates an average value of the subtraction values at the specific time. The subtracting unit 142 performs the processing for all combinations of the left audio data and the right audio data existing in one frame. Then, the subtracting unit 142 calculates the sum of the average values, and further calculates the average value of the sums. Specifically, the subtraction data AOI (1) in frame 1 can be expressed by equation (6). Hereinafter, the subtraction data is sometimes referred to as AOI (Audio out of phase information).

減算部１４２においても加算部１４１と同様に、減算データに関し減算値の平均値の算出、及び、総和の平均値の算出を必ずしも実行しなくても良い。すなわち、本実施の形態においては、減算データに関し右音声データと左音声データと差の平均値を算出することとしたが、平均値を算出することなく差の値を差信号に関する値として利用しても良い。つまり式（６）の１／２を１に代えて演算する。この場合、差の値の算出が、１フレーム内に存在する左音声データ及び右音声データ全ての組み合わせに対して行われる。この差の値の総和を算出し、さらにその総和の平均値を算出するようにしても良い。さらには、減算データに関し最後に総和の平均値を算出する例につき説明するが、平均値を算出することなく総和を減算データとして算出するようにしても良い。つまり式（６）の１／ｎを１とする演算を行う。減算部１４２は全てのフレーム１〜ｊについて同様の処理を行い減算データＡＯＩ（１）〜減算データＡＯＩ（ｊ）を算出する。加算部１４１及び減算部１４２は予め記憶した式（５）及び式（６）に基づき、全てのフレームに対して演算が行われた加算データ及び減算データ群をメタデータとして、変換部１１１を介してメタデータ付加部１５へ出力する。 Similarly to the adding unit 141, the subtracting unit 142 may not necessarily calculate the average value of the subtracted values and the average value of the sum for the subtracted data. That is, in the present embodiment, the average value of the difference between the right audio data and the left audio data is calculated for the subtraction data, but the difference value is used as the value for the difference signal without calculating the average value. May be. That is, the calculation is performed by replacing 1/2 of Equation (6) with 1. In this case, the difference value is calculated for all combinations of left audio data and right audio data existing in one frame. The sum of the difference values may be calculated, and an average value of the sum may be calculated. Furthermore, although an example in which the average value of the sum is finally calculated regarding the subtraction data will be described, the sum may be calculated as subtraction data without calculating the average value. That is, an operation is performed in which 1 / n of the equation (6) is 1. The subtraction unit 142 performs the same processing for all the frames 1 to j to calculate subtraction data AOI (1) to subtraction data AOI (j). The adding unit 141 and the subtracting unit 142 use the addition data and the subtraction data group that have been calculated for all the frames as metadata based on the equations (5) and (6) stored in advance as metadata. To the metadata adding unit 15.

続いて、ＨＰＦ１１０をメタデータ算出部１４の前段に適用したことの効果について検討する。図６はＨＰＦ１１０による処理を経ない場合の加算データの時間的変化を示すグラフである。図６のグラフにおける横軸はフレーム数を示し、縦軸は加算データＡＩＩの値を示す。実線は符号化及び復号処理がなされていない音声データに対する加算データの時間的変化を示す。また点線はdolby-Eによる符号化及び復号処理がなされた音声データに対する加算データの時間的変化を示す。なお実験に用いた音声は女性のアナウンス約２０秒であり、ランダムノイズまたはバーストノイズ等の各種ノイズの影響を受けていない。 Next, the effect of applying the HPF 110 to the previous stage of the metadata calculation unit 14 will be examined. FIG. 6 is a graph showing temporal changes in the added data when the process by the HPF 110 is not performed. In the graph of FIG. 6, the horizontal axis indicates the number of frames, and the vertical axis indicates the value of the addition data AII. A solid line indicates a temporal change of the addition data with respect to the audio data that has not been encoded and decoded. The dotted line shows the temporal change of the added data with respect to the audio data that has been encoded and decoded by dolby-E. The voice used in the experiment is about 20 seconds of female announcement, and is not affected by various noises such as random noise or burst noise.

図６の実線及び点線のグラフを比較した場合、dolby-E（点線）が符号化及び復号処理を経ていない実線に対し、全体的にオフセットしていることが理解できる。また図示しないが、ＡＡＣによる符号化及び復号処理を経た音声データは実線で示す符号化及び復号処理を経ていない音声データにその特性がほぼ一致することが確認できた。音声データを伝送する場合、様々な形式の符号化及び復号処理が適用される。その場合、dolby-E等の符号化及び復号処理を経た加算データは、符号化及び復号処理を経ていない音声データに係る加算データとその特性が相違する。その一方で、ＡＡＣ等の符号化及び復号処理を経た加算データは、符号化及び復号処理を経ていない音声データに係る加算データとその特性が近似する。 When comparing the solid line and dotted line graphs in FIG. 6, it can be understood that dolby-E (dotted line) is entirely offset with respect to the solid line that has not undergone encoding and decoding processing. Although not shown, it has been confirmed that the characteristics of the voice data that has undergone the encoding and decoding processes by AAC substantially match the characteristics of the voice data that has not undergone the encoding and decoding processes indicated by the solid lines. When audio data is transmitted, various types of encoding and decoding processes are applied. In that case, the added data that has undergone encoding and decoding processing such as dolby-E is different from the additional data related to the audio data that has not undergone encoding and decoding processing. On the other hand, the addition data that has undergone encoding and decoding processing such as AAC approximates the characteristics of the addition data related to audio data that has not undergone encoding and decoding processing.

本願出願人は、様々な形式の符号化及び復号処理を経た音声データに係る加算データを柔軟に活用すべく鋭意研究を重ねた結果、メタデータ算出部１４の前段にＨＰＦ１１０を設けることにより、この問題を解決した。図７はＨＰＦ１１０による処理を経た場合の加算データの時間的変化を示すグラフである。横軸及び縦軸の値は図６と同様である。実線は符号化及び復号処理がなされていないが、ＨＰＦ１１０による処理がなされた音声データに対する加算データの時間的変化を示す。また点線はdolby-Eによる符号化及び復号処理及びＨＰＦ１１０による処理がなされた音声データに対する加算データの時間的変化を示す。なお、図７の例では、図４に示すＨＰＦ１１０を４段カスケード接続したフィルタを用い、また遮断周波数を６Ｈｚとした例を示す。 The applicant of the present application has conducted extensive research to flexibly utilize the added data related to the audio data that has undergone various types of encoding and decoding processes. As a result, by providing the HPF 110 in the preceding stage of the metadata calculation unit 14, Solved the problem. FIG. 7 is a graph showing a temporal change in the added data when the process by the HPF 110 is performed. The values on the horizontal axis and the vertical axis are the same as in FIG. The solid line shows the temporal change of the added data with respect to the audio data that has not been encoded and decoded but is processed by the HPF 110. A dotted line indicates a temporal change of the addition data with respect to the audio data subjected to the encoding and decoding processing by dolby-E and the processing by HPF 110. 7 shows an example in which a filter in which HPFs 110 shown in FIG. 4 are cascaded in four stages is used and the cutoff frequency is 6 Hz.

図７に示すとおり、ＨＰＦ１１０による周波数成分の抽出処理によりオフセット量が低減され、符号化及び復号処理を経ていない音声データに係る加算データとその特性がほぼ一致していることが理解できる。これにより、様々な符号化及び復号処理を経た音声データを受信した場合でも、オフセット量を低減できることから量子化ノイズ等を誤って伝送障害ノイズと検出する事態を回避することが可能となる。なお、実験で用いた遮断周波数は６Ｈｚであるが２０Ｈｚ以下の値を適宜採用すればよい。これは２０Ｈｚ以下は人間にとって聴感度が低い周波数であり、この周波数帯域に障害が発生したとしても聞こえない、つまり人間が障害と認識できないからである。 As shown in FIG. 7, it can be understood that the offset amount is reduced by the frequency component extraction processing by the HPF 110, and the added data related to the audio data that has not undergone the encoding and decoding processing and the characteristics are almost the same. As a result, even when audio data that has undergone various encoding and decoding processes is received, the amount of offset can be reduced, so that it is possible to avoid a situation in which quantization noise or the like is erroneously detected as transmission failure noise. The cut-off frequency used in the experiment is 6 Hz, but a value of 20 Hz or less may be adopted as appropriate. This is because a frequency of 20 Hz or less is a frequency with low hearing sensitivity for humans, and even if a fault occurs in this frequency band, it cannot be heard, that is, humans cannot recognize it as a fault.

ＨＰＦ１１０をメタデータ算出部１４の前段に設けることによる効果は減算データＡＯＩにおいても実証された。図８はＨＰＦ１１０による処理を経ない場合の減算データの時間的変化を示すグラフである。図８のグラフにおける横軸はフレーム数を示し、縦軸は減算データＡＯＩの値を示す。実線は符号化及び復号処理がなされていない音声データに対する減算データの時間的変化を示す。また点線はdolby-Eによる符号化及び復号処理がなされた音声データに対する減算データの時間的変化を示す。図８に示す如く、dolby-Eによる符号化及び復号処理を経た減算データは、符号化及び復号処理を経ていない減算データに対し大きく相違する。減算データも加算データと同じく、符号化及び復号処理の形式如何によってはオフセット量が相違する。 The effect of providing the HPF 110 in the previous stage of the metadata calculation unit 14 has been demonstrated also in the subtraction data AOI. FIG. 8 is a graph showing temporal changes in subtraction data when the process by the HPF 110 is not performed. In the graph of FIG. 8, the horizontal axis indicates the number of frames, and the vertical axis indicates the value of the subtraction data AOI. A solid line indicates a temporal change of subtraction data with respect to audio data that has not been encoded and decoded. A dotted line indicates a temporal change of subtraction data with respect to audio data that has been encoded and decoded by dolby-E. As shown in FIG. 8, the subtraction data that has undergone the encoding and decoding processing by dolby-E is greatly different from the subtraction data that has not undergone the encoding and decoding processing. Similarly to the addition data, the subtraction data differs in offset amount depending on the format of encoding and decoding processing.

図９はＨＰＦ１１０による処理を経た場合の減算データの時間的変化を示すグラフである。横軸及び縦軸の値は図８と同様である。実線は符号化及び復号処理がなされていないが、ＨＰＦ１１０による処理がなされた音声データに対する減算データの時間的変化を示す。また点線はdolby-Eによる符号化及び復号処理及びＨＰＦ１１０による処理がなされた音声データに対する減算データの時間的変化を示す。なお、ＨＰＦ１１０の構成及び遮断周波数は加算データの実験と同様のものを用いた。図９に示す如く、減算データにおいても加算データと同じくオフセット量が低減され符号化及び復号処理を経た減算データが符号化及び復号処理を経ていない減算データにほぼ一致していることが理解できる。 FIG. 9 is a graph showing temporal changes in the subtraction data when the processing by the HPF 110 is performed. The values on the horizontal and vertical axes are the same as in FIG. The solid line shows the temporal change of the subtraction data with respect to the audio data that has not been encoded and decoded, but has been processed by the HPF 110. A dotted line shows a temporal change of subtraction data with respect to audio data that has been encoded and decoded by dolby-E and processed by HPF 110. The configuration and cutoff frequency of the HPF 110 were the same as those used in the addition data experiment. As shown in FIG. 9, it can be understood that the subtraction data in the subtraction data is substantially the same as the subtraction data in which the offset amount is reduced and the encoding and decoding processes are not performed, as in the addition data.

メタデータ算出部１４により算出された加算データ及び減算データは変換部１１１へ出力される。変換部１１１は上下限変換部１１１０及び整数変換部１１１１を含む。上下限変換部１１１０は加算データ及び減算データの絶対値を求め、内部のメモリ（図示せず）に記憶した上限値及び下限値に基づき加算データ及び減算データの絶対値を変換する。具体的には、下限値として３、上限値として２５５がメモリに記憶されている。なおこれらの数値は一例でありこれに限るものではない。上下限変換部１１１０は、加算データ及び減算データの絶対値が下限値３よりも小さい場合、零へその値を変換する。なお、必ずしも零に変換する必要はなく３より小さい１、または２に変換してもよい。この場合、上下限変換部１１１０は変換前の加算データ及び減算データに予め付与されていた絶対値算出前の符号を、変換後の加算データ及び減算データに付加する。例えば、加算データが−２の場合、絶対値２が算出され、変換処理により１へ変換される。最後に上下限変換部１１１０は元の符号−を付加して−１を得る。 The addition data and subtraction data calculated by the metadata calculation unit 14 are output to the conversion unit 111. The conversion unit 111 includes an upper / lower limit conversion unit 1110 and an integer conversion unit 1111. An upper / lower limit conversion unit 1110 obtains absolute values of addition data and subtraction data, and converts the absolute values of addition data and subtraction data based on an upper limit value and a lower limit value stored in an internal memory (not shown). Specifically, 3 is stored in the memory as the lower limit value and 255 as the upper limit value. In addition, these numerical values are examples and are not limited thereto. When the absolute values of the addition data and the subtraction data are smaller than the lower limit value 3, the upper / lower limit conversion unit 1110 converts the values to zero. It is not always necessary to convert to zero, and it may be converted to 1 or 2 smaller than 3. In this case, the upper / lower limit conversion unit 1110 adds the code before the absolute value calculation, which was previously assigned to the addition data and subtraction data before conversion, to the addition data and subtraction data after conversion. For example, when the addition data is −2, the absolute value 2 is calculated and converted to 1 by the conversion process. Finally, the upper and lower limit conversion unit 1110 adds the original code-to obtain -1.

さらに上下限変換部１１１０は加算データ及び減算データの絶対値が上限値２５５を超える場合、加算データ及び減算データの絶対値を上限値２５５または２５５未満の値へ変換する。次いで上下限変換部１１１０は変換前の加算データ及び減算データに予め付与されていた絶対値算出前の符号を、変換後の加算データ及び減算データに付加する。具体的には、加算データが−３２９である場合、加算データの絶対値が３２９と算出され、上限値を超える。上下限変換部１１１０は、加算データの絶対値を上限値２５５または上限値未満の７３（３２９−２５６）に変換する。上限値未満へ変換する場合、例えば予めメモリに記憶された２５４または２５３とする他、零としても良い。その他、上限値未満へ変換する処理の一例として、加算データまたは減算データの絶対値から２のｎ乗に係る値を減算して、所定ビット（例えば８ビット）以下で表現できる数値に変換しても良い。本例では２の８乗に係る値を減算している。その他、２の７乗に係る値を減算しても良い。 Furthermore, when the absolute values of the addition data and the subtraction data exceed the upper limit value 255, the upper / lower limit conversion unit 1110 converts the absolute values of the addition data and the subtraction data into a value less than the upper limit value 255 or 255. Next, the upper / lower limit conversion unit 1110 adds the code before the absolute value calculation, which is previously given to the addition data and subtraction data before conversion, to the addition data and subtraction data after conversion. Specifically, when the addition data is −329, the absolute value of the addition data is calculated as 329, which exceeds the upper limit value. The upper / lower limit conversion unit 1110 converts the absolute value of the addition data into an upper limit value 255 or 73 (329-256) less than the upper limit value. When converting to less than the upper limit value, for example, 254 or 253 stored in the memory in advance may be used, or zero may be used. In addition, as an example of processing for conversion to less than the upper limit value, a value related to the nth power of 2 is subtracted from the absolute value of the addition data or subtraction data, and converted to a numerical value that can be expressed by a predetermined bit (for example, 8 bits) or less. Also good. In this example, the value related to the power of 2 is subtracted. In addition, a value related to 2 to the seventh power may be subtracted.

上下限変換部１１１０は変換前の加算データ及び減算データに予め付与されていた絶対値算出前の符号を、変換後の加算データ及び減算データに付加する。上述の例では加算データが−３２９であるので、変換後の７３に符号−を付加して−７３を得る。加算データがより小さい場合、例えば−９５６である場合、上限値２５５または上限値未満の１８８（９５６−５１２−２５６）に変換する。本例では、変換後の値を８ビット以下とすべく、２のｎ乗に係る値として、２の９乗及び２の８乗を減算している。次いで上下限変換部１１１０は変換前の加算データ及び減算データに予め付与されていた絶対値算出前の符号を、変換後の加算データ及び減算データに付加する。本例の場合、加算データは−１８８となる。 The upper / lower limit conversion unit 1110 adds the code before the absolute value calculation, which has been previously given to the addition data and subtraction data before conversion, to the addition data and subtraction data after conversion. In the above example, since the addition data is −329, a sign − is added to 73 after conversion to obtain −73. When the addition data is smaller, for example, −956, the upper limit value 255 or less than the upper limit value 188 (956-512-256) is converted. In this example, 2 9 to the 2nd power and 2 to the 8th power are subtracted as values related to the 2 n power so that the converted value is 8 bits or less. Next, the upper / lower limit conversion unit 1110 adds the code before the absolute value calculation, which is previously given to the addition data and subtraction data before conversion, to the addition data and subtraction data after conversion. In this example, the addition data is −188.

また本実施の形態においては、上下限変換部１１１０において、正負の値を持つ加算データ及び減算データの絶対値を求めてから変換する処理を述べるが、これに限るものではない。上下限変換部１１１０はメモリ（図示せず）内に下限値に対応する第１範囲、及び、第２範囲を記憶している。第１範囲は例えば−３より大きく０より小さいと記憶され、第２範囲は例えば０より大きく＋３より小さいと記憶されている。また上下限変換部１１１０はメモリ（図示せず）内に上限値に対応する第３範囲、及び、第４範囲を記憶している。第３範囲は例えば−２５５よりも小さいと記憶されており、第４範囲は＋２５５よりも大きいと記憶されている。 In the present embodiment, the upper / lower limit conversion unit 1110 describes processing for obtaining the absolute values of the addition data and the subtraction data having positive and negative values and then converting them. However, the present invention is not limited to this. The upper / lower limit conversion unit 1110 stores a first range and a second range corresponding to the lower limit value in a memory (not shown). For example, the first range is stored as being larger than −3 and smaller than 0, and the second range is stored as being larger than 0 and smaller than +3, for example. The upper and lower limit conversion unit 1110 stores a third range and a fourth range corresponding to the upper limit value in a memory (not shown). For example, the third range is stored as being smaller than −255, and the fourth range is stored as being larger than +255.

上下限変換部１１１０は加算データ及び減算データを受け付けた場合、加算データ及び減算データが第１範囲乃至第４範囲に属するか否かを判断する。上下限変換部１１１０は第１範囲に属すると判断した場合、加算データ及び減算データを、例えば零、或いは−３より大きい負の値、例えば−１に変換する。上下限変換部１１１０は第２範囲に属すると判断した場合、加算データ及び減算データを、例えば零、或いは＋３より小さい正の値、例えば＋１に変換する。上下限変換部１１１０は第３範囲に属すると判断した場合、−２５５または−２５５よりも大きい負の値へ変換する。例えば−２５４、零、または加算データまたは減算データから２のｎ乗に係る値を加算して、所定ビット（例えば８ビット）以下で表現できる負の数値等とすれば良い。同様に上下限変換部１１１０は第４範囲に属すると判断した場合、＋２５５または２５５より小さい正の値へ変換する。例えば＋２５４、零、または加算データまたは減算データから２のｎ乗に係る値を減算して、所定ビット（例えば８ビット）以下で表現できる正の数値等とすれば良い。 When receiving the addition data and the subtraction data, the upper / lower limit conversion unit 1110 determines whether the addition data and the subtraction data belong to the first range to the fourth range. When it is determined that the upper / lower limit conversion unit 1110 belongs to the first range, the addition data and the subtraction data are converted to, for example, zero or a negative value larger than −3, for example, −1. When it is determined that the upper / lower limit conversion unit 1110 belongs to the second range, the addition data and the subtraction data are converted into, for example, zero or a positive value smaller than +3, for example, +1. When it is determined that the upper / lower limit conversion unit 1110 belongs to the third range, the upper / lower limit conversion unit 1110 converts the value into a negative value larger than −255 or −255. For example, it is possible to add −254, zero, or a value related to the nth power of 2 from addition data or subtraction data to obtain a negative numerical value that can be expressed by a predetermined bit (for example, 8 bits) or less. Similarly, when it is determined that the upper / lower limit conversion unit 1110 belongs to the fourth range, the upper / lower limit conversion unit 1110 converts the value into a positive value smaller than +255 or 255. For example, +254, zero, or a positive numerical value that can be expressed by a predetermined bit (for example, 8 bits) or less may be obtained by subtracting a value of 2 to the nth power from addition data or subtraction data.

整数変換部１１１１は、加算データ及び減算データの小数点以下を切り捨て、切り上げまたは四捨五入等することにより、加算データ及び減算データを整数値とする。なお、変換部１１１へ加算データ及び減算データが入力された場合、上下限変換部１１１０による変換処理を経てから整数変換部１１１１による処理を行っても良い。逆に、整数変換部１１１１により加算データ及び減算データを整数化してから、上下限変換部１１１０による変換処理を行っても良い。本実施の形態においては、先に上下限変換部１１１０による変換を行ってから、整数変換部１１１１により、整数値へ変換する処理を例に挙げて説明する。なお加算データ及び減算データが零の場合、並びに、加算データ及び減算データの絶対値が３以上２５５以下の場合、上下限変換部１１１０は変換処理を実行しない。この場合、メタデータ算出部１４から出力された加算データ及び減算データは変換部１１１の整数変換部１１１１にて整数への変換のみが行われる。 The integer conversion unit 1111 rounds off the decimal point of the addition data and the subtraction data, rounds it up, rounds it off, or the like, thereby converting the addition data and the subtraction data into integer values. In addition, when addition data and subtraction data are input to the conversion unit 111, the processing by the integer conversion unit 1111 may be performed after the conversion processing by the upper and lower limit conversion unit 1110. Conversely, the conversion processing by the upper / lower limit conversion unit 1110 may be performed after the addition data and the subtraction data are converted into integers by the integer conversion unit 1111. In the present embodiment, an example will be described in which the conversion by the upper / lower limit conversion unit 1110 is first performed and then the integer conversion unit 1111 performs conversion to an integer value. When the addition data and the subtraction data are zero, and when the absolute values of the addition data and the subtraction data are 3 or more and 255 or less, the upper / lower limit conversion unit 1110 does not perform the conversion process. In this case, the addition data and the subtraction data output from the metadata calculation unit 14 are only converted into integers by the integer conversion unit 1111 of the conversion unit 111.

図１０は各符号化及び復号処理を経た加算データの時間的変化を示すグラフである。図１０における横軸はフレーム、縦軸は加算データの値を示す。実線ａｂｓは符号化及び復号処理を経ていない加算データからＡＡＣによる符号化及び復号処理を経た加算データを減じた値の時間的変化を示す。点線ａｂｓは符号化及び復号処理を経ていない加算データからdolby-Eによる符号化及び復号処理を経た加算データを減じた絶対値の時間的変化を示す。実験に用いた音声データは図６乃至図９の説明に用いた音声データと同一である。 FIG. 10 is a graph showing temporal changes in the added data that has undergone each encoding and decoding process. In FIG. 10, the horizontal axis indicates the frame, and the vertical axis indicates the value of the added data. A solid line abs indicates a temporal change in a value obtained by subtracting the addition data that has undergone the encoding and decoding processing by AAC from the addition data that has not undergone the encoding and decoding processing. A dotted line abs shows a temporal change in absolute value obtained by subtracting the addition data that has undergone encoding and decoding processing by dolby-E from the addition data that has not undergone encoding and decoding processing. The voice data used in the experiment is the same as the voice data used in the description of FIGS.

図１１は各符号化及び復号処理を経た減算データの時間的変化を示すグラフである。図１１における横軸はフレーム、縦軸は減算データの値を示す。実線ａｂｓは符号化及び復号処理を経ていない減算データからＡＡＣによる符号化及び復号処理を経た減算データを減じた値の時間的変化を示す。点線ａｂｓは符号化及び復号処理を経ていない減算データからdolby-Eによる符号化及び復号処理を経た減算データを減じた絶対値の時間的変化を示す。図１０及び図１１に示す如く、各種符号化及び復号処理を経た加算データ及び減算データは伝送障害ノイズを有さない音声データであっても、符号化及び復号処理を経ていない加算データ及び減算データに対し、約３以下の差分を有する。 FIG. 11 is a graph showing temporal changes in subtraction data that has undergone each encoding and decoding process. In FIG. 11, the horizontal axis indicates the frame, and the vertical axis indicates the value of the subtraction data. A solid line abs indicates a temporal change in a value obtained by subtracting subtracted data that has undergone encoding and decoding processing by AAC from subtracted data that has not undergone encoding and decoding processing. A dotted line abs shows a temporal change in absolute value obtained by subtracting subtracted data that has undergone encoding and decoding processing by dolby-E from subtracted data that has not undergone encoding and decoding processing. As shown in FIGS. 10 and 11, addition data and subtraction data that have undergone various encoding and decoding processes are addition data and subtraction data that have not undergone encoding and decoding processes even if they are audio data that does not have transmission disturbance noise. On the other hand, it has a difference of about 3 or less.

この差分は各種符号化及び復号処理に起因するものであり、伝送障害ノイズと判断する虞がある。そのため、本実施の形態においては上下限変換部１１１０のメモリに下限値３を記憶しておき、３よりも小さい加算データ及び減算データの絶対値を０に変換することで、各符号化及び復号処理を経た音声データの量子化ノイズ等が障害ノイズと誤判断されることを低減するものである。一方、加算データ及び減算データの絶対値は図６乃至図９で示した如く、２５５を超える値は稀であった。そこで、本実施の形態においては上下限変換部１１１０のメモリに上限値２５５を記憶しておき、２５５を超える加算データ及び減算データの絶対値を上限値２５５または２５５未満の値に変換することで、伝送の際の情報量の低減を図ることとしたものである。 This difference is caused by various encoding and decoding processes, and may be determined as transmission failure noise. Therefore, in the present embodiment, the lower limit value 3 is stored in the memory of the upper / lower limit conversion unit 1110, and the absolute values of the addition data and the subtraction data smaller than 3 are converted to 0, whereby each encoding and decoding is performed. It is intended to reduce erroneous determination of quantization noise or the like of processed audio data as failure noise. On the other hand, the absolute values of the addition data and the subtraction data rarely exceed 255 as shown in FIGS. Therefore, in the present embodiment, the upper limit value 255 is stored in the memory of the upper / lower limit conversion unit 1110, and the absolute value of the addition data and subtraction data exceeding 255 is converted into a value less than the upper limit value 255 or 255. Therefore, the amount of information at the time of transmission is reduced.

また、整数変換部１１１１では加算データ及び減算データの絶対値を整数に変換することで情報量の低減を図ることとしたものである。その結果、変換部１１１を経ることで、加算データ及び減算データは０または３から２５５の整数値となり、１または２の数値は存在しないことから、下位の１ビットを無視することとし、０から１２７の７ビット値と符号１ビット値の合計８ビット値に変換され、情報量の低減を図ることが可能になり、かつ、符号化及び復号処理の種類に起因する量子化ノイズ等の誤検出を低減することが可能となる。変換部１１１により変換された後の加算データ及び減算データはメタデータ付加部１５へ出力される。 Further, the integer conversion unit 1111 reduces the amount of information by converting the absolute values of the addition data and the subtraction data into integers. As a result, after passing through the conversion unit 111, the addition data and the subtraction data become 0 or an integer value of 3 to 255, and the numerical value of 1 or 2 does not exist. It is converted into a total of 8-bit value of 127 7-bit value and 1-bit code value, and it is possible to reduce the amount of information, and erroneous detection of quantization noise or the like due to the type of encoding and decoding processing Can be reduced. The addition data and the subtraction data converted by the conversion unit 111 are output to the metadata adding unit 15.

図１２はメタデータ保持部１３のレコードレイアウトを示す説明図である。メタデータ保持部１３はデマルチプレクサ１１から出力される前段のメタデータ算出器１、１・・にて算出されたメタデータ及び特定データを記憶している。メタデータ保持部１３は、局ＩＤフィールド、機器ＩＤフィード、及びメタデータフィールドを含んで構成される。局ＩＤはキー局及び中継局に予め割り当てられる固有の識別子である。局ＩＤは例えば数値が小さいほど前段に存在することを意味している。本例では局ＩＤ０１がキー局であり、その後段に局ＩＤ０２の中継局、その後段に局ＩＤ０３の中継局、さらにその後段に局ＩＤ０４の中継局が存在していることを意味する。 FIG. 12 is an explanatory diagram showing a record layout of the metadata holding unit 13. The metadata holding unit 13 stores the metadata and specific data calculated by the preceding metadata calculators 1, 1... Output from the demultiplexer 11. The metadata holding unit 13 includes a station ID field, a device ID feed, and a metadata field. The station ID is a unique identifier assigned in advance to the key station and the relay station. The station ID, for example, means that the smaller the numerical value, the earlier the station ID. In this example, station ID01 is a key station, meaning that there is a relay station with station ID02 in the subsequent stage, a relay station with station ID03 in the subsequent stage, and a relay station with station ID04 in the subsequent stage.

本例における中継局の局ＩＤはさらにその後段の０５であるものとする。機器ＩＤはキー局及び中継局にそれぞれ設置されるメタデータ算出器１を特定するための予め割り当てられた固有の識別子である。この機器ＩＤは例えばＭＡＣ（Media Access Control）アドレス等を用いればよい。なお、本実施の形態においては局ＩＤ及び機器ＩＤの２種類を設ける形態につき説明するが、いずれか一つを用いても良い。機器ＩＤに関しても、どの機器ＩＤが前段のメタデータ算出器１に係る機器ＩＤであるかの情報が図示しないメモリに記憶されている。特定のメタデータ算出器１にて算出されたメタデータは当該メタデータ算出器１及びメタデータを特定するための局ＩＤ及び機器ＩＤに対応づけられる。以下では、算出したメタデータを特定するための局ＩＤ、及び、機器ＩＤを特定データという。 The station ID of the relay station in this example is assumed to be 05 in the subsequent stage. The device ID is a unique identifier assigned in advance for specifying the metadata calculator 1 installed in each of the key station and the relay station. For example, a MAC (Media Access Control) address may be used as the device ID. In this embodiment, two types of station ID and device ID are described. However, any one of them may be used. Regarding the device ID, information indicating which device ID is the device ID related to the metadata calculator 1 in the previous stage is stored in a memory (not shown). The metadata calculated by the specific metadata calculator 1 is associated with the metadata calculator 1 and the station ID and device ID for specifying the metadata. Hereinafter, the station ID for identifying the calculated metadata and the device ID are referred to as specific data.

メタデータフィールドには前段のメタデータ算出器１にて算出されたメタデータが記憶されている。メタデータ保持部１３は前段のメタデータ算出器１、１、・・・にて算出したメタデータ及びメタデータを特定するための特定データを履歴として記憶している。メタデータ保持部１３は、メタデータ及び特定データを識別情報付加部としてのメタデータ付加部１５へ出力する。なお、図２に示すメタデータ算出器１がキー局に存在する場合は、その前段が存在しないので、メタデータ保持部１３には何もデータが記憶されない。 In the metadata field, metadata calculated by the preceding metadata calculator 1 is stored. The metadata holding unit 13 stores the metadata calculated by the metadata calculators 1, 1,... In the previous stage and specific data for specifying the metadata as a history. The metadata holding unit 13 outputs the metadata and the specific data to the metadata adding unit 15 as an identification information adding unit. When the metadata calculator 1 shown in FIG. 2 exists in the key station, no data is stored in the metadata holding unit 13 because there is no preceding stage.

識別情報付加部として機能するメタデータ付加部１５はメタデータ算出部１４から出力されるメタデータに、局ＩＤ（本例では０５）、及び、機器ＩＤに係る特定データを付加する。さらにメタデータ付加部１５はこのメタデータ及び特定データを、メタデータ保持部１３から出力される前段のメタデータ及び特定データに付加する処理を行う。 The metadata adding unit 15 functioning as an identification information adding unit adds station ID (05 in this example) and specific data related to the device ID to the metadata output from the metadata calculating unit 14. Further, the metadata adding unit 15 performs processing for adding the metadata and specific data to the previous metadata and specific data output from the metadata holding unit 13.

図１３はメタデータ及び特定データのデータ構造を示す説明図である。図１３に示す如くヘッダに各メタデータ算出器１にて算出されたメタデータ及び特定データが伝送順に結合されている。つまり局ＩＤが小さいものから順に各メタデータが結合されている。キー局のメタデータ算出器１にて算出されたメタデータは、局ＩＤ０１の特定データと共に最前段に記憶されている。また本メタデータ算出器１にて算出されたメタデータは、局ＩＤ０５の特定データと共に最後段に記憶されている。メタデータ付加部１５にて前段の履歴が付加されたメタデータ及び特定データは付加部１７に出力される。 FIG. 13 is an explanatory diagram showing the data structure of metadata and specific data. As shown in FIG. 13, the metadata and specific data calculated by each metadata calculator 1 are combined with the header in the order of transmission. That is, the metadata is combined in order from the smallest station ID. The metadata calculated by the key station metadata calculator 1 is stored in the forefront together with the specific data of the station ID01. The metadata calculated by the metadata calculator 1 is stored in the last stage together with the specific data of the station ID 05. The metadata and specific data to which the previous history is added by the metadata adding unit 15 are output to the adding unit 17.

付加部１７はデマルチプレクサ１１から出力された音声データにメタデータ付加部１５から出力されたメタデータ及び特定データを付加し、送信部１８へ出力する。送信部１８は、後段の中継局に設けられるメタデータ算出器１へ映像データと共に、図示しないエンコーダによりエンコードされた音声データ、並びに、これに付加されたメタデータ及び特定データを送信する。これにより、各メタデータ算出器１にて算出されたメタデータ及び特定データが音声データに次々に付加されていくことになる。 The adding unit 17 adds the metadata and specific data output from the metadata adding unit 15 to the audio data output from the demultiplexer 11 and outputs the audio data to the transmitting unit 18. The transmission unit 18 transmits audio data encoded by an encoder (not shown), metadata added thereto, and specific data to the metadata calculator 1 provided in the subsequent relay station. As a result, the metadata and specific data calculated by each metadata calculator 1 are sequentially added to the audio data.

以上のハードウェア構成においてメタデータ算出処理及び付加処理の手順を、フローチャートを用いて説明する。図１４乃至図１６はメタデータ算出処理及び付加処理の手順を示すフローチャートである。デマルチプレクサ１１は入力された音声データにメタデータ及び特定データが付加されているか否かを判断する（ステップＳ７１）。デマルチプレクサ１１はメタデータ及び特定データが付加されていると判断した場合（ステップＳ７１でＹＥＳ）、音声データからメタデータ及び特定データを抽出する（ステップＳ７２）。 The procedure of the metadata calculation process and the addition process in the above hardware configuration will be described using a flowchart. 14 to 16 are flowcharts showing the procedures of the metadata calculation process and the addition process. The demultiplexer 11 determines whether metadata and specific data are added to the input audio data (step S71). If the demultiplexer 11 determines that metadata and specific data are added (YES in step S71), the demultiplexer 11 extracts the metadata and specific data from the audio data (step S72).

デマルチプレクサ１１はメタデータ及び特定データをメタデータ保持部１３へ出力する（ステップＳ７３）。ステップＳ７１において、音声データにメタデータ及び特定データが付加されていないと判断した場合（ステップＳ７１でＮＯ）、ステップＳ７２及びＳ７３の処理をスキップする。またデマルチプレクサ１１はメタデータ及び特定データが付加されていない音声データを抽出部１２及び付加部１７へ出力する（ステップＳ７４）。抽出部１２は音声データが２ｃｈを超えるチャンネル数であるか否かを判断する（ステップＳ７５）。 The demultiplexer 11 outputs the metadata and specific data to the metadata holding unit 13 (step S73). If it is determined in step S71 that metadata and specific data are not added to the audio data (NO in step S71), the processes in steps S72 and S73 are skipped. Further, the demultiplexer 11 outputs the audio data to which the metadata and the specific data are not added to the extraction unit 12 and the addition unit 17 (step S74). The extraction unit 12 determines whether or not the audio data has more than 2 channels (step S75).

抽出部１２は音声データが２ｃｈを超えるチャンネル数であると判断した場合（ステップＳ７５でＹＥＳ）、変換部１２１は式（１）を読み出し、数値を代入することで２ｃｈの音声データに変換し、出力部１２２を介して２ｃｈの音声データを出力する（ステップＳ７６）。抽出部１２はステップＳ７６の処理の後、ステップＳ７７へ移行する。またステップＳ７５において音声データが２ｃｈを超えるチャンネル数でないと判断した場合（ステップＳ７５でＮＯ）、すなわち、２ｃｈの信号であると判断した場合、抽出部１２はステップＳ７６の処理をスキップし、左音声データ及び右音声データを抽出する（ステップＳ７７）。 When the extraction unit 12 determines that the audio data has more than 2ch channels (YES in step S75), the conversion unit 121 reads the equation (1) and converts it into 2ch audio data by substituting numerical values. The 2ch audio data is output via the output unit 122 (step S76). After the process of step S76, the extraction unit 12 proceeds to step S77. If it is determined in step S75 that the audio data is not the number of channels exceeding 2ch (NO in step S75), that is, if it is determined that the signal is a 2ch signal, the extraction unit 12 skips the process of step S76 and the left audio Data and right audio data are extracted (step S77).

抽出部１２は左音声データ及び右音声データをＨＰＦ１１０へ出力する（ステップＳ７８）。ＨＰＦ１１０はメモリに記憶した遮断周波数を読み出す。ＨＰＦ１１０は読み出した遮断周波数を超える周波数成分を抽出する（ステップＳ７９）。具体的にはＨＰＦ１１０は図４に示すフィルタに、符号化及び復号処理の如何にかかわらず加算データ及び減算データの量子化ノイズ等の要因を低減させるべく、左音声データ及び右音声データをそれぞれ入力し、出力を得る。ＨＰＦ１１０は抽出後の左音声データ及び右音声データをメタデータ算出部１４へ出力する（ステップＳ７１０）。加算部１４１は式（５）を読み出し、左音声データ及び右音声データを式（５）へ代入することにより、各フレームの加算データを算出する（ステップＳ７１１）。減算部１４２は式（６）を読み出し、左音声データ及び右音声データを式（６）へ代入することにより、各フレームの減算データを算出する（ステップＳ７１２）。 The extraction unit 12 outputs the left audio data and the right audio data to the HPF 110 (step S78). The HPF 110 reads the cutoff frequency stored in the memory. The HPF 110 extracts a frequency component exceeding the read cutoff frequency (step S79). Specifically, the HPF 110 inputs left audio data and right audio data to the filter shown in FIG. 4 in order to reduce factors such as quantization noise of addition data and subtraction data regardless of encoding and decoding processing, respectively. And get the output. The HPF 110 outputs the extracted left audio data and right audio data to the metadata calculation unit 14 (step S710). The adder 141 reads the equation (5) and substitutes the left audio data and the right audio data into the equation (5) to calculate the addition data of each frame (step S711). The subtracting unit 142 reads the equation (6) and substitutes the left audio data and the right audio data into the equation (6) to calculate the subtraction data for each frame (step S712).

メタデータ算出部１４は各フレームの加算データ及び減算データをメタデータとして変換部１１１へ出力する（ステップＳ７１３）。変換部１１１の上下限変換部１１１０はメモリから下限値を読み出す。上下限変換部１１１０は読み出した下限値よりも小さい加算データ及び減算データの絶対値を零に変換する（ステップＳ７１４）。変換部１１１の上下限変換部１１１０はメモリから上限値を読み出す。上下限変換部１１１０は読み出した上限値を超える加算データ及び減算データの絶対値を上限値または上限値未満の値に変換する（ステップＳ７１５）。上下限変換部１１１０は、ステップＳ７１５にて変換する前の加算データ及び減算データに予め付与されていた絶対値算出前の符号を、ステップＳ７１５にて変換された加算データ及び減算データ（上限値または上限値未満の値）に付加する（ステップＳ７１６）。変換部１１１の整数変換部１１１１は正負の値を持つ加算データ及び減算データを整数に変換する（ステップＳ８１）。 The metadata calculation unit 14 outputs the addition data and subtraction data of each frame as metadata to the conversion unit 111 (step S713). The upper and lower limit conversion unit 1110 of the conversion unit 111 reads the lower limit value from the memory. The upper / lower limit conversion unit 1110 converts the absolute values of the addition data and subtraction data smaller than the read lower limit value to zero (step S714). The upper and lower limit conversion unit 1110 of the conversion unit 111 reads the upper limit value from the memory. The upper / lower limit conversion unit 1110 converts the absolute value of the addition data and subtraction data exceeding the read upper limit value into a value lower than the upper limit value or the upper limit value (step S715). The upper / lower limit conversion unit 1110 converts the addition data and subtraction data (upper limit value or subtraction data) converted in step S715 into the addition data and subtraction data before conversion in step S715, which are given in advance. (Value less than the upper limit value) (step S716). The integer conversion unit 1111 of the conversion unit 111 converts addition data and subtraction data having positive and negative values into integers (step S81).

変換部１１１は変換後の各フレームの加算データ及び減算データをメタデータとして、メタデータ付加部１５へ出力する（ステップＳ８２）。メタデータ付加部１５は図示しないメモリに記憶された当該メタデータ算出器１に係る局ＩＤ及び機器ＩＤを読み出し、メタデータ算出部１４から出力されたメタデータに付加する（ステップＳ８３）。メタデータ付加部１５はステップＳ８３で特定データが付加されたメタデータに、メタデータ保持部１３から出力された前段のメタデータ算出器１に係るメタデータ及び特定データを付加する（ステップＳ８４）。メタデータ付加部１５は前段にある局ＩＤまたは機器ＩＤが上位となるよう、例えば、局ＩＤまたは機器ＩＤの数値が小さい順に各メタデータ及び特定データをソートし、図６に示すメタデータ及び特定データ群を生成する。 The conversion unit 111 outputs the addition data and subtraction data of each frame after conversion to the metadata addition unit 15 as metadata (step S82). The metadata adding unit 15 reads the station ID and device ID related to the metadata calculator 1 stored in a memory (not shown), and adds it to the metadata output from the metadata calculating unit 14 (step S83). The metadata adding unit 15 adds the metadata and the specific data related to the previous metadata calculator 1 output from the metadata holding unit 13 to the metadata to which the specific data is added in step S83 (step S84). For example, the metadata adding unit 15 sorts each metadata and specific data in ascending order of the numerical value of the station ID or device ID so that the station ID or device ID in the preceding stage is higher, and the metadata and specific data shown in FIG. Generate a group of data.

メタデータ付加部１５はメタデータ及び特定データを付加部１７へ出力する（ステップＳ８５）。付加部１７はデマルチプレクサ１１から出力された音声データに、メタデータ付加部１５から出力されたメタデータ及び特定データを付加する（ステップＳ８６）。付加部１７は映像データと共にエンコードされた音声データ、メタデータ及び特定データを送信部１８へ出力する（ステップＳ８７）。送信部１８は映像データ、音声データ、メタデータ及び特定データを後段のメタデータ算出器１へ送信する（ステップＳ８８）。 The metadata adding unit 15 outputs the metadata and the specific data to the adding unit 17 (step S85). The adding unit 17 adds the metadata and specific data output from the metadata adding unit 15 to the audio data output from the demultiplexer 11 (step S86). The adding unit 17 outputs the audio data, metadata, and specific data encoded together with the video data to the transmitting unit 18 (step S87). The transmission unit 18 transmits video data, audio data, metadata, and specific data to the metadata calculator 1 at the subsequent stage (step S88).

実施の形態２
実施の形態２はメタデータ算出部１４が実効値（ＲＭＳＶ：Root Mean Square Value）を算出する形態に関する。図１７は実施の形態２に係るメタデータ算出器１のハードウェア構成を示すブロック図である。実施の形態１に対し、メタデータ算出部１４及び変換部１１１の構成が相違する。メタデータ算出部１４は第１実効値算出部である左実効値算出部１４３及び第２実効値算出部である右実効値算出部１４４を含む。左実効値算出部１４３はＨＰＦ１１０から出力された左音声データの実効値を式（７）に基づき算出し、変換部１１１へ出力する。左音声データの１フレーム目の実効値（ＬＡＲＩ）（１）は、式（７）で表すことができる。なお以下では実効値を場合によりＡＲＩ（Audio root-mean-square information）と称する。 Embodiment 2
The second embodiment relates to a mode in which the metadata calculation unit 14 calculates an effective value (RMSV: Root Mean Square Value). FIG. 17 is a block diagram showing a hardware configuration of the metadata calculator 1 according to the second embodiment. The configuration of the metadata calculation unit 14 and the conversion unit 111 is different from that of the first embodiment. The metadata calculation unit 14 includes a left effective value calculation unit 143 that is a first effective value calculation unit and a right effective value calculation unit 144 that is a second effective value calculation unit. The left effective value calculation unit 143 calculates the effective value of the left audio data output from the HPF 110 based on the equation (7), and outputs it to the conversion unit 111. The effective value (LARI) (1) of the first frame of the left audio data can be expressed by Expression (7). In the following, the effective value is sometimes referred to as ARI (Audio root-mean-square information).

また右音声データの１フレーム目の実効値（ＲＡＲＩ）（１）は、式（８）で表すことができる。 Further, the effective value (RARI) (1) of the first frame of the right audio data can be expressed by Expression (8).

メタデータ算出部１４で算出された各フレームの左音声データ及び右音声データの実効値を変換部１１１へ出力する。変換部１１１は入力された左音声データ及び右音声データの実効値の対数に基づき変換する。変換部１１１は対数変換部１１１２及び整数変換部１１１３を含む。対数変換部１１１２は各フレームの左音声データ及び右音声データの実効値の対数を式（９）及び式（１０）に基づき算出する。
L=ｋ×log₁₀(LARI(i)) ・・・（９）
R=k×log₁₀(RARI(i)) ・・・（１０） The effective values of the left audio data and the right audio data of each frame calculated by the metadata calculation unit 14 are output to the conversion unit 111. The conversion unit 111 performs conversion based on the logarithm of the effective value of the input left audio data and right audio data. The conversion unit 111 includes a logarithmic conversion unit 1112 and an integer conversion unit 1113. The logarithmic conversion unit 1112 calculates the logarithm of the effective value of the left audio data and the right audio data of each frame based on the equations (9) and (10).
L = k × log ₁₀ (LARI (i)) (9)
R = k × log ₁₀ (RARI (i)) (10)

左音声データの対数Ｌは、式（９）に示す如く、各フレームの左音声データの実効値の対数に係数ｋを乗じて得ることができる。また、右音声データの対数Ｒは、式（１０）に示す如く、各フレームの右音声データの実効値の対数に係数ｋを乗じて得ることができる。なおこの係数ｋは例えば５０とすれば良い。整数変換部１１１３は式（９）及び式（１０）で得られた右音声データの対数及び左音声データの対数の小数点以下を切り捨て、切り上げ、または四捨五入等することにより整数化する。変換部１１１により変換された後の左音声データ及び右音声データの対数に係る実効値はメタデータ付加部１５へ出力される。 The logarithm L of the left audio data can be obtained by multiplying the logarithm of the effective value of the left audio data of each frame by a coefficient k, as shown in Equation (9). Also, the logarithm R of the right audio data can be obtained by multiplying the logarithm of the effective value of the right audio data of each frame by the coefficient k as shown in the equation (10). The coefficient k may be 50, for example. The integer conversion unit 1113 converts the logarithm of the right audio data and the logarithm of the left audio data obtained by the equations (9) and (10) to an integer by rounding down, rounding up, or rounding. The effective value related to the logarithm of the left audio data and the right audio data after being converted by the conversion unit 111 is output to the metadata adding unit 15.

図１８は音声データの実効値の時間的変化を示すグラフである。図１８の横軸はフレームであり、縦軸は左音声データの実効値を示す。音声データは女性アナウンスを数秒間録音したものを用いた。実効値は縦軸に示す如く０から約８０００までの値をとるため伝送の際、情報量が大きくなる。そのため本実施の形態においては対数をとり、さらに整数化することで情報量の低減を図ることが可能となる。 FIG. 18 is a graph showing temporal changes in the effective value of audio data. In FIG. 18, the horizontal axis represents a frame, and the vertical axis represents the effective value of left audio data. The audio data was a female announcement recorded for a few seconds. Since the effective value takes a value from 0 to about 8000 as shown on the vertical axis, the amount of information increases during transmission. Therefore, in this embodiment, it is possible to reduce the amount of information by taking a logarithm and further converting it to an integer.

図１９は音声データの実効値に係る対数の時間的変化を示すグラフである。図１９の横軸はフレームであり、縦軸は左音声データの実効値の係数を示す。本実施の形態においては式（７）における係数ｋを５０とした。その結果、実効値の対数は０から２５５の範囲の整数値内に属し、実施の形態１と同じくメタデータを８ビットの情報量に収めることが可能となる。このように、対数変換、係数乗算及び整数化の一連の処理を経ることで、実施の形態１と同じビット数に情報量を設定することが可能となる。 FIG. 19 is a graph showing the logarithmic change in the effective value of the audio data. In FIG. 19, the horizontal axis represents the frame, and the vertical axis represents the coefficient of the effective value of the left audio data. In the present embodiment, the coefficient k in equation (7) is 50. As a result, the logarithm of the effective value belongs to an integer value in the range of 0 to 255, and the metadata can be stored in the 8-bit information amount as in the first embodiment. As described above, the information amount can be set to the same number of bits as in the first embodiment by performing a series of processes of logarithmic conversion, coefficient multiplication, and integer conversion.

なお、係数ｋは変換部１１１が動的に変化させるようにしても良い。対数変換部１１１２で得た左音声データの対数の最大値または右音声データの対数の最大値のｋ倍が、予め定めた上限値、例えば２５５、に最も近づくよう係数ｋを算出する。具体的には、上限値を、対数変換部１１１２から出力される左音声データの対数の最大値または右音声データの対数の最大値で除した値を係数ｋとすれば良い。 The coefficient k may be changed dynamically by the conversion unit 111. The coefficient k is calculated so that the maximum logarithm value of the left audio data or the maximum logarithm value of the right audio data obtained by the logarithmic conversion unit 1112 is closest to a predetermined upper limit value, for example, 255. Specifically, a value obtained by dividing the upper limit value by the maximum logarithm value of the left audio data or the logarithm value of the right audio data output from the logarithmic conversion unit 1112 may be used as the coefficient k.

図２０はメタデータ算出処理及び変換処理の手順を示すフローチャートである。実施の形態１で述べたステップＳ７１０の処理後以下の処理を実行する。左実効値算出部１４３は各フレームの左音声データに係る実効値を算出する（ステップＳ２０１）。同様に右実効値算出部１４４は右音声データに係る実効値を算出する（ステップＳ２０２）。メタデータ算出部１４は、算出した各フレームの左音声データ及び右音声データに係る実効値を変換部１１１へ出力する（ステップＳ２０３）。変換部１１１の対数変換部１１１２は、左音声データ及び右音声データに係る実効値の対数を算出する（ステップＳ２０４）。 FIG. 20 is a flowchart showing the metadata calculation process and the conversion process. After the process of step S710 described in the first embodiment, the following process is executed. The left effective value calculation unit 143 calculates an effective value related to the left audio data of each frame (step S201). Similarly, the right effective value calculation unit 144 calculates an effective value related to the right audio data (step S202). The metadata calculation unit 14 outputs the effective values related to the calculated left audio data and right audio data of each frame to the conversion unit 111 (step S203). The logarithmic conversion unit 1112 of the conversion unit 111 calculates the logarithm of the effective value related to the left audio data and the right audio data (step S204).

対数変換部１１１２はメモリ（図示せず）から予め記憶した係数ｋを読み出す（ステップＳ２０５）。対数変換部１１１２は対数に係数ｋを乗じる（ステップＳ２０６）。整数変換部１１１３は乗算後の左音声データ及び右音声データの実効値に係る対数を整数化する（ステップＳ２０７）。最後に変換部１１１は、整数化した各フレームの左音声データ及び右音声データの実効値に係る対数を、メタデータとしてメタデータ付加部１５へ出力する（ステップＳ２０８）。なお、以降の処理はステップＳ８３以降と同様であるので詳細な説明は省略する。 The logarithmic conversion unit 1112 reads a coefficient k stored in advance from a memory (not shown) (step S205). The logarithmic conversion unit 1112 multiplies the logarithm by a coefficient k (step S206). The integer conversion unit 1113 converts the logarithm related to the effective value of the left audio data and the right audio data after multiplication into an integer (step S207). Finally, the conversion unit 111 outputs the logarithms relating to the effective values of the left audio data and the right audio data of each frame converted to integers as metadata to the metadata adding unit 15 (step S208). Since the subsequent processing is the same as that after step S83, detailed description is omitted.

本実施の形態２は以上の如き構成としてあり、その他の構成及び作用は実施の形態１と同様であるので、対応する部分には同一の参照番号を付してその詳細な説明を省略する。 The second embodiment is configured as described above, and other configurations and operations are the same as those of the first embodiment. Therefore, the corresponding parts are denoted by the same reference numerals, and detailed description thereof is omitted.

実施の形態３
実施の形態３は複数の形式に係るメタデータを送信する形態に関する。図２１は実施の形態３に係るメタデータ算出器１のハードウェア構成を示すブロック図である。ＨＰＦ１１０の出力には選択部１１２の制御に従いオンまたはオフするスイッチＳ１及びスイッチＳ２が設けられている。選択部１１２は例えばメタデータ算出器１の筐体（図示せず）の外面に設けられるスイッチ等である。選択部１１２は加算データ及び減算データをメタデータとして用いるか、或いは、実効値をメタデータとして用いるかのいずれかを択一的に選択する事ができる。選択部１１２により加算データ及び減算データをメタデータとする選択（以下第１選択という）がなされた場合、選択部１１２はスイッチＳ１をオン、スイッチＳ２をオフとなるよう制御する。 Embodiment 3
Embodiment 3 relates to a mode of transmitting metadata according to a plurality of formats. FIG. 21 is a block diagram illustrating a hardware configuration of the metadata calculator 1 according to the third embodiment. The output of the HPF 110 is provided with a switch S1 and a switch S2 that are turned on or off according to the control of the selection unit 112. The selection unit 112 is, for example, a switch or the like provided on the outer surface of the casing (not shown) of the metadata calculator 1. The selection unit 112 can alternatively select either using the addition data and the subtraction data as metadata or using the effective value as metadata. When the selection unit 112 selects the addition data and the subtraction data as metadata (hereinafter referred to as a first selection), the selection unit 112 controls the switch S1 to be turned on and the switch S2 to be turned off.

この場合ＨＰＦ１１０から出力される左音声データ及び右音声データはスイッチＳ１を経由して加算部１４１及び減算部１４２を備えるメタデータ算出部１４へ出力される。以下では、加算部１４１及び減算部１４２を備えるメタデータ算出部１４側を第１選択側Ｓ１０という。一方、左実効値算出部１４３及び右実効値算出部１４４を備えるメタデータ算出部１４へは出力されない。以下では、左実効値算出部１４３及び右実効値算出部１４４を備えるメタデータ算出部１４側を第２選択側Ｓ２０という。 In this case, the left audio data and the right audio data output from the HPF 110 are output to the metadata calculation unit 14 including the addition unit 141 and the subtraction unit 142 via the switch S1. Hereinafter, the metadata calculation unit 14 side including the addition unit 141 and the subtraction unit 142 is referred to as a first selection side S10. On the other hand, it is not output to the metadata calculation unit 14 including the left effective value calculation unit 143 and the right effective value calculation unit 144. Hereinafter, the metadata calculation unit 14 side including the left effective value calculation unit 143 and the right effective value calculation unit 144 is referred to as a second selection side S20.

第１選択側Ｓ１０における変換部１１１により変換処理がなされた加算データ及び減算データはメタデータ付加部１５へ出力される。選択部１１２はメタデータの種類を示す情報をメタデータ付加部１５へ出力する。具体的には「第１」の情報を送信する。選択部１１２により実効値をメタデータとする選択（以下第２選択という）がなされた場合、選択部１１２はスイッチＳ１をオフ、スイッチＳ２をオンとなるよう制御する。 The addition data and the subtraction data converted by the conversion unit 111 in the first selection side S10 are output to the metadata adding unit 15. The selection unit 112 outputs information indicating the type of metadata to the metadata adding unit 15. Specifically, the “first” information is transmitted. When the selection unit 112 selects the effective value as metadata (hereinafter referred to as a second selection), the selection unit 112 controls the switch S1 to be turned off and the switch S2 to be turned on.

この場合ＨＰＦ１１０から出力される左音声データ及び右音声データはスイッチＳ２を経由して第２選択側Ｓ２０のメタデータ算出部１４へ出力される。第２選択側Ｓ２０における変換部１１１により変換処理がなされた左音声データ及び右音声データに係る実効値の対数はメタデータ付加部１５へ出力される。選択部１１２はメタデータの種類を示す情報をメタデータ付加部１５へ出力する。具体的には「第２」の情報を送信する。 In this case, the left audio data and the right audio data output from the HPF 110 are output to the metadata calculation unit 14 of the second selection side S20 via the switch S2. The logarithm of the effective value relating to the left audio data and the right audio data subjected to the conversion process by the conversion unit 111 in the second selection side S20 is output to the metadata adding unit 15. The selection unit 112 outputs information indicating the type of metadata to the metadata adding unit 15. Specifically, “second” information is transmitted.

メタデータ付加部１５には識別子付加部１５１が設けられている。識別子付加部１５１は特定データとして付加されるメタデータの種類を新たに付加する。図２２は実施の形態３に係るメタデータ及び特定データのデータ構造を示す説明図である。図２２（ａ）は第１選択がなされた場合のメタデータ及び特定データのデータ構造を示す説明図である。特定データとして付加される機器ＩＤの後段にメタデータの種類を示す情報が新たに付加される。検証の際にはこの特定データ内に記述されたメタデータの種類を参照し、メタデータの分析を行う。図２２（ａ）の例では、メタデータの種類として第１選択を示す「第１」が付加されている。 The metadata adding unit 15 is provided with an identifier adding unit 151. The identifier adding unit 151 newly adds a type of metadata added as specific data. FIG. 22 is an explanatory diagram showing a data structure of metadata and specific data according to the third embodiment. FIG. 22A is an explanatory diagram showing the data structure of the metadata and specific data when the first selection is made. Information indicating the type of metadata is newly added after the device ID added as the specific data. At the time of verification, the metadata type is analyzed by referring to the type of metadata described in the specific data. In the example of FIG. 22A, “first” indicating the first selection is added as the type of metadata.

図２２（ｂ）は第２選択がなされた場合のメタデータ及び特定データのデータ構造を示す説明図である。図２２（ｂ）の例では、メタデータの種類として第２選択を示す「第２」が付加されている。図２２（ｂ）に示すように、局ＩＤが「０１」のメタデータ算出器１ではメタデータの種類「第１」が付加されており、局ＩＤが「０５」のメタデータ算出器１ではメタデータの種類「第２」が付加されていることが理解できる。加算データ及び減算データに係るメタデータを用いたノイズ分析、または、実効値に係るメタデータを用いたノイズ分析の精度は通信環境、機器特性及び音声の種類等により、優劣が存在するため、いずれかを択一的に選択できる構成としたものである。メタデータ付加部１５はメタデータ及び特定データを付加部１７へ出力する。 FIG. 22B is an explanatory diagram showing the data structure of metadata and specific data when the second selection is made. In the example of FIG. 22B, “second” indicating the second selection is added as the type of metadata. As shown in FIG. 22B, in the metadata calculator 1 with the station ID “01”, the metadata type “first” is added, and with the metadata calculator 1 with the station ID “05”. It can be understood that the metadata type “second” is added. The accuracy of noise analysis using metadata related to addition data and subtraction data, or noise analysis using metadata related to effective values, depends on the communication environment, device characteristics, voice type, etc. It is set as the structure which can select these alternatively. The metadata adding unit 15 outputs the metadata and specific data to the adding unit 17.

図２３及び図２４は実施の形態３に係るメタデータ付加処理の手順を示すフローチャートである。予め選択部１１２は第１選択または第２選択を受け付ける（ステップＳ２３１）。選択部１１２は第１選択を受け付けたか否かを判断する（ステップＳ２３２）。選択部１１２は第１選択を受け付けたと判断した場合（ステップＳ２３２でＹＥＳ）、スイッチＳ１をオン、スイッチＳ２をオフとする（ステップＳ２３３）。続いて実施の形態１のステップＳ７１乃至ステップＳ７９以降以下の処理を実行する。 23 and 24 are flowcharts showing the procedure of the metadata adding process according to the third embodiment. The selection unit 112 accepts the first selection or the second selection in advance (step S231). The selection unit 112 determines whether or not the first selection has been accepted (step S232). If the selection unit 112 determines that the first selection has been received (YES in step S232), the switch S1 is turned on and the switch S2 is turned off (step S233). Subsequently, the following processing from step S71 to step S79 of the first embodiment is executed.

ＨＰＦ１１０はスイッチＳ１を介して、左音声データ及び右音声データを第１選択側Ｓ１０のメタデータ算出部１４へ出力する（ステップＳ２３４）。変換部１１１は変換後の加算データ及び減算データをメタデータとしてメタデータ付加部１５へ出力する（ステップＳ２３５）。なお、ステップＳ２３４及びＳ２３５の詳細は実施の形態１で述べたとおりであるので説明を省略する。選択部１１２はメタデータの種類を示す情報をメタデータ付加部１５へ出力する（ステップＳ２３６）。 The HPF 110 outputs the left audio data and the right audio data to the metadata calculation unit 14 of the first selection side S10 via the switch S1 (step S234). The conversion unit 111 outputs the converted addition data and subtraction data as metadata to the metadata addition unit 15 (step S235). Note that the details of steps S234 and S235 are as described in the first embodiment, and a description thereof will be omitted. The selection unit 112 outputs information indicating the type of metadata to the metadata adding unit 15 (step S236).

選択部１１２は第１選択を受け付けていないと判断した場合（ステップＳ２３２でＮＯ）、スイッチＳ１をオフ、スイッチＳ２をオンとする（ステップＳ２３７）。続いて実施の形態１のステップＳ７１乃至ステップＳ７９以降以下の処理を実行する。ＨＰＦ１１０はスイッチＳ２を介して、左音声データ及び右音声データを第２選択側Ｓ２０のメタデータ算出部１４へ出力する（ステップＳ２３８）。変換部１１１は変換後の左音声データ及び右音声データの実効値に係る対数をメタデータとしてメタデータ付加部１５へ出力する（ステップＳ２３９）。なお、ステップＳ２３８及びＳ２３９の詳細は実施の形態２で述べたとおりであるので説明を省略する。選択部１１２はメタデータの種類を示す情報をメタデータ付加部１５へ出力する（ステップＳ２４１）。 If the selection unit 112 determines that the first selection has not been received (NO in step S232), the switch S1 is turned off and the switch S2 is turned on (step S237). Subsequently, the following processing from step S71 to step S79 of the first embodiment is executed. The HPF 110 outputs the left audio data and the right audio data to the metadata calculation unit 14 on the second selection side S20 via the switch S2 (step S238). The conversion unit 111 outputs logarithms related to the effective values of the converted left audio data and right audio data to the metadata adding unit 15 as metadata (step S239). Note that the details of steps S238 and S239 are the same as described in the second embodiment, and a description thereof will be omitted. The selection unit 112 outputs information indicating the type of metadata to the metadata adding unit 15 (step S241).

メタデータ付加部１５は図示しないメモリに記憶された当該メタデータ算出器１に係る局ＩＤ及び機器ＩＤを読み出す。識別子付加部１５１は、選択部１１２から出力されたメタデータの種類、及び局ＩＤ及び機器ＩＤを特定データとして、メタデータ算出部１４から出力されたメタデータに付加する（ステップＳ２４２）。メタデータ付加部１５はステップＳ２４２で特定データが付加されたメタデータに、メタデータ保持部１３から出力された前段のメタデータ算出器１に係るメタデータ及び特定データを付加する（ステップＳ２４３）。 The metadata adding unit 15 reads the station ID and device ID related to the metadata calculator 1 stored in a memory (not shown). The identifier adding unit 151 adds the type of metadata output from the selection unit 112, the station ID, and the device ID as specific data to the metadata output from the metadata calculation unit 14 (step S242). The metadata adding unit 15 adds the metadata and specific data related to the previous metadata calculator 1 output from the metadata holding unit 13 to the metadata to which the specific data is added in step S242 (step S243).

メタデータ付加部１５は前段にある局ＩＤまたは機器ＩＤが上位となるよう、例えば、局ＩＤまたは機器ＩＤの数値が小さい順に各メタデータ及び特定データをソートし、図２２に示すメタデータ及び特定データ群を生成する。メタデータ付加部１５はメタデータ及び特定データを付加部１７へ出力する（ステップＳ２４４）。以降の処理は実施の形態１のステップＳ８６以降と同様であるので詳細な説明を省略する。このように、複数のメタデータ算出部１４を設け択一的に選択できるようにしたので、通信環境、符号化及び復号処理の種類、機器特性または音声データの種類等に応じてより好適なメタデータ算出部１４を活用することが可能となる。 For example, the metadata adding unit 15 sorts each metadata and specific data in ascending order of the numerical value of the station ID or device ID so that the station ID or device ID in the preceding stage is higher, and the metadata and specific data shown in FIG. Generate a group of data. The metadata adding unit 15 outputs the metadata and the specific data to the adding unit 17 (step S244). Subsequent processing is the same as that after step S86 of the first embodiment, and thus detailed description thereof is omitted. As described above, since a plurality of metadata calculation units 14 are provided so that they can be selected alternatively, a more suitable metadata can be selected according to the communication environment, types of encoding and decoding processes, device characteristics, types of audio data, and the like. The data calculation unit 14 can be used.

本実施の形態３は以上の如き構成としてあり、その他の構成及び作用は実施の形態１及び２と同様であるので、対応する部分には同一の参照番号を付してその詳細な説明を省略する。 The third embodiment is configured as described above, and the other configurations and operations are the same as those of the first and second embodiments. Therefore, the corresponding parts are denoted by the same reference numerals and detailed description thereof is omitted. To do.

実施の形態４
実施の形態４は複数種類のメタデータをあわせて付加する形態に関する。選択部１１２は、第１選択側Ｓ１０、第２選択側Ｓ２０、または、第１選択側Ｓ１０及び第２選択側Ｓ２０のいずれか３つを選択する事ができる。第１選択側Ｓ１０及び第２選択側Ｓ２０を選択した場合、選択部１１２は、スイッチＳ１及びスイッチＳ２が共にオンとなるよう制御する。この場合ＨＰＦ１１０は左音声データ及び右音声データをスイッチＳ１及びＳ２を介して、第１選択側Ｓ１０及び第２選択側Ｓ２０のメタデータ算出部１４双方に出力する。 Embodiment 4
The fourth embodiment relates to a mode in which a plurality of types of metadata are added together. The selection unit 112 can select any three of the first selection side S10, the second selection side S20, or the first selection side S10 and the second selection side S20. When the first selection side S10 and the second selection side S20 are selected, the selection unit 112 performs control so that both the switch S1 and the switch S2 are turned on. In this case, the HPF 110 outputs the left audio data and the right audio data to both the metadata calculators 14 of the first selection side S10 and the second selection side S20 via the switches S1 and S2.

選択部１１２は、メタデータの種類を示す情報をメタデータ付加部１５へ出力する。この場合、選択部１１２は例えば「第１及び第２」の情報を送信する。なお、本実施の形態においては、選択部１１２による選択を経て第１選択側Ｓ１０及び第２選択側Ｓ２０の双方に係るメタデータを付加する形態を説明するがこれに限るものではない。選択部１１２を設けることなく、常時ＨＰＦ１１０から左音声データ及び右音声データを第１選択側Ｓ１０のメタデータ算出部１４及び第２選択側Ｓ２０のメタデータ算出部１４へ出力するようにしても良い。 The selection unit 112 outputs information indicating the type of metadata to the metadata adding unit 15. In this case, the selection unit 112 transmits “first and second” information, for example. In the present embodiment, a mode in which metadata relating to both the first selection side S10 and the second selection side S20 is added through selection by the selection unit 112 will be described, but the present invention is not limited to this. Without providing the selection unit 112, the left audio data and the right audio data may always be output from the HPF 110 to the metadata calculation unit 14 on the first selection side S10 and the metadata calculation unit 14 on the second selection side S20. .

メタデータ付加部１５の識別子付加部１５１は、選択部１１２からメタデータの種類を示す情報として「第１及び第２」が出力された場合、第１選択側Ｓ１０のメタデータ及び第２選択側Ｓ２０のメタデータそれぞれにメタデータの種類を示す情報を付加する。図２５は実施の形態４に係るメタデータ及び特定データのデータ構造を示す説明図である。識別子付加部１５１は第１選択側Ｓ１０の変換部１１１から出力されたメタデータに、メタデータの種類を示す情報「第１」を付加する。図２５の例では加算データ及び減算データに係るメタデータの特定データとして「第１」が付加されている。 The identifier adding unit 151 of the metadata adding unit 15, when “first and second” is output as information indicating the type of metadata from the selection unit 112, the metadata on the first selection side S 10 and the second selection side Information indicating the type of metadata is added to each metadata of S20. FIG. 25 is an explanatory diagram showing a data structure of metadata and specific data according to the fourth embodiment. The identifier adding unit 151 adds information “first” indicating the type of metadata to the metadata output from the converting unit 111 of the first selection side S10. In the example of FIG. 25, “first” is added as the specific data of the metadata related to the addition data and the subtraction data.

同様に、識別子付加部１５１は第２選択側Ｓ２０の変換部１１１から出力されたメタデータに、メタデータの種類を示す情報「第２」を付加する。図２５の例では、左音声データ及び右音声データの対数に係るメタデータの特定データとして「第２」が付加されている。メタデータ付加部１５はメモリから局ＩＤ及び機器ＩＤを読み出し、特定データとしてメタデータに付加する。 Similarly, the identifier adding unit 151 adds information “second” indicating the type of metadata to the metadata output from the conversion unit 111 of the second selection side S20. In the example of FIG. 25, “second” is added as the specific data of the metadata relating to the logarithm of the left audio data and the right audio data. The metadata adding unit 15 reads the station ID and the device ID from the memory and adds them to the metadata as specific data.

メタデータ付加部１５は図２５に示す如く、局ＩＤ及び機器ＩＤの後段に、第１選択側Ｓ１０のメタデータの種類及びメタデータ、さらにその後段に、第２選択側Ｓ２０のメタデータの種類及びメタデータを連結する。なお、本実施の形態で述べた特定データ及びメタデータのデータ構造はあくまで一例であり、これに限るものではない。メタデータの種類及び局ＩＤまたは機器ＩＤを特定できる形態であれば、この順序に限るものではない。 As shown in FIG. 25, the metadata adding unit 15 includes the type and metadata of the first selection side S10 after the station ID and the device ID, and further the type of metadata of the second selection side S20. And concatenate metadata. Note that the data structure of the specific data and metadata described in the present embodiment is merely an example, and the present invention is not limited to this. The order is not limited to this as long as the metadata type and the station ID or device ID can be specified.

図２６乃至２８は実施の形態４に係るメタデータ付加処理の手順を示すフローチャートである。選択部１１２は第１選択、第２選択、または、第１選択及び第２選択のいずれかを受け付ける（ステップＳ２６１）。選択部１１２は第１選択を受け付けたか否かを判断する（ステップＳ２６２）。選択部１１２は第１選択を受け付けたと判断した場合（ステップＳ２６２でＹＥＳ）、スイッチＳ１をオン、スイッチＳ２をオフとする（ステップＳ２６３）。これ以降はステップＳ２３４以降の処理を実行し（ステップＳ２６４）、一連の処理を終了する。 26 to 28 are flowcharts showing the procedure of the metadata adding process according to the fourth embodiment. The selection unit 112 accepts either the first selection, the second selection, or the first selection and the second selection (step S261). The selection unit 112 determines whether the first selection has been accepted (step S262). If the selection unit 112 determines that the first selection has been received (YES in step S262), the switch S1 is turned on and the switch S2 is turned off (step S263). Thereafter, the processing after step S234 is executed (step S264), and the series of processing ends.

選択部１１２は第１選択を受け付けていないと判断した場合（ステップＳ２６２でＮＯ）、第１選択及び第２選択の双方を受け付けたか否かを判断する（ステップＳ２６５）。選択部１１２は第１選択及び第２選択の双方を受け付けていないと判断した場合（ステップＳ２６５でＮＯ）、第２選択を受け付けたとしてスイッチＳ１をオフ、スイッチＳ２をオンとする（ステップＳ２６６）。これ以降はステップＳ２３８以降の処理を実行し（ステップＳ２６７）、一連の処理を終了する。選択部１１２は第１選択及び第２選択の双方を受け付けたと判断した場合（ステップＳ２６５でＹＥＳ）、スイッチＳ１をオン、スイッチＳ２をオンとする（ステップＳ２６８）。続いて実施の形態１のステップＳ７１乃至ステップＳ７９以降以下の処理を実行する。 If the selection unit 112 determines that the first selection is not received (NO in step S262), the selection unit 112 determines whether both the first selection and the second selection are received (step S265). If the selection unit 112 determines that both the first selection and the second selection are not accepted (NO in step S265), the switch S1 is turned off and the switch S2 is turned on because the second selection is accepted (step S266). . Thereafter, the processing after step S238 is executed (step S267), and the series of processing ends. If the selection unit 112 determines that both the first selection and the second selection have been received (YES in step S265), the switch S1 is turned on and the switch S2 is turned on (step S268). Subsequently, the following processing from step S71 to step S79 of the first embodiment is executed.

選択部１１２はメタデータの種類を示す情報をメタデータ付加部１５へ出力する（ステップＳ２６９）。ＨＰＦ１１０はスイッチＳ１を介して、左音声データ及び右音声データを第１選択側Ｓ１０のメタデータ算出部１４へ出力する（ステップＳ２７１）。変換部１１１は変換後の加算データ及び減算データをメタデータとしてメタデータ付加部１５へ出力する（ステップＳ２７２）。識別子付加部１５１はステップＳ２６９で出力されたメタデータの種類を参照し、メタデータにメタデータの種類を特定データとして付加する（ステップＳ２７３）。具体的には、識別子付加部１５１は加算データ及び減算データに係るメタデータの前段に特定データとして第１選択側Ｓ１０を示す「第１」を付加する。 The selection unit 112 outputs information indicating the type of metadata to the metadata adding unit 15 (step S269). The HPF 110 outputs the left audio data and the right audio data to the metadata calculation unit 14 of the first selection side S10 via the switch S1 (step S271). The conversion unit 111 outputs the converted addition data and subtraction data as metadata to the metadata addition unit 15 (step S272). The identifier adding unit 151 refers to the metadata type output in step S269, and adds the metadata type as specific data to the metadata (step S273). Specifically, the identifier adding unit 151 adds “first” indicating the first selection side S10 as the specific data before the metadata related to the addition data and the subtraction data.

さらに、ＨＰＦ１１０はスイッチＳ２を介して、左音声データ及び右音声データを第２選択側Ｓ２０のメタデータ算出部１４へ出力する（ステップＳ２７４）。変換部１１１は変換後の左音声データ及び右音声データの実効値に係る対数をメタデータとしてメタデータ付加部１５へ出力する（ステップＳ２７５）。識別子付加部１５１はステップＳ２６９で出力されたメタデータの種類を参照し、メタデータにメタデータの種類を特定データとして付加する（ステップＳ２７６）。具体的には、識別子付加部１５１は左音声データ及び右音声データの対数に係るメタデータの前段に特定データとして第２選択側Ｓ２０を示す「第２」を付加する。 Further, the HPF 110 outputs the left audio data and the right audio data to the metadata calculation unit 14 on the second selection side S20 via the switch S2 (step S274). The conversion unit 111 outputs logarithms related to the effective values of the converted left audio data and right audio data as metadata to the metadata adding unit 15 (step S275). The identifier adding unit 151 refers to the metadata type output in step S269, and adds the metadata type as specific data to the metadata (step S276). Specifically, the identifier adding unit 151 adds “second” indicating the second selection side S20 as specific data to the preceding stage of the metadata relating to the logarithm of the left audio data and the right audio data.

識別子付加部１５１はステップＳ２７３で付加した第１選択側Ｓ１０のメタデータの種類及びメタデータ後段に、ステップＳ２７６で付加した第２選択側Ｓ２０のメタデータの種類及びメタデータを連結する（ステップＳ２７７）。メタデータ付加部１５はメモリから局ＩＤ及び機器ＩＤを読み出す（ステップＳ２７８）。メタデータ付加部１５は読み出した局ＩＤ及び機器ＩＤを特定データとしてステップＳ２７７で連結した特定データ及びメタデータの前段に付加する（ステップＳ２７９）。これにより、図２５で示した局ＩＤ（０５）の特定データ及びメタデータが完成する。 The identifier adding unit 151 connects the metadata type and metadata of the second selection side S20 added in step S276 to the metadata type and metadata of the first selection side S10 added in step S273 (step S277). ). The metadata adding unit 15 reads the station ID and the device ID from the memory (step S278). The metadata adding unit 15 adds the read station ID and device ID as specific data to the preceding stage of the specific data and metadata connected in step S277 (step S279). Thereby, the specific data and metadata of the station ID (05) shown in FIG. 25 are completed.

メタデータ付加部１５はさらに、ステップＳ２７９で特定データが付加されたメタデータに、メタデータ保持部１３から出力された前段のメタデータ算出器１に係るメタデータ及び特定データを付加する（ステップＳ２８１）。メタデータ付加部１５はメタデータ及び特定データを付加部１７へ出力する（ステップＳ２８２）。このように、変換部１１１により情報量を削減したので、複数種類のメタデータをあわせて送信することが可能となる。その結果、通信環境、符号化及び復号処理の種類、機器特性または音声データの種類等にかかわらず、複数のメタデータを利用してより精度良くノイズを検出することが可能となる。 The metadata adding unit 15 further adds the metadata and the specific data related to the metadata calculator 1 in the previous stage output from the metadata holding unit 13 to the metadata to which the specific data is added in step S279 (step S281). ). The metadata adding unit 15 outputs the metadata and the specific data to the adding unit 17 (step S282). As described above, since the amount of information is reduced by the conversion unit 111, a plurality of types of metadata can be transmitted together. As a result, it is possible to detect noise more accurately using a plurality of metadata regardless of the communication environment, the types of encoding and decoding processes, the device characteristics, the types of audio data, and the like.

本実施の形態４は以上の如き構成としてあり、その他の構成及び作用は実施の形態１乃至３と同様であるので、対応する部分には同一の参照番号を付してその詳細な説明を省略する。 The fourth embodiment is configured as described above, and the other configurations and operations are the same as those of the first to third embodiments. Therefore, the corresponding parts are denoted by the same reference numerals and detailed description thereof is omitted. To do.

実施の形態５
実施の形態１乃至４に係る処理を図２９で示したコンピュータを用いてソフトウェア処理として実現するようにしても良い。図２９は実施の形態５に係るメタデータ算出器１のハードウェア構成を示すブロック図である。コンピュータ１０はＣＰＵ（Central Processing Unit）１０１、ＲＡＭ（Random Access Memory）１０２、ハードディスク等の記憶部１０５、インターフェースたるＩ／Ｆ１０６、１０８、及び通信部１０９等を含んで構成される。ＣＰＵ１０１はバス１０７を介して各ハードウェアに接続されており、記憶部１０５に記憶した処理プログラム１０５Ｐに従い、上述した各種ソフトウェア処理を実行する。 Embodiment 5
The processing according to Embodiments 1 to 4 may be realized as software processing using the computer shown in FIG. FIG. 29 is a block diagram showing a hardware configuration of the metadata calculator 1 according to the fifth embodiment. The computer 10 includes a CPU (Central Processing Unit) 101, a RAM (Random Access Memory) 102, a storage unit 105 such as a hard disk, I / Fs 106 and 108 as interfaces, a communication unit 109, and the like. The CPU 101 is connected to each hardware via the bus 107, and executes the various software processes described above according to the processing program 105P stored in the storage unit 105.

コンピュータ１０を動作させるためのプログラムは、ＣＤ−ＲＯＭ、ＭＯ、またはＤＶＤ−ＲＯＭ等の可搬型記録媒体１Ａで提供することも可能である。さらに、当該プログラムを、無線ＬＡＮカード等の通信部１０９を介して図示しないサーバコンピュータからダウンロードすることも可能である。以下に、その内容を説明する。 A program for operating the computer 10 can also be provided by a portable recording medium 1A such as a CD-ROM, MO, or DVD-ROM. Furthermore, the program can be downloaded from a server computer (not shown) via the communication unit 109 such as a wireless LAN card. The contents will be described below.

図２９に示すコンピュータ１０の図示しないリーダ／ライタに、加算データを算出させ、減算データを算出させ、メタデータを付加させる等のプログラムが記録された可搬型記録媒体１Ａ（ＣＤ−ＲＯＭ、ＭＯ又はＤＶＤ−ＲＯＭ等）を、挿入して記憶部１０５の処理プログラム１０５Ｐ内にこのプログラムをインストールする。または、かかるプログラムを、通信部１０９を介して外部の図示しないサーバコンピュータからダウンロードし、記憶部１０５にインストールするようにしても良い。かかるプログラムはＲＡＭ１０２にロードして実行される。これにより、デマルチプレクサ１１からＩ／Ｆ１０６を介して音声データ、メタデータ及び特定データが入力され、実施の形態１乃至４で述べた処理が実行される。処理後のメタデータ及び特定データが付加された音声データは、Ｉ／Ｆ１０８を介して送信部１８へ出力される。 A portable recording medium 1A (CD-ROM, MO, or the like) on which a program for causing a reader / writer (not shown) of the computer 10 shown in FIG. 29 to calculate addition data, calculate subtraction data, and add metadata is recorded. DVD-ROM or the like) is inserted, and this program is installed in the processing program 105P of the storage unit 105. Alternatively, such a program may be downloaded from an external server computer (not shown) via the communication unit 109 and installed in the storage unit 105. Such a program is loaded into the RAM 102 and executed. Thus, audio data, metadata, and specific data are input from the demultiplexer 11 via the I / F 106, and the processes described in the first to fourth embodiments are executed. The audio data to which the processed metadata and specific data are added is output to the transmission unit 18 via the I / F 108.

本実施の形態５は以上の如き構成としてあり、その他の構成及び作用は実施の形態１乃至４と同様であるので、対応する部分には同一の参照番号を付してその詳細な説明を省略する。 The fifth embodiment has the above-described configuration, and the other configurations and operations are the same as those of the first to fourth embodiments. Therefore, the corresponding parts are denoted by the same reference numerals and detailed description thereof is omitted. To do.

以上の実施の形態１乃至５を含む実施形態に関し、さらに以下の付記を開示する。 With respect to the embodiments including the first to fifth embodiments, the following additional notes are disclosed.

（付記１）
音声音響信号を受信し、受信した音声音響信号を送信する送受信装置において、
受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出部と、
該抽出部により抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出するハイパスフィルタと、
該ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の和信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく加算データを算出する加算部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の差信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく減算データを算出する減算部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号の時系列の第１実効値を算出する第１実効値算出部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第２音声音響信号の時系列の第２実効値を算出する第２実効値算出部と、
加算データ及び減算データに基づくメタデータ、または、第１実効値及び第２実効値に基づくメタデータのいずれかを選択する選択部と、
前記選択部により加算データ及び減算データに基づくメタデータが選択された場合に、前記加算部及び減算部により算出した加算データ及び減算データをメタデータとして受信した音声音響信号に付加し、前記選択部により第１実効値及び第２実効値に基づくメタデータが選択された場合に、前記第１実効値算出部及び第２実効値算出部により算出した第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加する付加部と、
該付加部によりメタデータが付加された音声音響信号を外部へ送信する送信部と
を備えることを特徴とする送受信装置。 (Appendix 1)
In the transmission / reception device that receives the audio sound signal and transmits the received audio sound signal,
An extraction unit for extracting the first audio sound signal and the second audio sound signal according to the received audio sound signal;
A high-pass filter that extracts a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction unit;
A value related to a time-series sum signal of the first audio acoustic signal and the second audio acoustic signal related to the frequency component extracted by the high-pass filter is calculated, and addition data based on a cumulative addition value for a predetermined time of the calculated value is obtained. An adder to calculate;
A value related to a time-series difference signal between the first audio acoustic signal and the second audio acoustic signal related to the frequency component extracted by the high-pass filter is calculated, and subtraction data based on a cumulative addition value for a predetermined time of the calculated value is calculated. Subtracting part to calculate,
A first effective value calculation unit for calculating a first effective value in a time series of the first audio-acoustic signal related to the frequency component extracted by the high-pass filter;
A second effective value calculation unit for calculating a second effective value in a time series of the second audio-acoustic signal related to the frequency component extracted by the high-pass filter;
A selection unit that selects either metadata based on addition data and subtraction data, or metadata based on the first effective value and the second effective value;
When the selection unit selects metadata based on the addition data and the subtraction data, the addition unit and the subtraction data calculated by the addition unit and the subtraction unit are added to the received audio-acoustic signal as metadata, and the selection unit When metadata based on the first effective value and the second effective value is selected by the above, the first effective value and the second effective value calculated by the first effective value calculation unit and the second effective value calculation unit are metadata. An adding unit for adding to the received sound signal;
A transmission / reception apparatus comprising: a transmission unit that transmits the audio-acoustic signal to which metadata is added by the addition unit to the outside.

（付記２）
音声音響信号を受信し、受信した音声音響信号を送信する送受信装置において、
受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出部と、
該抽出部により抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出するハイパスフィルタと、
該ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の和信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく加算データを算出する加算部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の差信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく減算データを算出する減算部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号の時系列の第１実効値を算出する第１実効値算出部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第２音声音響信号の時系列の第２実効値を算出する第２実効値算出部と、
前記加算部及び減算部により算出した加算データ及び減算データ、並びに、前記第１実効値算出部及び第２実効値算出部により算出した第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加する付加部と、
該付加部によりメタデータが付加された音声音響信号を外部へ送信する送信部と
を備えることを特徴とする送受信装置。 (Appendix 2)
In the transmission / reception device that receives the audio sound signal and transmits the received audio sound signal,
An extraction unit for extracting the first audio sound signal and the second audio sound signal according to the received audio sound signal;
A high-pass filter that extracts a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction unit;
A value related to a time-series sum signal of the first audio acoustic signal and the second audio acoustic signal related to the frequency component extracted by the high-pass filter is calculated, and addition data based on a cumulative addition value for a predetermined time of the calculated value is obtained. An adder to calculate;
A value related to a time-series difference signal between the first audio acoustic signal and the second audio acoustic signal related to the frequency component extracted by the high-pass filter is calculated, and subtraction data based on a cumulative addition value for a predetermined time of the calculated value is calculated. Subtracting part to calculate,
A first effective value calculation unit for calculating a first effective value in a time series of the first audio-acoustic signal related to the frequency component extracted by the high-pass filter;
A second effective value calculation unit for calculating a second effective value in a time series of the second audio-acoustic signal related to the frequency component extracted by the high-pass filter;
Audio that has received the addition data and subtraction data calculated by the addition unit and the subtraction unit, and the first effective value and the second effective value calculated by the first effective value calculation unit and the second effective value calculation unit as metadata. An additional unit for adding to the acoustic signal;
A transmission / reception apparatus comprising: a transmission unit that transmits the audio-acoustic signal to which metadata is added by the addition unit to the outside.

（付記３）
音声音響信号を受信し、受信した音声音響信号を送信する送受信装置において、
受信した音声音響信号に係る第１音声音響信号及び第２音声音響信号を抽出する抽出部と、
該抽出部により抽出した第１音声音響信号及び第２音声音響信号から所定周波数以上の周波数成分を抽出するハイパスフィルタと、
該ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の和信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく加算データを算出する加算部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号及び第２音声音響信号の時系列の差信号に関する値を算出し、算出した値の所定時間分の累積加算値に基づく減算データを算出する減算部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第１音声音響信号の時系列の第１実効値を算出する第１実効値算出部と、
前記ハイパスフィルタにより抽出された周波数成分に係る第２音声音響信号の時系列の第２実効値を算出する第２実効値算出部と、
加算データ及び減算データに基づくメタデータ、第１実効値及び第２実効値に基づくメタデータ、または、加算データ及び減算データに基づくメタデータ並びに第１実効値及び第２実効値に基づくメタデータ、のいずれかを選択する選択部と、
前記選択部により加算データ及び減算データに基づくメタデータが選択された場合に、前記加算部及び減算部により算出した加算データ及び減算データをメタデータとして受信した音声音響信号に付加し、前記選択部により第１実効値及び第２実効値に基づくメタデータが選択された場合に、前記第１実効値算出部及び第２実効値算出部により算出した第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加し、前記選択部により加算データ及び減算データに基づくメタデータ並びに第１実効値及び第２実効値に基づくメタデータが選択された場合に、前記加算部及び減算部により算出した加算データ及び減算データ並びに前記第１実効値算出部及び第２実効値算出部により算出した第１実効値及び第２実効値をメタデータとして受信した音声音響信号に付加する付加部と、
該付加部によりメタデータが付加された音声音響信号を外部へ送信する送信部と
を備えることを特徴とする送受信装置。 (Appendix 3)
In the transmission / reception device that receives the audio sound signal and transmits the received audio sound signal,
An extraction unit for extracting the first audio sound signal and the second audio sound signal according to the received audio sound signal;
A high-pass filter that extracts a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction unit;
A value related to a time-series sum signal of the first audio acoustic signal and the second audio acoustic signal related to the frequency component extracted by the high-pass filter is calculated, and addition data based on a cumulative addition value for a predetermined time of the calculated value is obtained. An adder to calculate;
A value related to a time-series difference signal between the first audio acoustic signal and the second audio acoustic signal related to the frequency component extracted by the high-pass filter is calculated, and subtraction data based on a cumulative addition value for a predetermined time of the calculated value is calculated. Subtracting part to calculate,
A first effective value calculation unit for calculating a first effective value in a time series of the first audio-acoustic signal related to the frequency component extracted by the high-pass filter;
A second effective value calculation unit for calculating a second effective value in a time series of the second audio-acoustic signal related to the frequency component extracted by the high-pass filter;
Metadata based on addition data and subtraction data, metadata based on first effective value and second effective value, or metadata based on addition data and subtraction data, and metadata based on first effective value and second effective value, A selection section for selecting one of
When the selection unit selects metadata based on the addition data and the subtraction data, the addition unit and the subtraction data calculated by the addition unit and the subtraction unit are added to the received audio-acoustic signal as metadata, and the selection unit When metadata based on the first effective value and the second effective value is selected by the above, the first effective value and the second effective value calculated by the first effective value calculation unit and the second effective value calculation unit are metadata. When the metadata based on the addition data and the subtraction data and the metadata based on the first effective value and the second effective value are selected by the selection unit, the addition unit and the subtraction unit And the first and second effective values calculated by the first and second effective value calculating units and the first effective value calculating unit. And adding unit for adding the speech sound signal received as,
A transmission / reception apparatus comprising: a transmission unit that transmits the audio-acoustic signal to which metadata is added by the addition unit to the outside.

伝送システムの概要を示す模式図である。It is a schematic diagram which shows the outline | summary of a transmission system. メタデータ算出器のハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware constitutions of a metadata calculator. 係数Ａの値を示すテーブルである。It is a table which shows the value of coefficient A. ＨＰＦの構成を示すブロック図である。It is a block diagram which shows the structure of HPF. 左音声データ及び右音声データの時間的変化を模式的に示すグラフである。It is a graph which shows typically a time change of left voice data and right voice data. ＨＰＦによる処理を経ない場合の加算データの時間的変化を示すグラフである。It is a graph which shows the time change of the addition data when not passing through the process by HPF. ＨＰＦによる処理を経た場合の加算データの時間的変化を示すグラフである。It is a graph which shows the time change of the addition data at the time of passing through the process by HPF. ＨＰＦによる処理を経ない場合の減算データの時間的変化を示すグラフである。It is a graph which shows the time change of the subtraction data when not passing through the process by HPF. ＨＰＦによる処理を経た場合の減算データの時間的変化を示すグラフである。It is a graph which shows the time change of the subtraction data at the time of passing through the process by HPF. 各符号化及び復号処理を経た加算データの時間的変化を示すグラフである。It is a graph which shows the time change of the addition data which passed through each encoding and decoding process. 各符号化及び復号処理を経た減算データの時間的変化を示すグラフである。It is a graph which shows the time change of the subtraction data which passed through each encoding and decoding process. メタデータ保持部のレコードレイアウトを示す説明図である。It is explanatory drawing which shows the record layout of a metadata holding part. メタデータ及び特定データのデータ構造を示す説明図である。It is explanatory drawing which shows the data structure of metadata and specific data. メタデータ算出処理及び付加処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a metadata calculation process and an addition process. メタデータ算出処理及び付加処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a metadata calculation process and an addition process. メタデータ算出処理及び付加処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a metadata calculation process and an addition process. 実施の形態２に係るメタデータ算出器のハードウェア構成を示すブロック図である。6 is a block diagram showing a hardware configuration of a metadata calculator according to Embodiment 2. FIG. 音声データの実効値の時間的変化を示すグラフである。It is a graph which shows the time change of the effective value of audio | voice data. 音声データの実効値に係る対数の時間的変化を示すグラフである。It is a graph which shows the time change of the logarithm which concerns on the effective value of audio | voice data. メタデータ算出処理及び変換処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a metadata calculation process and a conversion process. 実施の形態３に係るメタデータ算出器のハードウェア構成を示すブロック図である。FIG. 10 is a block diagram illustrating a hardware configuration of a metadata calculator according to a third embodiment. 実施の形態３に係るメタデータ及び特定データのデータ構造を示す説明図である。10 is an explanatory diagram illustrating a data structure of metadata and specific data according to Embodiment 3. FIG. 実施の形態３に係るメタデータ付加処理の手順を示すフローチャートである。10 is a flowchart illustrating a procedure of metadata addition processing according to the third embodiment. 実施の形態３に係るメタデータ付加処理の手順を示すフローチャートである。10 is a flowchart illustrating a procedure of metadata addition processing according to the third embodiment. 実施の形態４に係るメタデータ及び特定データのデータ構造を示す説明図である。It is explanatory drawing which shows the data structure of the metadata which concerns on Embodiment 4, and specific data. 実施の形態４に係るメタデータ付加処理の手順を示すフローチャートである。14 is a flowchart illustrating a procedure of metadata addition processing according to the fourth embodiment. 実施の形態４に係るメタデータ付加処理の手順を示すフローチャートである。14 is a flowchart illustrating a procedure of metadata addition processing according to the fourth embodiment. 実施の形態４に係るメタデータ付加処理の手順を示すフローチャートである。14 is a flowchart illustrating a procedure of metadata addition processing according to the fourth embodiment. 実施の形態５に係るメタデータ算出器のハードウェア構成を示すブロック図である。FIG. 10 is a block diagram illustrating a hardware configuration of a metadata calculator according to a fifth embodiment.

Explanation of symbols

１メタデータ算出器
１Ａ可搬型記録媒体
１１デマルチプレクサ
１２抽出部
１３メタデータ保持部
１４メタデータ算出部
１５メタデータ付加部
１７付加部
１８送信部
１１０ＨＰＦ
１１２選択部
１４１加算部
１４２減算部
１４３左実効値算出部
１４４右実効値算出部
１５１識別子付加部
１１１０上下限変換部
１１１１、１１１３整数変換部
１１１２対数変換部
Ｓ１スイッチ
Ｓ２スイッチ DESCRIPTION OF SYMBOLS 1 Metadata calculator 1A Portable recording medium 11 Demultiplexer 12 Extraction part 13 Metadata holding part 14 Metadata calculation part 15 Metadata addition part 17 Addition part 18 Transmission part 110 HPF
112 selecting unit 141 adding unit 142 subtracting unit 143 left effective value calculating unit 144 right effective value calculating unit 151 identifier adding unit 1110 upper / lower limit converting unit 1111, 1113 integer converting unit 1112 logarithmic converting unit S1 switch S2 switch

Claims

In the transmission / reception device that receives the audio sound signal and transmits the received audio sound signal,
An extraction unit for extracting the first audio sound signal and the second audio sound signal according to the received audio sound signal;
A high-pass filter that extracts a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction unit;
A value related to a time-series sum signal of the first audio acoustic signal and the second audio acoustic signal related to the frequency component extracted by the high-pass filter is calculated, and addition data based on a cumulative addition value for a predetermined time of the calculated value is obtained. An adder to calculate;
A value related to a time-series difference signal between the first audio acoustic signal and the second audio acoustic signal related to the frequency component extracted by the high-pass filter is calculated, and subtraction data based on a cumulative addition value for a predetermined time of the calculated value is calculated. Subtracting part to calculate,
An addition unit for adding the addition data and the subtraction data calculated by the addition unit and the subtraction unit to the audio-acoustic signal received as metadata;
A transmission / reception apparatus comprising: a transmission unit that transmits the audio-acoustic signal to which metadata is added by the addition unit to the outside.

A conversion unit that converts the addition data calculated by the addition unit and the subtraction data calculated by the subtraction unit based on a predetermined lower limit value and an upper limit value;
The additional part is
The transmission / reception apparatus according to claim 1, wherein the addition data and the subtraction data converted by the conversion unit are added to the audio-acoustic signal received as metadata.

The converter is
Of the absolute value of the addition data calculated by the addition unit and the subtraction data calculated by the subtraction unit, the addition data and subtraction data smaller than the lower limit value are converted to zero, and the addition data exceeding the upper limit value and After converting the absolute value of the subtraction data to the upper limit value or a value less than the upper limit value, an upper and lower limit conversion unit that adds the sign before the absolute value calculation to the converted addition data and subtraction data,
The transmission / reception apparatus according to claim 2, further comprising: an integer conversion unit that converts addition data and subtraction data into an integer.

The transmission / reception apparatus according to claim 3, wherein the lower limit value is 3 and the upper limit value is 255.

In the transmission / reception device that receives the audio sound signal and transmits the received audio sound signal,
An extraction unit for extracting the first audio sound signal and the second audio sound signal according to the received audio sound signal;
A high-pass filter that extracts a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction unit;
A first effective value calculation unit for calculating a first effective value in a time series of the first audio-acoustic signal related to the frequency component extracted by the high-pass filter;
A second effective value calculation unit for calculating a second effective value in a time series of the second audio-acoustic signal related to the frequency component extracted by the high-pass filter;
An adding unit that adds the first effective value and the second effective value calculated by the first effective value calculating unit and the second effective value calculating unit to the audio-acoustic signal received as metadata;
A transmission / reception apparatus comprising: a transmission unit that transmits the audio-acoustic signal to which metadata is added by the addition unit to the outside.

A conversion unit that performs conversion based on the logarithm of the first effective value calculated by the first effective value calculation unit and the second effective value calculated by the second effective value calculation unit;
The said addition part is comprised so that the 1st effective value and 2nd effective value which concern on the logarithm converted by the said conversion part may be added to the audio | voice sound signal received as metadata. The transmitter / receiver described.

The transmission / reception apparatus according to claim 1, wherein the predetermined frequency is 20 Hz.

The extraction unit includes:
When the received audio-acoustic signal has a plurality of types of audio-acoustic signals exceeding the first audio-acoustic signal and the second audio-acoustic signal, the plurality of types of audio-acoustic signals are converted into the first audio-acoustic signal and the second audio-acoustic signal. The transmission / reception apparatus according to claim 1, wherein the transmission / reception apparatus is configured as described above.

In the transmission / reception method of receiving the audio / acoustic signal by the transmission / reception device and transmitting the received audio / acoustic signal from the transmission / reception device to the outside,
An extraction step of extracting a first audio sound signal and a second audio sound signal according to the received audio sound signal;
A component extraction step of extracting a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction step;
A value related to the time-series sum signal of the first audio sound signal and the second audio sound signal related to the frequency component extracted by the component extraction step is calculated, and the addition data based on the accumulated addition value for a predetermined time of the calculated value An adding step for calculating
Subtract data based on a cumulative addition value for a predetermined time of the calculated value for a time-series difference signal between the first audio sound signal and the second audio sound signal related to the frequency component extracted in the component extraction step. Subtracting step for calculating
An addition step of adding the addition data and the subtraction data calculated by the addition step and the subtraction step to the audio-acoustic signal received as metadata;
And a transmitting step of transmitting the audio-acoustic signal to which the metadata is added in the adding step to the outside.

In the transmission / reception method of receiving the audio / acoustic signal by the transmission / reception device and transmitting the received audio / acoustic signal from the transmission / reception device to the outside,
An extraction step of extracting a first audio sound signal and a second audio sound signal according to the received audio sound signal;
A component extraction step of extracting a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction step;
A first effective value calculating step of calculating a first effective value of a time series of the first audio-acoustic signal related to the frequency component extracted by the component extracting step;
A second effective value calculating step of calculating a second effective value in a time series of the second audio-acoustic signal related to the frequency component extracted by the component extracting step;
An adding step of adding the first effective value and the second effective value calculated in the first effective value calculating step and the second effective value calculating step to the audio-acoustic signal received as metadata;
And a transmitting step of transmitting the audio-acoustic signal to which the metadata is added in the adding step to the outside.

In a program used in a computer that receives a sound sound signal and transmits the sound sound signal to the outside,
On the computer,
An extraction step of extracting a first audio sound signal and a second audio sound signal according to the received audio sound signal;
A component extraction step of extracting a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction step;
A value related to the time-series sum signal of the first audio sound signal and the second audio sound signal related to the frequency component extracted by the component extraction step is calculated, and the addition data based on the accumulated addition value for a predetermined time of the calculated value An adding step for calculating
Subtract data based on a cumulative addition value for a predetermined time of the calculated value for a time-series difference signal between the first audio sound signal and the second audio sound signal related to the frequency component extracted in the component extraction step. Subtracting step for calculating
An addition step of adding the addition data and the subtraction data calculated by the addition step and the subtraction step to the audio-acoustic signal received as metadata;
And a transmission step of transmitting the audio-acoustic signal to which the metadata is added in the addition step to the outside.

In a program used in a computer that receives a sound sound signal and transmits the sound sound signal to the outside,
On the computer,
An extraction step of extracting a first audio sound signal and a second audio sound signal according to the received audio sound signal;
A component extraction step of extracting a frequency component of a predetermined frequency or higher from the first audio sound signal and the second audio sound signal extracted by the extraction step;
A first effective value calculating step of calculating a first effective value of a time series of the first audio-acoustic signal related to the frequency component extracted by the component extracting step;
A second effective value calculating step of calculating a second effective value in a time series of the second audio-acoustic signal related to the frequency component extracted by the component extracting step;
An adding step of adding the first effective value and the second effective value calculated in the first effective value calculating step and the second effective value calculating step to the audio-acoustic signal received as metadata;
And a transmission step of transmitting the audio-acoustic signal to which the metadata is added in the addition step to the outside.