JP2022089106A

JP2022089106A - Automatic voice adjustment device

Info

Publication number: JP2022089106A
Application number: JP2020201374A
Authority: JP
Inventors: 渉波多野; Wataru Hatano
Original assignee: Tamura Corp
Current assignee: Tamura Corp
Priority date: 2020-12-03
Filing date: 2020-12-03
Publication date: 2022-06-15

Abstract

To provide an automatic voice adjustment device with which it is possible to output a voice signal that is easy to hear without being affected by the skill level of the person who adjusts voice.SOLUTION: The automatic voice adjustment device comprises: a plurality of voice signal input units 1a, 1b-1n; a plurality of voice signal adjustment units 2a, 2b-2n for adjusting the voice signal inputted from each voice signal input unit to a prescribed frequency characteristic; a voice signal mixing unit 3 for mixing the voice signals having been adjusted by each voice signal adjustment unit; and a voice signal output unit 4 for outputting the voice signal having been mixed by the sound signal mixing unit. A frequency analysis arithmetic unit 5 analyzes and compares the frequency characteristic of each of voice signals A-N. When the comparison results are the same or similar, the frequencies f1a, f2a, fna of the first spectra of the compared voice signals are detected, and the center frequency feq of one of the voice signal adjustment units 2a, 2b-2n is moved to become greater than or equal to a preset threshold.SELECTED DRAWING: Figure 1

Description

本発明は、複数の音声信号の周波数特性を調整する自動音声調整装置に関する。 The present invention relates to an automatic voice adjusting device that adjusts the frequency characteristics of a plurality of voice signals.

従来より、ラジオやテレビジョンなどの放送スタジオや、音楽の録音スタジオなどでは、マイクなどから入力された複数の音声信号について、音質や音量等を調整した後、各音声信号をミキシングして出力する音声調整装置が使用されている。音声調整装置は、ミキシング装置、オーディオミキシングコンソール、コンソール、ミキシングボード、ミキサー、オーディオミキサー、音声調整卓、音響調整卓などとも呼ばれている。 Conventionally, in broadcasting studios such as radios and televisions, music recording studios, etc., after adjusting the sound quality and volume of multiple audio signals input from microphones, etc., each audio signal is mixed and output. A voice regulator is used. The audio adjustment device is also called a mixing device, an audio mixing console, a console, a mixing board, a mixer, an audio mixer, an audio adjustment console, an acoustic adjustment console, and the like.

音声調整装置に実装されているイコライザには、音声信号の周波数特性を複数の項目にわたってきめ細かく調整することが可能なパラメトリックイコライザ（PEQ）がある。このパラメトリックイコライザは、入力信号の周波数特性が均一になるように、又は、音楽的に好適な周波数になるように中心周波数、等価量、Q値（Quality Factor）などの設定パラメータを調整して使用する。 The equalizer mounted on the voice adjustment device includes a parametric equalizer (PEQ) that can finely adjust the frequency characteristics of a voice signal over a plurality of items. This parametric equalizer is used by adjusting setting parameters such as center frequency, equivalent amount, and Q value (Quality Factor) so that the frequency characteristics of the input signal become uniform or musically suitable. do.

パラメトリックイコライザの操作は、音声の調整を行う音声調整者（ミキシングエンジニア）によって、装置上に多数配置されているボタンスイッチ、調整つまみ、フェーダー及びタッチパネル（音量調整装置）等の各種操作子を操作することにより行われる。そのため、音声調整者には、限られた時間の中で迅速、且つ正確に多数の操作子を操作することが要求される。しかし、音声調整者は、実際に音を聞きながら音声調整装置を操作して、各音声信号のミキシングバランスが最適となるように手動で調整していることから、その調整には音声調整者の技術、経験などの熟練度が大きく影響し、放送番組、レコーディング、ビデオ編集など制作現場で作られる作品にはばらつきが生じてしまう。 The parametric equalizer is operated by a voice coordinator (mixing engineer) who adjusts the sound, and operates various controls such as button switches, adjustment knobs, faders, and a touch panel (volume control device) arranged on the device. It is done by. Therefore, the voice coordinator is required to operate a large number of controls quickly and accurately within a limited time. However, since the voice adjuster operates the voice adjustment device while actually listening to the sound and manually adjusts the mixing balance of each voice signal to be optimal, the voice adjuster adjusts the adjustment. Skills such as technique and experience have a great influence, and the works produced at the production site such as broadcast programs, recordings, and video editing will vary.

また、パラメトリックイコライザの操作に長けた音声調整者の育成には時間がかかる一方で、近年高齢化により熟練度の高い音声調整者が退職し、彼らが有するスキルも消失してしまうため、制作現場で作られる作品の質の低下が懸念されている。 In addition, while it takes time to train voice coordinators who are good at operating parametric equalizers, highly skilled voice coordinators will retire due to the aging of the population in recent years, and their skills will disappear. There is concern that the quality of the works made in Japan will deteriorate.

このような観点から、音声調整者の熟練度に関わらず、簡単な操作により聞き手にとって違和感のない音声信号を出力することができるようにするため、複数の音声信号を自動的にミックスする機能、いわゆるオートミキサーを搭載した自動音声調整装置に関する提案が、たとえば下記の特許文献に示されている。 From this point of view, a function that automatically mixes multiple audio signals so that the listener can output audio signals that are not uncomfortable for the listener with simple operations, regardless of the skill level of the audio adjuster. Proposals for an automatic audio regulator equipped with a so-called automixer are shown, for example, in the following patent documents.

特開２０１６－１１１６７３号公報Japanese Unexamined Patent Publication No. 2016-111673 特開２０１２－１０１５４号公報Japanese Unexamined Patent Publication No. 2012-10154

しかし、特許文献１は、ミックスしたい音声信号をレベル制御のみで調整するため、ミックスしたい音声信号の周波数特性が同一又は類似する場合には、入力される複数の信号が相互に干渉してしまい、それぞれの音が聞きづらくなってしまう場合があった。このような場合、音声調整者が、各入力信号の特徴に合わせてパラメトリックイコライザにより音質を変えて、それぞれの入力信号をミックスした時でも聞きやすくなるように手動で調整しているが、音声調整者の調整は非常に複雑であり、音声調整者の熟練度によって調整後の音声の聞き取りやすさが左右されてしまう。また、特許文献２は、特定のチャンネルの音声を大きくすることはできるものの、入力レベルが小さい音声の音声出力が低下してしまい、聞き取りにくくなってしまうという問題があった。 However, in Patent Document 1, since the audio signal to be mixed is adjusted only by level control, if the frequency characteristics of the audio signal to be mixed are the same or similar, a plurality of input signals interfere with each other. In some cases, it became difficult to hear each sound. In such a case, the voice adjuster manually adjusts the sound quality by using a parametric equalizer according to the characteristics of each input signal so that it is easy to hear even when each input signal is mixed. The adjustment of the person is very complicated, and the audibility of the adjusted voice depends on the skill level of the voice adjuster. Further, Patent Document 2 has a problem that although it is possible to increase the sound of a specific channel, the sound output of the sound having a small input level is lowered and it becomes difficult to hear.

本発明は前記のような従来技術の問題点を解決するために提案されたものである。本発明の目的は、複数の周波数特性が同一又は類似する音声が入力された場合に、ピークの周波数が異なるようにパラメトリックイコライザの設定を調整することにより、音声調整者の熟練度に左右されず、聞き取りやすい音声信号を出力することができる自動音声調整装置を提供することにある。 The present invention has been proposed to solve the above-mentioned problems of the prior art. An object of the present invention is to adjust the parametric equalizer setting so that when a plurality of voices having the same or similar frequency characteristics are input, the peak frequencies are different, so that the skill level of the voice adjuster is not affected. It is an object of the present invention to provide an automatic voice adjusting device capable of outputting an easy-to-hear voice signal.

前記の目的を達成するために、本発明の自動音声調整装置は、次のような構成を有することを特徴とする。
（１）複数の音声信号入力部。
（２）前記各音声信号入力部から入力された音声信号を、設定された中心周波数に基づいて所定の周波数特性に調整する複数の音声信号調整部。
（３）任意に選択された複数の前記音声信号の周波数特性を比較し、解析演算処理を施す周波数解析演算部。
（４）前記各音声信号調整部により調整処理済みの音声信号を混合する音声信号混合部。
（５）前記音声信号混合部により混合された音声信号を出力する音声信号出力部。
（６）前記周波数解析演算部は、前記各音声信号の周波数特性の比較結果が同一又は類似する場合に、前記音声信号のいずれかについて、その音声信号の周波数特性を調整するために設定された前記音声信号調整部の中心周波数を、その音声信号の第１スペクトルの周波数に対して、予め設定された閾値以上になるように移動させる。 In order to achieve the above object, the automatic voice adjusting device of the present invention is characterized by having the following configuration.
(1) Multiple audio signal input units.
(2) A plurality of audio signal adjusting units that adjust the audio signal input from each of the audio signal input units to a predetermined frequency characteristic based on a set center frequency.
(3) A frequency analysis calculation unit that compares the frequency characteristics of a plurality of arbitrarily selected audio signals and performs analysis calculation processing.
(4) An audio signal mixing unit that mixes audio signals that have been adjusted by each of the audio signal adjusting units.
(5) An audio signal output unit that outputs an audio signal mixed by the audio signal mixing unit.
(6) The frequency analysis calculation unit is set to adjust the frequency characteristics of the audio signal for any of the audio signals when the comparison results of the frequency characteristics of the audio signals are the same or similar. The center frequency of the voice signal adjusting unit is moved so as to be equal to or higher than a preset threshold value with respect to the frequency of the first spectrum of the voice signal.

本発明において、次のような構成を採用することができる。
（１）前記波数解析演算部は、前記各音声信号の周波数特性の比較結果が同一又は類似する場合に、各音声信号について第２スペクトルの周波数を検出し、前記第１スペクトルの周波数と前記第２スペクトルの周波数の差異が大きい音声信号について、前記音声信号調整部の中心周波数を予め設定された閾値以上になるように移動させる。
（２）前記音声信号調整部の中心周波数を、予め設定された閾値以上になるように、前記第２スペクトル側に移動させる。
（３）前記周波数解析演算部は、前記音声信号調整部から出力された調整後の音声信号を入力して、その音声信号の周波数特性を調整するために設定された前記音声信号調整部の中心周波数を、その音声信号の第１スペクトルの周波数に対して、予め設定された閾値以上になるように移動させる。 In the present invention, the following configurations can be adopted.
(1) The wave number analysis calculation unit detects the frequency of the second spectrum for each voice signal when the comparison result of the frequency characteristics of each voice signal is the same or similar, and the frequency of the first spectrum and the first. For an audio signal having a large difference in frequency between the two spectra, the center frequency of the audio signal adjusting unit is moved so as to be equal to or higher than a preset threshold value.
(2) The center frequency of the audio signal adjusting unit is moved to the second spectrum side so as to be equal to or higher than a preset threshold value.
(3) The frequency analysis calculation unit is the center of the audio signal adjustment unit set to input the adjusted audio signal output from the audio signal adjustment unit and adjust the frequency characteristics of the audio signal. The frequency is moved so as to be equal to or higher than a preset threshold value with respect to the frequency of the first spectrum of the audio signal.

本発明によれば、複数の周波数特性が同一又は類似する音声が入力された場合に、ピークの周波数が異なるようにパラメトリックイコライザの設定を調整するため、音声調整者の熟練度に左右されず、聞き取りやすい音声信号を出力することができる効果を発揮することができる。 According to the present invention, when a plurality of voices having the same or similar frequency characteristics are input, the parametric equalizer setting is adjusted so that the peak frequencies are different, so that it is not affected by the skill level of the voice adjuster. It is possible to exert the effect of being able to output an audio signal that is easy to hear.

第１実施形態の構成を示すブロック図。The block diagram which shows the structure of 1st Embodiment. 第１実施形態の作用を示すフローチャート。The flowchart which shows the operation of 1st Embodiment. 第１実施例の音声信号Ａ，Ｂについて、（ａ）第１スペクトルの周波数ｆ１ａ，ｆ２ａの値が予め定められた閾値以上に異なる場合を示すグラフ、（ｂ）１スペクトルの周波数ｆ１ａ，ｆ２ａの値が予め定められた閾値内で同一或いは近接している場合を示すグラフ。For the audio signals A and B of the first embodiment, (a) a graph showing the case where the values of the frequencies f1a and f2a of the first spectrum differ by more than a predetermined threshold value, and (b) the frequencies f1a and f2a of the first spectrum. A graph showing the case where the values are the same or close to each other within a predetermined threshold value.

［１．第１実施形態］
［１－１．第１実施形態の構成］
以下、本発明の第１実施形態を図１に従って具体的に説明する。図１に示すとおり、本実施形態の装置は、複数の音声信号入力部１ａ，１ｂ，１ｎと、各音声信号入力部１ａ，１ｂ，１ｎから入力された音声信号を所定の周波数特性に調整する複数の音声信号調整部２ａ，２ｂ，２ｎと、各音声信号調整部２ａ，２ｂ，２ｎにより調整処理済みの音声信号を混合する音声信号混合部３と、音声信号混合部３により混合された音声信号を出力する音声信号出力部４を有する。 [1. First Embodiment]
[1-1. Configuration of the first embodiment]
Hereinafter, the first embodiment of the present invention will be specifically described with reference to FIG. As shown in FIG. 1, the apparatus of the present embodiment adjusts the audio signals input from the plurality of audio signal input units 1a, 1b, 1n and the respective audio signal input units 1a, 1b, 1n to predetermined frequency characteristics. The audio signal mixing unit 3 that mixes the audio signals that have been adjusted by the plurality of audio signal adjusting units 2a, 2b, 2n, the audio signal adjusting units 2a, 2b, 2n, and the audio signal mixing unit 3 It has an audio signal output unit 4 that outputs a signal.

音声信号入力部１ａ，１ｂ，１ｎは、各チャンネルから音声信号が入力される。例えば、放送スタジオにおいては、男性アナウンサー、女性アナウンサー、複数のコメンテーターの音声など、また、バンド演奏の録音スタジオにおいては、ボーカルとその他の楽器の音声などが、各チャンネルの音声信号Ａ、音声信号Ｂ、・・・、音声信号Ｎとして、それぞれ音声信号入力部１ａ，１ｂ，１ｎに入力される。 Audio signals are input from each channel in the audio signal input units 1a, 1b, and 1n. For example, in a broadcasting studio, the voices of male announcers, female announcers, and multiple commentators, and in a recording studio for band performances, the voices of vocals and other musical instruments are the voice signals A and B of each channel. , ..., The voice signal N is input to the voice signal input units 1a, 1b, and 1n, respectively.

音声信号調整部２ａ，２ｂ，２ｎは、各音声信号入力部１ａ，１ｂ，１ｎから入力された音声信号を所定の周波数特性に調整する。音声信号調整部２ａ，２ｂ，２ｎは、いわゆるパラメトリックイコライザ（PEQ）であり、入力された音声信号Ａ，Ｂ，Ｎの周波数特性が均一になるように、又は、音楽的に好適な周波数になるように中心周波数、等価量、Q値（Quality Factor）などの設定パラメータを調整する。特に、本実施形態において音声信号調整部２ａ，２ｂ，２ｎは、それぞれの音声信号調整部２ａ，２ｂ，２ｎごとに予め設定された中心周波数ｆｅｑａ，ｆｅｑｂ，ｆｅｑｎに基づいて、入力された各音声信号Ａ，Ｂ，Ｎの周波数特性を調整するもので、中心周波数ｆｅｑａ，ｆｅｑｂ，ｆｅｑｎを移動させて、各音声信号のどの帯域の周波数のゲイン及びQ値を増減するかの調整が可能である。 The audio signal adjusting units 2a, 2b, 2n adjust the audio signals input from the respective audio signal input units 1a, 1b, 1n to predetermined frequency characteristics. The audio signal adjusting units 2a, 2b, and 2n are so-called parametric equalizers (PEQs) so that the frequency characteristics of the input audio signals A, B, and N become uniform or have a frequency suitable for music. Adjust the setting parameters such as center frequency, equivalent quantity, and Q value (Quality Factor). In particular, in the present embodiment, the audio signal adjusting units 2a, 2b, 2n are input voices based on the center frequencies feqa, pheqb, feqn preset for each of the audio signal adjusting units 2a, 2b, 2n. It adjusts the frequency characteristics of the signals A, B, and N, and it is possible to adjust the frequency gain and Q value of which band of each audio signal by moving the center frequencies feqa, feqb, and feqn. ..

各音声信号入力部１ａ，１ｂ，１ｎと音声信号調整部２ａ，２ｂ，２ｎには、音声調整者により任意に選択された複数の音声信号Ａ，Ｂ，Ｎの周波数特性を比較し、解析演算処理を施す周波数解析演算部５が接続されている。本実施形態において、周波数解析演算部５は、音声信号調整部２ａ，２ｂ，２ｎから出力されたゲイン等が調整された後の各音声信号を入力して、その周波数特性を解析している。すなわち、各音声信号調整部２ａ，２ｂ，２ｎから出力された調整済みの音声信号は、音声信号混合部３によって混合された後、音声信号出力部４から出力されるが、複数の音声が干渉したり、聞き取り難くなったりするのは、調整後の音声信号に起因している。そこで、本実施形態では、調整後の音声信号を解析することで、音声の干渉や聞き取りが困難になることを防止している。 The frequency characteristics of a plurality of audio signals A, B, N arbitrarily selected by the audio adjuster are compared between the audio signal input units 1a, 1b, 1n and the audio signal adjusting units 2a, 2b, 2n, and an analysis calculation is performed. A frequency analysis calculation unit 5 for processing is connected. In the present embodiment, the frequency analysis calculation unit 5 inputs each audio signal after the gain and the like output from the audio signal adjusting units 2a, 2b, and 2n have been adjusted, and analyzes the frequency characteristics thereof. That is, the adjusted audio signals output from the respective audio signal adjusting units 2a, 2b, and 2n are mixed by the audio signal mixing unit 3 and then output from the audio signal output unit 4, but a plurality of audios interfere with each other. It is due to the adjusted audio signal that it becomes difficult to hear. Therefore, in the present embodiment, by analyzing the adjusted voice signal, it is possible to prevent voice interference and difficulty in hearing.

周波数解析演算部５は、入力周波数解析部５１と、入力周波数比較部５２と、スペクトル間隔比較部５３と、中心周波数設定部５４とを有する。 The frequency analysis calculation unit 5 includes an input frequency analysis unit 51, an input frequency comparison unit 52, a spectrum interval comparison unit 53, and a center frequency setting unit 54.

周波数解析演算部５は、音声信号Ａ，Ｂ，Ｎの周波数特性を解析し、その特性を比較する。その比較結果が同一又は類似する場合に、各音声信号Ａ，Ｂ，Ｎの第１スペクトルの周波数ｆ１ａ，ｆ２ａ，ｆｎａを検出し、音声信号調整部２ａ，２ｂ，２ｎの中心周波数ｆｅｑａ，ｆｅｑｂ，ｆｅｑｎを予め設定された閾値以上になるように移動させる。そのため、周波数解析演算部５の出力側は音声信号調整部２ａ，２ｂ，２ｎに接続され、各音声信号調整部２ａ，２ｂ，２ｎが移動後の中心周波数に基づいて、音声信号のゲイン調整などを実行するように構成されている。 The frequency analysis calculation unit 5 analyzes the frequency characteristics of the audio signals A, B, and N, and compares the characteristics. When the comparison results are the same or similar, the frequencies f1a, f2a, fna of the first spectrum of each audio signal A, B, N are detected, and the center frequencies feqa, pheqb, of the audio signal adjusting units 2a, 2b, 2n, The frequency is moved so as to be equal to or higher than a preset threshold value. Therefore, the output side of the frequency analysis calculation unit 5 is connected to the audio signal adjustment units 2a, 2b, 2n, and each audio signal adjustment unit 2a, 2b, 2n adjusts the gain of the audio signal based on the center frequency after movement. Is configured to run.

入力周波数解析部５１は、任意に選択された複数の音声信号、例えば、音声信号Ａ，Ｂについて、ＦＦＴ（高速フーリエ変換：Fast Fourier Transform）により解析し、それぞれの周波数を分析する。 The input frequency analysis unit 51 analyzes a plurality of arbitrarily selected voice signals, for example, voice signals A and B by FFT (Fast Fourier Transform), and analyzes each frequency.

入力周波数比較部５２は、入力周波数解析部５１により解析された音声信号Ａ，Ｂの周波数スペクトルが同一又は類似するか否か、両者を比較する。すなわち、図３（ａ）に示すように、音声信号Ａ，Ｂの第１スペクトルの周波数ｆ１ａ，ｆ２ａ及び第２スペクトルの周波数ｆ１ｂ，ｆ２ｂを検出し、これらの周波数を比較する。 The input frequency comparison unit 52 compares whether or not the frequency spectra of the audio signals A and B analyzed by the input frequency analysis unit 51 are the same or similar. That is, as shown in FIG. 3A, the frequencies f1a and f2a of the first spectrum of the audio signals A and B and the frequencies f1b and f2b of the second spectrum are detected and these frequencies are compared.

スペクトル間隔比較部５３は、各音声信号Ａ，Ｂについて、その第１スペクトルの周波数ｆ１ａ、ｆ２ａと第２スペクトルの周波数ｆ１ｂ，ｆ２ｂの差異を比較する。例えば、図３（ｂ）に示すように、音声信号Ａにおける第１スペクトルの周波数ｆ１ａと第２スペクトルの周波数ｆ１ｂの距離Ｌ１と、音声信号Ｂにおける第１スペクトルの周波数ｆ２ａと第２スペクトルの周波数ｆ２ｂの距離Ｌ２とを比較する。図３（ｂ）では、Ｌ１＜Ｌ２であるため、本実施形態では音声信号Ｂが、第１スペクトルの周波数ｆ１と第２スペクトルの周波数ｆ２の周波数の差異が大きい。 The spectrum interval comparison unit 53 compares the difference between the frequencies f1a and f2a of the first spectrum and the frequencies f1b and f2b of the second spectrum for each of the audio signals A and B. For example, as shown in FIG. 3B, the distance L1 between the frequency f1a of the first spectrum and the frequency f1b of the second spectrum in the voice signal A, the frequency f2a of the first spectrum and the frequency of the second spectrum in the voice signal B. Compare with the distance L2 of f2b. In FIG. 3B, since L1 <L2, in the present embodiment, the frequency difference between the frequency f1 of the first spectrum and the frequency f2 of the second spectrum of the audio signal B is large.

中心周波数設定部５４は、入力周波数比較部５２及びスペクトル間隔比較部５３の比較結果に従って、各音声信号調整部２ａ，２ｂ，２ｎに出力する中心周波数ｆｅｑａ，ｆｅｑｂ，ｆｅｑｎの値を設定する。この場合、例えば、音声信号Ａ，Ｂについて、第１スペクトルの周波数ｆ１ａ，ｆ２ａの値が一定の閾値を越えて離れている場合には、各中心周波数ｆｅｑａ，ｆｅｑｂの値を第１スペクトルの周波数ｆ１ａ，ｆ２ａの値と一致させる。 The center frequency setting unit 54 sets the values of the center frequencies feqa, pheqb, and feqn to be output to the respective audio signal adjustment units 2a, 2b, and 2n according to the comparison results of the input frequency comparison unit 52 and the spectral interval comparison unit 53. In this case, for example, for the audio signals A and B, when the values of the frequencies f1a and f2a in the first spectrum are separated by more than a certain threshold value, the values of the respective center frequencies feqa and feqb are set to the frequencies of the first spectrum. Match the values of f1a and f2a.

一方、音声信号Ａ，Ｂについて、第１スペクトルの周波数ｆ１ａ，ｆ２ａの値が一定の閾値内で同一或いは近接している場合には、一方の音声信号を調整する音声信号調整部２ａ，２ｂ，２ｎの中心周波数を移動させる。例えば、図３（ｂ）のように、音声信号Ａと音声信号Ｂを比較した場合に、第１スペクトルの周波数ｆ２ａと第２スペクトルの周波数ｆ２ｂの周波数の差異が大きい音声信号Ｂについて、その音声信号調整部２ｂの中心周波数ｆｅｑｂを予め設定された閾値以上になるように第２スペクトル側ｆ２ｂに移動させる。ここで閾値とは、相互のピークとなる周波数が任意のレベル差になる値であり、聞き手にとって違和感が生じない程度のレベル差をいう。 On the other hand, regarding the audio signals A and B, when the values of the frequencies f1a and f2a of the first spectrum are the same or close to each other within a certain threshold value, the audio signal adjusting units 2a and 2b that adjust one of the audio signals, Move the center frequency of 2n. For example, as shown in FIG. 3B, when the audio signal A and the audio signal B are compared, the audio signal B having a large difference in frequency between the frequency f2a of the first spectrum and the frequency f2b of the second spectrum is the audio. The center frequency pheqb of the signal adjusting unit 2b is moved to the second spectrum side f2b so as to be equal to or higher than a preset threshold value. Here, the threshold value is a value at which the frequencies that become mutual peaks have an arbitrary level difference, and is a level difference to the extent that the listener does not feel uncomfortable.

音声信号混合部３は、各音声信号調整部２ａ，２ｂ，２ｎにより調整処理済みの音声信号を混合する。音声信号出力部４は、音声信号混合部３により混合された音声信号を出力する。 The audio signal mixing unit 3 mixes the audio signals that have been adjusted by the audio signal adjusting units 2a, 2b, and 2n. The audio signal output unit 4 outputs the audio signal mixed by the audio signal mixing unit 3.

［１－２．第１実施形態の作用］
図２は、前記のような構成を有する第１実施形態の作用を説明するフローチャートである。図２に示すように、ステップＳ０１では、音声信号入力部１ａ，１ｂ，１ｎに対して各チャンネルの音声信号Ａ，Ｂ，Ｎが入力される。 [1-2. Action of the first embodiment]
FIG. 2 is a flowchart illustrating the operation of the first embodiment having the above configuration. As shown in FIG. 2, in step S01, the audio signals A, B, and N of each channel are input to the audio signal input units 1a, 1b, and 1n.

ステップＳ０２では、複数の音声が被ってしまって聞き取りにくい場合や、男性アナウンサーと女性アナウンサーのように音声のピークが異なる周波数について増幅処理したい場合など、音声調整者の様々な要望に応じて、音声調整者が音声信号調整部２ａ，２ｂ，２ｎを操作することにより、音声調整を実施する任意のチャンネルの音声信号Ａ，Ｂ，Ｎが選択され、音声信号Ａ，Ｂ，Ｎの調整が実施される。この場合、音声調整者が手動で音声調整を実施する代わりに、従来技術として示したような自動音声調整装置、例えば、入力された音声信号の態様に応じて自動的に音声信号調整部２ａ，２ｂ，２ｎが調整を実行してもよい。 In step S02, the voice is responded to various requests of the voice adjuster, such as when a plurality of voices are covered and difficult to hear, or when it is desired to perform amplification processing for frequencies having different voice peaks such as a male announcer and a female announcer. When the adjuster operates the voice signal adjusting units 2a, 2b, 2n, the voice signals A, B, N of any channel for performing the voice adjustment are selected, and the voice signals A, B, N are adjusted. To. In this case, instead of manually performing the voice adjustment by the voice adjuster, an automatic voice adjustment device as shown in the prior art, for example, the voice signal adjustment unit 2a, automatically according to the mode of the input voice signal, 2b, 2n may perform the adjustment.

ステップＳ０３では、音声調整者により選択されたチャンネルの周波数特性を解析及び比較する。すなわち、各音声信号入力部１ａ，１ｂ，１ｎから出力された各音声信号Ａ，Ｂ，Ｎは、周波数解析演算部５に入力され、その入力周波数解析部５１によってＦＦＴ解析される。 In step S03, the frequency characteristics of the channel selected by the voice coordinator are analyzed and compared. That is, each audio signal A, B, N output from each audio signal input unit 1a, 1b, 1n is input to the frequency analysis calculation unit 5, and FFT analysis is performed by the input frequency analysis unit 51.

また、ステップＳ０３では、入力周波数解析部５１より解析された各音声信号Ａ，Ｂ，Ｎの解析結果は、入力周波数比較部５２に出力され、入力周波数比較部５２において各音声信号Ａ，Ｂ，Ｎの周波数特性が比較される。比較された音声信号の周波数特性が予め定められた閾値以上に異なる場合（ステップＳ０４のＹＥＳ）、すなわち図３（ａ）に示すように、例えば、音声信号Ａ，Ｂが比較された場合において、それらの第１スペクトルの周波数ｆ１ａ，ｆ２ａが予め定められた閾値以上に異なる場合（ｆ１ａ≠ｆ２ａ）は、各音声信号Ａ，Ｂの第１スペクトルの周波数ｆ１ａ，ｆ２ａと一致するように、中心周波数設定部５４において各音声信号調整部２ａ，２ｂの中心周波数ｆｅｑａ，ｆｅｑｂが決定される。 Further, in step S03, the analysis results of the voice signals A, B, and N analyzed by the input frequency analysis unit 51 are output to the input frequency comparison unit 52, and the voice signals A, B, respectively in the input frequency comparison unit 52. The frequency characteristics of N are compared. When the frequency characteristics of the compared audio signals differ by more than a predetermined threshold value (YES in step S04), that is, when the audio signals A and B are compared, for example, as shown in FIG. 3A. When the frequencies f1a and f2a of the first spectrum differ from each other by a predetermined threshold value or more (f1a ≠ f2a), the center frequency is matched with the frequencies f1a and f2a of the first spectrum of the audio signals A and B. The setting unit 54 determines the center frequencies feqa and feqb of the audio signal adjusting units 2a and 2b.

中心周波数設定部５４で決定された中心周波数ｆｅｑａ，ｆｅｑｂは各音声信号調整部２ａ，２ｂに出力され、各音声信号調整部２ａ，２ｂはその中心周波数ｆｅｑａ，ｆｅｑｂに基づいて、音声信号入力部１ａ，１ｂから入力された音声信号Ａ，Ｂについて、音声調整者が入力したパラメータや、自動音声調整装置により決定されたパラメータに従って、音声調整が実施される（ステップＳ０６）。 The center frequencies feqa and feqb determined by the center frequency setting unit 54 are output to the respective audio signal adjustment units 2a and 2b, and the respective audio signal adjustment units 2a and 2b are based on the center frequencies feqa and feqb. With respect to the audio signals A and B input from 1a and 1b, audio adjustment is performed according to the parameters input by the audio adjuster and the parameters determined by the automatic audio adjustment device (step S06).

前記のステップＳ０６において、各音声信号調整部２ａ，２ｂにより音声調整された音声信号は、その後段に設けられた音声信号混合部３によって混合された後（ステップＳ０７）、混合された音声信号は音声信号出力部４からスピーカや録音装置などの外部機器に出力される（ステップＳ０８）。 In step S06, the audio signal adjusted by the audio signal adjusting units 2a and 2b is mixed by the audio signal mixing unit 3 provided in the subsequent stage (step S07), and then the mixed audio signal is generated. It is output from the audio signal output unit 4 to an external device such as a speaker or a recording device (step S08).

ステップＳ０３において比較された音声信号の周波数特性が同一又は類似する場合（ステップＳ０４のＮＯ）、すなわち図３（ｂ）に示すように、例えば、音声信号Ａ，Ｂが比較された場合において、それらの第１スペクトルの周波数ｆ１ａ，ｆ２ａが一定の閾値内で同一或いは近接している場合（ｆ１ａ≒ｆ２ａ）は、スペクトル間隔比較部５３において、それぞれの音声信号Ａ，Ｂについて、第１スペクトルの周波数ｆ１ａ，ｆ２ａと第２スペクトルの周波数ｆ１ｂ，ｆ２ｂの差異を比較する（ステップＳ０９）。図３（ｂ）では、音声信号Ａの第１スペクトルの周波数ｆ１ａと第２スペクトルの周波数ｆ１ｂの差Ｌ１と、音声信号Ｂの第１スペクトルの周波数ｆ２ａと第２スペクトルの周波数ｆ２ｂの差Ｌ２とでは、Ｌ２＞Ｌ１になっている。 When the frequency characteristics of the voice signals compared in step S03 are the same or similar (NO in step S04), that is, when the voice signals A and B are compared, for example, as shown in FIG. 3 (b), they. When the frequencies f1a and f2a of the first spectrum of the above are the same or close to each other within a certain threshold value (f1a≈f2a), the frequency of the first spectrum is used for the respective voice signals A and B in the spectrum interval comparison unit 53. The difference between f1a and f2a and the frequencies f1b and f2b of the second spectrum are compared (step S09). In FIG. 3B, the difference L1 between the frequency f1a of the first spectrum and the frequency f1b of the second spectrum of the voice signal A, and the difference L2 between the frequency f2a of the first spectrum and the frequency f2b of the second spectrum of the voice signal B. Then, L2> L1.

次のステップＳ１０では、中心周波数設定部５４により、第１スペクトルの周波数ｆ２ａと第２スペクトルの周波数ｆ２ｂの差異が大きい音声信号Ｂの音声信号調整部２ｂの中心周波数ｆｅｑｂを、差異が小さい音声信号Ａの音声信号調整部２ａの中心周波数ｆｅｑａに対して、予め設定された閾値以上になるように移動させる。移動の方向はいずれでもよいが、本実施形態では、差異が大きい音声信号Ｂの音声信号調整部２ｂの中心周波数ｆｅｑｂを、差異が小さい音声信号Ａの音声信号調整部２ａの中心周波数ｆｅｑａから、予め設定された閾値以上離れるように、第２スペクトルの周波数ｆ２ｂ側に移動させる。 In the next step S10, the center frequency setting unit 54 sets the center frequency pheqb of the audio signal adjusting unit 2b of the audio signal B having a large difference between the frequency f2a of the first spectrum and the frequency f2b of the second spectrum to the audio signal having a small difference. The center frequency feqa of the audio signal adjusting unit 2a of A is moved so as to be equal to or higher than a preset threshold value. The direction of movement may be any, but in the present embodiment, the center frequency feqb of the audio signal adjusting unit 2b of the audio signal B having a large difference can be obtained from the center frequency feqa of the audio signal adjusting unit 2a of the audio signal A having a small difference. It is moved to the frequency f2b side of the second spectrum so as to be separated by a preset threshold value or more.

このようにして中心周波数設定部５４により設定された移動後の中心周波数ｆｅｑｂは、該当する音声信号Ｂの音声信号調整部２ｂに送られる。一方、周波数解析演算部５によって中心周波数ｆｅｑａ，ｆｅｑｎを移動させることがなかった音声信号Ａ，Ｎについては、音声信号調整部２ａ，２ｎに予め設定されている中心周波数ｆｅｑａ，ｆｅｑｎに基づいて、音声調整者や自動音声調整装置によって設定されたパラメータに従って、ゲインの調整などが実行される（ステップＳ０６）。 The moved center frequency pheqb set by the center frequency setting unit 54 in this way is sent to the audio signal adjusting unit 2b of the corresponding audio signal B. On the other hand, for the voice signals A and N for which the center frequencies feqa and feqn were not moved by the frequency analysis calculation unit 5, the center frequencies feqa and feqn preset in the voice signal adjustment units 2a and 2n were used. The gain adjustment and the like are executed according to the parameters set by the voice adjuster and the automatic voice adjustment device (step S06).

［１－３．第１実施形態の効果］
（１）本実施形態における自動音声調整装置によれば、複数の周波数特性が同一又は類似する音声が入力された場合に、ピークの周波数が異なるように音声信号調整部２ａ，２ｂ，２ｎの設定を調整するため、音声調整者の熟練度に左右されず、聞き取りやすい音声信号を出力することができる。 [1-3. Effect of the first embodiment]
(1) According to the automatic voice adjusting device in the present embodiment, the voice signal adjusting units 2a, 2b, 2n are set so that the peak frequencies are different when a plurality of voices having the same or similar frequency characteristics are input. Therefore, it is possible to output an easy-to-hear voice signal regardless of the skill level of the voice adjuster.

（２）本実施形態における自動音声調整装置によれば、差異が大きい音声信号Ｂの音声信号調整部２ｂの中心周波数ｆｅｑｂを、差異が小さい音声信号Ａの音声信号調整部２ａの中心周波数ｆｅｑａから、予め設定された閾値以上離れるように移動させるので、複数の周波数特性が同一又は類似する音声が入力された場合でも、相互に干渉することがなく、聞き取りやすい音声信号を出力することができる。 (2) According to the automatic voice adjusting device in the present embodiment, the center frequency feqb of the voice signal adjusting unit 2b of the voice signal B having a large difference is derived from the center frequency feqa of the voice signal adjusting unit 2a of the voice signal A having a small difference. Since the audio signals are moved so as to be separated by a preset threshold value or more, even when a plurality of audio signals having the same or similar frequency characteristics are input, they can output audio signals that are easy to hear without interfering with each other.

（３）本実施形態における自動音声調整装置によれば、差異が大きい音声信号Ｂの音声信号調整部２ｂの中心周波数ｆｅｑｂを、差異が小さい音声信号Ａの音声信号調整部２ａの中心周波数ｆｅｑａから、予め設定された閾値以上離れるように、第２スペクトルの周波数ｆ２ｂ側に移動させるので、強調されたスペクトルと第２スペクトルが近接することになり、違和感がなくなり、より聞き取りやすい音声信号を出力することが可能となる。 (3) According to the automatic voice adjusting device in the present embodiment, the center frequency feqb of the voice signal adjusting unit 2b of the voice signal B having a large difference is derived from the center frequency feqa of the voice signal adjusting unit 2a of the voice signal A having a small difference. Since it is moved to the frequency f2b side of the second spectrum so as to be separated by a preset threshold value or more, the emphasized spectrum and the second spectrum are close to each other, so that there is no discomfort and an audio signal that is easier to hear is output. It becomes possible.

（４）本実施形態における自動音声調整装置によれば、周波数解析演算部５は、音声信号調整部２ａ，２ｂ，２ｎから出力されたゲインの調整後の各音声信号を入力して、その周波数特性を解析しているため、複数の周波数特性が同一又は類似する音声が入力された場合でも、ピークの周波数が異なるように音声信号調整部２ａ，２ｂ，２ｎの設定を移動させている。そのため、音声信号混合部３で音声信号がミックスされる前の状態において、それぞれの音声信号Ａ，Ｂ，Ｎのピークの周波数が異なるよう調整されているため、各音声の干渉や聞き取りが困難になることを防止することができる。 (4) According to the automatic voice adjustment device in the present embodiment, the frequency analysis calculation unit 5 inputs each gain-adjusted voice signal output from the voice signal adjustment units 2a, 2b, 2n, and inputs the frequency thereof. Since the characteristics are analyzed, the settings of the audio signal adjusting units 2a, 2b, and 2n are moved so that the peak frequencies are different even when a plurality of voices having the same or similar frequency characteristics are input. Therefore, in the state before the audio signals are mixed by the audio signal mixing unit 3, the peak frequencies of the respective audio signals A, B, and N are adjusted to be different, which makes it difficult for each audio to interfere or be heard. It can be prevented from becoming.

［２．第２実施形態］
［２－１．第２実施形態の構成］
以下、本発明の第２実施形態について説明する。第２実施形態の構成は、第１実施形態の構成と以下の点で異なり、その他の点は同一である。異なる点は、周波数解析演算部５は、任意に選択された複数の音声信号Ａ，Ｂ，Ｎの周波数特性を比較するのではなく、複数の音声信号Ａ，Ｂ，Ｎのうち、ゲインが最大の音声信号と、ゲインが２番目に大きい音声信号を自動で選択し、周波数特性を比較し、解析演算処理を施す点である。 [2. Second Embodiment]
[2-1. Configuration of the second embodiment]
Hereinafter, a second embodiment of the present invention will be described. The configuration of the second embodiment is different from the configuration of the first embodiment in the following points, and is the same in other points. The difference is that the frequency analysis calculation unit 5 does not compare the frequency characteristics of a plurality of arbitrarily selected audio signals A, B, N, but has the maximum gain among the plurality of audio signals A, B, N. The point is that the audio signal of No. 1 and the audio signal having the second largest gain are automatically selected, the frequency characteristics are compared, and the analysis calculation process is performed.

［２－２．第２実施形態の作用効果］
本実施形態によれば、音声調整者が任意に選択をすることなく、ゲインが最大の音声信号と、ゲインが２番目に大きい音声信号を自動で選択し、周波数特性の調整を施すため、より簡単に音声調整が可能となり、音声調整者の熟練度に左右されず、聞き取りやすい音声信号を出力することができる。 [2-2. Action effect of the second embodiment]
According to the present embodiment, the voice signal having the maximum gain and the voice signal having the second largest gain are automatically selected and the frequency characteristics are adjusted without the voice adjuster making any selection. The voice can be easily adjusted, and an easy-to-hear voice signal can be output regardless of the skill level of the voice adjuster.

［３．他の実施形態］
本発明は上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより、種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態にわたる構成要素を適宜組み合わせてもよい。以下は、その一例である。 [3. Other embodiments]
The present invention is not limited to the above embodiment as it is, and at the implementation stage, the components can be modified and embodied within a range that does not deviate from the gist thereof. In addition, various inventions can be formed by an appropriate combination of the plurality of components disclosed in the above-described embodiment. For example, some components may be removed from all the components shown in the embodiments. Furthermore, components over different embodiments may be combined as appropriate. The following is an example.

（１）周波数解析演算部は、各音声信号の周波数特性のスペクトルの比較結果が同一又は類似する場合に、各音声信号の第２スペクトルの周波数を検出し、第１スペクトルの周波数と第２スペクトルの周波数の差異が小さい音声信号について、音声信号調整部の中心周波数を予め設定された閾値以上になるように移動させることができる。その場合、差異が小さい方の音声信号を処理する音声信号調整部の中心周波数を、第２スペクトルと反対側に移動させてもよい。 (1) The frequency analysis calculation unit detects the frequency of the second spectrum of each voice signal when the comparison results of the frequencies of the frequency characteristics of each voice signal are the same or similar, and the frequency of the first spectrum and the second spectrum. For an audio signal having a small difference in frequency, the center frequency of the audio signal adjusting unit can be moved so as to be equal to or higher than a preset threshold value. In that case, the center frequency of the audio signal adjusting unit that processes the audio signal having the smaller difference may be moved to the side opposite to the second spectrum.

（２）周波数解析演算部に入力する各音声信号としては、各音声信号調整部２ａ，２ｂ，２ｎを通過した調整後の音声信号の代わりに、音声信号入力部１ａ，１ｂ，１ｎからの調整前の音声信号を使用することができる。 (2) As each audio signal to be input to the frequency analysis calculation unit, adjustment from the audio signal input units 1a, 1b, 1n instead of the adjusted audio signal that has passed through the audio signal adjustment units 2a, 2b, 2n. The previous audio signal can be used.

（３）音声信号入力部、音声信号調整部は、入力チャンネル数に応じて適宜増減することができる。 (3) The audio signal input unit and the audio signal adjustment unit can be increased or decreased as appropriate according to the number of input channels.

（４）周波数解析演算部の構成は図示のものに限らず、各音声信号調整部内に周波数解析演算部を設けることができる。また、図示の実施形態では、周波数解析演算部で入力された音声信号の周波数特性の比較をしたが、音声信号調整部において周波数特性の比較を行い、周波数解析演算部はその結果に従って、音声信号処理部の中心周波数を予め設定された閾値以上になるように移動させてもよい。 (4) The configuration of the frequency analysis calculation unit is not limited to that shown in the figure, and a frequency analysis calculation unit can be provided in each audio signal adjustment unit. Further, in the illustrated embodiment, the frequency characteristics of the audio signal input by the frequency analysis calculation unit are compared, but the frequency characteristics are compared by the audio signal adjustment unit, and the frequency analysis calculation unit compares the frequency characteristics according to the result. The center frequency of the processing unit may be moved so as to be equal to or higher than a preset threshold value.

（５）閾値の設定部は、音声調整者が手動で設定する以外に、予めプログラムによって閾値を設定することも可能である。例えば、放送内容が複数の出演者が出演する時間帯にのみ本発明の処理を適用し、アナウンサーが１人で話している時間帯では本発明の処理を行わないように設定したり、複数の出演者が出演する主体の時間帯でも出演者の特性に応じて、閾値を自動的に変化させたりするように予め閾値変更用のプログラムを設定しておくこともできる。 (5) The threshold value setting unit can be set in advance by a program in addition to being manually set by the voice coordinator. For example, the processing of the present invention may be applied only to a time zone in which a plurality of performers appear in the broadcast content, and the processing of the present invention may not be performed in a time zone in which the announcer is speaking alone. It is also possible to set a program for changing the threshold in advance so that the threshold is automatically changed according to the characteristics of the performer even in the time zone of the main body in which the performer appears.

（６）音声信号が、男女、人数の増減などによって異なる場合には、例えば、聴感補正フィルタの逆数を手動で設定した閾値に乗じて補正後の閾値を決定するなど、各音声信号の特性に合わせた閾値を設定することで、男性の野太い声や女性の高い声を聞き取りやすくすることも可能である。 (6) When the voice signal differs depending on the gender and the number of people, for example, the inverse number of the hearing correction filter is multiplied by the manually set threshold value to determine the corrected threshold value. By setting a set threshold, it is possible to make it easier to hear the thick voice of men and the high voice of women.

（７）閾値を音声信号入力部ごとに異なる値に設定することもできる。例えば、第１実施形態において、音声信号Ａの閾値をバックグラウンドノイズとなる値に設定することにより、音声信号Ａで突発的に発生する大きな音声によるノイズの影響をなくすことができ、音声信号Ｂ、音声信号Ｎの音声に対し、効率的にバックグラウンドノイズをマスクことができる。 (7) The threshold value can be set to a different value for each audio signal input unit. For example, in the first embodiment, by setting the threshold value of the voice signal A to a value that becomes background noise, it is possible to eliminate the influence of noise caused by a large voice suddenly generated in the voice signal A, and the voice signal B can be eliminated. , Background noise can be efficiently masked with respect to the voice of the voice signal N.

Ａ，Ｂ，Ｎ…音声信号
１ａ，１ｂ，１ｎ…音声信号入力部
２ａ，２ｂ，２ｎ…音声信号調整部
３…音声信号混合部
４…音声信号出力部
５…周波数解析演算部
５１…入力周波数解析部
５２…入力周波数比較部
５３…スペクトル間隔比較部
５４…中心周波数設定部 A, B, N ... Audio signals 1a, 1b, 1n ... Audio signal input units 2a, 2b, 2n ... Audio signal adjustment unit 3 ... Audio signal mixing unit 4 ... Audio signal output unit 5 ... Frequency analysis calculation unit 51 ... Input frequency Analysis unit 52 ... Input frequency comparison unit 53 ... Spectral interval comparison unit 54 ... Center frequency setting unit

Claims

With multiple audio signal input units,
A plurality of audio signal adjusting units that adjust the audio signal input from each audio signal input unit to a predetermined frequency characteristic based on a set center frequency, and
A frequency analysis calculation unit that compares the frequency characteristics of a plurality of arbitrarily selected audio signals and performs analysis calculation processing, and a frequency analysis calculation unit.
An audio signal mixing unit that mixes audio signals that have been adjusted by each audio signal adjusting unit,
An audio signal output unit that outputs an audio signal mixed by the audio signal mixing unit, and an audio signal output unit.
Equipped with
The frequency analysis calculation unit is set to adjust the frequency characteristics of the audio signal for any of the audio signals when the comparison results of the frequency characteristics of the audio signals are the same or similar. An automatic voice adjustment device characterized in that the center frequency of the adjustment unit is moved so as to be equal to or higher than a preset threshold value with respect to the frequency of the first spectrum of the voice signal.

The wave number analysis calculation unit detects the frequency of the second spectrum for each voice signal when the comparison result of the frequency characteristics of each voice signal is the same or similar, and the frequency of the first spectrum and the frequency of the second spectrum. The automatic voice adjusting device according to claim 1, wherein the center frequency of the voice signal adjusting unit is moved so as to be equal to or higher than a preset threshold value for a voice signal having a large frequency difference.

The automatic voice adjustment device according to claim 2, wherein the center frequency of the voice signal adjustment unit is moved to the second spectrum side so as to be equal to or higher than a preset threshold value.

The wave number analysis calculation unit inputs the adjusted audio signal output from the audio signal adjustment unit, and sets the center frequency of the audio signal adjustment unit set to adjust the frequency characteristics of the audio signal. The automatic audio adjustment device according to any one of claims 1 to 3, wherein the frequency of the first spectrum of the audio signal is moved so as to be equal to or higher than a preset threshold value.