JP2019062322A

JP2019062322A - Audio control console controllable by camera

Info

Publication number: JP2019062322A
Application number: JP2017184231A
Authority: JP
Inventors: 源太友部; Genta Tomobe
Original assignee: Tamura Corp
Current assignee: Tamura Corp
Priority date: 2017-09-25
Filing date: 2017-09-25
Publication date: 2019-04-18
Anticipated expiration: 2037-09-25
Also published as: JP6917847B2

Abstract

To provide an audio control console that automatically switches microphone audio based on a face image captured by a camera.SOLUTION: A face image recognition unit 22 takes in images of cameras C1 to C3 and recognizes face images included in the images. A person's name and its face image are registered in a database 23. Person's names are assigned to audio adjustment faders 131 to 133. A person's name specification unit 24 specifies person's names of face images captured by the cameras C1 to C3 based on the face image input from the face image recognition unit 22 and the information registered in the database 23. A fader control unit 14 turns on the faders 131 to 133 to which the person's name specified by the person's name specification unit 24 is assigned.SELECTED DRAWING: Figure 1

Description

本発明は、カメラによって撮影した画像に基づいて，音声出力を制御することが可能な音声調整卓に関する。 The present invention relates to an audio control console capable of controlling audio output based on an image captured by a camera.

テレビ放送局の音声調整システムにおいて、放送もしくは収録する番組の進行により使用するマイク音声が頻繁に切り替わる。マイク音声とは、アナウンサーや出演者などが発する音声をマイクに入力した音声である。例えば、スタジオ内でシーン１からシーン２に切り替わるときに出演者が変わるタイミングで出演者が使用しているマイク音声を切り替える。また、スタジオから中継画像へ切り替わる時にアナウンサーが切り替わるタイミングで使用するマイク音声を切り替えることがある。 In a voice control system of a television station, microphone sound to be used is frequently switched as the program to be broadcast or recorded advances. The microphone sound is a sound obtained by inputting a sound emitted by an announcer or a performer to a microphone. For example, when the performer switches from scene 1 to scene 2 in the studio, the microphone sound used by the performer is switched at the timing when the performer changes. Also, the microphone sound to be used may be switched at the timing when the announcer switches when switching from the studio to the relay image.

通常の運用においては、番組中に使用するマイク音声の全てを音声調整卓に入力しておき、番組進行に合わせてフェーダーと呼ばれる操作部を手動操作することで使用するマイク音声を切り替えている。このような音声調整卓は、例えば下記のような特許文献に記載されている。 In normal operation, all the microphone voices used during the program are input to the voice control console, and the microphone voices used are switched by manually operating the operation unit called a fader according to the program progress. Such an audio control console is described, for example, in the following patent documents.

特開２０１５−２１３２４６号公報JP, 2015-213246, A 特開２０１５−９５１２７号公報JP, 2015-95127, A 特開２０１４−９３６６５号公報JP, 2014-93665, A

しかし、従来技術のように音声調整卓のフェーダーを手動で操作すると、上げ忘れや下げ忘れなどの誤操作によりマイク音声の切り替えが行われず音声が出力されないとか、不要な音声が出力されてしまうなどの問題が発生する。また、使用するフェーダー全てをＯＮ状態まで上げたままにした場合、不要な音声が出力されてしまったり、すべてのマイクが拾っているノイズが加算され放送音声のＳ／Ｎ比の低下を引き起こしてしまう。 However, when the faders of the audio control console are manually operated as in the prior art, switching of the microphone audio is not performed due to an erroneous operation such as forgetting to raise or forget to lower the audio, or unnecessary audio is output. I have a problem. Also, if all the faders used are kept ON, unnecessary audio may be output, or noise picked up by all the microphones will be added to cause a drop in the S / N ratio of the broadcast audio. I will.

本発明の目的は、カメラによってマイク音声の切り替え対象となる人の顔を認識し、その顔画像が登録された音声調整卓のフェーダーを制御して音量自動調整することにより、マイク音声の切り替えを確実に行うことを可能とした音声調整卓を提供することにある。 An object of the present invention is to switch a microphone voice by recognizing a face of a person who is a target of switching a microphone voice by a camera and controlling a fader of a voice control console registered with the face image to automatically adjust the volume. It is to provide an audio control console that can be reliably performed.

本発明のカメラによる制御可能な音声調整卓は、次のような構成を有することを特徴とする。
（１）カメラの映像を取り込んで、前記映像中に含まれる顔画像を認識する顔画像認識部。
（２）人名とその顔画像を登録したデータベース。
（３）人名が割り当てられている音声調整用のフェーダー。
（４）前記顔画像認識部から入力された顔画像と前記データベースに登録された情報に基づいて、カメラに写っている顔画像の人名を特定する人名特定部。
（５）前記人名特定部によって特定された人名が割り当てられているフェーダーをオンにするフェーダー制御部。 The camera-controllable audio control console of the present invention is characterized by having the following configuration.
(1) A face image recognition unit that captures an image of a camera and recognizes a face image included in the image.
(2) A database in which personal names and their face images are registered.
(3) Faders for audio adjustment to which personal names are assigned.
(4) A personal name specifying unit for specifying a personal name of a face image captured by a camera based on the face image input from the face image recognition unit and the information registered in the database.
(5) A fader control unit for turning on a fader to which the personal name specified by the personal name specifying unit is assigned.

本発明は、次のような態様も包含する。
（６）前記顔画像認識部は、複数のカメラの映像を取り込んで、各カメラの映像中に含まれる顔画像を前記複数のカメラごとに認識し、前記人名特定部が複数のカメラに同一人の顔画像が写っていることを特定した場合に、前記フェーダー制御部は、前記複数のカメラの中のいずれか1つに写っている顔画像の人名が割り当てられているフェーダーの音量をオンとする。
（７）観客用のマイクの音量を増減する観客用のフェーダーを備え、前記顔画像認識部は人の顔の複数種類のパーツを判定するパーツ判定部を更に備え、前記フェーダー制御部は前記パーツ判定部が認識できたパーツ数が閾値を越えた場合に観客用のフェーダーをオンとするように制御する。
（８）前記フェーダー制御部は、オンエアーまたは録画中のカメラを示すタリー信号の入力を判定する回路を備え、前記タリー信号によってアクティブと判定されたカメラからの情報に基づいて、アクティブなカメラに写っている人の人名に対応するマイクのフェーダーをオンとするように制御する。 The present invention also includes the following embodiments.
(6) The face image recognition unit takes in the images of a plurality of cameras, recognizes the face image contained in the images of each camera for each of the plurality of cameras, and the personal name identification unit is identical to the plurality of cameras. When it is determined that the face image of the face image is captured, the fader control unit turns on the volume of the fader to which the name of the face image captured in any one of the plurality of cameras is assigned. Do.
(7) A spectator fader is provided to increase or decrease the volume of a spectator microphone, the face image recognition unit further includes a part determination unit that determines a plurality of types of human face parts, and the fader control unit is the part Control is performed to turn on the spectator fader when the number of parts recognized by the determination unit exceeds a threshold.
(8) The fader control unit includes a circuit that determines an input of a tally signal indicating a camera that is on air or recording, and the image is displayed on the active camera based on information from the camera determined to be active by the tally signal. Control to turn on the microphone fader corresponding to the person's name.

本発明によれば、カメラで撮影した画像から顔認識し、自動でフェーダーのオン・オフ制御を行うことが可能になり、手動による誤操作の防止やマイク集音ノイズの低減を図ることができる。 According to the present invention, it is possible to perform face recognition from an image captured by a camera and to automatically perform on / off control of a fader, thereby preventing manual erroneous operation and reducing microphone sound collection noise.

本発明の第１実施形態を示す機能ブロック図。FIG. 1 is a functional block diagram showing a first embodiment of the present invention. 本発明の第２実施形態を示す機能ブロック図。The functional block diagram which shows 2nd Embodiment of this invention. 本発明の他の実施形態におけるカメラの切り替えの状態を示す図。The figure which shows the state of the switch of the camera in other embodiment of this invention.

以下、本発明の実施形態を、図面に従って具体的に説明する。本実施形態においては、カメラや音声調整卓が本来備えている機能に関する構成については省略し、オン・オフの切り替えの対象となるマイクとその音量を調整するフェーダーに関する機能および構成について説明する。なお、本実施形態においてカメラ及びマイクを３台として説明するが、これらの台数は単なる例示であり、限定されるものではない。 Hereinafter, embodiments of the present invention will be specifically described with reference to the drawings. In the present embodiment, the configuration relating to the functions originally provided for the camera and the audio control console will be omitted, and the functions and configurations relating to the microphone to be switched on / off and the fader for adjusting the volume will be described. Although three cameras and microphones are described in the present embodiment, the number of these is merely an example and is not limited.

［１．第１実施形態］
（１）構成
本実施形態の音声調整卓は、図１に示すように、公知の音声調整卓が備えている複数のマイクＭ１〜Ｍ３に対する音声制御部１０と、カメラＣ１〜Ｃ３に写っている人の名前を特定する顔情報分析部２０とから構成される。音声制御部１０と顔情報分析部２０にはそれぞれ通信部１１，２１が設けられ、この通信部１１，２１を介して顔情報分析部２０において特定された人名やカメラ映像中のパーツの数などのデータが交信される。 [1. First embodiment]
(1) Configuration As shown in FIG. 1, the audio control console of the present embodiment is reflected in the audio control unit 10 for a plurality of microphones M1 to M3 provided in a known audio control console and cameras C1 to C3. And a face information analysis unit 20 for specifying a person's name. The voice control unit 10 and the face information analysis unit 20 are respectively provided with communication units 11 and 21. The name of the person specified in the face information analysis unit 20 via the communication units 11 and 21, the number of parts in the camera image, etc. Data is communicated.

音声制御部１０は、音声調整卓に接続された複数のマイクＭ１〜Ｍ３からの音声信号を独立して受信する入力部１２１〜１２３と、入力部１２１〜１２３に入力された複数の音声信号をそれぞれ独立してオン・オフおよびその音量を調整する複数のフェーダー１３１〜１３３を備えている。これら複数のフェーダー１３１〜１３３は、従来の音声調整卓と同様に手動あるいは予め設定されているコンピュータプログラムにより、別々にあるいは同期してその音量が制御される。各フェーダー１３１〜１３３には、そのフェーダーが制御するマイクＭ１〜Ｍ３を装着する人の名前が割り当てられている。 The audio control unit 10 has input units 121 to 123 for independently receiving audio signals from a plurality of microphones M1 to M3 connected to the audio control console, and a plurality of audio signals input to the input units 121 to 123. A plurality of faders 131 to 133 are provided to independently turn on / off and adjust the volume. The volume of each of the plurality of faders 131 to 133 is controlled separately or in synchronization by a computer program which is manually or preset as in the conventional audio control console. The names of persons wearing the microphones M1 to M3 controlled by the faders are assigned to the respective faders 131 to 133.

音声制御部１０には、前記のような従来技術によるフェーダーの制御とは別に、顔情報分析部２０からの情報に従いフェーダー１３１〜１３３のオン・オフ制御を行うフェーダー制御部１４が設けられている。フェーダー制御部１４は、ハードウェアとしてはＣＰＵおよびＦＰＧＡ（プログラマブル・ロジック・デバイス）によって構成される。フェーダー制御部１４は、顔情報分析部２０からカメラＣ１〜Ｃ３に写っている人の名前が入力された場合に、その人名に割り当てられているマイクＭ１〜Ｍ３のフェーダー１３１〜１３３をオンとすると共に、顔情報分析部２０から人名の入力がない場合にはその人名に割り当てられているマイクＭ１〜Ｍ３のフェーダー１３１〜１３３をオフとする。 The audio control unit 10 is provided with a fader control unit 14 that performs on / off control of the faders 131 to 133 according to the information from the face information analysis unit 20, separately from the control of the fader according to the conventional technology as described above. . The fader control unit 14 is configured by a CPU and an FPGA (programmable logic device) as hardware. The fader control unit 14 turns on the faders 131 to 133 of the microphones M1 to M3 assigned to the person's name when the face information analysis unit 20 inputs the name of a person appearing in the cameras C1 to C3. At the same time, when there is no input of a personal name from the face information analysis unit 20, the faders 131 to 133 of the microphones M1 to M3 assigned to the personal name are turned off.

フェーダー制御部１４には、音声調整卓に接続されている複数のカメラＣ１〜Ｃ３のいずれか１つから同じ人名が入力された場合に、その人名に割り当てられているマイクＭ１〜Ｍ３のフェーダー１３１〜１３３をオンとするオア回路が設けられている。すなわち、フェーダー制御部１４は、いずれか１つのカメラＣ１〜Ｃ３から特定の人名が入力される限り、その人名に対応するマイクＭ１〜Ｍ３のフェーダー１３１〜１３３のオン状態を維持する。 When the same person's name is input from any one of the plurality of cameras C1 to C3 connected to the audio control console, the fader control unit 14 faders 131 of the microphones M1 to M3 assigned to the person's name. An OR circuit is provided to turn on. That is, as long as a specific personal name is input from any one of the cameras C1 to C3, the fader control unit 14 maintains the on state of the faders 131 to 133 of the microphones M1 to M3 corresponding to the personal name.

顔情報分析部２０には、顔画像認識部２２と、データベース２３と、人名特定部２４が設けられている。これらの各部は、ハードウェアとしてはＣＰＵおよびメモリなどの記録手段によって構成される。顔画像認識部２２は、音声調整卓に接続された１つあるいは複数のカメラＣ１〜Ｃ３の映像を取り込んで、映像中に含まれる顔画像を認識する。すなわち、顔画像認識部２２は、各カメラＣ１〜Ｃ３に写し出されている映像を取り込んで顔画像が存在するか否かを判定し、顔画像が存在した場合には、その顔画像とカメラＣ１〜Ｃ３のＩＤと対応付けた状態で人名特定部２４へ出力する。データベース２３は、人名とその顔画像を対応付けて予め記録手段に登録したものである。人名特定部２４は、顔画像認識部２２から入力された顔画像とデータベース２３に登録された顔画像を比較することで、カメラＣ１〜Ｃ３に写っている顔画像の人名を特定する。 The face information analysis unit 20 is provided with a face image recognition unit 22, a database 23, and a personal name identification unit 24. These units are configured by hardware and recording means such as a CPU and a memory. The face image recognition unit 22 captures an image of one or more cameras C1 to C3 connected to the audio control console, and recognizes a face image included in the image. That is, the face image recognition unit 22 takes in the video captured to each of the cameras C1 to C3 and determines whether or not a face image exists. If a face image exists, the face image and the camera C1 are determined. It outputs to the personal name identification part 24 in the state matched with ID of -C3. The database 23 associates a personal name with its face image and is registered in advance in the recording means. The personal name specification unit 24 specifies the personal name of the face image captured by the cameras C1 to C3 by comparing the face image input from the face image recognition unit 22 with the face image registered in the database 23.

人名特定部２４は、通信部１１，２１を介して音声制御部１０のフェーダー制御部１４に接続され、カメラＣ１〜Ｃ３のＩＤとそのカメラＣ１〜Ｃ３に写し出されている顔画像の人名とがフェーダー制御部１４に出力される。 The personal name identification unit 24 is connected to the fader control unit 14 of the audio control unit 10 via the communication units 11 and 21 and the IDs of the cameras C1 to C3 and the personal names of face images displayed on the cameras C1 to C3 are displayed. It is output to the fader control unit 14.

（２）作用
本実施形態においては、顔情報分析部２０に接続されているいずれかのカメラＣ１〜Ｃ３にマイクＭ１〜Ｍ３を装着している人の映像が写し出されると、その映像は顔画像認識部２２に入力される。顔画像認識部２２ではカメラＣ１〜Ｃ３からの映像中に顔画像が含まれるか否かを判別し、顔画像が含まれている場合にはその顔画像に相当する画像データを人名特定部２４に出力する。人名特定部２４では、入力された顔画像と予めデータベース２３に登録されている顔画像と比較し、同一の顔画像が存在した場合にはカメラＣ１〜Ｃ３に写し出された人の名前を特定する。 (2) Operation In the present embodiment, when an image of a person wearing the microphones M1 to M3 is taken out to one of the cameras C1 to C3 connected to the face information analysis unit 20, the image is a face image It is input to the recognition unit 22. The face image recognition unit 22 determines whether or not a face image is included in the images from the cameras C1 to C3. If the face image is included, the image data corresponding to the face image is identified by the person name identification unit 24. Output to The personal name specifying unit 24 compares the input face image with the face image registered in advance in the database 23, and when there is the same face image, specifies the name of the person shown on the cameras C1 to C3. .

人名特定部２４はカメラＣ１〜Ｃ３に写し出された人の名前を特定した後、カメラＣ１〜Ｃ３のＩＤとそこに写し出された人の名前を対応付けてフェーダー制御部１４に出力する。複数のカメラＣ１〜Ｃ３に同じ人あるいは異なる人が写っている場合も、人名特定部２４は各カメラＣ１〜Ｃ３のＩＤとそこに写し出された人の名前を対応付けてフェーダー制御部１４に出力する。 After identifying the names of the persons copied to the cameras C1 to C3, the personal name identification unit 24 associates the IDs of the cameras C1 to C3 with the names of the persons extracted there and outputs them to the fader control unit 14. Even when the same person or different people are shown in a plurality of cameras C1 to C3, the personal name specifying unit 24 outputs the ID of each camera C1 to C3 to the fader control unit 14 in association with the names of the persons photographed there. Do.

カメラＣ１〜Ｃ３のＩＤと対応する人名とを受信したフェーダー制御部１４は、その人名に対応するフェーダー１３１〜１３３の音量をオンとする。複数のカメラＣ１〜Ｃ３から異なる人名がフェーダー制御部１４に入力された場合は、対応するすべてのフェーダー１３１〜１３３のすべてについてその音量をオンとする。 The fader control unit 14 having received the IDs of the cameras C1 to C3 and the corresponding personal names turns on the volume of the faders 131 to 133 corresponding to the personal names. When different personal names are input to the fader control unit 14 from the plurality of cameras C1 to C3, the volume of all the corresponding faders 131 to 133 is turned on.

複数のカメラＣ１〜Ｃ３から同一の人名が入力された場合は、フェーダー制御部１４はそこに設けられたオア回路に従い、いずれかのカメラＣ１〜Ｃ３に同一の人が写っている限り、その人名に対応するマイクＭ１〜Ｍ３のフェーダー１３１〜１３３のオン状態を維持する。 When the same person's name is input from a plurality of cameras C1 to C3, the fader control unit 14 follows the OR circuit provided there, and as long as the same person appears on any of the cameras C1 to C3, the person's name The on-states of the faders 131 to 133 of the microphones M1 to M3 corresponding to V.

一方、マイクＭ１〜Ｍ３を装着しているにもかかわらず、いずれのカメラＣ１〜Ｃ３にも写っていない人は、顔画像認識部２２および人名特定部２４において人名が特定されることがない。そのため、人名特定部２４からフェーダー制御部１４に対して、カメラＣ１〜Ｃ３のＩＤと対応する人名が出力されることがない。その結果、フェーダー制御部１４は人名に対応するフェーダー１３１〜１３３をオンにできないため、そのフェーダー１３１〜１３３はオフとなり、そのマイクＭ１〜Ｍ３から入力された音声信号は音声調整卓から出力されることがない。 On the other hand, a person who is not shown in any of the cameras C1 to C3 despite wearing the microphones M1 to M3 does not have a person name specified in the face image recognition unit 22 and the person name specification unit 24. Therefore, the personal name corresponding to the ID of the cameras C1 to C3 is not output from the personal name specifying unit 24 to the fader control unit 14. As a result, since the fader control unit 14 can not turn on the faders 131 to 133 corresponding to personal names, the faders 131 to 133 are turned off, and the audio signals input from the microphones M1 to M3 are output from the audio control console I have not.

このことは、マイクＭ１〜Ｍ３を装着した人がカメラＣ１〜Ｃ３に写っており、フェーダー１３１〜１３３がオンになっている場合において、カメラＣ１〜Ｃ３あるいは写っている人が移動してカメラＣ１〜Ｃ３の映像中に顔画像認識ができなくなった時も同様である。カメラＣ１〜Ｃ３の映像中から顔画像が認識できなくなり、人名の特定が不可能になると、フェーダー制御部１４は人名に対応するフェーダーをオフとするので、映像に写っていない人に装着されたマイクＭ１〜Ｍ３の音量がオフになる。 This means that when a person wearing the microphones M1 to M3 is shown on the cameras C1 to C3 and the faders 131 to 133 are turned on, the camera C1 to C3 or a person on the camera C1 moves. The same is true when face image recognition can not be performed during the video of ~ C3. When the face image can not be recognized from the images of the cameras C1 to C3 and the identification of the person's name becomes impossible, the fader control unit 14 turns off the fader corresponding to the person's name. The volume of the microphones M1 to M3 is turned off.

（３）効果
本実施形態によれば、次のような効果が発揮される。
（ａ）カメラＣ１〜Ｃ３に写っている人が装着しているマイクＭ１〜Ｍ３のフェーダー１３１〜１３３のみをオンにすることが可能となるので、カメラＣ１〜Ｃ３に映っていない人のマイクＭ１〜Ｍ３から入力されるノイズを音声調整卓から出力することがなくなる。 (3) Effects According to the present embodiment, the following effects are exhibited.
(A) Since it becomes possible to turn on only the faders 131 to 133 of the microphones M1 to M3 worn by the people photographed on the cameras C1 to C3, the microphones M1 of the people not photographed on the cameras C1 to C3 The noise input from ~ M3 is not output from the audio control console.

（ｂ）マイクＭ１〜Ｍ３を装着した人がカメラに写っているか否かを顔画像の認識によって自動的に行い、しかもその顔画像に基づいて人名を判定するので、人名に対応付けたフェーダー１３１〜１３３もオン・オフを自動的に行うことができる。その結果、フェーダー１３１〜１３３のオン・オフを手動で行っていた場合のような操作ミスがなくなり、音声調整卓の操作者の負担も大幅に軽減する。 (B) It is automatically judged by the face image whether a person wearing the microphones M1 to M3 appears in the camera, and a person's name is determined based on the face image. ~ 133 can also be turned on and off automatically. As a result, there is no operation error as in the case where the faders 131 to 133 are manually turned on and off, and the burden on the operator of the audio control console is also greatly reduced.

（ｃ）フェーダー制御部１４に設けられたオア回路により、異なるカメラＣ１〜Ｃ３が同じ人の顔画像を認識した場合であっても、いずれかのカメラＣ１〜Ｃ３に人名を特定できる人が写っている限りは、その人名に対応するマイクＭ１〜Ｍ３のフェーダー１３１〜１３３をオンに維持することができる。 (C) Even if different cameras C1 to C3 recognize face images of the same person by the OR circuit provided in the fader control unit 14, a person who can specify a person's name appears in any of the cameras C1 to C3 As long as it is, it is possible to keep the faders 131 to 133 of the microphones M1 to M3 corresponding to the person's name on.

（ｄ）複数台カメラがある場合、その内の1台がタリー信号によってアクティブになり、そのカメラで写された映像のみが放映あるいは録画される場合があるが、本実施形態においては、タリー信号に関わらず、いずれかのカメラに写っている人の人名が特定された場合には、その人名に対応するマイクのフェーダーはオン制御される利点がある。 (D) When there are a plurality of cameras, one of them is activated by the tally signal, and only the video taken by the camera may be broadcast or recorded. In this embodiment, the tally signal is used. Regardless of the above, there is an advantage that the fader of the microphone corresponding to the person's name is controlled to be on when the person's name shown in any camera is identified.

［２．第２実施形態］
図２は本発明の第２実施形態を示す。第２実施形態は予め装着している人の名前が分っているマイクではなく、観客のように人名が不明な人が使用するマイクをカメラの映像によってオン・オフ制御する。そのため、第２実施形態の音声調整卓は、人の名前が分っているマイクＭ１，Ｍ２に加えて観客用のマイクＭ４と、その音量を増減する観客用のフェーダー１３４を備える。また、観客用のマイクＭ４が向けられている人を撮影するカメラＣ４の映像が顔画像認識部２２に入力される。顔画像認識部２２には、第１実施形態の顔画像認識処理を行う部分に加えて、人の顔の複数種類のパーツを判定するパーツ判定部２５を備える。 [2. Second embodiment]
FIG. 2 shows a second embodiment of the present invention. In the second embodiment, a microphone used by a person whose name is unknown, such as a spectator, is on / off controlled by an image of a camera, not a microphone whose name is known in advance. Therefore, in addition to the microphones M1 and M2 whose names are known, the audio control console of the second embodiment is provided with a microphone M4 for the audience and a fader 134 for the audience to increase / decrease the volume. In addition, an image of the camera C4 that captures a person to whom the audience microphone M4 is directed is input to the face image recognition unit 22. The face image recognition unit 22 includes, in addition to the part performing the face image recognition processing of the first embodiment, a parts determination unit 25 that determines a plurality of types of parts of a human face.

パーツ判定部２５は、人名特定部２４がデータベース２３にアクセスをし、データベース２３にカメラＣ４からの顔画像に対応する人名が登録されていなかった場合は、人名特定部２４では顔画像認識部２２からの顔画像はあるが、データベース２３上に登録がないため不一致となり、人名を特定することができず、結果としてフェーダー制御部１４はいずれのフェーダーに対してもオン・オフ制御を行うことができない。この場合の不一致に対して、第２実施形態では観客用のフェーダー１３４がオンになるよう処理が行えるようにする。 In the part determination unit 25, when the personal name identification unit 24 accesses the database 23 and the personal name corresponding to the face image from the camera C 4 is not registered in the database 23, the personal name identification unit 24 recognizes the face image recognition unit 22. There is a face image from, but there is no registration on the database 23 and there is a mismatch, and it is not possible to specify a person's name, and as a result the fader control unit 14 performs on / off control for any fader. Can not. In the second embodiment, it is possible to perform processing so that the audience fader 134 is turned on in response to the mismatch in this case.

すなわち、顔画像が入力された人名特定部２４がその顔画像に基づいて人名の特定を行うことができなかった場合には、人名特定部２４はパーツ判定部２５からの情報に基づいて、観客用のフェーダー１３４のオン・オフ制御を行う。観客用のフェーダー１３４のオン・オフ制御部の判断基準は、認識できた顔のパーツ（目、眉、鼻、口、耳、頭など）の数を用いる。 That is, when the personal name identification unit 24 in which the face image is input can not identify the personal name based on the face image, the personal name identification unit 24 determines the audience based on the information from the parts determination unit 25. Control the on / off control of the fader 134 for The judgment criteria of the on / off control unit of the audience fader 134 uses the number of recognized face parts (eyes, eyebrows, nose, mouth, ears, head, etc.).

フェーダー制御部１４は、パーツ判定部２５が認識できたパーツ数が予め定めた閾値（例えば３）を越えた場合に観客用のフェーダー１３４をオンとする。例えば、フェーダー制御部１４における観客用のフェーダー１３４の制御判断基準は、下記の左側の表の〇に示すとおり、認識できたパーツ数が閾値３を越えている場合にはフェーダーをオンとし、右側の表の〇に示すとおり、認識できたパーツ数が閾値３以下である場合には、フェーダーをオフとする。

The fader control unit 14 turns on the spectator fader 134 when the number of parts recognized by the parts determination unit 25 exceeds a predetermined threshold (for example, 3). For example, as shown in the left side of the table on the left side of the table below, the control judgment criteria for the spectator fader 134 in the fader control unit 14 turn on the fader when the number of recognized parts exceeds the threshold 3, As indicated by the circle in the table, when the number of recognized parts is equal to or less than the threshold 3, the fader is turned off.

本実施形態によれば、観客用マイクＭ４のように特定の人名と対応づけられていないマイクであっても、カメラＣ４が観客用マイクＭ４を向けられた人の顔画像を認識した場合には、観客用マイクＭ４を制御するフェーダー１３４をオン・オフすることが可能となる。しかも、カメラの映像の中に顔画像が存在することを認識した場合であっても、顔を構成する複数のパーツが明確に判定できる場合のみ観客用フェーダー１３４をオンにするので、観客がカメラ中に明確に映し出されている場合のみ観客用マイクＭ４をオンにすることができる。 According to the present embodiment, even when the microphone C4 does not correspond to a specific person's name like the audience microphone M4, when the camera C4 recognizes the face image of the person directed to the audience microphone M4, The fader 134 for controlling the audience microphone M4 can be turned on / off. Moreover, even when it is recognized that a face image is present in the camera image, the audience fader 134 is turned on only when a plurality of parts constituting the face can be clearly determined. The audience microphone M4 can be turned on only when it is clearly shown in the inside.

［３．他の実施形態］
本発明は、前記の実施形態に限定されるものではなく、例えば、次のような他の実施形態も包含する。
（１）前記各実施形態においては１つのカメラの映像に１つの顔画像が写っている場合について説明したが、１つのカメラに複数の顔画像が写っており、複数の人名が特定された場合には、複数のフェーダーを１つのカメラからの情報によって同時に制御することも可能である。 [3. Other embodiments]
The present invention is not limited to the above embodiment, and includes, for example, the following other embodiments.
(1) In the above embodiments, the case where one face image appears in the image of one camera has been described, but a plurality of face images appear in one camera, and a plurality of names of people are specified. It is also possible to simultaneously control multiple faders with information from one camera.

（２）複数人で１つのマイクを共用する場合のように、１つのマイクのフェーダーに対して複数の人名を対応付けておき、人名特定部２４によってカメラの映像中に複数の人名中のいずれか１つが特定された場合には、その人名に対応付けられたマイクをオンとすることもできる。 (2) As in the case where one microphone is shared by a plurality of persons, a plurality of personal names are associated with the fader of one microphone, and the personal name identification unit 24 selects one of the plural personal names in the video of the camera. If one is identified, the microphone associated with that person's name may be turned on.

（３）通常、スタジオカメラシステムには、タリーランプが搭載されており、オンエアー時や記録時にタリーランプを点灯させ、被写体（演者）側がアクティブなカメラの運用状態を確認できる構成となっている。このようなシステムにおいては、フェーダー制御部１４に各カメラのタリーランプの点灯および未点灯を検出するなどの、オンエアーまたは録画中のカメラを示すタリー信号の入力を判定する回路を設け、タリーランプの点灯したアクティブなカメラからの情報に基づいて各フェーダーを制御することができる。 (3) In general, a tally lamp is mounted in a studio camera system, and the tally lamp is turned on at the time of on-air or at the time of recording so that the subject (performer) can confirm the operation state of the active camera. In such a system, the fader control unit 14 is provided with a circuit for determining the input of a tally signal indicating a camera that is on air or recording, such as detecting lighting and non-lighting of the tally lamp of each camera. Each fader can be controlled based on information from the lighted active camera.

すなわち、図３（ａ）に示すように、タリーランプが点灯したカメラをアクティブとし、未点灯のカメラをスタンバイとすることで、アクティブのカメラからの映像によって特定された人名に対応するマイクのフェーダーのみをオン状態とし、スタンバイのカメラによって特定された人名については、たとえマイクを装着した人がカメラに写っていてもその人名に対応するマイクのフェーダーをオフ状態とする。その後、図３（ｂ）に示すように、オンエアー状態となるカメラが切り替わったことを、タリー信号の入力に基づいてフェーダー制御部１４が検出すると、新たにアクティブとなったカメラからの情報に基づいて、そのカメラによって特定された人名に対応するマイクのフェーダーをオン状態とし、他のスタンバイのカメラによって特定された人名に対応するマイクについてはそのフェーダーをオフ状態とする。 That is, as shown in FIG. 3A, by setting the camera with the tally lamp activated and setting the unlit camera as the standby, the fader of the microphone corresponding to the name specified by the image from the active camera For the personal name specified by the standby camera only, even if the person wearing the microphone is on the camera, the microphone fader corresponding to the personal name is turned off. Thereafter, as shown in FIG. 3B, when the fader control unit 14 detects that the camera in the on-air state has been switched based on the input of the tally signal, the information from the newly activated camera is used. Then, the fader of the microphone corresponding to the personal name specified by the camera is turned on, and the fader is turned off for the microphone corresponding to the personal name specified by the other standby camera.

Ｃ１〜Ｃ４…カメラ
Ｍ１〜Ｍ４…マイク
１０…音声制御部１１…通信部
１２１〜１２４…入力部
１３１〜１３４…フェーダー
１４…フェーダー制御部
２０…顔情報分析部
２１…通信部
２２…顔画像認識部
２３…データベース
２４…人名特定部
２５…パーツ判定部 C1 to C4: Cameras M1 to M4: Microphone 10: Voice control unit 11: Communication unit 121 to 124: Input unit 131 to 134: Fader 14: Fader control unit 20: Face information analysis unit 21: Communication unit 22: Face image recognition Part 23 ... Database 24 ... Person name identification part 25 ... Parts judgment part

Claims

A face image recognition unit for capturing an image of a camera and recognizing a face image included in the image;
A database in which people's names and their face images are registered,
Voice adjustment faders to which personal names are assigned,
A personal name specifying unit for specifying a personal name of a face image captured by a camera based on the face image input from the face image recognition unit and the information registered in the database;
A fader control unit for turning on a fader to which the personal name specified by the personal name specifying unit is assigned;
An audio control console that can be controlled by a camera equipped with.

The face image recognition unit takes in images of a plurality of cameras, and recognizes face images included in the images of each camera for each of the plurality of cameras.
When the personal name specifying unit specifies that face images of the same person are captured by a plurality of cameras, the fader control unit may determine the name of the face image captured in any one of the plurality of cameras. The camera-controllable audio console according to claim 1, wherein the faders assigned are turned on.

Has a fader for the audience to increase or decrease the volume of the audience microphone
The face image recognition unit further includes a parts determination unit that determines a plurality of types of parts of a human face,
The audio controllable by the camera according to claim 1 or 2, wherein the fader control unit performs control to turn on the spectator fader when the number of parts recognized by the part determination unit exceeds a threshold. Adjustment table.

The said fader control part is provided with a circuit which determines the input of the tally signal which shows the camera on air or recording, and the person who is reflected to the active camera based on the information from the camera judged to be active by the said tally signal. The camera-controllable voice control console according to any one of claims 1 to 3, which is controlled to turn on a microphone fader corresponding to a person's name.