JP2008236397A

JP2008236397A - Acoustic control system

Info

Publication number: JP2008236397A
Application number: JP2007073423A
Authority: JP
Inventors: Kenji Hirata; 健二平田
Original assignee: Fujifilm Corp
Current assignee: Fujifilm Corp
Priority date: 2007-03-20
Filing date: 2007-03-20
Publication date: 2008-10-02

Abstract

PROBLEM TO BE SOLVED: To provide an acoustic control system capable of performing exact acoustic control in accordance with the position of a person by accurately detecting the position and movement of the person listening to sound. SOLUTION: When a sound reproduction start instruction is inputted, a sound reproducing device 20 starts to reproduce sound, and an acoustic control processing part 18 starts acoustic control of the sound reproducing device 20. When sound reproduction is started, a camera 12 photographs a person in a vehicle. A face detection processing part 14 detects a face area of the person from an image photographed by the camera 12. A face position operating part 16 operates the position of the face of the person detected by the face detection processing part 14. An acoustic control processing part 18 adjusts sound outputting timing and performs acoustic control (for example, equalization) on the basis of the position of the face operated by the face position operating part 16. COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は音響調整システムに係り、特に複数のスピーカを備えた音声再生装置の音響を調整する音響調整システムに関する。 The present invention relates to an acoustic adjustment system, and more particularly to an acoustic adjustment system that adjusts the sound of a sound reproducing device including a plurality of speakers.

特許文献１には、車内の人物の声を検出する複数のマイクからの検出信号の時間差を検出して人物の位置を演算し、人物の位置に応じて音響調整を行う車載用音響装置の自動調整装置が開示されている。
特開平１１−２６８５９３号公報 Japanese Patent Application Laid-Open No. 2004-151867 describes an automatic on-vehicle acoustic device that detects time differences among detection signals from a plurality of microphones that detect a person's voice in a vehicle, calculates the position of the person, and performs acoustic adjustment according to the position of the person. An adjustment device is disclosed.
JP-A-11-268593

特許文献１では、複数のマイクにより検出した音声により人物の位置を演算しているが、車外からのノイズ、車内の再生中の音響等がマイクによって検出された場合に、人物の位置が正確に検出されないという問題があった。また、乗車時又は乗車中に体を動かした場合に、声を発しなければ人物の位置の検出が行われないという問題があった。 In Patent Document 1, the position of a person is calculated from sounds detected by a plurality of microphones. However, when noise from the outside of the vehicle, sound being reproduced in the vehicle, or the like is detected by the microphone, the position of the person is accurately determined. There was a problem that it was not detected. In addition, there is a problem that the position of a person cannot be detected unless a voice is produced when the body is moved during or upon boarding.

本発明はこのような事情に鑑みてなされたもので、音声を聴取する聴取者の位置及び移動を正確に検出して、聴取者の位置に応じて的確な音響調整を行うことができる音響調整システムを提供することを目的とする。 The present invention has been made in view of such circumstances, and it is possible to accurately detect the position and movement of a listener who listens to sound, and to perform accurate acoustic adjustment according to the position of the listener. The purpose is to provide a system.

上記課題を解決するために、本願発明１に係る音響調整システムは、音声を再生出力する音声出力手段と、前記音声出力手段によって出力される音声の聴取者の画像を撮影する撮影手段と、前記撮影手段によって撮影された画像から聴取者の顔を検出する顔検出手段と、前記顔検出手段によって検出された聴取者の顔の位置を演算する顔位置演算手段と、前記聴取者の顔の位置に応じて前記音声出力手段から出力される音声の音響調整を行う音響調整手段とを備えることを特徴とする。 In order to solve the above problems, an acoustic adjustment system according to the first aspect of the present invention includes an audio output unit that reproduces and outputs audio, an imaging unit that captures an image of a listener of the audio output by the audio output unit, Face detecting means for detecting a listener's face from an image taken by the photographing means, face position calculating means for calculating the position of the listener's face detected by the face detecting means, and position of the listener's face And a sound adjusting means for adjusting the sound of the sound output from the sound output means.

本願発明１によれば、聴取者の画像を撮影して顔検出処理を行うことにより、聴取者の顔の位置を正確に検出することができるので、聴取者の顔の位置に応じて的確な音響調整を行うことができる。 According to the first aspect of the present invention, the position of the listener's face can be accurately detected by photographing the image of the listener and performing the face detection process, so that it is accurate according to the position of the listener's face. Acoustic adjustment can be performed.

本願発明２は、本願発明１の音響調整システムにおいて、前記顔検出手段によって検出された顔の画像から前記聴取者の耳を検出する耳検出手段を更に備え、前記顔位置演算手段は、前記聴取者の耳の位置を演算し、前記音響調整手段は、前記聴取者の耳の位置に応じて前記音声出力手段から出力される音声の音響調整を行うことを特徴とする。 Invention 2 of the present application further comprises ear detection means for detecting an ear of the listener from the face image detected by the face detection means in the sound adjustment system of Invention 1 of the present application, and the face position calculation means includes the listening position The position of the user's ear is calculated, and the sound adjustment means adjusts the sound of the sound output from the sound output means according to the position of the listener's ear.

本願発明２によれば、聴取者の顔及び耳の位置に応じて的確な音響調整を行うことができる。 According to the present invention 2, accurate acoustic adjustment can be performed according to the position of the listener's face and ear.

本願発明３は、本願発明１又は２の音響調整システムにおいて、前記聴取者が着席している座席の正面を向いているかどうかを検出する第１の検出手段を更に備え、前記音響調整手段は、前記聴取者が座席の後方又は横方向を向いたことが検出された場合に、前記音声出力手段から出力される音声の音量を下げるか又は音声の再生を停止することを特徴とする。 Invention 3 of the present application further comprises first detection means for detecting whether or not the listener is facing the front of the seating seat in the sound adjustment system of Invention 1 or 2, wherein the sound adjustment means comprises: When it is detected that the listener has turned to the back or side of the seat, the volume of the sound output from the sound output means is reduced or the sound reproduction is stopped.

本願発明４は、本願発明１から３の音響調整システムにおいて、前記聴取者が前後の座席に分かれて着席している場合に、後ろの座席の聴取者が前の座席側に乗り出してきたかどうかを検出する第２の検出手段を更に備え、前記音響調整手段は、後ろの座席の聴取者が前の座席側に乗り出してきたことが検出された場合に、前記音声出力手段から出力される音声の音量を下げるか又は音声の再生を停止することを特徴とする。 Invention 4 of the present application is the acoustic adjustment system according to Inventions 1 to 3 of the present application, in which, when the listener is seated separately in the front and rear seats, whether or not the listener in the back seat has entered the front seat side. A second detecting means for detecting the sound; and the sound adjusting means detects the sound output from the sound output means when it is detected that the listener in the rear seat has entered the front seat side. The volume is lowered or the reproduction of the sound is stopped.

本願発明５は、本願発明１から４の音響調整システムにおいて、前記顔検出手段によって検出された聴取者が所定時間以上目を閉じているかどうかを検出する第３の検出手段を更に備え、前記音響調整手段は、前記聴取者が所定時間以上目を閉じていることが検出された場合に、前記音声出力手段から出力される音声の音量を下げるか又は音声の再生を停止することを特徴とする。 Invention 5 of the present application further comprises third detection means for detecting whether or not the listener detected by the face detection means has closed eyes for a predetermined time or more in the sound adjustment system of Inventions 1 to 4 of the present invention, The adjusting means lowers the volume of the sound output from the sound output means or stops the sound reproduction when it is detected that the listener has closed his eyes for a predetermined time or more. .

本願発明６は、本願発明５の音響調整システムにおいて、前記音響調整システムは、自動車に設置されており、前記聴取者が所定時間以上目を閉じていることが検出された場合に、前記自動車のエンジンがかかっているかどうかを検出する第４の検出手段を更に備え、前記音響調整手段は、前記聴取者が所定時間以上目を閉じており、且つ、前記自動車のエンジンがかかっていることが検出された場合に、前記音声出力手段から出力される音声の音量を上げることを特徴とする。 Invention 6 of the present application is the sound adjustment system of Invention 5 of the present application, wherein the sound adjustment system is installed in an automobile, and when it is detected that the listener has closed eyes for a predetermined time or more, It further comprises fourth detection means for detecting whether or not the engine is running, and the acoustic adjustment means detects that the listener has closed his eyes for a predetermined time or more and the car engine is running. In this case, the volume of the sound output from the sound output means is increased.

本願発明３から６によれば、聴取者の状態（例えば、車内の聴取者が会話している場合や、眠っている場合）に応じて的確な音響調整を行うことができる。 According to the present inventions 3 to 6, accurate acoustic adjustment can be performed according to the state of the listener (for example, when the listener in the vehicle is talking or sleeping).

本発明によれば、聴取者の画像を撮影して顔検出処理を行うことにより、聴取者の顔及び耳の位置を正確に検出することができるので、聴取者の顔及び耳の位置に応じて的確な音響調整を行うことができる。 According to the present invention, it is possible to accurately detect the position of the listener's face and ear by photographing the image of the listener and performing face detection processing. And accurate acoustic adjustment.

以下、添付図面に従って本発明に係る音響調整システムの好ましい実施の形態について説明する。 Hereinafter, preferred embodiments of an acoustic adjustment system according to the present invention will be described with reference to the accompanying drawings.

図１は、本発明の一実施形態に係る音響調整システムの主要構成を示すブロック図である。本実施形態の音響調整システム１０は、カメラ１２、顔検出処理部１４、顔位置演算部１６、音響調整処理部１８、音声再生装置２０、スピーカシステム２２、操作部２４及び状態検出部２６を備えている。本実施形態の音響調整システム１０は、自動車の車内に設置されるカーオーディオシステムの音響調整を行うものである。カメラ１２及びスピーカシステム２２は、自動車の内部の乗車スペースに配置される。 FIG. 1 is a block diagram showing the main configuration of an acoustic adjustment system according to an embodiment of the present invention. The acoustic adjustment system 10 of the present embodiment includes a camera 12, a face detection processing unit 14, a face position calculation unit 16, an acoustic adjustment processing unit 18, an audio reproduction device 20, a speaker system 22, an operation unit 24, and a state detection unit 26. ing. The acoustic adjustment system 10 according to the present embodiment performs acoustic adjustment of a car audio system installed in a car. The camera 12 and the speaker system 22 are arranged in a boarding space inside the automobile.

音声再生装置２０は、例えば、カセットテープ、コンパクトディスク（登録商標）、ミニディスク（登録商標）等の記録メディアから音声信号を読み出すスピーカ２２に供給するミュージックプレーヤ、又はラジオ電波を受信して音声信号に変換してスピーカ２２に供給するラジオ受信機を含んでいる。 The audio reproducing device 20 receives, for example, a music player supplied to a speaker 22 that reads an audio signal from a recording medium such as a cassette tape, a compact disc (registered trademark), a mini disc (registered trademark), or a radio wave and receives an audio signal A radio receiver that converts the signal into the speaker 22 and supplies it to the speaker 22.

スピーカシステム２２は、車内の所定の位置に配置された複数のスピーカを含んでおり、音声再生装置２０から供給された音声信号を音声に変換して出力する。 The speaker system 22 includes a plurality of speakers arranged at predetermined positions in the vehicle, and converts the audio signal supplied from the audio reproduction device 20 into sound and outputs the sound.

操作部２４は、音声再生装置２０に音声の再生開始指示、再生終了指示、ラジオの受信周波数の調整、音量の調整等の操作入力を行うための手段である。 The operation unit 24 is a means for performing operation inputs such as an audio playback start instruction, a playback end instruction, a radio reception frequency adjustment, a volume adjustment, and the like to the audio playback device 20.

音声の再生開始指示が入力されると、音声再生装置２０により音声の再生が開始されるとともに、音響調整処理部１８により音声再生装置２０の音響調整が開始される。 When a sound reproduction start instruction is input, sound reproduction by the sound reproduction device 20 is started and acoustic adjustment of the sound reproduction device 20 is started by the sound adjustment processing unit 18.

カメラ１２は、音声の再生が開始されると、自動車１００の車内の人物（聴取者）の画像を撮影する。顔検出処理部１４は、カメラ１２によって撮影された画像から聴取者の顔領域を検出する。顔位置演算部１６は、顔検出処理部１４によって検出された聴取者の顔の位置を演算する。音響調整処理部１８は、顔位置演算部１６によって求められた顔の位置に基づいて音声の出力のタイミングの調整及び音響調整（例えば、イコライズ）を行う。 The camera 12 captures an image of a person (listener) in the car 100 when the sound reproduction is started. The face detection processing unit 14 detects the listener's face area from the image taken by the camera 12. The face position calculation unit 16 calculates the position of the listener's face detected by the face detection processing unit 14. The sound adjustment processing unit 18 performs adjustment of sound output timing and sound adjustment (for example, equalization) based on the face position obtained by the face position calculation unit 16.

上記の顔及び顔の位置の検出は、音声再生装置２０による音声再生が継続している間、所定時間間隔で繰り返される。状態検出部２６は、顔検出処理部１４によって検出された顔の状態及び顔位置演算部１６によって検出された顔の位置を記憶する。そして、状態検出部２６は、最新の顔の状態及び顔の位置と、以前に記憶した顔の状態及び顔の位置に基づいて聴取者の状態を検出し、音響調整処理部１８に伝達する。音響調整処理部１８は、聴取者の状態に応じて音響調整を行う。例えば、後部座席３０に着席している聴取者が前に乗り出した場合（例えば、後部座席３０の聴取者の顔領域が大きくなったことが検出された場合）、又は前部座席２８又は後部座席３０に着席している聴取者が自動車１００の後方又は横を向いたことが検出された場合に、音響調整処理部１８は、音量を下げるか又は音声の再生を停止する。また、音響調整処理部１８は、検出された聴取者のいずれかが所定時間以上目を閉じている場合（即ち、当該聴取者が眠っている場合）に音量を下げるか又は音声の再生を停止する。また、状態検出部２６は、運転席の聴取者が所定時間以上目を閉じている場合（即ち、運転手が眠っている場合）に、自動車１００のエンジンがかかっているかどうかを検出する。そして、音響調整処理部１８は、運転席の聴取者が所定時間以上目を閉じており、且つ、自動車１００のエンジンがかかっている場合に、音量を上げる。これにより、車内の聴取者の状態に応じて適切な音響調整を行うことができる。 The detection of the face and the position of the face is repeated at predetermined time intervals while the sound reproduction by the sound reproduction device 20 continues. The state detection unit 26 stores the face state detected by the face detection processing unit 14 and the face position detected by the face position calculation unit 16. Then, the state detection unit 26 detects the listener's state based on the latest face state and face position and the previously stored face state and face position, and transmits them to the sound adjustment processing unit 18. The sound adjustment processing unit 18 performs sound adjustment according to the state of the listener. For example, when a listener seated in the rear seat 30 starts in front (for example, when it is detected that the face area of the listener in the rear seat 30 has increased), or the front seat 28 or the rear seat When it is detected that the listener seated at 30 is facing the back or side of the automobile 100, the sound adjustment processing unit 18 decreases the volume or stops the reproduction of the sound. The sound adjustment processing unit 18 lowers the volume or stops the sound reproduction when any of the detected listeners closes their eyes for a predetermined time or longer (that is, when the listener is asleep). To do. In addition, the state detection unit 26 detects whether the engine of the automobile 100 is running when the listener at the driver's seat closes his eyes for a predetermined time or longer (that is, when the driver is sleeping). Then, the sound adjustment processing unit 18 increases the sound volume when the listener at the driver's seat has closed his eyes for a predetermined time or more and the engine of the automobile 100 is running. Thereby, appropriate acoustic adjustment can be performed according to the state of the listener in the vehicle.

次に、聴取者の顔の位置に基づいて音響を制御する方法について具体的に説明する。図２は、車内におけるカメラの配置の第１の実施形態を示す自動車の平面図である。図２に示すように、自動車１００には、複数のスピーカ２２０Ｒから２２６Ｒ、２２０Ｌから２２６Ｌ、前部座席２８Ｒ及び２８Ｌ、後部座席３０が配置されている。 Next, a method for controlling sound based on the position of the listener's face will be specifically described. FIG. 2 is a plan view of the automobile showing the first embodiment of the camera arrangement in the vehicle. As shown in FIG. 2, a plurality of speakers 220R to 226R, 220L to 226L, front seats 28R and 28L, and a rear seat 30 are arranged in the automobile 100.

図２に示すように、自動車１００の前部には、カメラ１２０Ｒ及び１２０Ｌが配置されている。また、前部座席２８Ｒ及び２８Ｌの後部には、それぞれカメラ１２２Ｒ及び１２２Ｌが配置されている。カメラ１２０Ｒ及び１２０Ｌは前部座席２８Ｒ及び２８Ｌに着席している聴取者を撮影し、カメラ１２２Ｒ及び１２２Ｌは後部座席３０に着席している聴取者を撮影する。顔検出処理部１４は、カメラ１２０Ｒ及び１２０Ｌによって撮影された画像から前部座席２８Ｒ及び２８Ｌに着席している聴取者の顔領域を検出する。また、顔検出処理部１４は、検出した顔領域から耳の位置を検出する。 As shown in FIG. 2, cameras 120 R and 120 L are arranged in the front portion of the automobile 100. In addition, cameras 122R and 122L are arranged at the rear of the front seats 28R and 28L, respectively. The cameras 120R and 120L photograph the listener seated in the front seats 28R and 28L, and the cameras 122R and 122L photograph the listener seated in the rear seat 30. The face detection processing unit 14 detects the face area of the listener seated on the front seats 28R and 28L from images taken by the cameras 120R and 120L. Further, the face detection processing unit 14 detects the position of the ear from the detected face area.

ここで、顔検出処理の方式としては、例えば、肌色に予め指定された色と近い色を持つ画素を原画像から取り出し、取り出した領域を顔領域として検出するものがある。この顔検出処理の方式では、例えば、肌色を他の色と区別するための色空間上で、予めサンプリングした肌色の情報から色空間上の肌色の範囲を定め、各画素の色が定めた範囲に入っているか否かを判定することにより行われる。また、耳の検出処理の方式としては、例えば、上記顔検出処理によって検出された肌色の領域の中から左右に２つ並ぶ黒色を含む領域（目の領域）を検出し、こうして検出した目の領域の位置を基準として肌色領域の端部付近を耳の領域として検出するものがある。 Here, as a face detection processing method, for example, there is a method in which a pixel having a color close to a color specified in advance as a skin color is extracted from an original image, and the extracted region is detected as a face region. In this face detection processing method, for example, in a color space for distinguishing skin color from other colors, a skin color range on the color space is determined from pre-sampled skin color information, and a range in which the color of each pixel is determined. This is done by determining whether or not it has entered. In addition, as a method of ear detection processing, for example, a region (eye region) including two black colors arranged on the left and right is detected from the skin color regions detected by the face detection processing, and thus the detected eye Some detect the vicinity of the edge of the skin color area as an ear area with reference to the position of the area.

顔位置演算部１６は、カメラ１２０Ｒ及び１２０Ｌによって撮影された画像中の耳の領域の位置について三角測量を行って、前部座席２８Ｒ及び２８Ｌに着席している聴取者の左右の耳の位置の座標（Ｘ，Ｙ，Ｚ）を演算する。 The face position calculation unit 16 performs triangulation on the position of the ear region in the images photographed by the cameras 120R and 120L, and determines the positions of the left and right ears of the listener sitting on the front seats 28R and 28L. Coordinates (X, Y, Z) are calculated.

同様に、顔検出処理部１４は、カメラ１２２Ｒ及び１２２Ｌによって撮影された画像から後部座席３０に着席している聴取者の耳の領域を検出する。顔位置演算部１６は、カメラ１２２Ｒ及び１２２Ｌによって撮影された画像中の聴取者の耳の領域について三角測量を行って、後部座席３０に着席している聴取者の左右の耳の位置の座標（Ｘ，Ｙ，Ｚ）を演算する。 Similarly, the face detection processing unit 14 detects the ear region of the listener seated on the rear seat 30 from images taken by the cameras 122R and 122L. The face position calculation unit 16 performs triangulation on the area of the listener's ear in the images taken by the cameras 122R and 122L, and coordinates of the positions of the left and right ears of the listener seated in the rear seat 30 ( X, Y, Z) is calculated.

音響調整処理部１８は、左側のスピーカ２２０Ｌ、…、２２６Ｌから出力された音声が聴取者の左耳に、右側のスピーカ２２０Ｒ、…、２２６Ｒから出力された音声が聴取者の右耳にそれぞれ同時に到達するように、音声の出力のタイミングを調整するとともに、音量の調整を行う。これにより、車内の聴取者は、各スピーカから等距離の位置にいるのと同様の音声を聴取することができる。 The sound adjustment processing unit 18 simultaneously outputs the sound output from the left speaker 220L,... 226L to the listener's left ear and the sound output from the right speaker 220R,. The sound output timing and the volume are adjusted so as to reach the target. Thereby, the listener in a vehicle can listen to the same sound as being at an equal distance from each speaker.

なお、検出された聴取者のうちのどの聴取者に音響を最適化するかを選択可能にしてもよい。この場合、例えば、前部座席（運転席、助手席）、後部座席のどの席に音響を最適化するかを選択するための操作手段を設けるようにすればよい。また、カメラ１２によって撮影された聴取者の画像を表示する表示手段と、画像中のどの聴取者に音響を最適化するかを選択するための操作手段を設けるようにしてもよい。 Note that it may be possible to select which listener among the detected listeners to optimize the sound. In this case, for example, an operating means for selecting which seat of the front seat (driver seat, front passenger seat) and rear seat to optimize the sound may be provided. In addition, display means for displaying the image of the listener photographed by the camera 12 and operation means for selecting which listener in the image to optimize the sound may be provided.

図３は、車内におけるカメラの配置の第２の実施形態を示す自動車の平面図である。図３に示す例では、自動車１００の前部（例えば、ルームミラー）に、カメラ１２４が配置されており、自動車１００の天蓋の中央付近（例えば、ルームライトの近傍）に、カメラ１２６が配置されている。カメラ１２４は、前部座席２８Ｒ、２８Ｌ及び後部座席３０に着席している聴取者を撮影する。カメラ１２６は、広角レンズ又は魚眼レンズを備えた広角撮影が可能なカメラであり、車内の聴取者の頭部を上から撮影する。 FIG. 3 is a plan view of an automobile showing a second embodiment of the camera arrangement in the car. In the example shown in FIG. 3, the camera 124 is arranged at the front part (for example, a room mirror) of the automobile 100, and the camera 126 is arranged near the center of the canopy of the automobile 100 (for example, near the room light). ing. The camera 124 photographs the listener who is seated in the front seats 28R and 28L and the rear seat 30. The camera 126 is a camera capable of wide-angle photographing including a wide-angle lens or a fish-eye lens, and photographs the listener's head in the vehicle from above.

顔検出処理部１４は、カメラ１２４によって撮影された画像から聴取者の前部座席２８Ｒ、２８Ｌ及び後部座席３０に着席している聴取者の顔領域及び耳の領域を検出する。なお、顔及び耳の検出方式は上記と同様である。 The face detection processing unit 14 detects the face area and the ear area of the listener seated in the front seats 28R and 28L and the rear seat 30 of the listener from the images taken by the camera 124. The face and ear detection methods are the same as described above.

顔位置演算部１６は、カメラ１２６によって撮影された画像から聴取者の頭部を検出し、その座標（Ｘ，Ｙ）を演算する。そして、顔位置演算部１６は、聴取者の頭部の座標（Ｘ，Ｙ）と、顔検出処理部１４によって求められた耳の領域の位置から、車内の聴取者の耳の位置の座標（Ｘ，Ｙ，Ｚ）を演算する。これにより、車内の聴取者の耳の位置に合わせた音響調整が可能になる。 The face position calculation unit 16 detects the listener's head from the image captured by the camera 126 and calculates the coordinates (X, Y). Then, the face position calculation unit 16 uses the coordinates (X, Y) of the listener's head and the position of the ear of the listener in the vehicle (from the position of the ear region determined by the face detection processing unit 14). X, Y, Z) is calculated. As a result, it is possible to adjust the sound in accordance with the position of the listener's ear in the vehicle.

図４は、車内におけるカメラの配置の第３の実施形態を示す自動車の平面図である。図４に示す例では、自動車１００の前部（例えば、ルームミラー）に、カメラ１３０が配置されている。また、自動車１００左右の天蓋付近には、カメラ１３２Ｒ、１３４Ｒ、１３２Ｌ及び１３４Ｌが配置されている。カメラ１３０は、前部座席２８Ｒ、２８Ｌ及び後部座席３０に着席している聴取者を撮影する。カメラ１３２Ｒ及び１３４Ｒは、それぞれ前部座席２８Ｒ及び後部座席３０の右側に着席している聴取者を撮影する。カメラ１３２Ｌ及び１３４Ｌは、それぞれ前部座席２８Ｌ及び後部座席３０の左側に着席している聴取者を撮影する。 FIG. 4 is a plan view of an automobile showing a third embodiment of the arrangement of cameras in the car. In the example illustrated in FIG. 4, the camera 130 is disposed in the front part (for example, a room mirror) of the automobile 100. Cameras 132R, 134R, 132L, and 134L are arranged near the left and right canopies of the automobile 100. The camera 130 photographs a listener who is seated in the front seats 28R and 28L and the rear seat 30. The cameras 132R and 134R capture a listener who is seated on the right side of the front seat 28R and the rear seat 30, respectively. The cameras 132L and 134L photograph the listener seated on the left side of the front seat 28L and the rear seat 30, respectively.

顔検出処理部１４は、カメラ１３０によって撮影された画像から聴取者の前部座席２８Ｒ、２８Ｌ及び後部座席３０に着席している聴取者の顔領域及び耳の領域を検出する。なお、顔及び耳の検出方式は上記と同様である。 The face detection processing unit 14 detects the face area and the ear area of the listener seated in the front seats 28R and 28L and the rear seat 30 of the listener from the images taken by the camera 130. The face and ear detection methods are the same as described above.

顔位置演算部１６は、カメラ１３２Ｒ、１３４Ｒ、１３２Ｌ及び１３４Ｌによって撮影された画像から聴取者の頭部を検出し、その座標（Ｘ，Ｙ，Ｚ）を演算する。そして、顔位置演算部１６は、聴取者の頭部の座標（Ｘ，Ｙ，Ｚ）と、顔検出処理部１４によって求められた耳の領域の位置から、車内の聴取者の耳の位置の座標（Ｘ，Ｙ，Ｚ）を演算する。これにより、車内の聴取者の耳の位置に合わせた音響調整が可能になる。 The face position calculation unit 16 detects the listener's head from images taken by the cameras 132R, 134R, 132L, and 134L, and calculates the coordinates (X, Y, Z). Then, the face position calculation unit 16 calculates the position of the listener's ear in the vehicle from the coordinates (X, Y, Z) of the listener's head and the position of the ear region obtained by the face detection processing unit 14. Coordinates (X, Y, Z) are calculated. As a result, it is possible to adjust the sound in accordance with the position of the listener's ear in the vehicle.

なお、図４に示す例では、聴取者の頭部の位置を検出するためのカメラが左右に２対配置されているが、１対であってもよい。また、聴取者の頭部の位置を検出するためのカメラ１２６、１３０Ｒ及び１３０Ｌ、１３２Ｒ及び１３２Ｌは、例えば、赤外線カメラであってもよい。また、聴取者の頭部の位置を検出するためのカメラに代えて、測距センサ、赤外線センサを用いてもよい。 In the example shown in FIG. 4, two pairs of cameras for detecting the position of the listener's head are arranged on the left and right, but one pair may be used. The cameras 126, 130R and 130L, 132R and 132L for detecting the position of the listener's head may be, for example, an infrared camera. Further, a distance measuring sensor or an infrared sensor may be used instead of the camera for detecting the position of the listener's head.

また、カメラ１３０を顔までの測距機能を有するカメラ（例えば、ステレオカメラ）として、聴取者の頭部の位置を検出するためのカメラを備えない構成としてもよい。 Further, the camera 130 may be configured as a camera (for example, a stereo camera) having a function of measuring the distance to the face, without a camera for detecting the position of the listener's head.

図５は、音響調整処理の流れを示すフローチャートである。まず、音声再生が開始されると（ステップＳ１０）、車内に配置されたカメラ１２により車内の聴取者の撮影が開始され、撮影された画像に対して顔領域及び耳の領域の検出処理が行われる（ステップＳ１２）。次に、耳の領域の位置の座標（Ｘ，Ｙ，Ｚ）が演算されて、記録される（ステップＳ１４）。そして、耳の位置に合わせた音響調整処理が行われる（ステップＳ１６）。 FIG. 5 is a flowchart showing the flow of the sound adjustment process. First, when sound reproduction is started (step S10), photographing of a listener inside the vehicle is started by the camera 12 arranged in the vehicle, and detection processing of a face region and an ear region is performed on the captured image. (Step S12). Next, the coordinates (X, Y, Z) of the position of the ear region are calculated and recorded (step S14). Then, an acoustic adjustment process is performed according to the position of the ear (step S16).

そして、所定の時間ごとに上記ステップＳ１２からステップＳ１６の工程が繰り返される。ステップＳ１６では、例えば、後部座席３０に着席している聴取者が前に乗り出した場合（例えば、顔領域が大きくなったことが検出された場合）、又は前部座席２８又は後部座席３０に着席している聴取者が自動車１００の後方又は横を向いたことが検出された場合に音量を下げるか又は音声の再生を停止する。また、検出された聴取者のいずれかが所定時間以上目を閉じている場合（即ち、当該聴取者が眠っている場合）に音量を下げるか又は音声の再生を停止する。また、自動車のエンジンがかかっている状態で運転席の聴取者が所定時間以上目を閉じている場合（即ち、運転手が眠っている場合）に音量を上げる。これにより、車内の聴取者が会話している場合や、眠っている場合に応じて的確な音響調整を行うことができる。 And the process of said step S12 to step S16 is repeated for every predetermined time. In step S 16, for example, when a listener seated in the rear seat 30 starts in front (for example, when it is detected that the face area has increased), or seats in the front seat 28 or the rear seat 30. When it is detected that the listener who is listening is facing the back or side of the automobile 100, the sound volume is reduced or the sound reproduction is stopped. Also, if any of the detected listeners has closed their eyes for a predetermined time or longer (that is, if the listener is asleep), the volume is reduced or the sound reproduction is stopped. Further, the volume is increased when the listener of the driver's seat closes his eyes for a predetermined time or longer with the engine of the automobile running (that is, when the driver is sleeping). Thereby, an accurate acoustic adjustment can be performed according to a case where a listener in the vehicle is talking or asleep.

本実施形態によれば、聴取者の画像を撮影して顔検出処理を行うことにより、聴取者の顔及び耳の位置を正確に検出することができるので、聴取者の顔及び耳の位置に応じて的確な音響調整を行うことができる。 According to the present embodiment, the position of the listener's face and ear can be accurately detected by photographing the listener's image and performing face detection processing. Accordingly, accurate acoustic adjustment can be performed.

なお、上記の説明では、本発明の音響調整システムをカーオーディオシステムの音響調整に適用した例について説明したが、例えば、ホームシアターのスピーカシステムの音響調整にも適用可能である。 In the above description, the example in which the sound adjustment system of the present invention is applied to the sound adjustment of the car audio system has been described. However, for example, the sound adjustment system can also be applied to the sound adjustment of the speaker system of a home theater.

本発明の一実施形態に係る音響調整システムの主要構成を示すブロック図The block diagram which shows the main structures of the acoustic adjustment system which concerns on one Embodiment of this invention. 車内におけるカメラの配置の第１の実施形態を示す自動車の平面図The top view of the motor vehicle which shows 1st Embodiment of arrangement | positioning of the camera in a vehicle 車内におけるカメラの配置の第２の実施形態を示す自動車の平面図The top view of the motor vehicle which shows 2nd Embodiment of arrangement | positioning of the camera in a vehicle 車内におけるカメラの配置の第３の実施形態を示す自動車の平面図The top view of the motor vehicle which shows 3rd Embodiment of arrangement | positioning of the camera in a vehicle 音響調整処理の流れを示すフローチャートFlow chart showing the flow of acoustic adjustment processing

Explanation of symbols

１０…音響調整システム、１２…カメラ、１４…顔検出処理部、１６…顔位置演算部、１８…音響調整処理部、２０…音声再生装置、２２…スピーカシステム、２４…操作部、２６…状態検出部 DESCRIPTION OF SYMBOLS 10 ... Acoustic adjustment system, 12 ... Camera, 14 ... Face detection process part, 16 ... Face position calculation part, 18 ... Sound adjustment process part, 20 ... Sound reproduction apparatus, 22 ... Speaker system, 24 ... Operation part, 26 ... State Detection unit

Claims

Audio output means for reproducing and outputting audio;
Photographing means for photographing an image of a listener of the sound output by the sound output means;
Face detection means for detecting the face of the listener from the image photographed by the photographing means;
Face position calculating means for calculating the position of the listener's face detected by the face detecting means;
Sound adjusting means for adjusting the sound of the sound output from the sound output means according to the position of the listener's face;
An acoustic adjustment system comprising:

Ear detection means for detecting the listener's ear from the face image detected by the face detection means;
The face position calculating means calculates the position of the listener's ear,
The acoustic adjustment system according to claim 1, wherein the acoustic adjustment unit performs acoustic adjustment of a sound output from the sound output unit according to a position of an ear of the listener.

First detecting means for detecting whether the listener is facing the front of the seated seat,
The sound adjusting means may reduce the volume of the sound output from the sound output means or stop the sound reproduction when it is detected that the listener is facing the back or side of the seat. The acoustic adjustment system according to claim 1 or 2, characterized in that

When the listener is seated separately in the front and rear seats, further comprising second detection means for detecting whether the listener in the back seat has entered the front seat side,
The sound adjusting means lowers the volume of the sound output from the sound output means or stops the sound reproduction when it is detected that the listener in the back seat has entered the front seat side. The acoustic adjustment system according to any one of claims 1 to 3, wherein

Further comprising third detection means for detecting whether or not the listener detected by the face detection means has closed their eyes for a predetermined time or more;
The sound adjustment means lowers the volume of the sound output from the sound output means or stops the sound reproduction when it is detected that the listener has closed his eyes for a predetermined time or more. The acoustic adjustment system according to any one of claims 1 to 4.

The acoustic adjustment system is installed in an automobile,
When it is detected that the listener has closed his eyes for a predetermined time or more, further comprising fourth detection means for detecting whether the engine of the automobile is running,
The sound adjustment means increases the volume of the sound output from the sound output means when it is detected that the listener has closed his eyes for a predetermined time or more and the automobile engine is running. The acoustic adjustment system according to claim 5.