JP2011217328A

JP2011217328A - Audio device

Info

Publication number: JP2011217328A
Application number: JP2010086128A
Authority: JP
Inventors: Takashi Tsutsui; 崇筒井
Original assignee: Alpine Electronics Inc
Current assignee: Alpine Electronics Inc
Priority date: 2010-04-02
Filing date: 2010-04-02
Publication date: 2011-10-27

Abstract

PROBLEM TO BE SOLVED: To provide an audio device capable of easily setting desired acoustic properties.SOLUTION: When a still image is being displayed by a video source device 1 in a pause state (in (a)), a designation of range 401 is accepted by an input device 3 from a user (in (b)), and a recognition object (female vocalist) is identified by applying image recognition processing to an image (c) in the range 401, so that equalizer characteristics registered in association with the identified recognition object in an equalizer characteristic table 13 are set in an equalizer 6. In the equalizer characteristic table 13, equalizer characteristics for picking out sound generated by the recognition object are preliminarily registered in association with the respective recognition objects.

Description

本発明は、オーディオ装置において出力音に与える音響特性を設定する技術に関するものである。 The present invention relates to a technique for setting acoustic characteristics to be given to output sound in an audio apparatus.

オーディオ装置において出力音に与える音響特性を設定する技術としては、予め用意しておいた複数の音響特性の名称のリストを表示し、表示したリスト上で、ユーザが名称を指示した音響特性を表すグラフを表示しつつ、ユーザによって名称を選択された音響特性を、出力音に与える音響特性として設定する技術が知られている（たとえば、特許文献１）。 As a technique for setting the acoustic characteristics to be given to the output sound in the audio device, a list of names of a plurality of acoustic characteristics prepared in advance is displayed, and the acoustic characteristics indicated by the user on the displayed list are represented. A technique is known in which an acoustic characteristic whose name is selected by a user is set as an acoustic characteristic given to an output sound while displaying a graph (for example, Patent Document 1).

特開２０００-１８１４６２号公報JP 2000-181462 A

上述した音響特性の名称のリスト上で、出力音に与える音響特性の選択を受け付ける技術によれば、ユーザが、出力中のオーディオコンテンツに無関係な音響特性を含む多数の音響特性のうちから、所望の音響特性を選択する煩雑な操作を行う必要がある。 According to the technology for accepting selection of the acoustic characteristics to be given to the output sound on the above-described list of acoustic characteristics names, the user can select a desired acoustic characteristic from among a large number of acoustic characteristics including an acoustic characteristic irrelevant to the audio content being output. Therefore, it is necessary to perform a complicated operation for selecting the acoustic characteristics.

また、音響特性の名称や音響特性を表すグラフによっては、その音響特性の効果を把握し難い場合もあり、このような場合には、直ちに所望の音響特性を設定することが困難となる。
そこで、本発明は、より容易に所望の音響特性を設定できるオーディオ装置を提供することを課題とする。 Further, depending on the name of the acoustic characteristic and the graph representing the acoustic characteristic, it may be difficult to grasp the effect of the acoustic characteristic. In such a case, it is difficult to immediately set the desired acoustic characteristic.
Therefore, an object of the present invention is to provide an audio apparatus that can set desired acoustic characteristics more easily.

前記課題達成のために、本発明は、映像の表示と共に、音声の出力を行うオーディオ装置に、前記出力する音声を、設定された特性値が表す音響特性で調整するイコライザと、複数の対象物について、当該対象物の識別と前記特性値との対応を登録した特性値テーブルと、前記ユーザから前記映像の表示上で範囲の指定を受け付けて、前記表示されている映像中の前記指定された範囲内の部分に画像認識処理を施して、当該範囲内の部分に像が含まれる前記対象物を認識する画像認識手段と、前記画像認識手段が認識した前記対象物の識別との対応が前記特性値テーブルに登録されている前記特性値を前記イコライザに設定するイコライザ特性設定手段とを備えたものである。 In order to achieve the above object, the present invention provides an audio device that outputs a sound together with a video display, an equalizer that adjusts the sound to be output with an acoustic characteristic represented by a set characteristic value, and a plurality of objects The characteristic value table in which the correspondence between the identification of the object and the characteristic value is registered, and the specification of the range on the display of the video from the user is received, and the specified in the displayed video Image recognition processing is performed on a portion within the range, and the correspondence between the image recognition means for recognizing the object whose image is included in the portion within the range and the identification of the object recognized by the image recognition means is the Equalizer characteristic setting means for setting the characteristic value registered in the characteristic value table in the equalizer is provided.

このようなオーディオ装置によれば、映像の表示上で所望の対象物が写り込んでいる画像部分を範囲指定するだけで、その対象物の識別との対応が特性値テーブルに登録されている特性値が前記イコライザに設定される。
よって、対象物の識別との対応を特性値テーブルに登録する特性値を適当に設定しておくことにより、より直感的かつ容易に所望の音響特性を設定できるようになる。すなわち、たとえば、前記複数の対象物を、各々音の発生源とし、前記特性値テーブルに登録されている、前記対象物の識別と対応する前記特性値は、当該対象物の発生する音を際立たせる音響特性を表すものとすれば、ユーザは、映像の表示上で、当該映像に含まれている、より良く音を聴きたい対象物の像の範囲を指定するだけで、当該対象物の発生する音を際立たせる音響特性の特性値をイコライザに設定することができる。 According to such an audio apparatus, a characteristic whose correspondence with the identification of the target object is registered in the characteristic value table only by specifying a range of the image portion in which the desired target object is reflected on the video display. A value is set in the equalizer.
Therefore, it is possible to set a desired acoustic characteristic more intuitively and easily by appropriately setting a characteristic value registered in the characteristic value table for correspondence with identification of the object. That is, for example, each of the plurality of objects is a sound generation source, and the characteristic value corresponding to the identification of the object registered in the characteristic value table stands out from the sound generated by the object. If the user expresses the acoustic characteristics to be generated, the user only has to specify the range of the image of the target object that he / she wants to hear better on the video display. The characteristic value of the acoustic characteristic that makes the sound to stand out stand out can be set in the equalizer.

ここで、以上のようなオーディオ装置は、前記イコライザ特性設定手段において、画像認識手段が複数の対象物を認識した場合に、当該複数の対象物の内からの一つの対象物の選択をユーザから受け付け、選択を受け付けた対象物の識別に対応して前記特性値テーブルに登録されている前記特性値を前記イコライザに設定するように構成することもできる。 Here, in the audio device as described above, when the image recognition unit recognizes a plurality of objects in the equalizer characteristic setting unit, the user selects one object from the plurality of objects. The characteristic value registered in the characteristic value table may be set in the equalizer corresponding to the identification of the object that has been received and selected.

以上のように、本発明によれば、より容易に所望の音響特性を設定できるオーディオ装置を提供することができる。 As described above, according to the present invention, it is possible to provide an audio apparatus that can set desired acoustic characteristics more easily.

本発明の実施形態に係るオーディオ装置の構成を示すブロック図である。It is a block diagram which shows the structure of the audio apparatus which concerns on embodiment of this invention. 本発明の実施形態に係るイコライザ特性テーブルの内容を示す図である。It is a figure which shows the content of the equalizer characteristic table which concerns on embodiment of this invention. 本発明の実施形態に係るイコライザ特性設定制御処理を示すフローチャートである。It is a flowchart which shows the equalizer characteristic setting control process which concerns on embodiment of this invention. 本発明の実施形態に係るイコライザ特性設定制御処理の処理例を示す図である。It is a figure which shows the process example of the equalizer characteristic setting control process which concerns on embodiment of this invention. 本発明の実施形態に係るイコライザ特性設定制御処理の処理例を示す図である。It is a figure which shows the process example of the equalizer characteristic setting control process which concerns on embodiment of this invention.

以下、本発明に係るオーディオ装置の実施形態について説明する。
図１に、本実施形態に係るオーディオ装置の構成を示す。
図示するように、オーディオ装置は、ビデオ再生装置やＴＶ受信機などのオーディオデータと映像データとを出力するビデオソース機器１、スピーカ２、入力装置３、表示装置４、制御部５、イコライザ６、アンプ７、ビデオメモリ８、表示制御部９、画像認識辞書１０、画像認識部１１、イコライザ特性設定制御部１２、イコライザ特性テーブル１３とを備えている。 Hereinafter, embodiments of an audio apparatus according to the present invention will be described.
FIG. 1 shows the configuration of an audio apparatus according to this embodiment.
As shown in the figure, the audio device includes a video source device 1 that outputs audio data and video data such as a video playback device and a TV receiver, a speaker 2, an input device 3, a display device 4, a control unit 5, an equalizer 6, An amplifier 7, a video memory 8, a display control unit 9, an image recognition dictionary 10, an image recognition unit 11, an equalizer characteristic setting control unit 12, and an equalizer characteristic table 13 are provided.

但し、本オーディオ装置は、マイクロプロセッサや、メモリや、その他の周辺デバイスなどの一般的なハードウエア構成を備えたコンピュータを利用して構成されるものであってよく、この場合、以上に示した制御部５、画像認識辞書１０、画像認識部１１、イコライザ特性設定制御部１２、イコライザ特性テーブル１３、イコライザ６などの各部、または、その一部は、コンピュータが所定のコンピュータプログラムを実行することにより実現されるリソースやプロセスであってよい。 However, the audio apparatus may be configured using a computer having a general hardware configuration such as a microprocessor, a memory, and other peripheral devices. The control unit 5, the image recognition dictionary 10, the image recognition unit 11, the equalizer characteristic setting control unit 12, the equalizer characteristic table 13, the equalizer 6, and the like, or a part thereof, is executed by the computer executing a predetermined computer program. It may be a resource or process that is realized.

このような構成において、制御部５は、入力装置３で受け付けたユーザ操作に応じて、ビデオソース機器１の動作を制御する。
また、ビデオソース機器１から出力されたオーディオデータは、イコライザ６においてイコライザ６に設定されているイコライザ特性に従って音響特性が調整された後、アンプ７を介してスピーカ２に出力される。一方、ビデオソース機器１から出力された映像データは、一旦、ビデオメモリ８に格納された後、表示制御部９によって読み出され、映像データが表す映像が表示装置４に表示される。 In such a configuration, the control unit 5 controls the operation of the video source device 1 in accordance with a user operation received by the input device 3.
The audio data output from the video source device 1 is output to the speaker 2 via the amplifier 7 after the acoustic characteristics are adjusted in accordance with the equalizer characteristics set in the equalizer 6 in the equalizer 6. On the other hand, the video data output from the video source device 1 is temporarily stored in the video memory 8 and then read out by the display control unit 9, and the video represented by the video data is displayed on the display device 4.

次に、画像認識辞書１０には、画像認識部１１において行う画像認識処理において画像認識する対象である認識対象の各々に対応づけて、当該認識対象を表す画像の特徴を表す特徴データが格納されている。ここで、画像認識部１１において行う画像認識処理では、たとえば、「男性ボーカル」、「女性ボーカル」、「ギター」、「ベース」などの、音の発生源となる歌唱者や楽器の種別を認識の対象とする。 Next, the image recognition dictionary 10 stores feature data representing the features of the image representing the recognition target in association with each recognition target that is an image recognition target in the image recognition processing performed in the image recognition unit 11. ing. Here, in the image recognition processing performed in the image recognition unit 11, for example, the type of a singer or instrument that is a sound source, such as “male vocal”, “female vocal”, “guitar”, “bass”, or the like is recognized. The target of.

そして、イコライザ特性テーブル１３には、図２に示すように、上述した画像認識処理において認識する各認識対象に対応づけて、イコライザ特性を登録する。ここで、イコライザ特性テーブル１３には、イコライザ特性として、各チャネル毎にFc,Q,Gainなどによって表される周波数特性を登録する。また、イコライザ特性テーブル１３に登録するイコライザ特性が表す周波数特性は、基本的には、対応づけられた認識対象の発生する音を際だたせるものとする。すなわち、たとえば、「女性ボーカル」に対応づけられてイコライザ特性テーブル１３に登録されたイコライザ特性が表す周波数特性は女性の声の音域を際だたせるものとし、「ギター」に対応づけけられてイコライザ特性テーブル１３登録されたイコライザ特性が表す周波数特性はギターの音域を際だたせるものとする。 In the equalizer characteristic table 13, as shown in FIG. 2, the equalizer characteristic is registered in association with each recognition target recognized in the image recognition process described above. Here, in the equalizer characteristic table 13, frequency characteristics represented by Fc, Q, Gain, etc. are registered for each channel as equalizer characteristics. In addition, the frequency characteristic represented by the equalizer characteristic registered in the equalizer characteristic table 13 basically emphasizes the sound generated by the associated recognition target. That is, for example, the frequency characteristic represented by the equalizer characteristic associated with “female vocal” and registered in the equalizer characteristic table 13 is to distinguish the female voice range, and is associated with “guitar” and the equalizer characteristic. It is assumed that the frequency characteristic represented by the equalizer characteristic registered in the table 13 highlights the sound range of the guitar.

以下、このようなオーディオ装置において行う、イコライザ６へのイコライザ特性の設定動作について説明する。
イコライザ特性設定制御部１２は、イコライザ６へのイコライザ特性の設定のために、図３に示すイコライザ特性設定制御処理を行う。
図示するように、この処理では、制御部５がユーザ操作に従ってビデオソース機器１の出力をポーズ状態（映像を構成する１フレームを静止画として表示している状態）に制御しているときに（ステップ３０２）、ユーザによる入力装置３を介した、表示装置４の表示画面上の範囲指定が発生するのを監視する（ステップ３０４）。 Hereinafter, the setting operation of the equalizer characteristic to the equalizer 6 performed in such an audio apparatus will be described.
The equalizer characteristic setting control unit 12 performs an equalizer characteristic setting control process shown in FIG. 3 in order to set the equalizer characteristic to the equalizer 6.
As shown in the figure, in this process, when the control unit 5 controls the output of the video source device 1 to a pause state (a state in which one frame constituting a video is displayed as a still image) according to a user operation ( Step 302), monitoring for the occurrence of range designation on the display screen of the display device 4 via the input device 3 by the user (step 304).

そして、範囲指定が発生したならば、画像認識部１１を用いて、当該時点で表示装置４で表示している映像の、指定された範囲内の部分の画像の画像認識を行って、指定された範囲内の部分の画像に像が写り込んでいる認識対象を識別する（ステップ３０６）。ここで、ステップ３０６において、画像認識部１１は、ビデオメモリ８に格納されている静止画表示中の映像の映像データが表す画像中の、指定された範囲内の部分の画像に対して、画像認識辞書１０を用いて画像認識処理を施して、上述した認識対象のいずれの像が、当該入力された範囲内の部分の画像に含まれているかを識別し、識別結果をイコライザ特性設定制御部１２に通知する。 When the range designation occurs, the image recognition unit 11 is used to perform image recognition of the image within the designated range of the video displayed on the display device 4 at that time, and the designated range is designated. The recognition target whose image is reflected in the image of the portion within the range is identified (step 306). Here, in step 306, the image recognizing unit 11 performs image processing on an image of a portion within a specified range in the image represented by the video data of the still image display video stored in the video memory 8. An image recognition process is performed using the recognition dictionary 10 to identify which image of the recognition target described above is included in the image of the portion within the input range, and the identification result is an equalizer characteristic setting control unit 12 is notified.

すなわち、たとえば、ポーズ状態とされたビデオソース機器１によって図４ａに示す画像が静止画表示されているときに、図４ｂに示す範囲４０１がユーザによって入力装置３によって指定された場合には、画像認識部１１は、指定された範囲４０１内の画像である図４ｃに示す画像に対して画像認識処理を施して、図４ｃに示す画像に含まれるパターンの特徴とマッチする特徴データと対応づけて画像認識辞書１０に登録されている認識対象（ここでは、「女性ボーカル」）を識別し、イコライザ特性設定制御部１２に通知する。 That is, for example, when the image shown in FIG. 4A is displayed as a still image by the video source device 1 in the paused state, if the range 401 shown in FIG. The recognition unit 11 performs image recognition processing on the image shown in FIG. 4c that is an image in the designated range 401, and associates it with feature data that matches the feature of the pattern included in the image shown in FIG. 4c. A recognition target (here, “female vocal”) registered in the image recognition dictionary 10 is identified and notified to the equalizer characteristic setting control unit 12.

次に、このようにして画像認識を行ったならば、イコライザ特性テーブル１３より、画像認識で識別した認識対象に対応づけて登録されているイコライザ特性を取得し（ステップ３０８）、取得したイコライザ特性をイコライザ６に設定する（ステップ３１０）。
すなわち、図４ｂに示す範囲４０１がユーザによって入力装置３によって指定され、「女性ボーカル」を画像認識した場合には、「女性ボーカル」に対応づけてイコライザ特性テーブル１３に登録されているイコライザ特性をイコライザ６に設定する。
ここで、イコライザ６は、イコライザ特性が設定されたならば、以降、設定されたイコライザ特性に従ってオーディオデータの音響特性の調整を行う。
この結果、図４ｂに示すように表示装置４の表示上で「女性ボーカル」が写り込んでいる画像部分４０１をユーザが範囲指定するだけで、「女性ボーカル」の音域を際だたせるオーディオデータの音響特性の調整をイコライザ６に行わせるイコライザ特性が、自動的にイコライザ６に設定されることになる。 Next, when the image recognition is performed in this way, the equalizer characteristic registered in association with the recognition target identified by the image recognition is acquired from the equalizer characteristic table 13 (step 308), and the acquired equalizer characteristic is acquired. Is set in the equalizer 6 (step 310).
That is, when the range 401 shown in FIG. 4B is designated by the input device 3 by the user and “female vocal” is image-recognized, the equalizer characteristics registered in the equalizer characteristic table 13 in association with “female vocal” are displayed. Set to equalizer 6.
Here, if the equalizer characteristic is set, the equalizer 6 thereafter adjusts the acoustic characteristic of the audio data according to the set equalizer characteristic.
As a result, as shown in FIG. 4 b, the audio data sound that highlights the range of “female vocal” simply by the user specifying the range of the image portion 401 in which “female vocal” is reflected on the display of the display device 4. The equalizer characteristic that causes the equalizer 6 to adjust the characteristic is automatically set in the equalizer 6.

以上、イコライザ特性設定制御処理について説明した。
ところで、以上のイコライザ特性設定制御処理のステップ３０６の画像認識において、指定範囲内の画像に対する画像認識処理によって複数の認識対象が識別された場合には、以下のようにしてよい。
すなわち、識別された複数の認識対象のうちのいずれか一つの認識対象を所定の規則に従って選択して、選択認識対象とし、選択認識対象に対応づけて登録されているイコライザ特性をイコライザ特性テーブル１３より取得し（ステップ３０８）、取得したイコライザ特性をイコライザ６に設定するようにする（ステップ３１０）。ここで、上記所定の規則としては、画像認識辞書１０に登録されている当該認識対象の特徴データと指定範囲内の画像中のパターンとのマッチ度が最大の認識対象を選択認識対象とする規則や、識別した認識対象に対応する指定範囲内の画像中のパターンのサイズが最も大きい認識対象を選択認識対象とする規則や、識別した認識対象に対応する指定範囲内の画像中のパターンの中心位置が、指定範囲の中心に最も近い認識対象を選択認識対象とする規則などを用いることができる。 The equalizer characteristic setting control process has been described above.
By the way, in the image recognition in step 306 of the equalizer characteristic setting control process described above, when a plurality of recognition targets are identified by the image recognition process for the image within the specified range, the following may be performed.
That is, any one recognition target among a plurality of identified recognition targets is selected according to a predetermined rule to be a selection recognition target, and an equalizer characteristic registered in association with the selection recognition target is the equalizer characteristic table 13. (Step 308), and the obtained equalizer characteristic is set in the equalizer 6 (step 310). Here, as the predetermined rule, a rule for selecting a recognition target having a maximum degree of matching between the feature data of the recognition target registered in the image recognition dictionary 10 and the pattern in the image within the specified range. Or a rule that selects and recognizes the recognition target having the largest pattern size in the image within the specified range corresponding to the identified recognition target, or the center of the pattern in the image within the specified range corresponding to the identified recognition target. A rule or the like in which the recognition target whose position is closest to the center of the specified range can be used.

または、イコライザ特性テーブル１３に、複数の認識対象の組み合わせに対応づけたイコライザ特性も登録しておくようにし、識別された複数の認識対象の組み合わせに対応づけられたイコライザ特性をイコライザ特性テーブル１３より取得し（ステップ３０８）、取得したイコライザ特性をイコライザ６に設定するようにしてもよい（ステップ３１０）。 Alternatively, an equalizer characteristic associated with a plurality of combinations of recognition targets is also registered in the equalizer characteristic table 13, and the equalizer characteristics associated with a plurality of identified recognition target combinations are stored in the equalizer characteristic table 13. It is possible to acquire (step 308) and set the acquired equalizer characteristic in the equalizer 6 (step 310).

または、識別された複数の認識対象のリストを表示装置４に表示し、表示したリスト中からの一つの認識対象の選択をユーザから受け付け、選択された認識対象を、選択認識対象として、選択認識対象に対応づけて登録されているイコライザ特性をイコライザ特性テーブル１３より取得し（ステップ３０８）、取得したイコライザ特性をイコライザ６に設定するようにしてもよい（ステップ３１０）。 Alternatively, a list of a plurality of recognized recognition targets is displayed on the display device 4, a selection of one recognition target from the displayed list is received from the user, and the selected recognition target is selected and recognized as a selection recognition target. The equalizer characteristic registered in association with the target may be acquired from the equalizer characteristic table 13 (step 308), and the acquired equalizer characteristic may be set in the equalizer 6 (step 310).

すなわち、たとえば、図５ａに示すようにユーザから指定された範囲５０１に対する画像認識によって「女性ボーカル」と「ギター」が識別された認識対象として得られた場合には、図５ｂに示すように、選択ウインドウ５０２を表示して、識別された認識対象「女性ボーカル」、「ギター」のリストを表示して、「女性ボーカル」、「ギター」のいずれかの選択を受け付けて、選択認識対象とするようにする。 That is, for example, when “female vocal” and “guitar” are obtained as recognition targets identified by image recognition for a range 501 designated by the user as shown in FIG. 5a, as shown in FIG. 5b, A selection window 502 is displayed to display a list of identified recognition targets “female vocal” and “guitar”, and accepts a selection of either “female vocal” or “guitar” to be a selection recognition target. Like that.

または、図５ｃに示すように、ユーザから指定された範囲５０３に対する画像認識によって「女性ボーカル」と「男性ボーカル」が識別された認識対象として得られた場合には、選択ウインドウ５０２を表示して、識別された認識対象「女性ボーカル」、「男性ボーカル」のリストを表示して、「女性ボーカル」、「男性ボーカル」のいずれかの選択を受け付けて、選択認識対象とするようにする。 Alternatively, as shown in FIG. 5c, when “female vocal” and “male vocal” are obtained as the recognition targets identified by the image recognition for the range 503 designated by the user, a selection window 502 is displayed. Then, a list of identified recognition targets “female vocals” and “male vocals” is displayed, and selection of either “female vocals” or “male vocals” is accepted and selected.

または、図５ｄに示すように、以上のイコライザ特性設定制御処理のステップ３０６の画像認識において、指定範囲５０４内の画像に対する画像認識処理によって、認識対象として複数の大きさが異なる人物であるところの認識対象（「女性ボーカル」や「男性ボーカル」等）が、識別された場合には、同図に示すように選択ウインドウ５０５を表示し、識別した認識対象と、奥行き感のいずれかの選択を受け付け、識別した認識対象が選択された場合には、識別した認識対象に対応づけて登録されているイコライザ特性をイコライザ特性テーブル１３より取得し（ステップ３０８）、取得したイコライザ特性をイコライザ６に設定し、奥行き感が選択された場合には、イコライザ特性テーブル１３に予め登録しておいた出力音像の奥行き感を増加させる音響特性を与えるイコライザ特性を取得して、取得したイコライザ特性をイコライザ６に設定するようにしてもよい。 Alternatively, as shown in FIG. 5d, in the image recognition in step 306 of the equalizer characteristic setting control process described above, the image recognition process for the image in the designated range 504 is a person having a plurality of different sizes as recognition targets. When a recognition target (such as “female vocal” or “male vocal”) is identified, a selection window 505 is displayed as shown in the figure to select either the identified recognition target or a sense of depth. When the recognized and identified recognition target is selected, the equalizer characteristic registered in association with the identified recognition target is acquired from the equalizer characteristic table 13 (step 308), and the acquired equalizer characteristic is set in the equalizer 6 When the sense of depth is selected, the sense of depth of the output sound image registered in advance in the equalizer characteristic table 13 is selected. Acquires the equalizer characteristic to provide an acoustic characteristic of increasing, it may be set the acquired equalizer characteristics to the equalizer 6.

ところで、以上の実施形態では画像認識辞書１０に、認識対象を表す画像の特徴を表す特徴データを認識対象と対応づけて登録したが、画像認識辞書１０には認識対象を表す典型的な画像を参照画像として認識対象と対応づけて登録するようにしてもよい。ここで、この場合、画像認識部１１は、指定された範囲内の画像とパターンや特徴がマッチする参照画像と対応づけて画像認識辞書１０に登録されている認識対象を識別する。 In the above embodiment, the feature data representing the feature of the image representing the recognition target is registered in the image recognition dictionary 10 in association with the recognition target. However, a typical image representing the recognition target is registered in the image recognition dictionary 10. The reference image may be registered in association with the recognition target. Here, in this case, the image recognition unit 11 identifies a recognition target registered in the image recognition dictionary 10 in association with a reference image whose pattern or feature matches with an image within the specified range.

なお、以上のオーディオ装置には、画像認識辞書１０やイコライザ特性テーブル１３の内容をユーザ操作に応じて編集する編集機能を設けるようにしてもよい。また、この場合において、画像認識辞書１０に上述のように参照画像を登録する場合には、ビデオソース機器１が出力するビデオデータが表す画像を参照画像として画像認識辞書１０に登録できるようにしてもよい。 Note that the above audio device may be provided with an editing function for editing the contents of the image recognition dictionary 10 and the equalizer characteristic table 13 in accordance with a user operation. In this case, when the reference image is registered in the image recognition dictionary 10 as described above, an image represented by the video data output from the video source device 1 can be registered in the image recognition dictionary 10 as a reference image. Also good.

以上、本発明の実施形態について説明した。 The embodiment of the present invention has been described above.

１…ビデオソース機器、２…スピーカ、３…入力装置、４…表示装置、５…制御部、６…イコライザ、７…アンプ、８…ビデオメモリ、９…表示制御部、１０…画像認識辞書、１１…画像認識部、１２…イコライザ特性設定制御部、１３…イコライザ特性テーブル。 DESCRIPTION OF SYMBOLS 1 ... Video source device, 2 ... Speaker, 3 ... Input device, 4 ... Display apparatus, 5 ... Control part, 6 ... Equalizer, 7 ... Amplifier, 8 ... Video memory, 9 ... Display control part, 10 ... Image recognition dictionary, DESCRIPTION OF SYMBOLS 11 ... Image recognition part, 12 ... Equalizer characteristic setting control part, 13 ... Equalizer characteristic table.

Claims

An audio device that outputs audio along with video display,
An equalizer that adjusts the sound to be output with an acoustic characteristic represented by a set characteristic value;
For a plurality of objects, a characteristic value table in which correspondence between the identification of the object and the characteristic value is registered,
The specification of a range is received from the user on the display of the video, and image recognition processing is performed on a portion within the specified range in the displayed video, and an image is included in the portion within the range. Image recognition means for recognizing the object;
An audio apparatus comprising: equalizer characteristic setting means for setting, in the equalizer, the characteristic value registered in the characteristic value table so as to correspond to the identification of the object recognized by the image recognition means.

The audio device according to claim 1,
Each of the plurality of objects is a sound source,
The audio apparatus, wherein the characteristic value corresponding to the identification of the object registered in the characteristic value table represents an acoustic characteristic that makes a sound generated by the object stand out.

The audio device according to claim 1 or 2,
When the image recognition means recognizes a plurality of objects, the equalizer characteristic setting means accepts selection of one object from the plurality of objects, and identifies the object that has received the selection. The audio device is characterized in that the characteristic value registered in the characteristic value table is set in the equalizer.

A computer program that is read and executed by a computer having an equalizer that adjusts the sound to be output with an acoustic characteristic represented by a set characteristic value, together with displaying video and outputting sound.
The computer,
For a plurality of objects, a characteristic value table in which correspondence between the identification of the object and the characteristic value is registered,
The specification of a range is received from the user on the display of the video, and image recognition processing is performed on a portion within the specified range in the displayed video, and an image is included in the portion within the range. Image recognition means for recognizing the object;
A computer program for causing a function corresponding to identification of the object recognized by the image recognition means to function as an equalizer characteristic setting means for setting the characteristic value registered in the characteristic value table in the equalizer.