JP2008209703A

JP2008209703A - Karaoke machine

Info

Publication number: JP2008209703A
Application number: JP2007046691A
Authority: JP
Inventors: Akira Ouchi; 亮大内
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2007-02-27
Filing date: 2007-02-27
Publication date: 2008-09-11

Abstract

PROBLEM TO BE SOLVED: To provide a karaoke machine 1 outputting the sound of a directional beam 6a composed of a singing voice to a singer 6 while tracking the singing position of the singer 6 by detecting a microphone position. SOLUTION: The karaoke machine 1 outputs the sound of the singing voice of the singer and accompaniment sound 81 picked up by a microphone 2 from a speaker array 3. The karaoke machine 1 outputs the measurement sound 83 from the speakers SP1, SPn at both ends of the speaker array 3 simultaneously with the sound production of the maskers or immediately thereafter. The karaoke machine 1 detects the microphone position based on the time elapsed from the production of the measurement sound 83 till picking up of sound by the microphone 2. At this time, the measurement sounds 83 are generated by the harmonics of the fundamental frequencies of the masker. The karaoke machine 1 tracks the microphone position by outputting the periodic measurement sounds 83. Namely, the karaoke machine 1 can continuously output the sound of the directional beam 6a composed of the singing voice to the singer 6 by tracking the microphone position. COPYRIGHT: (C)2008,JPO&INPIT

Description

この発明は、歌唱音声の指向性を制御できるカラオケ装置に関する。 The present invention relates to a karaoke apparatus capable of controlling the directivity of singing voice.

従来のカラオケ装置は、カラオケボックスの１室等のような単一のグループが占有する場所に設置されるほか、スナック店舗等の飲食店等、不特定の顧客が集まる場所に設置される場合も多い。 A conventional karaoke device is installed in a place occupied by a single group such as a room of a karaoke box, or in a place where unspecified customers gather such as a restaurant such as a snack shop. Many.

従来のカラオケ装置は、ステレオスピーカを用いて、設置される場所内全体に伴奏音及び歌唱音声を拡声していた。この場合、上記不特定の顧客が集まる店舗に設置されると、誰が歌唱した歌唱音声でも、店内の全てに聞こえるようになっていた。スナック等の飲食店では、他のグループの歌唱は必ずしも聞きたいものではなく、場合によっては耳障りなものである。これを解決するために、例えば、歌唱者に向けた指向性スピーカを設置し、歌唱者には歌唱音声を聞かせ、歌唱者以外にはガイドボーカルを聞かせたり、予め指定したグループと歌唱者には歌唱音声をきかせ、それ以外には、ガイドボーカルを聞かせたりしていた（特許文献１参照）。 A conventional karaoke apparatus uses a stereo speaker to amplify accompaniment sounds and singing sounds throughout the place where the karaoke apparatus is installed. In this case, when installed in a store where the unspecified customers gather, the singing voice sung by anyone can be heard by all in the store. In restaurants such as snacks, the singings of other groups are not necessarily something you want to hear, and in some cases they are annoying. In order to solve this, for example, a directional speaker for singers is installed, the singers are allowed to listen to the singing voice, the non-singers are allowed to listen to the guide vocals, Other than that, the singing voice was heard and the guide vocal was heard (see Patent Document 1).

特許文献１のカラオケ装置では、ユーザの操作入力により予め設定されたグループと、指定した歌唱位置（カラオケ装置のモニタ付近）に向けて指向性を持たせて歌唱音声を放音し、他には指向性を持たせてガイドボーカルを放音していた。
特開２００５−１７３１３７公報 In the karaoke device of Patent Document 1, a singing sound is emitted with directivity toward a group set in advance by a user's operation input and a designated singing position (near the monitor of the karaoke device), The guide vocal was emitted with directivity.
JP 2005-173137 A

しかしながら、特許文献１の発明では、歌唱者が歌唱中に位置を変更すると、その都度、その場にいる誰かが歌唱音声の放音方向を指定しなければならないという問題があった。 However, in the invention of Patent Document 1, when the singer changes the position during singing, there is a problem that every time the singer has to specify the sound emission direction of the singing voice.

そこで、この発明は、歌唱者に歌唱音声を放音するために、歌唱者の歌唱位置を追尾するカラオケ装置を提供することを目的とする。 Then, this invention aims at providing the karaoke apparatus which tracks a singer's singing position in order to emit a singing voice to a singer.

請求項１の発明は、歌唱者の歌唱音声を含む周囲からの音声をマイクで収音し、音声信号を生成する収音手段と、複数のスピーカを有するスピーカアレイの２つのスピーカから、マスカーの基本周波数の倍音で構成される測定音を、該マスカーの発音と同時またはその直後に放音する放音手段と、該放音手段による前記測定音の放音から、前記収音手段による該測定音の収音までの経過時間に基づいて、前記マイク位置を検出するマイク位置検出手段と、を備え、前記放音手段は、前記マイク位置検出手段が検出した前記マイク位置に向けて、歌唱者に与えるべき放音音声を含む指向性ビームを放音することを特徴とする。 According to the first aspect of the present invention, the sound from the surroundings including the singing voice of the singer is picked up by the microphone, the sound collecting means for generating the sound signal, and the two speakers of the speaker array having a plurality of speakers, A measurement sound composed of harmonics of the fundamental frequency is emitted at the same time or immediately after the masker's pronunciation, and from the sound emission of the measurement sound by the sound emission means, the measurement by the sound collection means Microphone position detection means for detecting the microphone position based on the elapsed time until sound collection, and the sound emission means is a singer toward the microphone position detected by the microphone position detection means. The directional beam including the sound to be emitted is emitted.

この構成では、カラオケ装置は、スピーカアレイの中の２つのスピーカから測定音をマスカーの発音と同時またはその直後に放音する。カラオケ装置は、スピーカアレイの中の２つのスピーカから放音した測定音をマイクで収音するまでの経過時間から、マイク位置を検出する。この際、測定音は、マスカーの基本周波数の倍音で構成される。また、カラオケ装置は、検出したマイク位置（歌唱者）に向けて、歌唱音声を含む指向性ビームを放音する。これにより、カラオケ装置は、歌唱者の歌唱位置を知ることができ、歌唱者に向けて、歌唱音声を含む指向性ビームを放音することができる。また、測定音は、マスカーの基本周波数の倍音で構成されるので、マスカーによりマスキングされる。このため、カラオケ装置は、人に知覚されることなく、測定音を放音して、歌唱者の歌唱位置を知ることができ、歌唱者に向けて、歌唱音声を含む指向性ビームを放音することができる。また、所定の間隔で測定音を放音することで、カラオケ装置は、歌唱者を追尾することができる。これにより、カラオケ装置は、歌唱者が移動しても、歌唱者に向けて歌唱音声を含む指向性ビームを放音することができる。 In this configuration, the karaoke apparatus emits the measurement sound from the two speakers in the speaker array at the same time as or immediately after the masker's pronunciation. The karaoke apparatus detects the microphone position from the elapsed time until the measurement sound emitted from the two speakers in the speaker array is collected by the microphone. At this time, the measurement sound is composed of harmonics of the fundamental frequency of the masker. In addition, the karaoke apparatus emits a directional beam including the singing voice toward the detected microphone position (singer). Thereby, the karaoke apparatus can know a singer's singing position, and can emit a directional beam including a singing voice toward the singer. Moreover, since the measurement sound is composed of overtones of the basic frequency of the masker, it is masked by the masker. For this reason, the karaoke apparatus emits the measurement sound without being perceived by a person and can know the singing position of the singer, and emits the directional beam including the singing voice toward the singer. can do. Moreover, the karaoke apparatus can track the singer by emitting the measurement sound at predetermined intervals. Thereby, even if a singer moves, the karaoke apparatus can emit a directional beam including a singing voice toward the singer.

請求項２の発明は、前記放音手段は、カラオケ曲の伴奏音を構成する１又は複数の楽器音をマスカーとして、予めカラオケ曲のデータに含まれる前記測定音を放音することを特徴とする。 The invention according to claim 2 is characterized in that the sound emitting means emits the measurement sound included in the data of the karaoke song in advance, using one or a plurality of instrument sounds constituting the accompaniment sound of the karaoke song as a masker. To do.

この構成では、カラオケ曲の伴奏音の中から１又は複数の楽器音をマスカーとして、予めカラオケ曲に含まれた測定音を放音する。これにより、カラオケ演奏中、定期的に演奏される楽器音をマスカーとすることで、マスカーの発音と同時に、測定音を定期的に放音することができる。また、マスカーとなる楽器音が複数の場合は、測定音を放音回数を増すことができ、より定期的に測定音を放音することができる。 In this configuration, one or a plurality of musical instrument sounds are used as maskers from accompaniment sounds of karaoke songs, and the measurement sounds included in the karaoke songs are emitted in advance. Thereby, during the karaoke performance, the musical instrument sound that is regularly played is used as a masker, so that the measurement sound can be periodically emitted simultaneously with the pronunciation of the masker. In addition, when there are a plurality of instrument sounds serving as maskers, the number of measurement sounds can be increased, and the measurement sounds can be emitted more regularly.

請求項３の発明は、前記放音手段は、カラオケ曲の伴奏音を構成する１又は複数の楽器音の放音のタイミング毎に、その楽器音をマスカーとする前記測定音を生成して放音することを特徴とする。 According to a third aspect of the present invention, the sound emitting means generates and releases the measurement sound having the instrument sound as a masker at each timing of sound emission of one or more instrument sounds constituting the accompaniment sound of the karaoke song. It is characterized by sound.

この構成では、カラオケ装置は、カラオケ曲の伴奏音を解析して、マスカーとなる楽器音を決定し、マスカーの放音のタイミングで、測定音を生成して放音する。これにより、カラオケ曲に予め測定音が含まれていなくても、カラオケ装置は自動で測定音を生成して放音することができる。 In this configuration, the karaoke apparatus analyzes the accompaniment sound of the karaoke song, determines an instrument sound to be a masker, generates a measurement sound at the timing of the masker sound emission, and emits it. Thereby, even if the measurement sound is not included in the karaoke song in advance, the karaoke apparatus can automatically generate and emit the measurement sound.

請求項４の発明は、前記放音手段は、歌唱音声の音圧レベルの上昇を検知し、その歌唱音声をマスカーとする前記測定音を生成して放音することを特徴とする。 The invention of claim 4 is characterized in that the sound emitting means detects an increase in the sound pressure level of the singing voice, and generates and emits the measurement sound using the singing voice as a masker.

この構成では、カラオケ装置は、マスカーである歌唱音声の音圧レベルの上昇を検知して、測定音を生成して放音する。これにより、アカペラ等の演目で、カラオケ曲に伴奏音が含まれない場合であっても、測定音を発音することができる。 In this configuration, the karaoke device detects an increase in the sound pressure level of the singing voice that is a masker, generates a measurement sound, and emits the sound. As a result, even if the accompaniment is not included in the karaoke music piece such as a cappella, the measurement sound can be generated.

この発明によれば、マスカーの基本周波数の倍音で測定音を構成することで、カラオケ装置は、人に知覚されることなく、測定音をスピーカアレイの２つのスピーカから放音して、マイクで収音することができる。これにより、カラオケ装置は、マイクの位置を検出することができるので、歌唱者の歌唱位置を知ることができ、歌唱者に向けて歌唱音声を含む指向性ビームを放音することができる。更に、測定音を随時放音することで、歌唱者の歌唱位置を追尾することができ、歌唱者が移動しても、歌唱者に向けて歌唱音声を含む指向性ビームを放音することができる。 According to the present invention, the measurement sound is composed of overtones of the fundamental frequency of the masker, so that the karaoke apparatus emits the measurement sound from the two speakers of the speaker array without being perceived by a person and uses the microphone. Sound can be collected. Thereby, since the karaoke apparatus can detect the position of a microphone, it can know a singer's singing position and can emit a directional beam containing a singing voice toward a singer. Furthermore, it is possible to track the singing position of the singer by emitting the measurement sound at any time, and even if the singer moves, the directional beam including the singing voice can be emitted toward the singer. it can.

［第１実施形態］
本発明の実施形態に係るカラオケ装置について、図１，２を参照して説明する。図１は、飲食店の店内を説明する図である。図１（Ａ）は、歌唱者がモニタの前で、歌唱している様子を示す。図１（Ｂ）は、歌唱者が、自身のグループの前で歌唱している様子を示す。図２は、マイク位置検出方法の説明図である。 [First Embodiment]
A karaoke apparatus according to an embodiment of the present invention will be described with reference to FIGS. FIG. 1 is a diagram illustrating the inside of a restaurant. FIG. 1A shows a state where a singer is singing in front of a monitor. FIG. 1B shows a state where a singer is singing in front of his / her group. FIG. 2 is an explanatory diagram of a microphone position detection method.

図１（Ａ）に示すように、飲食店の店内５には、カラオケ装置１が設置されている。カラオケ装置１は、マイク２とスピーカアレイ３とモニタ４を有する。更に、店内５には、テーブル７（７ａ〜７ｄ）が配置され、各テーブル７ａ〜７ｄには、それぞれ顧客が着席している。また、テーブル７ａの顧客である歌唱者６は、カラオケ装置１を利用して歌唱する。なお、説明の簡単化のため、本実施形態では、歌唱者６の歌唱音声は、自身と自身が着席しているテーブル７ａとに聞かせ、他のテーブル７ｂ〜７ｄには歌唱音声を聞かせずに、ガイドボーカルを聞かせる場合について説明する。 As shown in FIG. 1A, a karaoke apparatus 1 is installed in a restaurant 5 of a restaurant. The karaoke apparatus 1 includes a microphone 2, a speaker array 3, and a monitor 4. Furthermore, tables 7 (7a to 7d) are arranged in the store 5, and customers are seated on the tables 7a to 7d, respectively. Moreover, the singer 6 who is a customer of the table 7a sings using the karaoke apparatus 1. For the sake of simplification of explanation, in this embodiment, the singing voice of the singer 6 is heard on the table 7a on which the singer 6 is seated and the singing voice is not heard on the other tables 7b to 7d. The case where the guide vocal is heard will be described.

歌唱者６が歌唱すると、カラオケ装置１は、歌唱音声を含む指向性ビーム７０ａを生成し、歌唱者６のグループが着席しているテーブル７ａに向けて放音するとともに、歌唱者６の位置を検出し、歌唱音声を含む指向性ビーム６ａを生成して、歌唱者６に放音する。図１（Ｂ）に示すように、歌唱者６が移動した場合、カラオケ装置１は、歌唱者６の位置を追尾し、歌唱音声を含む指向性ビーム６ａを生成して、歌唱者６に放音する。また、カラオケ装置１は、ガイドボーカルを含む指向性ビーム７０ｂ〜７０ｄを生成して、他のテーブル７ｂ〜７ｄへ放音する。この際、カラオケ装置１は、歌唱者６の操作入力を受け付け、歌唱音声を放音するテーブル７ａを指定させる。 When the singer 6 sings, the karaoke apparatus 1 generates a directional beam 70a including the singing voice, emits the sound toward the table 7a on which the group of the singer 6 is seated, and the position of the singer 6 The directional beam 6 a including the singing voice is detected and emitted to the singer 6. As shown in FIG. 1B, when the singer 6 moves, the karaoke apparatus 1 tracks the position of the singer 6, generates a directional beam 6a including the singing voice, and releases it to the singer 6. Sound. Moreover, the karaoke apparatus 1 produces | generates the directional beams 70b-70d containing a guide vocal, and emits sound to the other tables 7b-7d. At this time, the karaoke apparatus 1 accepts the operation input of the singer 6 and designates the table 7a for emitting the singing voice.

本発明では、カラオケ装置１は、予めカラオケ曲に含まれる測定音を、スピーカアレイ３の両端のスピーカから放音し、マイク２で収音する。カラオケ装置１は、測定音の放音から収音までの時間を計測し、三角法を用いて、マイク２の位置を検出する。カラオケ装置１は、定期的に測定音を放音することで、マイク２を追尾し、マイク２に向けて、歌唱音声を含む指向性ビーム６ａを放音する。更に、測定音は、カラオケ曲の伴奏音に含まれる楽器音をマスカーとして、マスカーの基本周波数の倍音から構成される。カラオケ装置１は、マスカーの発音と同時又はマスカーの発音の直後に測定音を発音することで、測定音を同時マスキング又は経時マスキングしながら放音することができる。これにより、本発明では、人に知覚されることなく測定音を放音して、マイク２の位置を検出することができるので、歌唱者６を追尾しながら、歌唱音声を含む指向性ビーム６ａを歌唱者６に放音することができる。なお、本発明では、マスカーとは、測定音の発音を隠す音のことを言う。 In the present invention, the karaoke apparatus 1 emits the measurement sound included in the karaoke song in advance from the speakers at both ends of the speaker array 3 and collects the sound with the microphone 2. The karaoke apparatus 1 measures the time from sound emission to sound collection and detects the position of the microphone 2 using trigonometry. The karaoke apparatus 1 tracks the microphone 2 by periodically emitting the measurement sound, and emits the directional beam 6 a including the singing sound toward the microphone 2. Further, the measurement sound is composed of overtones of the fundamental frequency of the masker, with the instrumental sound included in the accompaniment sound of the karaoke song as the masker. The karaoke apparatus 1 can emit the measurement sound while simultaneously masking or aging masking by generating the measurement sound simultaneously with the masker sounding or immediately after the masker sounding. Accordingly, in the present invention, since the measurement sound can be emitted without being perceived by a person and the position of the microphone 2 can be detected, the directional beam 6a including the singing voice can be tracked while tracking the singer 6. Can be released to the singer 6. In the present invention, the masker means a sound that hides the pronunciation of the measurement sound.

以下に、マイク位置の検出方法について、図２を参照して説明する。図２に示すように、カラオケ装置１は、スピーカアレイ３（スピーカＳＰ１〜ＳＰｎ）の両端のスピーカＳＰ１とＳＰｎとから、測定音８３を放音する。測定音８３は、スピーカＳＰ１とスピーカＳＰｎとから放音されると、マイク２により収音される。ここで、スピーカＳＰ１から放音した測定音８３をマイク２で収音するまでの経過時間をＴａ、スピーカＳＰｎから放音した測定音８３をマイク２で収音するまでの経過時間をＴｂ、スピーカＳＰ１からの距離をＬａ、スピーカＳＰｎからの距離をＬｂとする。スピーカＳＰ１及びＳＰｎからの経過時間（Ｔａ＜Ｔｂ）から、スピーカＳＰ１及びＳＰｎからマイク２までの距離（Ｌａ＜Ｌｂ）が求まる。これに、三角法を利用して、マイク２の位置を算出する（（Ａ）参照）。また、経過時間がＴａ≒Ｔｂの場合（（Ｂ）参照）、経過時間がＴａ＞Ｔｂの場合（（Ｃ）参照）も同様の方法で、マイク２の位置を算出する。これにより、スピーカＳＰ１，ＳＰｎから定期的に測定音８３を放音することで、カラオケ装置１は、マイク２の位置を検出し、マイク２の位置を追尾して、歌唱音声を放音することができる。 Hereinafter, a method for detecting the microphone position will be described with reference to FIG. As shown in FIG. 2, the karaoke apparatus 1 emits the measurement sound 83 from the speakers SP1 and SPn at both ends of the speaker array 3 (speakers SP1 to SPn). The measurement sound 83 is picked up by the microphone 2 when sound is emitted from the speakers SP1 and SPn. Here, the elapsed time until the measurement sound 83 emitted from the speaker SP1 is collected by the microphone 2 is Ta, and the elapsed time until the measurement sound 83 emitted from the speaker SPn is collected by the microphone 2 is Tb. The distance from SP1 is La, and the distance from speaker SPn is Lb. From the elapsed time from the speakers SP1 and SPn (Ta <Tb), the distance from the speakers SP1 and SPn to the microphone 2 (La <Lb) is obtained. For this, the position of the microphone 2 is calculated using trigonometry (see (A)). Also, the position of the microphone 2 is calculated by the same method when the elapsed time Ta≈Tb (see (B)) and when the elapsed time Ta> Tb (see (C)). Thereby, the karaoke apparatus 1 detects the position of the microphone 2, tracks the position of the microphone 2, and emits the singing voice by periodically emitting the measurement sound 83 from the speakers SP1 and SPn. Can do.

次に、スピーカＳＰ１，ＳＰｎから放音する測定音８３について、図３，４を参照して説明する。図３は、マスカーの選択についての説明図である。図３（Ａ）は、マスカーに適している例を示す。図３（Ｂ）は、マスカーに適さない例を示す。図４は、測定音の加算についての説明図である。 Next, the measurement sound 83 emitted from the speakers SP1 and SPn will be described with reference to FIGS. FIG. 3 is an explanatory diagram for selecting a masker. FIG. 3A shows an example suitable for a masker. FIG. 3B shows an example that is not suitable for a masker. FIG. 4 is an explanatory diagram regarding the addition of the measurement sound.

測定音８３は、カラオケ曲の伴奏音に含まれる楽器音をマスカーとして、マスカーの基本周波数の倍音を用いて生成される。測定音８３は、マスカーの発音と同時に発音されたり、マスカーの発音の直後に発音されたりすることで、同時マスキング、又は経時マスキングされる。また、測定音８３は、楽器音の種類とレベルに応じて、音圧レベルが変更される。例えば、楽器音の音圧が上昇すると、測定音８３の音圧を上げ、楽器音の音圧が下降すると、測定音８３の音圧を下げる。これにより、歌唱者６や店内５の顧客は、測定音８３を知覚せずに、カラオケを楽しむことができる。 The measurement sound 83 is generated using overtones of the fundamental frequency of the masker with the instrument sound included in the accompaniment sound of the karaoke song as the masker. The measurement sound 83 is simultaneously masked or masked over time by being pronounced simultaneously with the masker's pronunciation or by being pronounced immediately after the masker's pronunciation. The sound pressure level of the measurement sound 83 is changed according to the type and level of the instrument sound. For example, when the sound pressure of the instrument sound increases, the sound pressure of the measurement sound 83 is increased, and when the sound pressure of the instrument sound decreases, the sound pressure of the measurement sound 83 is decreased. Thereby, the singer 6 and the customers in the store 5 can enjoy karaoke without perceiving the measurement sound 83.

マスカーに適している楽器音は、図３（Ａ）に示すように、低域から高域まで、音の成分がある楽器音である。例えば、ハープシーコード、グロッケン、シロホン等の楽器音や、波形がノコギリ波になる楽器音である。また、マスカーに適さない楽器音は、図３（Ｂ）に示すように、低域のみに音の成分があり、高域は音の成分がない楽器音である。例えば、オルガン、ホルン等の楽器音である。 As shown in FIG. 3A, the instrument sound suitable for the masker is an instrument sound having sound components from a low range to a high range. For example, instrument sounds such as harpsichord, glocken, and xylophone, and instrument sounds that have a sawtooth waveform. In addition, as shown in FIG. 3B, the instrument sound that is not suitable for a masker is a musical instrument sound that has a sound component only in the low range and no sound component in the high range. For example, instrument sounds such as organs and horns.

ここで、一般的に、人の聴覚が知覚できる周波数帯域は、２０Ｈｚ〜２０ｋＨｚ程度であり、１５ｋＨｚ以上の周波数帯域は、人によって聞こえたり聞こえなかったりする。そこで、図４に示すように、音階のある楽器をマスカーとする場合は、マスカーとなる楽器音の基本周波数の倍音で、かつ、人が聞き取り難い周波数帯域（１５ｋＨｚ〜）に測定音８３を生成する。また、音階のない楽器をマスカーとする場合は、マスカーとなる楽器音の周波数成分がある帯域で、かつ、人が聞き取り難い帯域（１５ｋＨｚ〜）に測定音８３を生成する。これにより、測定音８３は、マスカーによりマスキングされ、かつ、人が聞き取り難い周波数帯域からなる音なので、歌唱者６や店内５の顧客により知覚されなくなる。 Here, in general, the frequency band in which human hearing can be perceived is about 20 Hz to 20 kHz, and the frequency band of 15 kHz or higher may or may not be heard by humans. Therefore, as shown in FIG. 4, when a musical instrument with a scale is used as a masker, a measurement sound 83 is generated in a frequency band (15 kHz to) that is a harmonic overtone of the fundamental frequency of the instrumental sound that is a masker and that is difficult for humans to hear. To do. When a musical instrument without a scale is used as a masker, the measurement sound 83 is generated in a band having a frequency component of the musical instrument sound that becomes a masker and in a band (15 kHz to) that is difficult for humans to hear. Accordingly, the measurement sound 83 is masked by a masker and is a sound having a frequency band that is difficult for humans to hear, so that the measurement sound 83 is not perceived by the singer 6 or the customer in the store 5.

具体的に、例えば、この測定音８３は、伴奏音８１の人が聞き取り難い周波数帯域（１５ｋＨｚ〜）に予め含まれており、伴奏音８１とともにカラオケ装置１から放音される。カラオケ装置１は、伴奏音８１をスピーカアレイ３から放音する際に、ローパスフィルタに伴奏音８１と測定音８３とを通過させる。カラオケ装置１は、人が聞き取り難い周波数帯域（１５ｋＨｚ〜）をカットすることで、測定音８３を取り除いた伴奏音８１のみをスピーカアレイ３から放音する。次に、カラオケ装置１は、ローパスフィルタで取り除いた帯域（１５ｋＨｚ〜）から測定音８３が存在する帯域を取得するバンドパスフィルタに伴奏音８１と測定音８３とを通過させることで、測定音８３を取得して、両端のスピーカＳＰ１，ＳＰｎから放音する。これにより、スピーカアレイ３の各スピーカＳＰ１〜ＳＰｎから伴奏音８１を放音し、両端のスピーカＳＰ１，ＳＰｎは、伴奏音８１と一緒に測定音８３を放音することができる。 Specifically, for example, the measurement sound 83 is included in advance in a frequency band (15 kHz to) that is difficult for a person of the accompaniment sound 81 to hear, and is emitted from the karaoke apparatus 1 together with the accompaniment sound 81. The karaoke apparatus 1 allows the accompaniment sound 81 and the measurement sound 83 to pass through the low-pass filter when the accompaniment sound 81 is emitted from the speaker array 3. The karaoke apparatus 1 emits only the accompaniment sound 81 from which the measurement sound 83 is removed from the speaker array 3 by cutting a frequency band (15 kHz to) that is difficult for humans to hear. Next, the karaoke apparatus 1 passes the accompaniment sound 81 and the measurement sound 83 through a bandpass filter that acquires a band in which the measurement sound 83 exists from the band (15 kHz to) removed by the low-pass filter, so that the measurement sound 83 is obtained. And sound is emitted from the speakers SP1 and SPn at both ends. Thereby, the accompaniment sound 81 can be emitted from each speaker SP1-SPn of the speaker array 3, and the speakers SP1 and SPn at both ends can emit the measurement sound 83 together with the accompaniment sound 81.

なお、測定音８３は、必ずしも人が聞き取り難い周波数帯域（１５ｋＨｚ〜）で生成される必要はなく、マスカーとなる楽器が音階を有する場合は、マスカーの基本周波数の倍音で生成されればよく、マスカーとなる楽器が音階を有しない場合は、マスカーの音の周波数成分のある帯域で生成されればよい。この場合、カラオケ装置１は、伴奏音８１やマイク２が収音した歌唱音声等から測定音８３を検出できればよい。 Note that the measurement sound 83 does not necessarily have to be generated in a frequency band (15 kHz to) that is difficult for humans to hear. If the musical instrument to be a masker has a scale, it may be generated with a harmonic of the basic frequency of the masker. If the musical instrument to be a masker does not have a scale, it may be generated in a band having a frequency component of the masker sound. In this case, the karaoke apparatus 1 only needs to be able to detect the measurement sound 83 from the accompaniment sound 81 or the singing sound collected by the microphone 2.

次に、カラオケ装置１の機能について、図５を参照して説明する。図５は、カラオケ装置の機能ブロック図である。カラオケ装置１は、操作部１００、制御部１０、記憶部８、ＭＩＤＩ音源９１、ガイドボーカル再生部９２、マイク２、スピーカアレイ３（スピーカＳＰ１〜ＳＰｎ）、Ａ／Ｄコンバータ１１，１６、ビーム形成部１３，１８、ローパスフィルタ１２，１７、バンドパスフィルタ１４（１４ａ〜１４ｄ），１９（１９ａ〜１９ｄ）、マイク位置検出部１５、ミキサ２０、Ｄ／Ａコンバータ２１（２１−１〜２１−ｎ）及びＡＭＰ２２（２２−１〜２２−ｎ）から構成される。以下、説明の簡単化のため、本実施形態で用いるマイク２の収音範囲は、２０ｋＨｚ以下とし、測定音８３は、１５ｋＨｚ〜２０ｋＨｚの周波数帯域で生成されるものとして以下に説明する。 Next, the function of the karaoke apparatus 1 will be described with reference to FIG. FIG. 5 is a functional block diagram of the karaoke apparatus. The karaoke apparatus 1 includes an operation unit 100, a control unit 10, a storage unit 8, a MIDI sound source 91, a guide vocal reproduction unit 92, a microphone 2, a speaker array 3 (speakers SP1 to SPn), A / D converters 11 and 16, and beam forming. Units 13 and 18, low-pass filters 12 and 17, band-pass filters 14 (14a to 14d) and 19 (19a to 19d), microphone position detection unit 15, mixer 20, D / A converter 21 (21-1 to 21-n) ) And AMP22 (22-1 to 22-n). Hereinafter, for simplification of description, the sound collection range of the microphone 2 used in the present embodiment is 20 kHz or less, and the measurement sound 83 is described below as being generated in a frequency band of 15 kHz to 20 kHz.

操作部１００は、歌唱者６等の操作入力を受け、操作入力内容を制御部１０へ出力する。例えば、操作部１００は、カラオケ曲の選曲や、歌唱者６の歌唱音声を放音するテーブル７ａの指定や、ガイドメロディ８２を放音する／放音しない等の各種設定が入力される。また、説明の簡単化のため、ガイドメロディ８２を放音しないよう設定されたものとする。 The operation unit 100 receives an operation input from the singer 6 or the like and outputs the operation input content to the control unit 10. For example, the operation unit 100 is input with various settings such as selection of a karaoke song, specification of the table 7a that emits the singing voice of the singer 6, and whether or not the guide melody 82 is emitted. For simplicity of explanation, it is assumed that the guide melody 82 is set not to be emitted.

制御部１０は、操作部１００の操作入力を受け、以下に説明するカラオケ装置１の各機能部を制御する。各機能部の制御方法については、後述する。 The control part 10 receives the operation input of the operation part 100, and controls each function part of the karaoke apparatus 1 demonstrated below. A method for controlling each functional unit will be described later.

記憶部８は、複数のカラオケ曲を記憶しており、カラオケ曲毎に、伴奏音８１のデータ、ガイドメロディ８２のデータ、測定音８３のデータ、ガイドボーカル８４のデータを記憶する。 The storage unit 8 stores a plurality of karaoke songs, and stores accompaniment sound 81 data, guide melody 82 data, measurement sound 83 data, and guide vocal 84 data for each karaoke song.

ＭＩＤＩ音源９１は、制御部１０の指示により、記憶部８から伴奏音８１のデータ、ガイドメロディ８２のデータ、測定音８３のデータを逐次取得し、Ａ／Ｄコンバータ１１に出力する。伴奏音８１は、色々な楽器音から構成される。ガイドメロディ８２は、伴奏音８１の主旋律であり、歌唱者６の歌唱を支援するものである。測定音８３は、伴奏音８１の中から、１つ又は複数の楽器音をマスカーとして、マスカーの基本周波数の倍音で生成されている。この際、マスカーとなる楽器音は、カラオケ曲に応じて適切に選択される。また、測定音８３は、定期的（例えば１小節毎等）に、放音されるようになっている。更に、測定音８３は、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから放音される。この際、測定音８３は、スピーカＳＰ１，ＳＰｎ毎に、異なる周波数帯域で生成され、スピーカＳＰ１，ＳＰｎから放音される。これにより、カラオケ装置１は、マイク２で収音した測定音８３がスピーカＳＰ１，ＳＰｎのどちらから放音されたか判別することができる。 The MIDI sound source 91 sequentially acquires the data of the accompaniment sound 81, the data of the guide melody 82, and the data of the measurement sound 83 from the storage unit 8 according to an instruction from the control unit 10, and outputs the acquired data to the A / D converter 11. The accompaniment sound 81 is composed of various instrument sounds. The guide melody 82 is the main melody of the accompaniment sound 81 and supports the singing of the singer 6. The measurement sound 83 is generated from the accompaniment sound 81 by using one or a plurality of instrument sounds as a masker and a harmonic of the fundamental frequency of the masker. At this time, the musical instrument sound to be a masker is appropriately selected according to the karaoke song. The measurement sound 83 is emitted periodically (for example, every bar). Further, the measurement sound 83 is emitted from the speakers SP1 and SPn at both ends of the speaker array 3. At this time, the measurement sound 83 is generated in different frequency bands for each of the speakers SP1 and SPn and emitted from the speakers SP1 and SPn. Thereby, the karaoke apparatus 1 can determine from which of the speakers SP1 and SPn the measurement sound 83 collected by the microphone 2 is emitted.

また、カラオケ装置１は、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから同時又は別々に測定音８３を放音しても、マイク２の位置を検出することができる。この際、スピーカＳＰ１とスピーカＳＰｎとから別々に測定音８３を放音する場合には、同じ周波数の測定音８３を放音してもよい。また、スピーカＳＰ１とスピーカＳＰ２とから同時に測定音８３を放音する場合には、それぞれ周波数を変える必要がある。更に、歌唱者が連続的に移動している場合には、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから同時に測定音８３を放音した方が、別々に放音するより、マイク２の位置を正確に検出することができる。 Further, the karaoke apparatus 1 can detect the position of the microphone 2 even if the measurement sound 83 is emitted from the speakers SP1 and SPn at both ends of the speaker array 3 simultaneously or separately. At this time, when the measurement sound 83 is separately emitted from the speaker SP1 and the speaker SPn, the measurement sound 83 having the same frequency may be emitted. Further, when the measurement sound 83 is simultaneously emitted from the speaker SP1 and the speaker SP2, it is necessary to change the frequency. In addition, when the singer is continuously moving, the position of the microphone 2 is more effective when the measurement sound 83 is simultaneously emitted from the speakers SP1 and SPn at both ends of the speaker array 3 than when the sound is emitted separately. It can be detected accurately.

ガイドボーカル再生部９２は、制御部１０の指示により、記憶部８からガイドボーカル８４のデータを逐次取得して、Ａ／Ｄコンバータ１１に出力する。ガイドボーカル８４は、手本となる歌唱音声から構成され、歌唱者６の歌唱を支援するためのものである。 The guide vocal reproducing unit 92 sequentially acquires the data of the guide vocal 84 from the storage unit 8 according to an instruction from the control unit 10 and outputs the data to the A / D converter 11. The guide vocal 84 is composed of a singing voice serving as a model, and is for supporting the singing of the singer 6.

Ａ／Ｄコンバータ１１は、ＭＩＤＩ音源９１やガイドボーカル再生部９２から入力されたこれらのデータをアナログ形式からデジタル形式に変換して、オーディオ信号を生成する。 The A / D converter 11 converts these data input from the MIDI sound source 91 and the guide vocal reproducing unit 92 from an analog format to a digital format, and generates an audio signal.

ローパスフィルタ１２は、Ａ／Ｄコンバータ１１に入力されたオーディオ信号から、測定音８３のオーディオ信号が存在しない周波数帯域（〜１５ｋＨｚ）だけを通過させ、後述するビーム形成部１３に入力する。また、バンドパスフィルタ１４（１４ａ〜１４ｄ）は、Ａ／Ｄコンバータ１１に入力されたオーディオ信号から、測定音８３のオーディオ信号が存在する帯域の周波数成分（１５〜２０ｋＨｚの一部の周波数成分）だけを通過させ、後述するマイク位置検出部１５に入力する。この際、バンドパスフィルタ１４ａ〜１４ｄは、それぞれ異なる周波数成分を取り出す。 The low-pass filter 12 passes only the frequency band (˜15 kHz) in which the audio signal of the measurement sound 83 does not exist from the audio signal input to the A / D converter 11 and inputs it to the beam forming unit 13 described later. Further, the band pass filter 14 (14a to 14d) is a frequency component (a part of a frequency component of 15 to 20 kHz) in a band in which the audio signal of the measurement sound 83 exists from the audio signal input to the A / D converter 11. And the signal is input to the microphone position detector 15 described later. At this time, the band-pass filters 14a to 14d extract different frequency components.

マイク２は、歌唱者６の歌唱音声を収音するとともに、スピーカＳＰ１〜ＳＰｎから放音された放音音声についても収音する。マイク２は、収音した歌唱者６の歌唱音声とともに、スピーカアレイ３からの放音音声（伴奏音８１、測定音８３、ガイドボーカル８４等）をＡ／Ｄコンバータ１６、各フィルタ１７，１９（１９ａ〜１９ｄ）を介して、ビーム形成部１８、後述するマイク位置検出部１５に入力する。この際、歌唱音声とスピーカアレイ３からの放音音声と（以下、歌唱音声とスピーカアレイ３からの放音音声とを、収音音声と称す。）は、Ａ／Ｄコンバータ１６にてＡ／Ｄ変換され、収音音声信号として生成される。また、ローパスフィルタ１７は、測定音８３を含まない収音音声信号の低域部分（〜１５ｋＨｚ）だけを通過させ、ビーム形成部１８に入力する。この際、ローパスフィルタ１７は、ローパスフィルタ１２に対応する周波数成分を収音音声信号から取り出す。また、バンドパスフィルタ１９（１９ａ〜１９ｄ）は、測定音８３を含む収音音声信号の周波数成分（１５〜２０ｋＨｚの一部の周波数成分）だけを通過させ、後述するマイク位置検出部１５に入力する。この際、バンドパスフィルタ１９ａ〜１９ｄは、それぞれがバンドパスフィルタ１４ａ〜１４ｄに対応した周波数成分を収音音声信号から取り出す。 The microphone 2 picks up the singing voice of the singer 6 and picks up the sound emitted from the speakers SP1 to SPn. The microphone 2 collects the singing voice of the singer 6 that has collected the sound and the sound emitted from the speaker array 3 (accompaniment sound 81, measurement sound 83, guide vocal 84, etc.), A / D converter 16, and filters 17, 19 ( 19a to 19d) and input to the beam forming unit 18 and a microphone position detecting unit 15 described later. At this time, the singing voice and the sound emitted from the speaker array 3 (hereinafter, the singing voice and the sound emitted from the speaker array 3 are referred to as collected sound) are converted into A / D by the A / D converter 16. D-converted and generated as a collected sound signal. The low-pass filter 17 passes only the low frequency portion (˜15 kHz) of the collected sound signal that does not include the measurement sound 83 and inputs the low-frequency filter 17 to the beam forming unit 18. At this time, the low-pass filter 17 extracts a frequency component corresponding to the low-pass filter 12 from the collected sound signal. Further, the band pass filter 19 (19a to 19d) passes only the frequency components (a part of the frequency components of 15 to 20 kHz) of the collected sound signal including the measurement sound 83 and inputs them to the microphone position detection unit 15 described later. To do. At this time, each of the bandpass filters 19a to 19d extracts frequency components corresponding to the bandpass filters 14a to 14d from the collected sound signal.

ビーム形成部１３，１８は、スピーカアレイ３から指向性を持たせて、指向性ビーム６ａ，７０ａ〜７０ｄを放音するとともに、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから指向性を持たせずに伴奏音８１を放音するよう、各スピーカＳＰ１〜ＳＰｎに対応する放音音声信号を形成する。具体的には、ビーム形成部１３は、制御部１０の指示により、ローパスフィルタ１２によってフィルタリングされた伴奏音８１のオーディオ信号とガイドボーカル８４のオーディオ信号とから、スピーカアレイ３を構成する各スピーカＳＰ１〜ＳＰｎのそれぞれに対応した放音音声信号を形成して、ミキサ２０へ放音音声信号を出力する。また、ビーム形成部１８は、ローパスフィルタ１７によって測定音８３を除去した収音音声信号から、スピーカアレイ３を構成する各スピーカＳＰ１〜ＳＰｎのそれぞれに対応した放音音声信号を形成して、ミキサ２０へ出力する。この際、ビーム形成部１３，１８は、後述するマイク位置検出部１５からビーム形成係数が入力されると、このビーム形成係数に基づいて、指向性ビーム６ａの放音方向を決定し、対応する各スピーカＳＰ１〜ＳＰｎの放音音声信号を形成して、ミキサ２０へ出力する。 The beam forming units 13 and 18 emit directivity beams 6a and 70a to 70d with directivity from the speaker array 3, and do not have directivity from the speakers SP1 and SPn at both ends of the speaker array 3. The sound emission sound signals corresponding to the speakers SP1 to SPn are formed so that the accompaniment sound 81 is emitted. Specifically, the beam forming unit 13 is configured by each speaker SP1 constituting the speaker array 3 from the audio signal of the accompaniment sound 81 and the audio signal of the guide vocal 84 filtered by the low-pass filter 12 according to an instruction from the control unit 10. A sound emission sound signal corresponding to each of .about.SPn is formed, and the sound emission sound signal is output to the mixer 20. Further, the beam forming unit 18 forms sound emission sound signals corresponding to the speakers SP1 to SPn constituting the speaker array 3 from the collected sound signal from which the measurement sound 83 is removed by the low-pass filter 17, and the mixer 18 20 output. At this time, when a beam forming coefficient is input from a microphone position detecting unit 15 to be described later, the beam forming units 13 and 18 determine the sound emitting direction of the directional beam 6a based on the beam forming coefficient and respond accordingly. Sound output sound signals of the speakers SP1 to SPn are formed and output to the mixer 20.

ミキサ２０は、ビーム形成部１３，１８から入力された放音音声信号（伴奏音８１、ガイドボーカル８４、収音音声）に対して、ミキシングを行う。具体的には、ミキサ２０は、両端のスピーカＳＰ１，ＳＰｎに入力される放音音声信号に対して、バンドパスフィルタ１４から入力された測定音８３を加算する。この際、ミキサ２０は、歌唱者６、歌唱者６のグループが着席するテーブル７ａに対する指向性ビーム６ａ，７０ａは、歌唱音声の放音音声信号と伴奏音８１の放音音声信号等を加算して生成する。また、他のテーブル７ｂ〜７ｄに対する指向性ビーム７０ｂ〜７０ｄは、ガイドボーカル８４の放音音声信号と伴奏音８１の放音音声信号等を加算して生成する。ミキサ２０は、放音音声信号をＤ／Ａコンバータ２１（２１−１〜２１−ｎ）及びＡＭＰ２２（２２−１〜２２−ｎ）を介して、スピーカＳＰ１〜ＳＰｎに入力する。ここで、Ｄ／Ａコンバータ２１、ＡＭＰ２２は、放音音声信号に対してＤ／Ａ変換や増幅等を行い、スピーカＳＰ１〜ＳＰｎは、指向性ビーム６ａ，７０ａ〜７０ｄを放音する。 The mixer 20 performs mixing on the sound emission sound signals (accompaniment sound 81, guide vocal 84, and sound collection sound) input from the beam forming units 13 and 18. Specifically, the mixer 20 adds the measurement sound 83 input from the band pass filter 14 to the sound emission sound signal input to the speakers SP1 and SPn at both ends. At this time, the mixer 20 adds the sound output sound signal of the singing sound and the sound output sound signal of the accompaniment sound 81 to the directional beams 6a and 70a with respect to the table 7a on which the singer 6 and the group of the singer 6 are seated. To generate. The directional beams 70b to 70d for the other tables 7b to 7d are generated by adding the sound output sound signal of the guide vocal 84, the sound output sound signal of the accompaniment sound 81, and the like. The mixer 20 inputs the emitted sound signal to the speakers SP1 to SPn via the D / A converter 21 (21-1 to 21-n) and the AMP22 (22-1 to 22-n). Here, the D / A converter 21 and the AMP 22 perform D / A conversion, amplification, and the like on the emitted sound signal, and the speakers SP1 to SPn emit the directional beams 6a and 70a to 70d.

マイク位置検出部１５は、レベル検出部１５１（１５１ａ〜１５１ｄ），１５３（１５３ａ〜１５３ｄ）、タイマ部１５２（１５２ａ〜１５２ｄ）、マイク位置算出部１５４及びビーム形成係数算出部１５５から構成される。マイク位置検出部１５は、歌唱者６に放音する指向性ビーム６ａの放音方向を決定するビーム形成係数を算出する。 The microphone position detection unit 15 includes level detection units 151 (151a to 151d) and 153 (153a to 153d), a timer unit 152 (152a to 152d), a microphone position calculation unit 154, and a beam forming coefficient calculation unit 155. The microphone position detection unit 15 calculates a beam forming coefficient that determines the sound emission direction of the directional beam 6 a emitted to the singer 6.

具体的には、レベル検出部１５１は、バンドパスフィルタ１４を介して入力されたオーディオ信号に含まれる測定音８３のオーディオ信号を検出すると、タイマ部１５２にタイマの起動を指示する。レベル検出部１５３は、バンドパスフィルタ１９を介して入力された収音音声信号に含まれる測定音８３のオーディオ信号を検出すると、タイマ部１５２にタイマの終了を指示する。タイマ部１５２は、タイマの起動指示を受けてから終了指示を受けるまでの時間を計時して、マイク位置算出部１５４にこの時間情報を出力する。 Specifically, when the level detection unit 151 detects the audio signal of the measurement sound 83 included in the audio signal input via the bandpass filter 14, the level detection unit 151 instructs the timer unit 152 to start the timer. When the level detection unit 153 detects the audio signal of the measurement sound 83 included in the collected sound signal input via the band pass filter 19, the level detection unit 153 instructs the timer unit 152 to end the timer. The timer unit 152 counts the time from receiving the timer start instruction to receiving the end instruction, and outputs this time information to the microphone position calculation unit 154.

この際、バンドパスフィルタ１４ａ〜１４ｄの各々とバンドパスフィルタ１９ａ〜１９ｄの各々とは、同じ周波数成分を取り出すので、スピーカＳＰ１，ＳＰｎから放音された測定音８３（レベル検出部１５１にて検出）と、マイク２により収音された測定音８３（レベル検出部１５３にて検出）とを対応付けて検出することができる。このため、タイマ部１５２は、タイマの起動から終了までの時間を求めることで、スピーカＳＰ１，ＳＰｎから測定音８３を放音してから、マイク２で測定音８３を収音するまでの時間（以下、経過時間と称す。）を求めることができる。 At this time, since each of the bandpass filters 14a to 14d and each of the bandpass filters 19a to 19d extract the same frequency component, the measurement sound 83 (detected by the level detection unit 151) emitted from the speakers SP1 and SPn. ) And the measurement sound 83 (detected by the level detection unit 153) collected by the microphone 2 can be detected in association with each other. Therefore, the timer unit 152 obtains the time from the start to the end of the timer, so that the time from when the measurement sound 83 is emitted from the speakers SP1 and SPn until the measurement sound 83 is collected by the microphone 2 ( Hereinafter, it is referred to as elapsed time).

また、スピーカＳＰ１，ＳＰｎは、異なる周波数成分を取り出すバンドパスフィルタ１４ａ〜１４ｄを通過させて、測定音８３を放音する。これにより、レベル検出部１５１，１５３に入力される測定音８３は、スピーカＳＰ１，ＳＰｎのどちらに対応しているか分かる。このため、タイマ部１５２は、スピーカＳＰ１，ＳＰｎ毎に、経過時間を求めることができる。 Further, the speakers SP1 and SPn emit the measurement sound 83 through the band pass filters 14a to 14d that extract different frequency components. As a result, the measurement sound 83 input to the level detectors 151 and 153 can be identified as to which of the speakers SP1 and SPn corresponds. For this reason, the timer part 152 can obtain | require elapsed time for every speaker SP1, SPn.

マイク位置算出部１５４は、スピーカＳＰ１，ＳＰｎ毎の経過時間に基づいて、マイク２の位置を算出する。マイク位置算出部１５４で算出したマイク２の位置に基づいて、ビーム形成係数算出部１５５は、マイク２の位置の方向に指向性を持たせたビーム形成係数を算出する。ビーム形成係数は、ビーム形成部１３，１８に出力される。 The microphone position calculation unit 154 calculates the position of the microphone 2 based on the elapsed time for each of the speakers SP1 and SPn. Based on the position of the microphone 2 calculated by the microphone position calculation unit 154, the beam forming coefficient calculation unit 155 calculates a beam forming coefficient having directivity in the direction of the position of the microphone 2. The beam forming coefficient is output to the beam forming units 13 and 18.

次に、歌唱者６に向けた指向性ビーム６ａの生成時の処理の流れについて、図６を参照して説明する。図６は、カラオケ曲に測定音が含まれる場合における指向性ビームの生成手順を示すフローチャートである。なお、説明の簡単化のため、各テーブル７ａ〜７ｄに対する指向性ビーム７０ａ〜７０ｄの生成方法を除いて、歌唱者６に対する指向性ビーム６ａの生成方法についてのみ記載する。 Next, the flow of processing when generating the directional beam 6a directed toward the singer 6 will be described with reference to FIG. FIG. 6 is a flowchart showing a procedure for generating a directional beam when a measurement sound is included in a karaoke song. For simplification of description, only the method of generating the directional beam 6a for the singer 6 will be described except for the method of generating the directional beams 70a to 70d for the tables 7a to 7d.

まず、カラオケ演奏時の処理の流れについて説明する。図６に示すように、ステップＳ１０１にて、ＭＩＤＩ音源９１は、制御部１０の指示により、伴奏音８１のデータとガイドメロディ８２のデータと測定音８３のデータを、記憶部８から読み出して逐次Ａ／Ｄコンバータ１１へ出力する。この際、各データは、Ａ／Ｄ変換され、それぞれに対応したオーディオ信号が生成されて、ステップＳ１０２へ進む。 First, the flow of processing during karaoke performance will be described. As shown in FIG. 6, in step S 101, the MIDI sound source 91 reads the accompaniment sound 81 data, the guide melody 82 data, and the measurement sound 83 data from the storage unit 8 in accordance with an instruction from the control unit 10. Output to the A / D converter 11. At this time, each data is A / D converted, an audio signal corresponding to each data is generated, and the process proceeds to step S102.

ステップＳ１０２にて、伴奏音８１のオーディオ信号とガイドメロディ８２のオーディオ信号と測定音８３のオーディオ信号は、ローパスフィルタ１２へ出力される。この際、これらのオーディオ信号から、測定音８３のオーディオ信号が取り除かれる。ローパスフィルタ１２を通過した伴奏音８１のオーディオ信号とガイドメロディ８２のオーディオ信号とは、ビーム形成部１３に出力され（Ｓ１０３）、ステップＳ１０４へ進む。 In step S102, the audio signal of the accompaniment sound 81, the audio signal of the guide melody 82, and the audio signal of the measurement sound 83 are output to the low-pass filter 12. At this time, the audio signal of the measurement sound 83 is removed from these audio signals. The audio signal of the accompaniment sound 81 and the audio signal of the guide melody 82 that have passed through the low-pass filter 12 are output to the beam forming unit 13 (S103), and the process proceeds to step S104.

ステップＳ１０４にて、ビーム形成部１３に、ビーム形成係数が入力されているかどうか調べる。ビーム形成係数が入力されている場合（歌唱途中）（Ｓ１０４：Ｙｅｓ）は、ステップＳ１０６へ進む。 In step S104, it is checked whether or not a beam forming coefficient is input to the beam forming unit 13. When the beam forming coefficient is input (in the middle of singing) (S104: Yes), the process proceeds to step S106.

ビーム形成係数が入力されていない場合（歌唱開始時）（Ｓ１０４：Ｎｏ）、制御部１０の指示により、ビーム形成部１３は、モニタ４に向けて指向性ビーム６ａを放音するように、伴奏音８１のオーディオ信号から放音音声信号を生成する（Ｓ１０５）。このように、歌唱者６の歌唱開始時は、歌唱位置の検出を開始していないので、ビーム形成係数が入力されていない。そこで、ビーム形成部１３は、モニタ４に向けて指向性ビーム６ａを放音するよう放音音声信号を生成する。また、ビーム形成部１３は、ビーム形成係数が入力されている場合のみ、放音音声信号を生成してもよい。 When the beam forming coefficient is not input (at the time of singing) (S104: No), the beam forming unit 13 emits the directional beam 6a toward the monitor 4 according to an instruction from the control unit 10 to accompaniment. A sound emission sound signal is generated from the audio signal of the sound 81 (S105). Thus, since the detection of the singing position is not started when the singer 6 starts singing, the beam forming coefficient is not input. Therefore, the beam forming unit 13 generates a sound emission sound signal so as to emit the directional beam 6 a toward the monitor 4. Further, the beam forming unit 13 may generate a sound emission sound signal only when a beam forming coefficient is input.

ステップＳ１０６にて、制御部１０の指示により、ビーム形成部１３は、ビーム形成係数に基づいて指向性制御を行い、伴奏音８１のオーディオ信号から放音音声信号を生成して、ステップＳ１０７へ進む。 In step S106, in response to an instruction from the control unit 10, the beam forming unit 13 performs directivity control based on the beam forming coefficient, generates a sound emission sound signal from the audio signal of the accompaniment sound 81, and proceeds to step S107. .

ステップＳ１０７にて、ビーム形成部１３は、生成した放音音声信号をミキサ２０へ出力して、ステップＳ１０８へ進む。 In step S107, the beam forming unit 13 outputs the generated sound emission sound signal to the mixer 20 and proceeds to step S108.

ステップＳ１０８にて、伴奏音８１のオーディオ信号とガイドメロディ８２のオーディオ信号と測定音８３のオーディオ信号は、バンドパスフィルタ１４へ出力される。バンドパスフィルタ１４は、これらのオーディオ信号から、測定音８３のオーディオ信号のみを通過させて、レベル検出部１５１へ出力する。そして、レベル検出部１５１にて、測定音８３のオーディオ信号が検出される（Ｓ１０９：Ｙｅｓ）と、タイマ部１５２は、タイマを起動して（Ｓ１１０）、ステップＳ１１１へ進む。 In step S 108, the audio signal of the accompaniment sound 81, the audio signal of the guide melody 82, and the audio signal of the measurement sound 83 are output to the bandpass filter 14. The band pass filter 14 passes only the audio signal of the measurement sound 83 from these audio signals, and outputs it to the level detection unit 151. When the level detection unit 151 detects the audio signal of the measurement sound 83 (S109: Yes), the timer unit 152 activates the timer (S110), and proceeds to step S111.

ステップＳ１１１にて、バンドパスフィルタ１４から出力された測定音８３のオーディオ信号は、ミキサ２０において放音音声信号と加算される。この際、測定音８３は、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから放音されるように、放音音声信号に加算される。 In step S 111, the audio signal of the measurement sound 83 output from the bandpass filter 14 is added to the sound output sound signal in the mixer 20. At this time, the measurement sound 83 is added to the emitted sound signal so as to be emitted from the speakers SP1 and SPn at both ends of the speaker array 3.

ステップＳ１１２にて、これらの放音音声信号は、対応するＤ／Ａコンバータ２１、ＡＭＰ２２を介して、スピーカＳＰ１〜ＳＰｎから放音され、ステップＳ１１３へ進む。この放音音声信号は、指向性ビーム６ａとなり、歌唱者６に向けて放音される。 In step S112, these sound emission signals are emitted from the speakers SP1 to SPn via the corresponding D / A converter 21 and AMP 22, and the process proceeds to step S113. This sound emission sound signal becomes a directional beam 6 a and is emitted toward the singer 6.

ステップＳ１１３にて、マイク２は、歌唱者６の歌唱音声とスピーカアレイ３から放音された放音音声と（以下、収音音声と称す。）を収音する。これら収音音声は、Ａ／Ｄコンバータ１１へ入力されて、ステップＳ１１４へ進む。この際、収音音声は、Ａ／Ｄ変換され、収音音声信号として生成される。 In step S113, the microphone 2 collects the singing voice of the singer 6 and the emitted voice emitted from the speaker array 3 (hereinafter referred to as the collected voice). These collected sounds are input to the A / D converter 11, and the process proceeds to step S114. At this time, the collected sound is A / D converted and generated as a collected sound signal.

ステップＳ１１４にて、収音音声信号は、ローパスフィルタ１７へ出力される。この際、収音音声信号から、測定音８３のオーディオ信号（スピーカアレイ３から放音された放音音声に含まれる）が取り除かれる。ローパスフィルタ１７を通過した収音音声信号は、ビーム形成部１８に出力され（Ｓ１１５）、ステップＳ１１６へ進む。 In step S 114, the collected sound signal is output to the low pass filter 17. At this time, the audio signal of the measurement sound 83 (included in the sound emitted from the speaker array 3) is removed from the collected sound signal. The collected sound signal that has passed through the low-pass filter 17 is output to the beam forming unit 18 (S115), and the process proceeds to step S116.

ステップＳ１１６にて、ビーム形成部１８に、ビーム形成係数が入力されているかどうか調べる。ビーム形成係数が入力されている場合（歌唱途中）（Ｓ１１６：Ｙｅｓ）は、ステップＳ１１８へ進む。 In step S116, it is checked whether or not a beam forming coefficient is input to the beam forming unit 18. When the beam forming coefficient is input (in the middle of singing) (S116: Yes), the process proceeds to step S118.

ビーム形成係数が入力されていない場合（歌唱開始時）（Ｓ１１６：Ｎｏ）、制御部１０の指示により、ビーム形成部１８は、モニタ４に向けて指向性ビーム６ａを放音するように、収音音声信号から放音音声信号を生成する（Ｓ１１７）。このように、歌唱者６の歌唱開始時は、歌唱位置の検出を開始していないので、ビーム形成係数が入力されていない。そこで、ビーム形成部１８は、モニタ４に向けて指向性ビーム６ａを放音するよう放音音声信号を生成する。また、ビーム形成部１８は、ビーム形成係数が入力されている場合のみ、放音音声信号を生成してもよい。 When the beam forming coefficient is not input (at the time of singing) (S116: No), the beam forming unit 18 collects the directional beam 6a toward the monitor 4 according to the instruction of the control unit 10 so as to emit sound. A sound emitting sound signal is generated from the sound sound signal (S117). Thus, since the detection of the singing position is not started when the singer 6 starts singing, the beam forming coefficient is not input. Therefore, the beam forming unit 18 generates a sound emission sound signal so as to emit the directional beam 6 a toward the monitor 4. Further, the beam forming unit 18 may generate a sound emission sound signal only when a beam forming coefficient is input.

ステップＳ１１８にて、制御部１０の指示により、ビーム形成部１８は、ビーム形成係数に基づいて指向性制御を行い、収音音声信号から放音音声信号を生成して、ステップＳ１１９へ進む。 In step S118, in response to an instruction from the control unit 10, the beam forming unit 18 performs directivity control based on the beam forming coefficient, generates a sound output sound signal from the collected sound signal, and proceeds to step S119.

ステップＳ１１９にて、ビーム形成部１８は、放音音声信号をミキサ２０へ出力して、ステップＳ１２０へ進む。 In step S119, the beam forming unit 18 outputs the sound output sound signal to the mixer 20, and proceeds to step S120.

ステップＳ１２０にて、収音音声信号は、バンドパスフィルタ１９へ出力される。バンドパスフィルタ１９は、収音音声信号から、測定音８３のオーディオ信号を取得して、レベル検出部１５３へ出力する。そして、レベル検出部１５３にて、測定音８３のオーディオ信号が検出される（Ｓ１２１：Ｙｅｓ）と、タイマ部１５２は、タイマを停止して（Ｓ１２２）、ステップＳ１２３へ進む。 In step S 120, the collected sound signal is output to the band pass filter 19. The band pass filter 19 acquires the audio signal of the measurement sound 83 from the collected sound signal and outputs it to the level detection unit 153. When the level detection unit 153 detects the audio signal of the measurement sound 83 (S121: Yes), the timer unit 152 stops the timer (S122) and proceeds to step S123.

ステップＳ１２３にて、マイク位置算出部１５４は、タイマの起動から停止までの計測時間に基づいて、マイク２の位置を算出して、ステップＳ１２４へ進む。 In step S123, the microphone position calculation unit 154 calculates the position of the microphone 2 based on the measurement time from the start to the stop of the timer, and proceeds to step S124.

ステップＳ１２４にて、ビーム形成係数算出部１５５は、マイク２の位置に指向性ビーム６ａがスピーカアレイ３から放音されるように、ビーム形成係数を算出する。カラオケ装置１は、算出したビーム形成係数をビーム形成部１３，１８に入力して、ステップＳ１０１へ戻る。 In step S124, the beam forming coefficient calculation unit 155 calculates the beam forming coefficient so that the directional beam 6a is emitted from the speaker array 3 at the position of the microphone 2. The karaoke apparatus 1 inputs the calculated beam forming coefficient to the beam forming units 13 and 18, and returns to step S101.

カラオケ装置１は、以上に示すステップＳ１０１〜Ｓ１２４の処理を繰り返し行い、カラオケ曲が終了するまで、スピーカアレイ３から伴奏音８１と測定音８３とマイク２で収音した収音音声とを放音する。 The karaoke apparatus 1 repeats the processes of steps S101 to S124 described above, and emits the accompaniment sound 81, the measurement sound 83, and the sound collected by the microphone 2 from the speaker array 3 until the karaoke song is finished. To do.

以上より、第１実施形態に係るカラオケ装置１は、スピーカアレイ３から指向性を持たせて歌唱音声、ガイドボーカル、伴奏音８１を放音するとともに、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから伴奏音８１と測定音８３を放音することができる。カラオケ装置１は、この測定音８３をマイク２で収音するまでの経過時間を求めることにより、マイク位置、つまり歌唱者６の位置を検出することができ、歌唱者６に指向性ビーム６ａを放音することができる。また、測定音８３は、楽器音をマスカーとして、マスカーの基本周波数の倍音で構成され、それらの発音のタイミングで発音される。これにより、歌唱者６や店内５の顧客は、測定音８３を知覚せずに、カラオケを楽しむことができる。更に、測定音８３は、人が知覚し難い周波数帯域を用いて生成されているので、歌唱者６や店内５の顧客は、測定音８３をより知覚することがない。 As described above, the karaoke apparatus 1 according to the first embodiment emits the singing voice, the guide vocal, and the accompaniment sound 81 with directivity from the speaker array 3, and from the speakers SP1 and SPn at both ends of the speaker array 3. Accompaniment sound 81 and measurement sound 83 can be emitted. The karaoke apparatus 1 can detect the microphone position, that is, the position of the singer 6 by obtaining the elapsed time until the measurement sound 83 is picked up by the microphone 2, and the directional beam 6 a is provided to the singer 6. Sound can be emitted. The measurement sound 83 is composed of overtones of the fundamental frequency of the masker, using the instrument sound as a masker, and is generated at the timing of their pronunciation. Thereby, the singer 6 and the customers in the store 5 can enjoy karaoke without perceiving the measurement sound 83. Furthermore, since the measurement sound 83 is generated using a frequency band that is difficult for humans to perceive, the singer 6 or the customer in the store 5 does not perceive the measurement sound 83 more.

［第２実施形態］
次に、本発明の第２実施形態について、図７，８を参照して説明する。本発明の第２実施形態のカラオケ装置１は、測定音８３のデータがカラオケ曲に含まれない点が第１実施形態と異なる。そこで、カラオケ装置１は、伴奏音８１のデータを解析し、マスカーとなる楽器音を決定する。カラオケ装置１は、マスカーとなる楽器音の発音のタイミングで測定音８３を生成して発音する。この際、測定音８３は、伴奏音８１から選択された楽器音（例えば、ハープシーコード）をマスカーとして、このマスカーの基本周波数の倍音で生成される。図７は、カラオケ装置の機能ブロック図である。図８は、伴奏音に基づいて測定音を生成する場合における指向性ビームの生成手順を示すフローチャートである。 [Second Embodiment]
Next, a second embodiment of the present invention will be described with reference to FIGS. The karaoke apparatus 1 according to the second embodiment of the present invention is different from the first embodiment in that the data of the measurement sound 83 is not included in the karaoke song. Therefore, the karaoke apparatus 1 analyzes the data of the accompaniment sound 81 and determines an instrument sound to be a masker. The karaoke apparatus 1 generates a measurement sound 83 at the timing of sounding a musical instrument sound that becomes a masker and generates a sound. At this time, the measurement sound 83 is generated as an overtone of the fundamental frequency of the masker using the instrument sound (for example, harpsichord) selected from the accompaniment sound 81 as a masker. FIG. 7 is a functional block diagram of the karaoke apparatus. FIG. 8 is a flowchart showing a procedure for generating a directional beam when a measurement sound is generated based on an accompaniment sound.

図７に示すように、第２実施形態のカラオケ装置１は、カラオケ曲に測定音８３が含まれない。また、カラオケ装置１に、ＭＩＤＩ信号解析部２３、測定音ＭＩＤＩ信号生成部２４及びＭＩＤＩ信号併合部２５が更に備えられる。これらの機能部について、以下に説明する。 As shown in FIG. 7, in the karaoke apparatus 1 of the second embodiment, the measurement sound 83 is not included in the karaoke song. The karaoke apparatus 1 further includes a MIDI signal analyzing unit 23, a measurement sound MIDI signal generating unit 24, and a MIDI signal merging unit 25. These functional units will be described below.

ＭＩＤＩ信号解析部２３は、伴奏音８１のＭＩＤＩデータを解析して、マスカーとなる楽器音をハープシーコードに決定する。ＭＩＤＩ信号解析部２３は、マスカーの基本周波数の倍音で、かつ、マスカーと同じタイミングで測定音８３を生成するように測定音ＭＩＤＩ信号生成部２４に指示する。具体的には、一定の時間（例えば１小節）内における伴奏音８１のＭＩＤＩデータから検出されるハープシーコードの音符のうち、ベロシティの値とボリュームの値とエクスプレッションの値を読み取ることにより、音圧レベルが最も大きい音符を検出し、マスカーとして選択する。測定音８３のＭＩＤＩデータは、マスカーの周波数（マスカーのノートナンバーの値とピッチベンドの値などから算出される）の整数倍となるように、ノートナンバーの値とピッチベンドの値が決定され、マスカーのノートオンの値と同じ値でノートオンの値が決定され、マスカーのベロシティの値とボリュームの値とエクスプレッションの値に基づいて適切にベロシティの値とボリュームの値とエクスプレッションの値が決定される。測定音ＭＩＤＩ信号生成部２４は、これらの値をＭＩＤＩ信号解析部２３から受け取り、その値に基づいて測定音８３のＭＩＤＩデータを生成する。 The MIDI signal analyzing unit 23 analyzes the MIDI data of the accompaniment sound 81 and determines a musical instrument sound to be a masker as a harpsichord. The MIDI signal analysis unit 23 instructs the measurement sound MIDI signal generation unit 24 to generate the measurement sound 83 that is a harmonic of the fundamental frequency of the masker and at the same timing as the masker. Specifically, by reading the value of the velocity, the value of the volume, and the value of the expression among the notes of the harpsichord code detected from the MIDI data of the accompaniment sound 81 within a certain time (for example, one measure), The note with the highest pressure level is detected and selected as a masker. The MIDI data of the measurement sound 83 has a note number value and a pitch bend value determined so as to be an integral multiple of the masker frequency (calculated from the masker note number value and the pitch bend value). The note-on value is determined by the same value as the note-on value, and the velocity value, the volume value, and the expression value are appropriately determined based on the masker velocity value, the volume value, and the expression value. The measurement sound MIDI signal generation unit 24 receives these values from the MIDI signal analysis unit 23 and generates MIDI data of the measurement sound 83 based on the values.

また、ＭＩＤＩ信号解析部２３は、１又は複数の基本周波数に基づいて測定音８３を生成する。複数の基本周波数に基づいて測定音８３を生成した場合、特定の基本周波数の放音が中断しても、他の基本周波数の放音に基づいて測定音８３を生成することで、定期的に測定音８３を放音することができる。また、マスカーとなる楽器音は、１つに限らないので、マスカーに適した楽器音であれば、複数の楽器音（ハープシーコード，グロッケン）を用いてもよい。これにより、一方の楽器音の放音が中断しても、他の楽器音の放音に基づいて測定音８３を生成することで、定期的に測定音８３を放音することができる。また、マスカーとして、音階を有しない楽器を用いた場合は、マスカーの音の周波数成分のある帯域に測定音８３を生成する。 Further, the MIDI signal analyzing unit 23 generates the measurement sound 83 based on one or a plurality of fundamental frequencies. When the measurement sound 83 is generated based on a plurality of fundamental frequencies, even if the sound emission of a specific fundamental frequency is interrupted, the measurement sound 83 is periodically generated based on the sound emission of other fundamental frequencies. The measurement sound 83 can be emitted. In addition, since the musical instrument sound that becomes a masker is not limited to one, a plurality of musical instrument sounds (harpsy chord, glocken) may be used as long as the musical instrument sound is suitable for a masker. Thereby, even if the sound emission of one instrument sound is interrupted, the measurement sound 83 can be periodically emitted by generating the measurement sound 83 based on the sound emission of the other instrument sound. When a musical instrument having no musical scale is used as the masker, the measurement sound 83 is generated in a band having a frequency component of the masker sound.

この測定音８３は、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから同時に放音されても、別々に放音されてもよい。スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから測定音８３が同時に放音される場合には、スピーカＳＰ１，ＳＰｎ毎に異なる周波数にて測定音８３を生成する。 The measurement sound 83 may be emitted simultaneously from the speakers SP1 and SPn at both ends of the speaker array 3, or may be emitted separately. When the measurement sound 83 is simultaneously emitted from the speakers SP1 and SPn at both ends of the speaker array 3, the measurement sound 83 is generated at a different frequency for each of the speakers SP1 and SPn.

測定音ＭＩＤＩ信号生成部２４は、ＭＩＤＩ信号解析部２３の指示を受け、測定音８３のＭＩＤＩ信号を生成して、ＭＩＤＩ信号併合部２５に出力する。具体的には、ＭＩＤＩ信号解析部２３の指示により、ハープシーコードと同じタイミングで発音し、かつ、ハープシーコードの基本周波数の倍音になるような測定音８３のＭＩＤＩ信号を生成し、ＭＩＤＩ信号併合部２５に出力する。 The measurement sound MIDI signal generation unit 24 receives an instruction from the MIDI signal analysis unit 23, generates a MIDI signal of the measurement sound 83, and outputs the MIDI signal to the MIDI signal merging unit 25. Specifically, in response to an instruction from the MIDI signal analysis unit 23, a MIDI signal of the measurement sound 83 that is generated at the same timing as the harpsichord and becomes a harmonic of the fundamental frequency of the harpsichord is generated. Output to the merging unit 25.

なお、マスカーとして伴奏音８１とガイドメロディ８２の両方を用いることができる。しかしながら、伴奏音８１の方がガイドメロディ８２より、音圧レベルが高いので、マスカーには、ガイドメロディ８２より音圧レベルの高い伴奏音８１を用いた方がより適している。 In addition, both the accompaniment sound 81 and the guide melody 82 can be used as a masker. However, since the accompaniment sound 81 has a higher sound pressure level than the guide melody 82, it is more suitable to use the accompaniment sound 81 having a higher sound pressure level than the guide melody 82 for the masker.

ＭＩＤＩ信号併合部２５は、伴奏音８１のＭＩＤＩデータに測定音８３のＭＩＤＩデータを付加して、ＭＩＤＩ音源９１に出力する。ＭＩＤＩ音源９１は、ＭＩＤＩ信号併合部２５で併合された伴奏音８１のＭＩＤＩデータと測定音８３のＭＩＤＩデータをそれぞれオーディオ信号に変換して出力する。以上のように、伴奏音８１を解析して、測定音８３を生成する。 The MIDI signal merging unit 25 adds the MIDI data of the measurement sound 83 to the MIDI data of the accompaniment sound 81 and outputs it to the MIDI sound source 91. The MIDI sound source 91 converts the MIDI data of the accompaniment sound 81 and the MIDI data of the measurement sound 83 merged by the MIDI signal merging unit 25 into audio signals and outputs the audio signals. As described above, the accompaniment sound 81 is analyzed, and the measurement sound 83 is generated.

また、図８に示すように、歌唱者６に向けた指向性ビームの生成時の処理の流れは、第２実施形態では、第１実施形態の処理にステップＳ２０１〜Ｓ２０６の処理が追加される。以下に、追加されたステップＳ２０１〜Ｓ２０６の処理についてのみ説明する。 Moreover, as shown in FIG. 8, the flow of the process at the time of the production | generation of the directional beam toward the singer 6 adds the process of step S201-S206 to the process of 1st Embodiment in 2nd Embodiment. . Only the added steps S201 to S206 will be described below.

ステップＳ２０１にて、制御部１０は、伴奏音８１のＭＩＤＩデータとガイドメロディ８２のＭＩＤＩデータとを記憶部８からＭＩＤＩ信号解析部２３へ入力する。ＭＩＤＩ信号解析部２３は、伴奏音８１のＭＩＤＩデータの解析を行い、伴奏音８１からマスカーになる楽器音（ハープシーコード）を決定（Ｓ２０２）して、ステップＳ２０３へ進む。 In step S 201, the control unit 10 inputs the MIDI data of the accompaniment sound 81 and the MIDI data of the guide melody 82 from the storage unit 8 to the MIDI signal analysis unit 23. The MIDI signal analyzing unit 23 analyzes the MIDI data of the accompaniment sound 81, determines an instrument sound (harpsichord) that becomes a masker from the accompaniment sound 81 (S202), and proceeds to step S203.

ステップＳ２０３にて、ＭＩＤＩ信号解析部２３は、一定の時間（例えば１小節）内に、ハープシーコードの基本周波数の音圧レベルが急激に上昇したかどうか調べる。ハープシーコードの基本周波数の音圧レベルの急激な上昇を検出すると、（Ｓ２０３：Ｙｅｓ）、ステップＳ２０４へ進む。 In step S203, the MIDI signal analyzing unit 23 checks whether or not the sound pressure level of the fundamental frequency of the harpsichord has suddenly increased within a certain time (for example, one measure). If a rapid increase in the sound pressure level of the fundamental frequency of the harpsichord is detected (S203: Yes), the process proceeds to step S204.

ステップＳ２０４にて、測定音ＭＩＤＩ信号生成部２４は、音圧レベルが急激に上昇した基本周波数の倍音で、測定音８３のＭＩＤＩデータを生成して、ステップＳ２０５へ進む。この際、音圧レベルが急激に上昇した基本周波数の音圧レベルに基づいて、測定音８３の音圧レベルを決定する。 In step S204, the measurement sound MIDI signal generation unit 24 generates MIDI data of the measurement sound 83 with harmonics of the fundamental frequency whose sound pressure level has rapidly increased, and proceeds to step S205. At this time, the sound pressure level of the measurement sound 83 is determined based on the sound pressure level of the fundamental frequency at which the sound pressure level has rapidly increased.

ステップＳ２０５にて、ＭＩＤＩ信号併合部２５は、伴奏音８１のＭＩＤＩデータに測定音８３のＭＩＤＩデータを付加して、ステップＳ２０６へ進む。この際、ＭＩＤＩ信号併合部２５は、音圧レベルが急激に上昇した基本周波数により測定音８３が同時マスキングされるように付加する。 In step S205, the MIDI signal merging unit 25 adds the MIDI data of the measurement sound 83 to the MIDI data of the accompaniment sound 81, and proceeds to step S206. At this time, the MIDI signal merging unit 25 adds so that the measurement sound 83 is simultaneously masked by the fundamental frequency at which the sound pressure level has rapidly increased.

ステップＳ２０６にて、ＭＩＤＩ信号解析部２３は、伴奏音８１の解析が終了するまで（Ｓ２０６：Ｎｏ）、ステップＳ２０１〜Ｓ２０５の処理を繰り返し行う。この際、伴奏音８１の解析の終了は、伴奏音８１がＭＩＤＩ信号解析部２３に入力されなくなったことにより分かる。伴奏音８１の解析が完了したら（Ｓ２０６：Ｙｅｓ）、ステップＳ２０７へ進む。なお、ステップＳ２０７以降の処理は、第１実施形態のステップＳ１０１以降と同じ処理を行う。 In step S206, the MIDI signal analyzing unit 23 repeats the processes of steps S201 to S205 until the analysis of the accompaniment sound 81 is completed (S206: No). At this time, the end of the analysis of the accompaniment sound 81 is recognized by the fact that the accompaniment sound 81 is no longer input to the MIDI signal analysis unit 23. When the analysis of the accompaniment sound 81 is completed (S206: Yes), the process proceeds to step S207. In addition, the process after step S207 performs the same process as step S101 after 1st Embodiment.

なお、第２実施形態では、伴奏音８１を解析して、伴奏音８１に測定音８３の付加が完了した後に、伴奏音８１等の放音を開始している。しかしながら、これに限らず、図９に示すように、伴奏音８１を解析して、伴奏音８１に測定音８３を付加しながら、伴奏音８１等の放音を開始してもよい。なお、図９は、図８のＳ２０６の処理を行わずに、すぐにＳ２０７の処理を行うようにしたフローである。 In the second embodiment, the accompaniment sound 81 is analyzed, and after the addition of the measurement sound 83 to the accompaniment sound 81 is completed, sound emission of the accompaniment sound 81 and the like is started. However, the present invention is not limited to this, and as shown in FIG. 9, sounding of the accompaniment sound 81 or the like may be started while analyzing the accompaniment sound 81 and adding the measurement sound 83 to the accompaniment sound 81. FIG. 9 is a flow in which the process of S207 is performed immediately without performing the process of S206 of FIG.

また、第２実施形態では、マスカーをハープシーコードとしたが、これに限らず、マスカーに適した楽器音であれば他の楽器音でもよい。また、マスカーとして複数の楽器音を用いる場合について、ハープシーコードとグロッケンとを用いたが、これに限らず、マスカーに適した楽器音を複数用いればよい。 In the second embodiment, the masker is a harpsichord. However, the present invention is not limited to this, and any other instrument sound may be used as long as the instrument sound is suitable for the masker. Further, in the case where a plurality of instrument sounds are used as a masker, the harpsichord and the glocken are used. However, the present invention is not limited to this, and a plurality of instrument sounds suitable for the masker may be used.

以上より、第２実施形態に係るカラオケ装置１では、カラオケ曲に測定音８３が含まれていなくても、伴奏音８１を解析することで測定音８３を生成して発音することができる。これにより、第１実施形態と同様に、マイク位置の検出ができ、歌唱者６に指向性ビーム６ａを放音することができる。 As described above, in the karaoke apparatus 1 according to the second embodiment, the measurement sound 83 can be generated and generated by analyzing the accompaniment sound 81 even if the measurement sound 83 is not included in the karaoke song. Thereby, similarly to 1st Embodiment, a microphone position can be detected and the directional beam 6a can be emitted to the singer 6.

第１，第２実施形態においては、スピーカアレイ３から、歌唱者６と歌唱者６のグループが着席するテーブル７ａに向けて伴奏音８１と歌唱音声とからなる指向性ビーム６ａ，７０ａを放音し、他のテーブル７ｂ〜７ｄに向けて伴奏音８１とガイドボーカル８４とからなる指向性ビーム７０ｂ〜７０ｄを放音する。更に、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから測定音８３と伴奏音８１とを指向性を持たせずに放音するとして本発明の説明を行った。しかしながら、これに限らず、スピーカアレイ３から、歌唱者６と歌唱者６のグループが着席するテーブル７ａに向けて歌唱音声からなる指向性ビーム６ａ，７０ａを放音し、他のテーブル７ｂ〜７ｄに向けてガイドボーカル８４からなる指向性ビーム７０ｂ〜７０ｄを放音してもよい。更に、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから伴奏音８１と測定音８３とを指向性を持たせずに放音してもよい。また、スピーカアレイ３から、歌唱者６と歌唱者６のグループが着席するテーブル７ａに向けて伴奏音８１と歌唱音声と測定音８３とからなる指向性ビーム６ａ，７０ａを放音し、他のテーブル７ｂ〜７ｄに向けて伴奏音８１とガイドボーカル８４と測定音８３とからなる指向性ビーム７０ｂ〜７０ｄを放音してもよい。つまり、第１，第２実施形態では、マスカーとなる伴奏音８１とともに測定音８３がスピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから放音されればよい。 In the first and second embodiments, the directional beams 6a and 70a composed of the accompaniment sound 81 and the singing sound are emitted from the speaker array 3 toward the table 7a on which the group of the singer 6 and the singer 6 is seated. And the directional beams 70b-70d which consist of the accompaniment sound 81 and the guide vocal 84 are emitted toward the other tables 7b-7d. Further, the present invention has been described on the assumption that the measurement sound 83 and the accompaniment sound 81 are emitted from the speakers SP1 and SPn at both ends of the speaker array 3 without directivity. However, the present invention is not limited thereto, and the directional beams 6a and 70a composed of the singing voice are emitted from the speaker array 3 toward the table 7a on which the singer 6 and the group of the singers 6 are seated, and the other tables 7b to 7d. The directional beams 70b to 70d composed of the guide vocal 84 may be emitted toward the sound source. Further, the accompaniment sound 81 and the measurement sound 83 may be emitted from the speakers SP1 and SPn at both ends of the speaker array 3 without directivity. Further, the speaker array 3 emits directional beams 6a and 70a composed of the accompaniment sound 81, the singing sound, and the measurement sound 83 toward the table 7a on which the singer 6 and the group of the singer 6 are seated. The directional beams 70b to 70d including the accompaniment sound 81, the guide vocal 84, and the measurement sound 83 may be emitted toward the tables 7b to 7d. That is, in the first and second embodiments, the measurement sound 83 may be emitted from the speakers SP1 and SPn at both ends of the speaker array 3 together with the accompaniment sound 81 serving as a masker.

［第３実施形態］
次に、本発明の第３実施形態について、図１０，１１を参照して説明する。本発明の第３実施形態のカラオケ装置１は、伴奏音８１のデータとガイドメロディ８２のデータと測定音８３のデータとがＭＩＤＩ音源９１に含まれない（例えば、アカペラ曲等）点が第１実施形態と異なる。そこで、カラオケ装置１は、歌唱者６の歌唱音声を解析し、歌唱音声の音圧レベルが上昇するタイミングで測定音８３を生成して発音する。図１０は、カラオケ装置の機能ブロック図である。図１１は、歌唱音声に基づいて測定音を生成する場合における指向性ビームの生成手順を示すフローチャートである。 [Third Embodiment]
Next, a third embodiment of the present invention will be described with reference to FIGS. The karaoke apparatus 1 according to the third embodiment of the present invention has the first point that the data of the accompaniment sound 81, the data of the guide melody 82, and the data of the measurement sound 83 are not included in the MIDI sound source 91 (for example, a cappella tune). Different from the embodiment. Therefore, the karaoke apparatus 1 analyzes the singing voice of the singer 6 and generates and generates the measurement sound 83 at the timing when the sound pressure level of the singing voice rises. FIG. 10 is a functional block diagram of the karaoke apparatus. FIG. 11 is a flowchart showing a procedure for generating a directional beam when a measurement sound is generated based on a singing voice.

図１０に示すように、第３実施形態は、カラオケ曲に伴奏音８１のデータとガイドメロディ８２のデータと測定音８３のデータとが含まれない。また、カラオケ装置１に、音声信号解析部２６、測定音生成部２７及び信号併合部２８が更に備えられる。これらの機能部について、以下に説明する。 As shown in FIG. 10, in the third embodiment, the data of the accompaniment sound 81, the data of the guide melody 82, and the data of the measurement sound 83 are not included in the karaoke song. The karaoke apparatus 1 further includes an audio signal analysis unit 26, a measurement sound generation unit 27, and a signal merging unit 28. These functional units will be described below.

音声信号解析部２６は、歌唱者６の歌唱音声を解析して、測定音８３の生成タイミングになると、測定音生成部２７に測定音８３を生成するよう指示する。具体的には、例えば、音声信号解析部２６は、歌唱音声の音声信号の急激な音圧レベルの上昇が検出されると、測定音生成部２７に測定音８３を生成するよう指示する。音声信号解析部２６は、歌唱音声の音声信号の急激な音圧レベルの上昇を、１小節毎に検出し、定期的に測定音８３を生成するよう指示する。この際、測定音８３のレベルは、歌唱音声の音声信号の音圧レベルに応じて決定される。また、この測定音８３は、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから同時に放音されても、別々に放音されてもよい。スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから測定音８３が同時に放音される場合には、スピーカＳＰ１，ＳＰｎ毎に異なる周波数にて測定音８３を生成する。 The voice signal analysis unit 26 analyzes the singing voice of the singer 6 and instructs the measurement sound generation unit 27 to generate the measurement sound 83 when the measurement sound 83 is generated. Specifically, for example, when a sudden increase in the sound pressure level of the singing voice signal is detected, the voice signal analysis unit 26 instructs the measurement sound generation unit 27 to generate the measurement sound 83. The audio signal analysis unit 26 detects an abrupt increase in sound pressure level of the audio signal of the singing voice for each bar, and instructs to generate the measurement sound 83 periodically. At this time, the level of the measurement sound 83 is determined according to the sound pressure level of the voice signal of the singing voice. The measurement sound 83 may be emitted simultaneously from the speakers SP1 and SPn at both ends of the speaker array 3, or may be emitted separately. When the measurement sound 83 is simultaneously emitted from the speakers SP1 and SPn at both ends of the speaker array 3, the measurement sound 83 is generated at a different frequency for each of the speakers SP1 and SPn.

測定音生成部２７は、音声信号解析部２６の指示を受け、測定音８３のオーディオ信号を生成して、信号併合部２８に出力する。具体的には、測定音生成部２７は、歌唱音声の基本周波数の倍音になるよう測定音８３を生成する。 The measurement sound generator 27 receives an instruction from the audio signal analyzer 26, generates an audio signal of the measurement sound 83, and outputs the audio signal to the signal merger 28. Specifically, the measurement sound generator 27 generates the measurement sound 83 so as to be a harmonic of the fundamental frequency of the singing voice.

信号併合部２８は、歌唱音声の音声信号に測定音８３のオーディオ信号を付加して、バンドパスフィルタ２９（２９ａ〜２９ｄ）に出力する。以上のように、歌唱音声を解析して、測定音８３を生成する。 The signal merging unit 28 adds the audio signal of the measurement sound 83 to the voice signal of the singing voice and outputs it to the bandpass filter 29 (29a to 29d). As described above, the measurement sound 83 is generated by analyzing the singing voice.

また、図１１に示すように、歌唱者６に向けた指向性ビーム６ａの生成時の処理の流れは、第３実施形態では、第１実施形態のステップＳ１０１〜Ｓ１１２の処理の削除し、ステップＳ１１９とステップＳ１２０との間にステップＳ３０９〜Ｓ３１７の処理が追加される。以下に、追加されたステップＳ３０９〜Ｓ３１７の処理についてのみ説明する。 Moreover, as shown in FIG. 11, the flow of the process at the time of the production | generation of the directional beam 6a toward the singer 6 deletes the process of step S101-S112 of 1st Embodiment in 3rd Embodiment, Steps S309 to S317 are added between S119 and S120. Only the added processing in steps S309 to S317 will be described below.

図１１に示すように、ステップＳ３０９にて、マイク２で収音された収音音声信号が入力されると、音声信号解析部２６は、収音音声信号の音圧レベルの上昇を検出したかどうか調べる。収音音声信号の音圧レベルの急激な上昇を検出すると（Ｓ３１０：Ｙｅｓ）と、ステップＳ３１１へ進む。 As shown in FIG. 11, when the collected sound signal collected by the microphone 2 is input in step S309, has the sound signal analysis unit 26 detected an increase in the sound pressure level of the collected sound signal? Please check. When a rapid increase in the sound pressure level of the collected sound signal is detected (S310: Yes), the process proceeds to step S311.

ステップＳ３１１にて、測定音生成部２７は、収音音声信号の基本周波数の倍音で、測定音８３のオーディオ信号を生成して、ステップＳ３１２へ進む。この際、収音音声信号の音圧レベルに基づいて、測定音８３の音圧レベルを決定する。 In step S311, the measurement sound generator 27 generates an audio signal of the measurement sound 83 with a harmonic of the fundamental frequency of the collected sound signal, and the process proceeds to step S312. At this time, the sound pressure level of the measurement sound 83 is determined based on the sound pressure level of the collected sound signal.

ステップＳ３１２にて、信号併合部２８は、収音音声信号に、測定音８３のオーディオ信号を付与して、ステップＳ３１３へ進む。この際、信号併合部２８は、収音音声信号により測定音８３のオーディオ信号が経時マスキングされるように加算する。 In step S312, the signal merging unit 28 adds the audio signal of the measurement sound 83 to the collected sound signal, and proceeds to step S313. At this time, the signal merging unit 28 performs addition so that the audio signal of the measurement sound 83 is masked over time by the collected sound signal.

ステップＳ３１３にて、測定音８３のオーディオ信号が付与された収音音声信号は、バンドパスフィルタ２９へ出力される。バンドパスフィルタ２９は、収音音声信号から測定音８３のオーディオ信号だけを通過させて、レベル検出部１５１へ出力する。そして、レベル検出部１５１にて、測定音８３のオーディオ信号が検出される（Ｓ３１４：Ｙｅｓ）と、タイマ部１５２は、タイマを起動（Ｓ３１５）して、ステップＳ３１６へ進む。 In step S 313, the collected sound signal to which the audio signal of the measurement sound 83 is added is output to the band pass filter 29. The band pass filter 29 passes only the audio signal of the measurement sound 83 from the collected sound signal and outputs it to the level detection unit 151. Then, when the level detection unit 151 detects the audio signal of the measurement sound 83 (S314: Yes), the timer unit 152 activates the timer (S315) and proceeds to step S316.

ステップＳ３１６にて、バンドパスフィルタ２９から出力された測定音８３のオーディオ信号をミキサ２０に出力する。ミキサ２０は、測定音８３のオーディオ信号を放音音声信号に加算して、ステップＳ３１７へ進む。この際、測定音８３は、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから放音されるように、放音音声信号に加算される。 In step S 316, the audio signal of the measurement sound 83 output from the band pass filter 29 is output to the mixer 20. The mixer 20 adds the audio signal of the measurement sound 83 to the sound output sound signal, and proceeds to step S317. At this time, the measurement sound 83 is added to the emitted sound signal so as to be emitted from the speakers SP1 and SPn at both ends of the speaker array 3.

ステップＳ３１７にて、これらの放音音声信号は、対応するＤ／Ａコンバータ２１、ＡＭＰ２２を介して、スピーカＳＰ１〜ＳＰｎから放音され、ステップＳ３１８へ進む。この放音音声信号は、指向性ビーム６ａとなり、歌唱者６に向けて放音される。なお、ステップＳ３１８以降の処理は、第１実施形態のステップＳ１２０以降と同じ処理を行う。 In step S317, these sound emission audio signals are emitted from the speakers SP1 to SPn via the corresponding D / A converter 21 and AMP 22, and the process proceeds to step S318. This sound emission sound signal becomes a directional beam 6 a and is emitted toward the singer 6. In addition, the process after step S318 performs the same process as step S120 after 1st Embodiment.

以上のように、カラオケ曲に伴奏音８１が含まれない場合は、カラオケ装置１は、歌唱者６と歌唱者６のグループが着席しているテーブル７ａとに歌唱者６の歌唱音声を放音し、他のテーブル７ｂ〜７ｄにガイドボーカル８４を放音する。また、カラオケ装置１は、歌唱者６の歌唱音声をマスカーとして測定音８３を放音する。また、測定音８３は、人が知覚し難い周波数帯域に生成される。これにより、歌唱者６の歌唱音声をマスカーとして、測定音８３を経時マスキングできる。このため、歌唱者６や店内５の顧客は、測定音８３を知覚しないようにすることができる。 As described above, when the accompaniment sound 81 is not included in the karaoke song, the karaoke apparatus 1 emits the singing voice of the singer 6 to the singer 6 and the table 7a on which the group of the singer 6 is seated. Then, the guide vocal 84 is emitted to the other tables 7b to 7d. Moreover, the karaoke apparatus 1 emits the measurement sound 83 by using the singing voice of the singer 6 as a masker. The measurement sound 83 is generated in a frequency band that is difficult for humans to perceive. Thereby, the measurement sound 83 can be masked over time using the singing voice of the singer 6 as a masker. For this reason, the singer 6 and the customers in the store 5 can prevent the measurement sound 83 from being perceived.

なお、第３実施形態では、歌唱者６の歌唱音声をマスカーとして用いた。しかしながら、これに限らず、ガイドボーカル８４をマスカーとして用いてもよい。 In the third embodiment, the singing voice of the singer 6 is used as a masker. However, the present invention is not limited to this, and the guide vocal 84 may be used as a masker.

以上より、第３実施形態に係るカラオケ装置１では、アカペラ等の伴奏音８１が含まれないカラオケ曲であっても、歌唱音声を解析して測定音８３を発音することができる。これにより、第１，２実施形態と同様に、マイク位置の検出ができ、歌唱者６に指向性ビーム６ａを放音することができる。 As mentioned above, in the karaoke apparatus 1 which concerns on 3rd Embodiment, even if it is a karaoke tune which does not contain accompaniment sounds 81, such as a cappella, a singing voice can be analyzed and the measurement sound 83 can be pronounced. Thereby, like the first and second embodiments, the microphone position can be detected, and the directional beam 6a can be emitted to the singer 6.

次に、ローパスフィルタ１２，１７の代わりに、バンドエリミネーションフィルタ又はノッチフィルタ又はコムフィルタを用いた場合について説明する。なお、説明の簡単化のため、バンドパスフィルタ１４，１９，２９の通過帯域に、測定音８３が存在するものとして説明する。また、第１実施形態に基づいて説明するが、これらのフィルタは第２、第３実施形態にも適応することができる。 Next, a case where a band elimination filter, a notch filter or a comb filter is used instead of the low-pass filters 12 and 17 will be described. For simplification of description, it is assumed that the measurement sound 83 exists in the pass band of the bandpass filters 14, 19, and 29. Although described based on the first embodiment, these filters can also be applied to the second and third embodiments.

バンドエリミネーションフィルタを用いた場合、バンドエリミネーションフィルタの減衰帯域をバンドパスフィルタ１４，１９，２９の通過帯域と同じにすることで、測定音８３をカットすることができる。これにより、ローパスフィルタ１２，１７と比較して伴奏音８１が通過する周波数帯域が広いので、より音質がよい伴奏音８１を放音することができる。 When the band elimination filter is used, the measurement sound 83 can be cut by making the attenuation band of the band elimination filter the same as the pass band of the band pass filters 14, 19, and 29. Thereby, since the frequency band through which the accompaniment sound 81 passes is wider than that of the low-pass filters 12 and 17, the accompaniment sound 81 with better sound quality can be emitted.

また、バンドエリミネーションフィルタの減衰帯域に、ある程度の帯域幅を設けることで、減衰帯域に異なる周波数からなる複数の測定音８３を生成することができる。これにより、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎに対して、周波数の異なる測定音８３を適応することができる。 Further, by providing a certain amount of bandwidth in the attenuation band of the band elimination filter, it is possible to generate a plurality of measurement sounds 83 having different frequencies in the attenuation band. As a result, the measurement sound 83 having a different frequency can be applied to the speakers SP1 and SPn at both ends of the speaker array 3.

また、ノッチフィルタを用いた場合、ノッチフィルタのディップをバンドパスフィルタ１４，１９，２９のピークと同じにすることで、測定音８３をカットすることができる。ノッチフィルタのディップは狭帯域なので、バンドエリミネーションフィルタを用いるより、より音質がよい伴奏音８１を放音することができる。 When the notch filter is used, the measurement sound 83 can be cut by making the dip of the notch filter the same as the peak of the bandpass filters 14, 19, and 29. Since the dip of the notch filter is a narrow band, it is possible to emit the accompaniment sound 81 with better sound quality than using the band elimination filter.

更に、コムフィルタを用いた場合、コムフィルタのディップをバンドパスフィルタ１４，１９，２９のピークと同じにすることで、測定音８３をカットすることができる。コムフィルタは、複数のディップを有するため、複数の異なる周波数からなる測定音８３を生成することができる。これにより、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎに対して、周波数の異なる測定音８３を適応することができ、且つ、伴奏音８１の音質を向上させることができる。 Further, when the comb filter is used, the measurement sound 83 can be cut by making the dip of the comb filter the same as the peak of the bandpass filters 14, 19, and 29. Since the comb filter has a plurality of dips, the measurement sound 83 having a plurality of different frequencies can be generated. As a result, the measurement sound 83 having different frequencies can be applied to the speakers SP1 and SPn at both ends of the speaker array 3, and the sound quality of the accompaniment sound 81 can be improved.

なお、第１〜第３実施形態では、バンドパスフィルタ１４，１９，２９を用いて、４つの異なる周波数成分を取り出しているが、これに限らず、左右のスピーカＳＰ１，ＳＰｎ用に２以上の周波数成分が取り出せればよいので、バンドパスフィルタ１４，１９，２９はそれぞれ１個以上あればよい。 In the first to third embodiments, four different frequency components are extracted by using the bandpass filters 14, 19, and 29. However, the present invention is not limited to this, and two or more frequency components for the left and right speakers SP1 and SPn are used. Since it suffices to extract frequency components, it is sufficient that at least one band pass filter 14, 19, 29 is provided.

また、第１〜第３実施形態では、ローパスフィルタ１２，１７の通過帯域を１５ｋＨｚ以下とし、測定音８３が１５ｋＨｚ〜２０ｋＨｚの範囲内で検出されるとしている。しかしながら、これに限らず、測定音８３を検出する周波数帯域より、低域をローパスフィルタ１２，１７の通過帯域とすればよい。例えば、測定音８３を１７ｋＨｚ〜１８ｋＨｚ等で生成するのであれば、ローパスフィルタ１２，１７の通過帯域は、１７ｋＨｚ以下とする。 In the first to third embodiments, the pass band of the low-pass filters 12 and 17 is set to 15 kHz or less, and the measurement sound 83 is detected within the range of 15 kHz to 20 kHz. However, the present invention is not limited to this, and the low band may be used as the pass band of the low-pass filters 12 and 17 from the frequency band for detecting the measurement sound 83. For example, if the measurement sound 83 is generated at 17 kHz to 18 kHz or the like, the pass bands of the low-pass filters 12 and 17 are set to 17 kHz or less.

また、第１〜第３実施形態では、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから測定音８３を放音する例について説明した。しかしながら、これに限らず、スピーカアレイ３を構成するスピーカＳＰ１〜ＳＰｎのうちの２つから測定音８３を放音すればよい。これにより、三角法を利用して、マイク２の位置を検出することができる。 In the first to third embodiments, the example in which the measurement sound 83 is emitted from the speakers SP1 and SPn at both ends of the speaker array 3 has been described. However, the present invention is not limited to this, and the measurement sound 83 may be emitted from two of the speakers SP1 to SPn constituting the speaker array 3. Thereby, the position of the microphone 2 can be detected using trigonometry.

更に、第１〜第３実施形態では、ＭＩＤＩ音源９１及びガイドボーカル再生部９２からの出力がアナログオーディオ信号なのでＡ／Ｄコンバータ１１を設けた。しかしながら、これに限らず、ＭＩＤＩ音源９１及びガイドボーカル再生部９２からの出力がデジタルオーディオ信号の場合は、Ａ／Ｄコンバータ１１を設けなくてもよい。 Furthermore, in the first to third embodiments, the A / D converter 11 is provided because the outputs from the MIDI sound source 91 and the guide vocal reproducing unit 92 are analog audio signals. However, the present invention is not limited to this, and when the output from the MIDI sound source 91 and the guide vocal reproducing unit 92 is a digital audio signal, the A / D converter 11 may not be provided.

以上より、本発明に係るカラオケ装置１は、スピーカアレイ３から伴奏音８１や歌唱音声を放音し、スピーカアレイ３の両端のスピーカＳＰ１，ＳＰｎから測定音８３を放音する。カラオケ装置１は、この測定音８３をマイク２で収音するまでの経過時間を求めることにより、マイク位置、つまり歌唱者６の位置を検出することができ、歌唱者６に指向性ビーム６ａを常に放音することができる。また、測定音８３は、伴奏音８１や歌唱音声をマスカーとして、マスカーの基本周波数の倍音で構成されることで、カラオケ装置１は、測定音８３を同時マスキングや経時マスキングすることができる。これにより、歌唱者６や店内５の顧客は、測定音８３を知覚せずに、カラオケを楽しむことができる。更に、測定音８３は、人が知覚し難い周波数帯域を用いて構成されているので、歌唱者６や店内５の顧客は、測定音８３をより知覚することがない。 As described above, the karaoke apparatus 1 according to the present invention emits the accompaniment sound 81 and the singing sound from the speaker array 3, and emits the measurement sound 83 from the speakers SP1 and SPn at both ends of the speaker array 3. The karaoke apparatus 1 can detect the microphone position, that is, the position of the singer 6 by obtaining the elapsed time until the measurement sound 83 is picked up by the microphone 2, and the directional beam 6 a is provided to the singer 6. Sound can always be emitted. Further, the measurement sound 83 is composed of the accompaniment sound 81 or the singing voice as a masker and is a harmonic overtone of the basic frequency of the masker, so that the karaoke apparatus 1 can mask the measurement sound 83 simultaneously or with time. Thereby, the singer 6 and the customers in the store 5 can enjoy karaoke without perceiving the measurement sound 83. Furthermore, since the measurement sound 83 is configured using a frequency band that is difficult for humans to perceive, the singer 6 or the customer in the store 5 does not perceive the measurement sound 83 more.

飲食店の店内を説明する図である。It is a figure explaining the inside of a restaurant. マイク位置検出方法の説明図である。It is explanatory drawing of the microphone position detection method. マスカーの選択についての説明図である。It is explanatory drawing about selection of a masker. 測定音の加算についての説明図である。It is explanatory drawing about addition of a measurement sound. カラオケ装置の機能ブロック図である。It is a functional block diagram of a karaoke apparatus. カラオケ曲に測定音が含まれる場合における指向性ビームの生成手順を示すフローチャートである。It is a flowchart which shows the production | generation procedure of a directional beam in case measurement sound is included in karaoke music. 第２実施形態に係るカラオケ装置の機能ブロック図である。It is a functional block diagram of the karaoke apparatus which concerns on 2nd Embodiment. 伴奏音に基づいて測定音を生成する場合における指向性ビームの生成手順を示すフローチャートである。It is a flowchart which shows the production | generation procedure of a directional beam in the case of producing | generating a measurement sound based on an accompaniment sound. 伴奏音に基づいて測定音を生成する場合における指向性ビームの他の生成手順を示すフローチャートである。It is a flowchart which shows the other production | generation procedure of a directional beam in the case of producing | generating a measurement sound based on an accompaniment sound. 第３実施形態に係るカラオケ装置の機能ブロック図である。It is a functional block diagram of the karaoke apparatus which concerns on 3rd Embodiment. 歌唱音声に基づいて測定音を生成する場合における指向性ビームの生成手順を示すフローチャートである。It is a flowchart which shows the production | generation procedure of a directional beam in the case of producing | generating a measurement sound based on a song voice.

Explanation of symbols

１−カラオケ装置，２−マイク，３−スピーカアレイ，４−モニタ，５−店内，６−歌唱者，６ａ，７０ａ〜７０ｄ−指向性ビーム，７（７ａ〜７ｄ）−テーブル，１０−制御部，１１，１６−Ａ／Ｄコンバータ，１２，１７−ローパスフィルタ，１３，１８−ビーム形成部，１４（１４ａ〜１４ｄ），１９（１９ａ〜１９ｄ），２９（２９ａ〜２９ｄ）−バンドパスフィルタ，１５−マイク位置検出部，２０−ミキサ，２１−Ｄ／Ａコンバータ，２２−ＡＭＰ，２３−ＭＩＤＩ信号解析部，２４−測定音ＭＩＤＩ信号生成部，２５−ＭＩＤＩ信号併合部，２６−音声信号解析部，２７−測定音生成部，２８−信号併合部，８１−伴奏音，８２−ガイドメロディ，８３−測定音，８４−ガイドボーカル，９１−ＭＩＤＩ音源，９２−ガイドボーカル再生部，１００−操作部，１５１，１５３−レベル検出部，１５２−タイマ部，１５４−マイク位置算出部，１５５−ビーム形成係数算出部，ＳＰ１〜ＳＰｎ−スピーカ 1-karaoke device, 2-microphone, 3-speaker array, 4-monitor, 5-store, 6-singer, 6a, 70a-70d-directional beam, 7 (7a-7d) -table, 10-control unit 11, 16-A / D converter, 12, 17-low pass filter, 13, 18-beam forming unit, 14 (14a-14d), 19 (19a-19d), 29 (29a-29d) -band pass filter, 15-microphone position detector, 20-mixer, 21-D / A converter, 22-AMP, 23-MIDI signal analyzer, 24-measurement sound MIDI signal generator, 25-MIDI signal merger, 26-audio signal analyzer Section, 27-measurement sound generation section, 28-signal merging section, 81-accompaniment sound, 82-guide melody, 83-measurement sound, 84-guide vocal, 91-MIDI sound source, 92-guide sound Cull reproduction unit, 100 - operating unit, 151,153- level detector, 152- timer, 154- microphone position calculating unit, 155-beam forming coefficient calculation unit, SP1～SPn- speaker

Claims

Sound collecting means for picking up sound from the surroundings including the singing voice of the singer with a microphone and generating a sound signal;
Sound emission means for emitting measurement sound composed of harmonics of the fundamental frequency of a masker from two speakers of a speaker array having a plurality of speakers simultaneously with or immediately after the pronunciation of the masker;
Microphone position detection means for detecting the microphone position based on the elapsed time from the sound emission of the measurement sound by the sound emission means to the sound collection of the measurement sound by the sound collection means,
The said sound emission means is a karaoke apparatus which emits the directional beam containing the sound emission sound which should be given to a singer toward the said microphone position which the said microphone position detection means detected.

2. The karaoke apparatus according to claim 1, wherein the sound emitting unit emits the measurement sound included in the data of the karaoke song in advance by using one or a plurality of instrument sounds constituting the accompaniment sound of the karaoke song as a masker.

2. The sound emission means generates and emits the measurement sound using the instrument sound as a masker at each emission timing of one or a plurality of instrument sounds constituting an accompaniment sound of a karaoke song. Karaoke equipment.

The karaoke apparatus according to claim 1, wherein the sound emitting unit detects an increase in sound pressure level of the singing voice, generates the measurement sound using the singing voice as a masker, and emits the sound.