JP6819236B2

JP6819236B2 - Sound processing equipment, sound processing methods, and programs

Info

Publication number: JP6819236B2
Application number: JP2016225546A
Authority: JP
Inventors: 雄太湯山; 加納　真弥; 真弥加納; 良太郎青木; 友明平井
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2016-11-18
Filing date: 2016-11-18
Publication date: 2021-01-27
Anticipated expiration: 2036-11-18
Also published as: JP2018082411A

Description

本発明は音処理装置、音処理方法、及びプログラムに関する。 The present invention relates to a sound processing device, a sound processing method, and a program.

音楽や映画等のコンテンツを楽しむための装置には、コンテンツの音響信号に対して擬似的に間接音成分（残響成分等）を付加してスピーカから放音させることによってホールの音場を再現する機能を備えたものがある（特許文献１）。 For devices for enjoying content such as music and movies, the sound field of the hall is reproduced by adding a pseudo indirect sound component (reverberation component, etc.) to the acoustic signal of the content and emitting sound from the speaker. Some have a function (Patent Document 1).

特開２０１５−５０４９３号公報Japanese Unexamined Patent Publication No. 2015-50493

例えば、ユーザによって演奏されている楽器の楽器音又はユーザの歌唱音に対して擬似的な間接音成分を付加して、音楽コンテンツとともにスピーカから放音させることができれば、ユーザは音楽コンテンツの演奏者の一員となって楽器を演奏したり、歌を歌ったりしている気分を楽しむことができるようになる。しかしながら、この場合、ユーザの楽器音（又は歌唱音）とコンテンツ音との特性が異なることによって、ユーザの楽器音（又は歌唱音）とコンテンツ音との一体感をユーザが十分に感じることができないおそれがある。例えば、ユーザの楽器音（又は歌唱音）の間接音成分の量と、コンテンツ音の間接音成分の量とが異なることによって、ユーザの楽器音（又は歌唱音）とコンテンツ音との一体感をユーザが十分に感じることができないおそれがある。 For example, if a pseudo indirect sound component can be added to the musical instrument sound of the musical instrument played by the user or the singing sound of the user and the sound is emitted from the speaker together with the music content, the user is a performer of the music content. You will be able to enjoy the feeling of playing an instrument or singing a song as a member. However, in this case, since the characteristics of the user's musical instrument sound (or singing sound) and the content sound are different, the user cannot sufficiently feel the sense of unity between the user's musical instrument sound (or singing sound) and the content sound. There is a risk. For example, by making the amount of the indirect sound component of the user's instrument sound (or singing sound) different from the amount of the indirect sound component of the content sound, the user's instrument sound (or singing sound) and the content sound can be united. The user may not be able to fully feel it.

本発明は上記課題に鑑みてなされたものであって、その目的は、ユーザの楽器音又は歌唱音とコンテンツ音との一体感を感じながら、ユーザがコンテンツに合わせて演奏を行うことが可能な音処理装置、音処理方法、及びプログラムを提供することにある。 The present invention has been made in view of the above problems, and an object of the present invention is to enable a user to perform according to a content while feeling a sense of unity between the user's musical instrument sound or singing sound and the content sound. The purpose of the present invention is to provide a sound processing device, a sound processing method, and a program.

上記課題を解決するために、本発明に係る音処理装置は、ユーザの演奏音の入力を受け付ける入力手段と、前記演奏音と、コンテンツデータに基づいて得られる音であるコンテンツ音との特性を合わせるために、前記演奏音と前記コンテンツ音との少なくとも一方を調整する調整手段と、前記演奏音と前記コンテンツ音とをミックスしてなる音に対応する間接音成分を生成する生成手段と、前記演奏音と、前記コンテンツ音の少なくとも直接音成分と、前記間接音成分とをミックスしてなる音を出力手段に出力する出力制御手段と、を含む。 In order to solve the above problems, the sound processing device according to the present invention has characteristics of an input means for receiving input of a user's performance sound, the performance sound, and a content sound which is a sound obtained based on the content data. An adjustment means for adjusting at least one of the performance sound and the content sound, a generation means for generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, and the above-mentioned It includes an output control means for outputting a sound formed by mixing the performance sound, at least the direct sound component of the content sound, and the indirect sound component to the output means.

また、本発明に係る音処理方法は、ユーザの演奏音と、コンテンツデータに基づいて得られるコンテンツ音との特性を合わせるために、前記演奏音と前記コンテンツ音との少なくとも一方を調整する調整ステップと、前記演奏音と前記コンテンツ音とをミックスしてなる音に対応する間接音成分を生成する生成ステップと、前記演奏音と、前記コンテンツ音の少なくとも直接音成分と、前記間接音成分とをミックスしてなる音を出力手段に出力する出力制御ステップと、を含む。 Further, the sound processing method according to the present invention is an adjustment step of adjusting at least one of the performance sound and the content sound in order to match the characteristics of the user's performance sound and the content sound obtained based on the content data. A generation step of generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, the performance sound, at least a direct sound component of the content sound, and the indirect sound component. It includes an output control step for outputting the mixed sound to the output means.

また、本発明に係るプログラムは、ユーザの演奏音と、コンテンツデータに基づいて得られるコンテンツ音との特性を合わせるために、前記演奏音と前記コンテンツ音との少なくとも一方を調整する調整手段、前記演奏音と前記コンテンツ音とをミックスしてなる音に対応する間接音成分を生成する生成手段、及び、前記演奏音と、前記コンテンツ音の少なくとも直接音成分と、前記間接音成分とをミックスしてなる音を出力手段に出力する出力制御手段、としてコンピュータを機能させるためのプログラムである。また、本発明に係る情報記憶媒体は、上記プログラムを記録したコンピュータ読み取り可能な情報記憶媒体である。 Further, the program according to the present invention is an adjusting means for adjusting at least one of the performance sound and the content sound in order to match the characteristics of the user's performance sound and the content sound obtained based on the content data. A generation means for generating an indirect sound component corresponding to a sound obtained by mixing a performance sound and the content sound, and the performance sound, at least a direct sound component of the content sound, and the indirect sound component are mixed. This is a program for operating a computer as an output control means for outputting a sound to an output means. The information storage medium according to the present invention is a computer-readable information storage medium on which the above program is recorded.

なお、本発明において、「演奏」とは音を出す行為を示し、「演奏」には、楽器を奏でる行為だけでなく、歌を歌う行為も含まれる。すなわち、「演奏音」とは楽器の演奏音だけでなく、歌唱音も含む。 In the present invention, "performance" means an act of producing a sound, and "performance" includes not only an act of playing a musical instrument but also an act of singing a song. That is, the "performance sound" includes not only the performance sound of the musical instrument but also the singing sound.

本発明によれば、ユーザの楽器音又は歌唱音とコンテンツ音との一体感を感じながら、ユーザがコンテンツに合わせて演奏を行うことが可能になる。 According to the present invention, the user can perform a performance according to the content while feeling a sense of unity between the user's musical instrument sound or singing sound and the content sound.

本発明の実施形態に係る音処理装置を備えたシステムの構成を示す図である。It is a figure which shows the structure of the system provided with the sound processing apparatus which concerns on embodiment of this invention. ユーザの演奏環境の一例を示す図である。It is a figure which shows an example of the performance environment of a user. 第１実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 1st Embodiment. 第１実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow chart which shows the process executed by the sound processing apparatus which concerns on 1st Embodiment. 間接音成分の生成方法の一例について説明するための図である。It is a figure for demonstrating an example of the method of generating an indirect sound component. 第２実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 2nd Embodiment. 第２実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow figure which shows the process executed by the sound processing apparatus which concerns on 2nd Embodiment. スピーカから放音される音について説明するための図である。It is a figure for demonstrating the sound emitted from a speaker. 第３実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 3rd Embodiment. 第３実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow chart which shows the process executed by the sound processing apparatus which concerns on 3rd Embodiment. スピーカから放音される音について説明するための図である。It is a figure for demonstrating the sound emitted from a speaker. 第４実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 4th Embodiment. 第４実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow diagram which shows the process executed by the sound processing apparatus which concerns on 4th Embodiment. 第５実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 5th Embodiment. 第５実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow chart which shows the process executed by the sound processing apparatus which concerns on 5th Embodiment. 第６実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 6th Embodiment. 第６実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow chart which shows the process executed by the sound processing apparatus which concerns on 6th Embodiment. 第７実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 7th Embodiment. 第７実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow chart which shows the process executed by the sound processing apparatus which concerns on 7th Embodiment. 第８実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 8th Embodiment. 第８実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow chart which shows the process executed by the sound processing apparatus which concerns on 8th Embodiment. 第９実施形態に係る音処理装置の機能ブロック図である。It is a functional block diagram of the sound processing apparatus which concerns on 9th Embodiment. 第９実施形態に係る音処理装置で実行される処理を示すフロー図である。It is a flow chart which shows the process executed by the sound processing apparatus which concerns on 9th Embodiment.

以下、本発明の実施形態の例を図面に基づいて説明する。 Hereinafter, examples of embodiments of the present invention will be described with reference to the drawings.

［第１実施形態］まず、第１実施形態について説明する。図１は、本発明の第１実施形態に係る音処理装置を備えたシステムの構成を示す。図１に示すように、このシステムは、音処理装置１、コンテンツ再生装置２、マイク３、電子楽器４、電気楽器５、スピーカ６（放音手段の一例）、及び表示装置７を含む。なお、コンテンツ再生装置２は、例えば、光学記憶媒体に記憶されたコンテンツ（音楽又は動画等）を再生するものであってもよいし、ネットワークを介して配信されるコンテンツを再生するものであってもよい。 [First Embodiment] First, the first embodiment will be described. FIG. 1 shows a configuration of a system including a sound processing device according to a first embodiment of the present invention. As shown in FIG. 1, this system includes a sound processing device 1, a content reproduction device 2, a microphone 3, an electronic musical instrument 4, an electric musical instrument 5, a speaker 6 (an example of sound emitting means), and a display device 7. The content reproduction device 2 may, for example, reproduce the content (music, moving image, etc.) stored in the optical storage medium, or reproduce the content distributed via the network. May be good.

音処理装置１は例えばＡＶレシーバ等である。音処理装置１は、ＣＰＵ１１、メモリ１２、入力部１３、出力部１４、音響信号処理部１５、及び映像信号処理部１６を含む。 The sound processing device 1 is, for example, an AV receiver or the like. The sound processing device 1 includes a CPU 11, a memory 12, an input unit 13, an output unit 14, an audio signal processing unit 15, and a video signal processing unit 16.

ＣＰＵ１１は、メモリ１２に記憶されたプログラムに基づいて、入力部１３、出力部１４、音響信号処理部１５、及び映像信号処理部１６を制御したり、情報処理を実行したりする。図１では省略されているが、ネットワークを介してデータ通信を行うためのネットワークインタフェースが音処理装置１に備えられており、プログラムはネットワークを介してダウンロードされてメモリ１２に記憶される。または、メモリカード等の情報記憶媒体からプログラムを読み出すための構成要素が音処理装置１に備えられており、プログラムが情報記憶媒体から読み出されてメモリ１２に記憶される。 The CPU 11 controls the input unit 13, the output unit 14, the audio signal processing unit 15, and the video signal processing unit 16 and executes information processing based on the program stored in the memory 12. Although omitted in FIG. 1, the sound processing device 1 is provided with a network interface for performing data communication via the network, and the program is downloaded via the network and stored in the memory 12. Alternatively, the sound processing device 1 is provided with a component for reading a program from an information storage medium such as a memory card, and the program is read from the information storage medium and stored in the memory 12.

入力部１３は、コンテンツ再生装置２からコンテンツデータに基づく音響信号及び映像信号の入力を受け付けることが可能であり、音響信号を音響信号処理部１５に供給し、映像信号を映像信号処理部１６に供給する。 The input unit 13 can receive the input of the audio signal and the video signal based on the content data from the content reproduction device 2, supplies the audio signal to the audio signal processing unit 15, and supplies the video signal to the video signal processing unit 16. Supply.

また入力部１３は、ユーザの演奏音の入力を受け付けることも可能である。なお、「演奏」とは音を出す行為を示し、「演奏」には、楽器を奏でる行為だけでなく、歌を歌う行為も含まれる。このため、「演奏音」には、楽器の演奏音だけでなく、歌唱音も含まれる。なお以下では、楽器の演奏音のことを便宜上「楽器音」と記載する。 The input unit 13 can also accept the input of the user's performance sound. The "performance" indicates the act of producing a sound, and the "performance" includes not only the act of playing a musical instrument but also the act of singing a song. Therefore, the "performance sound" includes not only the performance sound of the musical instrument but also the singing sound. In the following, the performance sound of the musical instrument will be referred to as "musical instrument sound" for convenience.

例えば、入力部１３はマイク３と接続されて、マイク３から出力される音響信号の入力を受け付けることが可能であり、当該音響信号を音響信号処理部１５に供給する。マイク３は音を収音し、収音された音を音響信号として出力する。マイク３は、ユーザによって演奏されるアコースティック楽器の楽器音や、ユーザの歌唱音を音処理装置１に入力するために用いられる。 For example, the input unit 13 is connected to the microphone 3 and can receive the input of the acoustic signal output from the microphone 3, and supplies the acoustic signal to the acoustic signal processing unit 15. The microphone 3 collects sound and outputs the collected sound as an acoustic signal. The microphone 3 is used to input the musical instrument sound of an acoustic musical instrument played by the user and the singing sound of the user to the sound processing device 1.

また例えば、入力部１３はユーザによって演奏される電子楽器４又は電気楽器５と接続されて、電子楽器４又は電気楽器５から出力される音響信号の入力を受け付けることも可能であり、当該音響信号を音響信号処理部１５に供給する。 Further, for example, the input unit 13 can be connected to the electronic musical instrument 4 or the electric musical instrument 5 played by the user to receive the input of the acoustic signal output from the electronic musical instrument 4 or the electric musical instrument 5, and the acoustic signal can be received. Is supplied to the acoustic signal processing unit 15.

なお、入力部１３が無線ネットワークインタフェースを含むようにし、音響信号が無線通信を介して入力部１３に入力されるようにしてもよい。すなわち、コンテンツ音や演奏音が無線通信を介して音処理装置１に入力されるようにしてもよい。 The input unit 13 may include a wireless network interface so that an acoustic signal is input to the input unit 13 via wireless communication. That is, the content sound and the performance sound may be input to the sound processing device 1 via wireless communication.

音響信号処理部１５は例えばＤＳＰ（Digital Signal Processor）であり、ＣＰＵ１１からの制御に従って、音響信号に関する処理を実行する。音響信号処理部１５から出力される音響信号は出力部１４を介してスピーカ６から放音される。 The acoustic signal processing unit 15 is, for example, a DSP (Digital Signal Processor), and executes processing related to the acoustic signal according to the control from the CPU 11. The acoustic signal output from the acoustic signal processing unit 15 is emitted from the speaker 6 via the output unit 14.

映像信号処理部１６は例えばＤＳＰ（Digital Signal Processor）であり、ＣＰＵ１１からの制御に従って、映像信号に関する処理を実行する。映像信号処理部１６から出力される映像信号は出力部１４を介して表示装置７に表示される。 The video signal processing unit 16 is, for example, a DSP (Digital Signal Processor), and executes processing related to the video signal according to the control from the CPU 11. The video signal output from the video signal processing unit 16 is displayed on the display device 7 via the output unit 14.

第１実施形態に係る音処理装置１では、自宅等でアコースティック楽器を奏でたり、歌を歌ったりするユーザがホール等で演奏している気分を楽しむことが可能になっている。以下、このような機能を実現するための構成について説明する。なお、図１に示したように、音処理装置１は、電子楽器４又は電気楽器５の楽器音の入力を受け付ける機能や、コンテンツ再生装置２によって再生されたコンテンツをスピーカ６や表示装置７で出力させる機能を備えているが、これらの機能は第１実施形態では必須のものではない。 In the sound processing device 1 according to the first embodiment, it is possible for a user who plays an acoustic musical instrument or sings a song at home or the like to enjoy the feeling of playing in a hall or the like. Hereinafter, a configuration for realizing such a function will be described. As shown in FIG. 1, the sound processing device 1 has a function of receiving the input of the musical instrument sound of the electronic musical instrument 4 or the electric musical instrument 5, and the content reproduced by the content reproduction device 2 is transmitted by the speaker 6 or the display device 7. Although it has a function to output, these functions are not essential in the first embodiment.

図２はユーザの演奏環境の一例を示す。図２に示す例では、ユーザＵの目の前にマイク３が設置されている。マイク３はユーザの演奏音を収音するために用いられる。例えば、ユーザがアコースティック楽器を奏でている場合には、楽器音がマイク３によって収音され、入力部１３に入力される。また例えば、ユーザが歌を歌っている場合には、歌唱音がマイク３によって収音され、入力部１３に入力される。 FIG. 2 shows an example of the user's playing environment. In the example shown in FIG. 2, the microphone 3 is installed in front of the user U. The microphone 3 is used to collect the performance sound of the user. For example, when the user is playing an acoustic musical instrument, the musical instrument sound is picked up by the microphone 3 and input to the input unit 13. Further, for example, when the user is singing a song, the singing sound is picked up by the microphone 3 and input to the input unit 13.

また図２に示す例では、複数のスピーカ６Ａ，６Ｂ，６Ｃ，６Ｄ，６Ｅが設置されている。具体的には、ユーザＵの正面にスピーカ６Ａが設置されている。また、ユーザＵから見て左前方、右前方にそれぞれスピーカ６Ｂ，６Ｃが設置され、ユーザＵから見て左後方、右後方にそれぞれスピーカ６Ｄ，６Ｅが設置されている。図２に示す例では、５台のスピーカ６Ａ〜６Ｅを設置しているが、４台以下のスピーカ６を設置してもよいし、６台以上のスピーカ６を設置してもよい。例えば、スピーカ６Ｂ，６Ｃのみを設置してもよい。 Further, in the example shown in FIG. 2, a plurality of speakers 6A, 6B, 6C, 6D, and 6E are installed. Specifically, the speaker 6A is installed in front of the user U. Further, the speakers 6B and 6C are installed on the left front and the right front when viewed from the user U, and the speakers 6D and 6E are installed on the left rear and the right rear when viewed from the user U, respectively. In the example shown in FIG. 2, five speakers 6A to 6E are installed, but four or less speakers 6 may be installed, or six or more speakers 6 may be installed. For example, only the speakers 6B and 6C may be installed.

図３は、第１実施形態に係る音処理装置１で実現される機能を示す機能ブロック図である。図３に示すように、第１実施形態に係る音処理装置１は、演奏音調整部１０１、プリプロセッシング部１０２（第１の処理手段の一例）、間接音成分生成部１０３、ポストプロセッシング部１０４（第２の処理手段の一例）、及び出力制御部１０５を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 3 is a functional block diagram showing a function realized by the sound processing device 1 according to the first embodiment. As shown in FIG. 3, the sound processing device 1 according to the first embodiment includes a performance sound adjusting unit 101, a pre-processing unit 102 (an example of the first processing means), an indirect sound component generation unit 103, and a post-processing unit 104. (An example of the second processing means), and the output control unit 105. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

図４は、第１実施形態に係る音処理装置１で実行される処理を示すフロー図である。以下、図４を参照しながら各機能ブロックの機能について説明する。 FIG. 4 is a flow chart showing a process executed by the sound processing device 1 according to the first embodiment. Hereinafter, the functions of each functional block will be described with reference to FIG.

まず、演奏音調整部１０１は、マイク３から入力された演奏音に対して所定処理を施すことによって、演奏音を調整する（Ｓ１０）。例えば、演奏音調整部１０１は、マイク３におけるハウリングを低減するためのハウリング低減処理を演奏音に対して施す。また例えば、演奏音調整部１０１はエフェクト処理（例えば、間接音を生成する前に不要な周波数帯域を削除したり、音圧レベルを整えたりする処理等）を演奏音に対して施すようにしてもよい。演奏音調整部１０１による処理が施された演奏音はプリプロセッシング部１０２に供給される。 First, the performance sound adjustment unit 101 adjusts the performance sound by performing a predetermined process on the performance sound input from the microphone 3 (S10). For example, the performance sound adjustment unit 101 applies a howling reduction process to the performance sound to reduce howling in the microphone 3. Further, for example, the performance sound adjustment unit 101 performs effect processing (for example, processing for deleting an unnecessary frequency band or adjusting the sound pressure level before generating an indirect sound) on the performance sound. May be good. The performance sound processed by the performance sound adjustment unit 101 is supplied to the preprocessing unit 102.

プリプロセッシング部１０２は、供給された音（ここでは演奏音）に対して、プリプロセッシングを実行する（Ｓ１１）。例えば、プリプロセッシング部１０２は、供給された音に対して、イコライザによる音声調整処理等を施す。プリプロセッシング部１０２による処理が施された演奏音は間接音成分生成部１０３に供給される。なお、図３では、演奏音調整部１０１とプリプロセッシング部１０２とが別個の機能ブロックとして示されているが、これらは一体的に構成されるようにしてもよい。 The preprocessing unit 102 executes preprocessing on the supplied sound (here, the playing sound) (S11). For example, the preprocessing unit 102 performs voice adjustment processing or the like by an equalizer on the supplied sound. The performance sound processed by the preprocessing unit 102 is supplied to the indirect sound component generation unit 103. Although the performance sound adjusting unit 101 and the preprocessing unit 102 are shown as separate functional blocks in FIG. 3, they may be integrally configured.

間接音成分生成部１０３は演奏音に対応する擬似的な間接音成分を生成する（Ｓ１２）。すなわち、間接音成分生成部１０３は、ホール等の音響空間で演奏音が発せられた場合を想定し、その場合に音響空間で発生する間接音成分（残響成分等）を生成する。擬似的な間接音成分を生成する方法としては公知の各種方法を採用することができる。例えば、間接音成分生成部１０３は、想定する音響空間における間接音（残響音）の発生位置、直接音に対する間接音の遅延時間や、直接音の音圧レベルに対する間接音のレベルの割合等の情報に基づいて、演奏音に対応する擬似的な間接音成分を生成する。 The indirect sound component generation unit 103 generates a pseudo indirect sound component corresponding to the performance sound (S12). That is, the indirect sound component generation unit 103 assumes a case where a performance sound is emitted in an acoustic space such as a hall, and generates an indirect sound component (reverberation component or the like) generated in the acoustic space in that case. Various known methods can be adopted as a method for generating a pseudo indirect sound component. For example, the indirect sound component generation unit 103 determines the position where the indirect sound (reverberation sound) is generated in the assumed acoustic space, the delay time of the indirect sound with respect to the direct sound, the ratio of the level of the indirect sound to the sound pressure level of the direct sound, and the like. Based on the information, a pseudo indirect sound component corresponding to the playing sound is generated.

例えば、間接音成分生成部１０３は、供給された音に対応する間接音成分を当該供給された音に対して付加する間接音成分付加部を含んでおり、間接音成分生成部１０３は演奏音を間接音成分付加部に供給する。そして、間接音成分生成部１０３は、間接音成分付加部から出力される音（間接音成分が付加された演奏音）から元々の演奏音を除去することによって、間接音成分のみを取得する。 For example, the indirect sound component generation unit 103 includes an indirect sound component addition unit that adds an indirect sound component corresponding to the supplied sound to the supplied sound, and the indirect sound component generation unit 103 includes a performance sound. Is supplied to the indirect sound component addition part. Then, the indirect sound component generation unit 103 acquires only the indirect sound component by removing the original performance sound from the sound output from the indirect sound component addition unit (the performance sound to which the indirect sound component is added).

図５は間接音成分の生成方法の一例について説明するための図である。図５（Ａ）は演奏音の一例を示す。この演奏音は直接音成分に相当する。例えば、図５（Ａ）に示す演奏音（直接音成分）は第１バッファ及び第２バッファの各々に格納される。間接音成分付加部は、第１バッファに格納された演奏音（直接音成分）に対して、当該演奏音に対応する間接音成分を付加する。ここで、間接音成分を付加する方法として公知の各種方法を採用することができる。この場合、第１バッファには、例えば図５（Ｂ）に示すように、演奏音の直接音成分及び間接音成分が格納される。その後、間接音成分生成部１０３は、第１バッファに格納された演奏音の直接音成分及び間接音成分（図５（Ｂ））から、第２バッファに格納された演奏音の直接音成分（図５（Ａ））を減算することによって、図５（Ｃ）に示すような間接音成分のみを取得する。 FIG. 5 is a diagram for explaining an example of a method of generating an indirect sound component. FIG. 5A shows an example of a performance sound. This playing sound corresponds to a direct sound component. For example, the performance sound (direct sound component) shown in FIG. 5A is stored in each of the first buffer and the second buffer. The indirect sound component addition unit adds an indirect sound component corresponding to the performance sound to the performance sound (direct sound component) stored in the first buffer. Here, various known methods can be adopted as a method for adding the indirect sound component. In this case, as shown in FIG. 5B, for example, the first buffer stores the direct sound component and the indirect sound component of the performance sound. After that, the indirect sound component generation unit 103 changes the direct sound component and the indirect sound component of the performance sound stored in the first buffer (FIG. 5B) to the direct sound component of the performance sound stored in the second buffer (FIG. 5B). By subtracting FIG. 5 (A), only the indirect sound component as shown in FIG. 5 (C) is acquired.

なお、間接音成分を生成する方法は上記の例に限られない。例えば、図５（Ａ）に示す演奏音（直接音成分）を第１バッファに格納し、当該演奏音（直接音成分）に対応する間接音成分を第２バッファに生成するようにしてもよい。 The method of generating the indirect sound component is not limited to the above example. For example, the performance sound (direct sound component) shown in FIG. 5A may be stored in the first buffer, and the indirect sound component corresponding to the performance sound (direct sound component) may be generated in the second buffer. ..

間接音成分生成部１０３によって生成された間接音成分はポストプロセッシング部１０４に供給される。ポストプロセッシング部１０４は、供給された音（ここでは間接音成分）に対して、ポストプロセッシングを実行する（Ｓ１３）。例えば、ポストプロセッシング部１０４は、供給された音に対して、スピーカ６の特性に合わせて調整するための処理を施す。ポストプロセッシング部１０４よる処理が施された間接音成分は出力制御部１０５に供給される。 The indirect sound component generated by the indirect sound component generation unit 103 is supplied to the post-processing unit 104. The post-processing unit 104 executes post-processing on the supplied sound (here, an indirect sound component) (S13). For example, the post-processing unit 104 performs a process for adjusting the supplied sound according to the characteristics of the speaker 6. The indirect sound component processed by the post-processing unit 104 is supplied to the output control unit 105.

出力制御部１０５は、供給された間接音成分を出力部１４（出力手段の一例）に出力する（Ｓ１４）。すなわち、出力制御部１０５は、マイク３から入力された演奏音（アコースティック楽器の楽器音又は歌唱音）を出力部１４に出力することを制限しつつ、間接音成分を出力部１４に出力する。出力部１４に出力された間接音成分はスピーカ６によって放音される。 The output control unit 105 outputs the supplied indirect sound component to the output unit 14 (an example of the output means) (S14). That is, the output control unit 105 outputs the indirect sound component to the output unit 14 while limiting the output of the performance sound (musical instrument sound or singing sound of the acoustic instrument) input from the microphone 3 to the output unit 14. The indirect sound component output to the output unit 14 is emitted by the speaker 6.

ここで、「演奏音を出力部１４に出力することを制限する」とは、例えば、演奏音を出力部１４に出力しないようにすることである。すなわち、出力制御部１０５は、マイク３から入力された演奏音（直接音成分）を出力部１４に出力せずに、間接音成分のみを出力部１４に出力する。言い換えれば、出力制御部１０５は、マイク３から入力された演奏音（直接音成分）がスピーカ６から放音されないようにし、間接音成分のみがスピーカ６から放音されるようにする。 Here, "restricting the output of the performance sound to the output unit 14" means, for example, not to output the performance sound to the output unit 14. That is, the output control unit 105 does not output the performance sound (direct sound component) input from the microphone 3 to the output unit 14, but outputs only the indirect sound component to the output unit 14. In other words, the output control unit 105 prevents the performance sound (direct sound component) input from the microphone 3 from being emitted from the speaker 6, and causes only the indirect sound component to be emitted from the speaker 6.

「演奏音を出力部１４に出力することを制限する」とは、例えば、間接音成分に比べてかなり小さい音量で演奏音を放音されるように出力部１４に出力することであってもよい。すなわち、出力制御部１０５は、マイク３から入力された演奏音（直接音成分）を通常の音量に比べてかなり小さい音量（ユーザの耳に聞こえ難い程度に小さい音量）で放音されるように出力部１４に出力しつつ、間接音成分を通常の音量で出力部１４に出力するようにしてもよい。言い換えれば、出力制御部１０５は、マイク３から入力された演奏音（直接音成分）が通常の音量に比べてかなり小さい音量でスピーカ６から放音されるようにし、間接音成分が通常の音量でスピーカ６から放音されるようにする。 "Restricting the output of the performance sound to the output unit 14" means, for example, even if the performance sound is output to the output unit 14 so as to be emitted at a volume considerably lower than that of the indirect sound component. Good. That is, the output control unit 105 emits the performance sound (direct sound component) input from the microphone 3 at a volume considerably lower than the normal volume (a volume low enough to be hard to hear by the user). While outputting to the output unit 14, the indirect sound component may be output to the output unit 14 at a normal volume. In other words, the output control unit 105 causes the performance sound (direct sound component) input from the microphone 3 to be emitted from the speaker 6 at a volume considerably lower than the normal volume, and the indirect sound component is the normal volume. Is made to emit sound from the speaker 6.

なお、スピーカ６が音処理装置１に内蔵される場合、出力制御部１０５は、供給された間接音成分をスピーカ６（出力手段の他の一例）に出力することになる。 When the speaker 6 is built in the sound processing device 1, the output control unit 105 outputs the supplied indirect sound component to the speaker 6 (another example of the output means).

以上に説明した第１実施形態に係る音処理装置１によれば、ユーザの演奏音（アコースティック楽器の楽器音又は歌唱音）に対応する擬似的な間接音成分（残響成分等）がスピーカ６から放音されるため、ユーザはホールや教会等でアコースティック楽器を演奏したり、歌を歌ったりしている気分を楽しむことができる。また、第１実施形態に係る音処理装置１によれば、ユーザの演奏音（アコースティック楽器の楽器音又は歌唱音）がスピーカ６から放音されることが制限されるため、本来の発音位置とは異なる位置から発せられる演奏音が聞こえることに起因する違和感をユーザに与えてしまわないように図ることができる。 According to the sound processing device 1 according to the first embodiment described above, a pseudo indirect sound component (reverberation component, etc.) corresponding to a user's playing sound (musical instrument sound or singing sound of an acoustic instrument) is transmitted from the speaker 6. Since the sound is emitted, the user can enjoy the feeling of playing an acoustic musical instrument or singing a song in a hall or a church. Further, according to the sound processing device 1 according to the first embodiment, the user's performance sound (musical instrument sound or singing sound of an acoustic instrument) is restricted from being emitted from the speaker 6, so that the original sounding position is used. Can be designed so as not to give the user a sense of discomfort due to hearing performance sounds emitted from different positions.

［第２実施形態］次に、第２実施形態について説明する。第２実施形態に係る音処理装置１のハードウェア構成は第１実施形態と同様である。また、ユーザの演奏環境も第１実施形態と基本的に同様である。ただし、第２実施形態では、音処理装置１の入力部１３と接続された電子楽器４又は電気楽器５がユーザによって演奏されるため、マイク３は不要である。 [Second Embodiment] Next, the second embodiment will be described. The hardware configuration of the sound processing device 1 according to the second embodiment is the same as that of the first embodiment. Further, the playing environment of the user is basically the same as that of the first embodiment. However, in the second embodiment, since the electronic musical instrument 4 or the electric musical instrument 5 connected to the input unit 13 of the sound processing device 1 is played by the user, the microphone 3 is unnecessary.

第２実施形態に係る音処理装置１では、自宅等で電子楽器４又は電気楽器５を演奏しているユーザがホール等で演奏している気分を楽しむことが可能になっている。以下、このような機能を実現するための構成について説明する。なお、図１に示したように、音処理装置１は、コンテンツ再生装置２によって再生されたコンテンツをスピーカ６や表示装置７で出力させる機能を備えているが、これらの機能は第２実施形態では必須のものではない。 In the sound processing device 1 according to the second embodiment, a user who is playing the electronic musical instrument 4 or the electric musical instrument 5 at home or the like can enjoy the feeling of playing in the hall or the like. Hereinafter, a configuration for realizing such a function will be described. As shown in FIG. 1, the sound processing device 1 has a function of outputting the content reproduced by the content reproduction device 2 to the speaker 6 and the display device 7, but these functions are the second embodiment. Is not essential.

図６は、第２実施形態に係る音処理装置１で実現される機能を示す機能ブロック図である。図６に示すように、第２実施形態に係る音処理装置１は、演奏音調整部１１１、プリプロセッシング部１１２、間接音成分生成部１１３、ポストプロセッシング部１１４、及び出力制御部１１５を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 6 is a functional block diagram showing a function realized by the sound processing device 1 according to the second embodiment. As shown in FIG. 6, the sound processing device 1 according to the second embodiment includes a performance sound adjustment unit 111, a pre-processing unit 112, an indirect sound component generation unit 113, a post-processing unit 114, and an output control unit 115. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

図７は、第２実施形態に係る音処理装置１で実行される処理を示すフロー図である。以下、図７を参照しながら各機能ブロックの機能について説明する。 FIG. 7 is a flow chart showing a process executed by the sound processing device 1 according to the second embodiment. Hereinafter, the functions of each functional block will be described with reference to FIG. 7.

まず、演奏音調整部１１１は、電子楽器４又は電気楽器５から入力された演奏音に対して所定処理を施すことによって、演奏音を調整する（Ｓ２０）。例えば、演奏音調整部１１１はエフェクト処理（例えば、ギター音に対するディストーション処理等）を演奏音に対して施す。なお、演奏音調整部１１１では、大きな遅延を発生させるような処理は実行されず、遅延の小さい処理のみが実行される。演奏音調整部１１１による処理が施された演奏音はプリプロセッシング部１１２に供給される。 First, the performance sound adjustment unit 111 adjusts the performance sound by performing a predetermined process on the performance sound input from the electronic musical instrument 4 or the electric musical instrument 5 (S20). For example, the performance sound adjustment unit 111 performs effect processing (for example, distortion processing on the guitar sound) on the performance sound. Note that the performance sound adjustment unit 111 does not execute a process that causes a large delay, and only a process with a small delay is executed. The performance sound processed by the performance sound adjustment unit 111 is supplied to the preprocessing unit 112.

プリプロセッシング部１１２は、供給された音（ここでは演奏音）に対して、プリプロセッシングを実行する（Ｓ２１）。また、間接音成分生成部１１３は、演奏音に対応する擬似的な間接音成分を生成する（Ｓ２２）。そして、ポストプロセッシング部１１４は、供給された音（ここでは間接音成分）に対して、ポストプロセッシングを実行する（Ｓ２３）。ステップＳ２１〜Ｓ２３は第１実施形態のステップＳ１１〜Ｓ１３と基本的に同様であり、プリプロセッシング部１１２、間接音成分生成部１１３、及びポストプロセッシング部１１４は第１実施形態のプリプロセッシング部１０２、間接音成分生成部１０３、及びポストプロセッシング部１０４と基本的に同様であるため、ここでは説明を省略する。 The preprocessing unit 112 executes preprocessing on the supplied sound (here, the playing sound) (S21). Further, the indirect sound component generation unit 113 generates a pseudo indirect sound component corresponding to the performance sound (S22). Then, the post-processing unit 114 executes post-processing on the supplied sound (here, an indirect sound component) (S23). Steps S21 to S23 are basically the same as steps S11 to S13 of the first embodiment, and the pre-processing unit 112, the indirect sound component generation unit 113, and the post-processing unit 114 are the pre-processing unit 102 of the first embodiment. Since it is basically the same as the indirect sound component generation unit 103 and the post-processing unit 104, the description thereof will be omitted here.

なお、演奏音調整部１１１による処理が施された演奏音は、経路１１９を介して、出力制御部１１５にも供給される。経路１１９は、プリプロセッシング部１１２、間接音成分生成部１１３、及びポストプロセッシング部１１４を介さずに出力制御部１１５へと至る経路である。言い換えれば、経路１１９は、プリプロセッシング部１１２、間接音成分生成部１１３、及びポストプロセッシング部１１４を介して出力制御部１１５へと至る経路に比べて遅延の少ない経路である。例えば、プリプロセッシング部１１２、間接音成分生成部１１３、及びポストプロセッシング部１１４では、バッファに格納された演奏音に基づいて処理が実行されるが、経路１１９では、演奏音がバッファに格納されることなく、出力制御部１１５まで供給される。 The performance sound processed by the performance sound adjustment unit 111 is also supplied to the output control unit 115 via the path 119. The path 119 is a path leading to the output control unit 115 without going through the preprocessing unit 112, the indirect sound component generation unit 113, and the post processing unit 114. In other words, the path 119 is a path having less delay than the path leading to the output control unit 115 via the preprocessing unit 112, the indirect sound component generation unit 113, and the post processing unit 114. For example, in the preprocessing unit 112, the indirect sound component generation unit 113, and the post processing unit 114, processing is executed based on the performance sound stored in the buffer, but in the path 119, the performance sound is stored in the buffer. It is supplied to the output control unit 115 without any problem.

出力制御部１１５は、経路１１９を介して供給された演奏音（直接音成分）と、間接音成分生成部１１３によって生成された間接音成分とをミックスし、当該ミックス音を出力部１４に出力する（Ｓ２４）。出力部１４に出力されたミックス音はスピーカ６によって放音される。 The output control unit 115 mixes the performance sound (direct sound component) supplied via the path 119 and the indirect sound component generated by the indirect sound component generation unit 113, and outputs the mixed sound to the output unit 14. (S24). The mixed sound output to the output unit 14 is emitted by the speaker 6.

図８は、スピーカ６から放音される音について説明するための図である。ここでは、図８（Ａ）に示すように、演奏音Ａが入力された後で演奏音Ｂが入力された場合を想定する。これらの演奏音Ａ，Ｂは直接音成分に相当する。この場合、間接音成分生成部１１３では、図８（Ｂ）に示すように、演奏音Ａに対応する間接音成分Ａが生成され、当該間接音成分Ａが出力制御部１１５に供給される。なお、間接音成分生成部１１３では、上記の間接音成分Ａが生成された後で、演奏音Ｂに対応する間接音成分Ｂも生成されるが、ここでは省略している。 FIG. 8 is a diagram for explaining the sound emitted from the speaker 6. Here, as shown in FIG. 8A, it is assumed that the performance sound B is input after the performance sound A is input. These performance sounds A and B correspond to direct sound components. In this case, as shown in FIG. 8B, the indirect sound component generation unit 113 generates the indirect sound component A corresponding to the performance sound A, and the indirect sound component A is supplied to the output control unit 115. The indirect sound component generation unit 113 also generates the indirect sound component B corresponding to the performance sound B after the indirect sound component A is generated, but this is omitted here.

プリプロセッシング部１１２、間接音成分生成部１１３、及びポストプロセッシング部１１４での処理量は大きく、これらの機能ブロックでの処理には時間を要するため、間接音成分Ａは、これらの機能ブロックでの処理に要した時間に応じた遅延時間だけ遅延してスピーカ６から放音される。これに対して、演奏音Ａ，Ｂ（直接音成分）は、遅延の少ない経路１１９（実質的な遅延の生じない経路）を介してスピーカ６から放音される。このため、図８（Ｃ）に示すように、演奏音Ａに対応する間接音成分Ａが実際よりも遅延して、演奏音Ａよりも後の演奏音Ｂとミックスされ、当該ミックスされた音がスピーカ６から放音される。 Since the amount of processing in the preprocessing unit 112, the indirect sound component generation unit 113, and the post-processing unit 114 is large and processing in these functional blocks takes time, the indirect sound component A is the indirect sound component A in these functional blocks. Sound is emitted from the speaker 6 with a delay of a delay time corresponding to the time required for processing. On the other hand, the performance sounds A and B (direct sound components) are emitted from the speaker 6 via a path 119 with a small delay (a path in which a substantial delay does not occur). Therefore, as shown in FIG. 8C, the indirect sound component A corresponding to the performance sound A is delayed from the actual state and is mixed with the performance sound B after the performance sound A, and the mixed sound is mixed. Is emitted from the speaker 6.

以上に説明した第２実施形態に係る音処理装置１によれば、ユーザの演奏音（電子楽器４又は電気楽器５の楽器音）に対して擬似的な間接音成分（残響成分等）が付加されてスピーカ６から放音されるため、ユーザはホール等で楽器を演奏している気分を楽しむことができる。また、第２実施形態に係る音処理装置１によれば、ユーザの演奏音は遅延の小さい経路１１９を介してスピーカ６から放音されるため、ユーザによって演奏されてから当該演奏音がスピーカ６から放音されるまでの遅延を小さく抑えることができる。その結果、ユーザによって演奏されてから当該演奏音がスピーカ６から放音されるまでの遅延が大きいことに起因する違和感にユーザに与えてしまわないように図ることができる。 According to the sound processing device 1 according to the second embodiment described above, a pseudo indirect sound component (reverberation component, etc.) is added to the user's performance sound (musical instrument sound of the electronic musical instrument 4 or the electric musical instrument 5). Since the sound is emitted from the speaker 6, the user can enjoy the feeling of playing an musical instrument in a hall or the like. Further, according to the sound processing device 1 according to the second embodiment, since the user's performance sound is emitted from the speaker 6 via the path 119 having a small delay, the performance sound is emitted from the speaker 6 after being played by the user. It is possible to suppress the delay until the sound is emitted from the sound. As a result, it is possible to prevent the user from feeling uncomfortable due to a large delay from the performance by the user until the performance sound is emitted from the speaker 6.

なお、第２実施形態に係る音処理装置１では、ユーザの演奏音に対応する間接音成分が、現実の音響空間で演奏音が発せられた場合に生じる間接音成分に比べて遅れて生じることになるが（図８参照）、間接音成分に遅延が多少生じたとしても、それによりユーザに違和感を与える可能性は低いため、特に問題は生じない。 In the sound processing device 1 according to the second embodiment, the indirect sound component corresponding to the user's performance sound is generated later than the indirect sound component generated when the performance sound is emitted in the actual acoustic space. However, even if there is a slight delay in the indirect sound component, it is unlikely that the user will feel uncomfortable, so that no particular problem will occur.

［第３実施形態］次に、第３実施形態について説明する。第３実施形態に係る音処理装置１のハードウェア構成は第１実施形態と同様である。また、ユーザの演奏環境は第２実施形態と同様である。 [Third Embodiment] Next, the third embodiment will be described. The hardware configuration of the sound processing device 1 according to the third embodiment is the same as that of the first embodiment. The playing environment of the user is the same as that of the second embodiment.

第３実施形態に係る音処理装置１では、自宅等で電子楽器４又は電気楽器５を演奏しているユーザが音楽コンテンツの演奏者の一員となってホール等で演奏している気分を楽しむことが可能になっている。以下、このような機能を実現するための構成について説明する。 In the sound processing device 1 according to the third embodiment, a user playing an electronic musical instrument 4 or an electric musical instrument 5 at home or the like can enjoy the feeling of playing in a hall or the like as a member of a music content player. Is possible. Hereinafter, a configuration for realizing such a function will be described.

図９は、第３実施形態に係る音処理装置１で実現される機能を示す機能ブロック図である。図９に示すように、第３実施形態に係る音処理装置１は、演奏音調整部１２１、プリプロセッシング部１２２、間接音成分生成部１２３、ポストプロセッシング部１２４、出力制御部１２５、及びコンテンツデコード部１２６を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 9 is a functional block diagram showing a function realized by the sound processing device 1 according to the third embodiment. As shown in FIG. 9, the sound processing device 1 according to the third embodiment includes a performance sound adjustment unit 121, a pre-processing unit 122, an indirect sound component generation unit 123, a post-processing unit 124, an output control unit 125, and a content decoding unit. Includes part 126. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

図１０は、第３実施形態に係る音処理装置１で実行される処理を示すフロー図である。以下、図１０を参照しながら各機能ブロックの機能について説明する。 FIG. 10 is a flow chart showing a process executed by the sound processing device 1 according to the third embodiment. Hereinafter, the functions of each functional block will be described with reference to FIG.

まず、コンテンツデコード部１２６は、コンテンツ再生装置２から入力されるマルチチャンネルのコンテンツ音をフォーマットデコードすることによって、ＰＣＭ信号に変換する（Ｓ３０）。 First, the content decoding unit 126 converts the multi-channel content sound input from the content playback device 2 into a PCM signal by format decoding (S30).

また、演奏音調整部１２１は、電子楽器４又は電気楽器５から入力された演奏音に対して所定処理を施すことによって、演奏音を調整する（Ｓ３１）。ステップＳ３１は第２実施形態のステップＳ２０と同様であり、演奏音調整部１２１は第２実施形態の演奏音調整部１１１と同様であるため、ここでは説明を省略する。 Further, the performance sound adjusting unit 121 adjusts the performance sound by performing a predetermined process on the performance sound input from the electronic musical instrument 4 or the electric musical instrument 5 (S31). Since step S31 is the same as step S20 of the second embodiment and the performance sound adjustment unit 121 is the same as the performance sound adjustment unit 111 of the second embodiment, the description thereof will be omitted here.

なお、図１０では、便宜上、ステップＳ３０，Ｓ３１が順番に実行されるように示されているが、ステップＳ３０，Ｓ３１は並列的に実行される。 In FIG. 10, for convenience, steps S30 and S31 are shown to be executed in order, but steps S30 and S31 are executed in parallel.

ＰＣＭ信号に変換されたコンテンツ音は、ＡＤ変換回路によってＰＣＭ信号に変換された演奏音とミックスされ（Ｓ３２）、当該ミックス音がプリプロセッシング部１２２に供給される。なお、演奏音は経路１２９を介して出力制御部１２５にも供給される。経路１２９は第２実施形態の経路１１９と同様である。 The content sound converted into the PCM signal is mixed with the performance sound converted into the PCM signal by the AD conversion circuit (S32), and the mixed sound is supplied to the preprocessing unit 122. The performance sound is also supplied to the output control unit 125 via the path 129. Route 129 is similar to Route 119 of the second embodiment.

プリプロセッシング部１２２は、上記ミックス音に対して、プリプロセッシングを実行する（Ｓ３３）。例えば、プリプロセッシング部１２２は、上記ミックス音に対して、イコライザによる音声調整処理等を施す。プリプロセッシング部１２２による処理が施されたミックス音は間接音成分生成部１２３に供給される。 The preprocessing unit 122 executes preprocessing on the mixed sound (S33). For example, the preprocessing unit 122 performs voice adjustment processing or the like by an equalizer on the mixed sound. The mixed sound processed by the preprocessing unit 122 is supplied to the indirect sound component generation unit 123.

間接音成分生成部１２３は上記ミックス音に対応する擬似的な間接音成分を生成する（Ｓ３４）。すなわち、間接音成分生成部１２３は、演奏音（直接音成分）とコンテンツ音とに対応する間接音成分を生成する。間接音成分生成部１２３は、ホール等の音響空間で上記ミックス音が発せられた場合を想定し、その場合に音響空間で発生する間接音成分（残響成分等）を生成する。擬似的な間接音成分を生成する方法としては公知の各種方法を採用することができる。 The indirect sound component generation unit 123 generates a pseudo indirect sound component corresponding to the mixed sound (S34). That is, the indirect sound component generation unit 123 generates an indirect sound component corresponding to the performance sound (direct sound component) and the content sound. The indirect sound component generation unit 123 assumes the case where the mixed sound is emitted in an acoustic space such as a hall, and generates an indirect sound component (reverberation component or the like) generated in the acoustic space in that case. Various known methods can be adopted as a method for generating a pseudo indirect sound component.

例えば、間接音成分生成部１２３は、第１バッファに格納された上記ミックス音に対して、間接音を付加する処理を施し、その後、第１バッファに格納された音から、第２バッファに格納された元々の上記ミックス音を減算することによって、上記ミックス音に対応する間接音成分を取得する。なお、間接音成分生成部１２３は、第１バッファに格納された上記ミックス音に基づいて、間接音を生成する処理を実行することによって、上記ミックス音に対応する間接音を第２バッファに生成することによって、上記ミックス音に対応する間接音成分を取得するようにしてもよい。 For example, the indirect sound component generation unit 123 performs a process of adding an indirect sound to the mixed sound stored in the first buffer, and then stores the sound stored in the first buffer in the second buffer. By subtracting the original mixed sound, the indirect sound component corresponding to the mixed sound is obtained. The indirect sound component generation unit 123 generates an indirect sound corresponding to the mixed sound in the second buffer by executing a process of generating the indirect sound based on the mixed sound stored in the first buffer. By doing so, the indirect sound component corresponding to the mixed sound may be acquired.

間接音成分生成部１２３によって生成された間接音成分は、コンテンツ音とともに、ポストプロセッシング部１２４を経て、出力制御部１２５に供給される。 The indirect sound component generated by the indirect sound component generation unit 123 is supplied to the output control unit 125 together with the content sound via the post-processing unit 124.

ポストプロセッシング部１２４はポストプロセッシングを実行する（Ｓ３５）。ステップＳ３５は第１実施形態のステップＳ１３と基本的に同様であり、ポストプロセッシング部１２４は第１実施形態のポストプロセッシング部１０４と基本的に同様であるため、ここでは説明を省略する。 The post-processing unit 124 executes post-processing (S35). Since step S35 is basically the same as step S13 of the first embodiment and the post-processing unit 124 is basically the same as the post-processing unit 104 of the first embodiment, description thereof will be omitted here.

出力制御部１２５は、経路１２９を介して供給された演奏音（直接音成分）と、ポストプロセッシング部１２４から供給されるコンテンツ音及び間接音成分とをミックスし、当該ミックス音を出力部１４に出力する（Ｓ３６）。出力部１４に出力されたミックス音はスピーカ６によって放音される。 The output control unit 125 mixes the performance sound (direct sound component) supplied via the path 129 with the content sound and the indirect sound component supplied from the post-processing unit 124, and transfers the mixed sound to the output unit 14. Output (S36). The mixed sound output to the output unit 14 is emitted by the speaker 6.

図１１は、スピーカ６から放音される音について説明するための図である。ここでは、図１１（Ａ）に示すように、演奏音Ａ及びコンテンツ音Ａが入力された後で、演奏音Ｂ及びコンテンツ音Ｂが入力された場合を想定する。なお、演奏音Ａ，Ｂは直接音成分に相当する。また図１１では、便宜上、演奏音Ａとコンテンツ音Ａとを時間的に少しずらして示しているが、演奏音Ａとコンテンツ音Ａとの入力時点は同じであることとする。演奏音Ｂとコンテンツ音Ｂとに関しても同様である。 FIG. 11 is a diagram for explaining the sound emitted from the speaker 6. Here, as shown in FIG. 11A, it is assumed that the performance sound B and the content sound B are input after the performance sound A and the content sound A are input. The performance sounds A and B correspond to direct sound components. Further, in FIG. 11, for convenience, the performance sound A and the content sound A are shown with a slight time lag, but it is assumed that the input time points of the performance sound A and the content sound A are the same. The same applies to the performance sound B and the content sound B.

図１１（Ａ）に示す例の場合、間接音成分生成部１２３では、図１１（Ｂ）に示すように、演奏音Ａとコンテンツ音Ａとのミックス音に対応する間接音成分Ａが生成され、当該間接音成分Ａがコンテンツ音Ａとともに出力制御部１２５に供給される。なお、間接音成分生成部１１３では、上記の間接音成分Ａが生成された後で、演奏音Ｂとコンテンツ音Ｂとのミックス音に対応する間接音成分Ｂも生成されるが、ここでは省略している。 In the case of the example shown in FIG. 11A, the indirect sound component generation unit 123 generates the indirect sound component A corresponding to the mixed sound of the performance sound A and the content sound A as shown in FIG. 11B. , The indirect sound component A is supplied to the output control unit 125 together with the content sound A. In the indirect sound component generation unit 113, after the indirect sound component A is generated, the indirect sound component B corresponding to the mixed sound of the performance sound B and the content sound B is also generated, but is omitted here. doing.

プリプロセッシング部１２２、間接音成分生成部１２３、及びポストプロセッシング部１２４での処理量は大きく、これらの機能ブロックでの処理には時間を要するため、間接音成分Ａ及びコンテンツ音Ａは、これらの機能ブロックでの処理に要した時間に応じた遅延時間だけ遅延してスピーカ６から放音される。これに対して、演奏音Ａ，Ｂ（直接音成分）は、遅延の少ない経路１２９（実質的な遅延の生じない経路）を介してスピーカ６から放音される。このため、図１１（Ｃ）に示すように、間接音成分Ａが実際よりも遅延して、演奏音Ａよりも後の演奏音Ｂとミックスされ、当該ミックスされた音がスピーカ６から放音される。 Since the processing amount of the pre-processing unit 122, the indirect sound component generation unit 123, and the post-processing unit 124 is large and the processing by these functional blocks takes time, the indirect sound component A and the content sound A are described. Sound is emitted from the speaker 6 with a delay time corresponding to the time required for processing in the functional block. On the other hand, the performance sounds A and B (direct sound components) are emitted from the speaker 6 via the path 129 with less delay (the path where substantially no delay occurs). Therefore, as shown in FIG. 11C, the indirect sound component A is delayed from the actual state and is mixed with the performance sound B after the performance sound A, and the mixed sound is emitted from the speaker 6. Will be done.

以上に説明した第３実施形態に係る音処理装置１によれば、ユーザの演奏音（電子楽器４又は電気楽器５の楽器音）とマルチチャンネルのコンテンツ音とに対して擬似的な間接音成分（残響成分等）が付加されてスピーカ６から放音されるため、ユーザは音楽コンテンツの演奏者の一員となってホール等で演奏している気分を楽しむことができる。また、第３実施形態に係る音処理装置１によれば、ユーザの演奏音は遅延の小さい経路１２９を介してスピーカ６から放音されるため、ユーザによって演奏されてから当該演奏音がスピーカ６から放音されるまでの遅延を小さく抑えることができる。その結果、ユーザによって演奏されてから当該演奏音がスピーカ６から放音されるまでの遅延が大きいことに起因する違和感にユーザに与えてしまわないように図ることができる。 According to the sound processing device 1 according to the third embodiment described above, a pseudo indirect sound component is provided for the user's performance sound (musical instrument sound of electronic musical instrument 4 or electric musical instrument 5) and multi-channel content sound. Since (reverberation component, etc.) is added and sound is emitted from the speaker 6, the user can enjoy the feeling of playing in a hall or the like as a member of a performer of music content. Further, according to the sound processing device 1 according to the third embodiment, since the user's performance sound is emitted from the speaker 6 via the path 129 having a small delay, the performance sound is emitted from the speaker 6 after being played by the user. It is possible to suppress the delay until the sound is emitted from the sound. As a result, it is possible to prevent the user from feeling uncomfortable due to a large delay from the performance by the user until the performance sound is emitted from the speaker 6.

［第４実施形態］次に、第４実施形態について説明する。第４実施形態に係る音処理装置１のハードウェア構成は第１実施形態と同様である。また、ユーザの演奏環境は第１実施形態と同様である。 [Fourth Embodiment] Next, the fourth embodiment will be described. The hardware configuration of the sound processing device 1 according to the fourth embodiment is the same as that of the first embodiment. The playing environment of the user is the same as that of the first embodiment.

第４実施形態に係る音処理装置１では、ユーザが音楽コンテンツの演奏者の一員となってホール等で歌を歌ったり、アコースティック楽器を奏でたりしている気分を楽しむことが可能になっている。以下、このような機能を実現するための構成について説明する。 In the sound processing device 1 according to the fourth embodiment, it is possible for the user to enjoy the feeling of singing a song or playing an acoustic musical instrument in a hall or the like as a member of a music content player. .. Hereinafter, a configuration for realizing such a function will be described.

図１２は、第４実施形態に係る音処理装置１で実現される機能を示す機能ブロック図である。図１２に示すように、第４実施形態に係る音処理装置１は、演奏音調整部１３１、プリプロセッシング部１３２、間接音成分生成部１３３、ポストプロセッシング部１３４、出力制御部１３５、及びコンテンツデコード部１３６を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 12 is a functional block diagram showing a function realized by the sound processing device 1 according to the fourth embodiment. As shown in FIG. 12, the sound processing device 1 according to the fourth embodiment includes a performance sound adjustment unit 131, a pre-processing unit 132, an indirect sound component generation unit 133, a post-processing unit 134, an output control unit 135, and a content decoding unit. Includes part 136. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

図１３は、第４実施形態に係る音処理装置１で実行される処理を示すフロー図である。以下、図１３を参照しながら各機能ブロックの機能について説明する。 FIG. 13 is a flow chart showing a process executed by the sound processing device 1 according to the fourth embodiment. Hereinafter, the functions of each functional block will be described with reference to FIG.

まず、コンテンツデコード部１３６はコンテンツ再生装置２から入力されるマルチチャンネルのコンテンツ音をフォーマットデコードすることによって、ＰＣＭ信号に変換する（Ｓ４０）。ステップＳ４０は第３実施形態のステップＳ３０と基本的に同様であり、コンテンツデコード部１３６は第３実施形態のコンテンツデコード部１２６と基本的に同様である。 First, the content decoding unit 136 converts the multi-channel content sound input from the content playback device 2 into a PCM signal by format decoding (S40). Step S40 is basically the same as step S30 of the third embodiment, and the content decoding unit 136 is basically the same as the content decoding unit 126 of the third embodiment.

ただし、第４実施形態のコンテンツデコード部１３６は特定成分除去部１３６Ａを含み、ステップＳ４０において、特定成分除去部１３６Ａはコンテンツ音に含まれる特定成分を除去する。具体的には、特定成分除去部１３６Ａは、マイク３から入力された演奏音に対応する特定成分をコンテンツ音から除去する。例えば、ユーザの歌唱音がマイク３から入力される場合、特定成分除去部１３６Ａはボーカル成分をコンテンツ音から除去する。マルチチャンネルのコンテンツ音ではボーカル成分がセンターチャンネルに含まれていることが多いため、特定成分除去部１３６Ａはセンターチャンネルを除去することによって、ボーカル成分をコンテンツ音から除去する。ボーカル成分をコンテンツ音から除去する方法はこの方法に限られず、公知の各種方法を採用することができる。また例えば、アコースティック楽器の楽器音がマイク３から入力される場合、特定成分除去部１３６Ａは、当該アコースティック楽器の楽器音成分をコンテンツ音から除去するようにしてもよい。なお、マイク３から入力される演奏音の種類（例えば歌唱音、ギター音、ピアノ音等のいずれであるのか）に関しては、演奏音を解析することによって自動的に判別するようにしてもよいし、ユーザが入力装置を介して指定するようにしてもよい。 However, the content decoding unit 136 of the fourth embodiment includes the specific component removing unit 136A, and in step S40, the specific component removing unit 136A removes the specific component included in the content sound. Specifically, the specific component removing unit 136A removes the specific component corresponding to the performance sound input from the microphone 3 from the content sound. For example, when the user's singing sound is input from the microphone 3, the specific component removing unit 136A removes the vocal component from the content sound. Since the vocal component is often included in the center channel of the multi-channel content sound, the specific component removing unit 136A removes the vocal component from the content sound by removing the center channel. The method of removing the vocal component from the content sound is not limited to this method, and various known methods can be adopted. Further, for example, when the musical instrument sound of an acoustic instrument is input from the microphone 3, the specific component removing unit 136A may remove the musical instrument sound component of the acoustic instrument from the content sound. The type of performance sound input from the microphone 3 (for example, singing sound, guitar sound, piano sound, etc.) may be automatically determined by analyzing the performance sound. , The user may specify via an input device.

演奏音調整部１３１は、マイク３から入力された演奏音に対して所定処理を施すことによって、演奏音を調整する（Ｓ４１）。ステップＳ４１は第１実施形態のステップＳ１０と同様であり、演奏音調整部１３１は第１実施形態の演奏音調整部１０１と同様であるため、ここでは説明を省略する。 The performance sound adjustment unit 131 adjusts the performance sound by performing a predetermined process on the performance sound input from the microphone 3 (S41). Since step S41 is the same as step S10 of the first embodiment and the performance sound adjustment unit 131 is the same as the performance sound adjustment unit 101 of the first embodiment, the description thereof will be omitted here.

なお、図１３では、便宜上、ステップＳ４０，Ｓ４１が順番に実行されるように示されているが、ステップＳ４０，Ｓ４１は並列的に実行される。 Note that, in FIG. 13, for convenience, steps S40 and S41 are shown to be executed in order, but steps S40 and S41 are executed in parallel.

ＰＣＭ信号に変換されたコンテンツ音は、ＡＤ変換回路によってＰＣＭ信号に変換された演奏音とミックスされ（Ｓ４２）、当該ミックス音がプリプロセッシング部１３２に供給される。そして、当該ミックス音に基づいて、プリプロセッシング部１３２、間接音成分生成部１３３、及びポストプロセッシング部１３４による処理が実行される（Ｓ４３，Ｓ４４，Ｓ４５）。ステップＳ４３〜Ｓ４５は第３実施形態のステップＳ３３〜Ｓ３５と基本的に同様であり、プリプロセッシング部１３２、間接音成分生成部１３３、及びポストプロセッシング部１３４は第３実施形態のプリプロセッシング部１２２、間接音成分生成部１２３、及びポストプロセッシング部１２４と基本的に同様であるため、ここでは説明を省略する。 The content sound converted into the PCM signal is mixed with the performance sound converted into the PCM signal by the AD conversion circuit (S42), and the mixed sound is supplied to the preprocessing unit 132. Then, based on the mixed sound, processing by the pre-processing unit 132, the indirect sound component generation unit 133, and the post-processing unit 134 is executed (S43, S44, S45). Steps S43 to S45 are basically the same as steps S33 to S35 of the third embodiment, and the pre-processing unit 132, the indirect sound component generation unit 133, and the post-processing unit 134 are the pre-processing unit 122 of the third embodiment. Since it is basically the same as the indirect sound component generation unit 123 and the post-processing unit 124, the description thereof will be omitted here.

なお、演奏音は経路１３９を介して出力制御部１３５に供給される。経路１３９は第２実施形態の経路１１９と同様である。 The performance sound is supplied to the output control unit 135 via the path 139. Route 139 is similar to Route 119 of the second embodiment.

第３実施形態の出力制御部１２５と同様に、出力制御部１３５は、経路１３９を介して供給された演奏音（直接音成分）と、ポストプロセッシング部１３４から供給されるコンテンツ音及び間接音成分とをミックスし、当該ミックス音を出力部１４に出力する（Ｓ４６）。出力部１４に出力されたミックス音はスピーカ６によって放音される。 Similar to the output control unit 125 of the third embodiment, the output control unit 135 includes a performance sound (direct sound component) supplied via the path 139, and a content sound and an indirect sound component supplied from the post-processing unit 134. And are mixed, and the mixed sound is output to the output unit 14 (S46). The mixed sound output to the output unit 14 is emitted by the speaker 6.

以上に説明した第４実施形態に係る音処理装置１によれば、ユーザの演奏音（歌唱音又はアコースティック楽器の楽器音）とマルチチャンネルのコンテンツ音とに対して擬似的な間接音成分（残響成分等）が付加されてスピーカ６から放音されるため、ユーザは音楽コンテンツの演奏者の一員となってホール等で歌を歌ったり、アコースティック楽器を奏でたりしている気分を楽しむことができる。また、第４実施形態に係る音処理装置１によれば、ユーザの演奏音は遅延の小さい経路１３９を介してスピーカ６から放音されるため、ユーザによって演奏されてから当該演奏音がスピーカ６から放音されるまでの遅延を小さく抑えることができる。その結果、ユーザによって演奏されてから当該演奏音がスピーカ６から放音されるまでの遅延が大きいことに起因する違和感にユーザに与えてしまわないように図ることができる。 According to the sound processing device 1 according to the fourth embodiment described above, a pseudo indirect sound component (reverberation) with respect to the user's performance sound (singing sound or musical instrument sound of an acoustic instrument) and the multi-channel content sound. Since the sound is emitted from the speaker 6 with the addition of components), the user can enjoy the feeling of singing a song or playing an acoustic instrument in a hall or the like as a member of a music content player. .. Further, according to the sound processing device 1 according to the fourth embodiment, since the user's performance sound is emitted from the speaker 6 via the path 139 having a small delay, the performance sound is emitted from the speaker 6 after being played by the user. It is possible to suppress the delay until the sound is emitted from the sound. As a result, it is possible to prevent the user from feeling uncomfortable due to a large delay from the performance by the user until the performance sound is emitted from the speaker 6.

さらに、第４実施形態に係る音処理装置１によれば、例えば、ユーザが歌を歌っている場合にはコンテンツ音に含まれるボーカル成分が除去されるため、ユーザが音楽コンテンツのボーカルとなってホール等で歌を歌っている気分を楽しむことができる。 Further, according to the sound processing device 1 according to the fourth embodiment, for example, when the user is singing a song, the vocal component contained in the content sound is removed, so that the user becomes a vocal of the music content. You can enjoy the feeling of singing a song in a hall or the like.

なお、以上では、演奏音とコンテンツ音とをミックスする前に、コンテンツ音のボーカル成分を除去することとして説明したが、コンテンツ音のボーカル成分の除去は、演奏音とコンテンツ音とがミックスされた後で行われてもよい。 In the above, it has been described that the vocal component of the content sound is removed before mixing the performance sound and the content sound. However, in the removal of the vocal component of the content sound, the performance sound and the content sound are mixed. It may be done later.

［第５実施形態］次に、第５実施形態について説明する。第５実施形態に係る音処理装置１のハードウェア構成は第１実施形態と同様である。また、ユーザの演奏環境は第１実施形態又は第２実施形態と同様である。 [Fifth Embodiment] Next, the fifth embodiment will be described. The hardware configuration of the sound processing device 1 according to the fifth embodiment is the same as that of the first embodiment. The playing environment of the user is the same as that of the first embodiment or the second embodiment.

第５実施形態に係る音処理装置１でも、ユーザが音楽コンテンツの演奏者の一員となってホール等で演奏している気分を楽しむことが可能になっている。特に、第５実施形態に係る音処理装置１では、ユーザの演奏音とコンテンツ音との一体感を感じることが可能になっている。以下、このような機能を実現するための構成について説明する。 Also in the sound processing device 1 according to the fifth embodiment, it is possible for the user to enjoy the feeling of playing in a hall or the like as a member of the performer of the music content. In particular, in the sound processing device 1 according to the fifth embodiment, it is possible to feel a sense of unity between the user's performance sound and the content sound. Hereinafter, a configuration for realizing such a function will be described.

図１４は、第５実施形態に係る音処理装置１で実現される機能を示す機能ブロック図である。図１４に示すように、第５実施形態に係る音処理装置１は、演奏音調整部１４１、プリプロセッシング部１４２、間接音成分生成部１４３、ポストプロセッシング部１４４、出力制御部１４５、コンテンツデコード部１４６、及びコンテンツ音調整部１４７を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 14 is a functional block diagram showing a function realized by the sound processing device 1 according to the fifth embodiment. As shown in FIG. 14, the sound processing device 1 according to the fifth embodiment includes a performance sound adjustment unit 141, a pre-processing unit 142, an indirect sound component generation unit 143, a post-processing unit 144, an output control unit 145, and a content decoding unit. 146 and content sound adjustment unit 147 are included. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

図１５は、第５実施形態に係る音処理装置１で実行される処理を示すフロー図である。以下、図１５を参照しながら各機能ブロックの機能について説明する。 FIG. 15 is a flow chart showing a process executed by the sound processing device 1 according to the fifth embodiment. Hereinafter, the functions of each functional block will be described with reference to FIG.

まず、コンテンツデコード部１４６は、コンテンツ再生装置２から入力されるマルチチャンネルのコンテンツ音をフォーマットデコードすることによって、ＰＣＭ信号に変換する（Ｓ５０）。ステップＳ５０は第３実施形態のステップＳ３０と基本的に同様であり、コンテンツデコード部１４６は第３実施形態のコンテンツデコード部１２６と基本的に同様であるため、ここでは説明を省略する。 First, the content decoding unit 146 converts the multi-channel content sound input from the content playback device 2 into a PCM signal by format decoding (S50). Since step S50 is basically the same as step S30 of the third embodiment and the content decoding unit 146 is basically the same as the content decoding unit 126 of the third embodiment, the description thereof will be omitted here.

コンテンツ音調整部１４７は、演奏音とコンテンツ音との特性を合わせるために、コンテンツ音を調整する（Ｓ５１）。コンテンツ音調整部１４７は間接音成分除去部１４７Ａを含む。間接音成分除去部１４７Ａは、演奏音とコンテンツ音との間接音成分の量を合わせるために、コンテンツ音に含まれる間接音成分を除去する。コンテンツ音に含まれる間接音成分を除去する方法としては公知の各種方法を採用することができる。 The content sound adjustment unit 147 adjusts the content sound in order to match the characteristics of the performance sound and the content sound (S51). The content sound adjusting unit 147 includes an indirect sound component removing unit 147A. The indirect sound component removing unit 147A removes the indirect sound component included in the content sound in order to match the amount of the indirect sound component between the performance sound and the content sound. Various known methods can be adopted as a method for removing the indirect sound component contained in the content sound.

電子楽器４又は電気楽器５から入力される演奏音は直接音成分のみを含み、間接音成分を含んでいないのに対し、コンテンツ音には直接音成分と間接音成分とが含まれている場合がある。このため、例えば、間接音成分除去部１４７Ａはコンテンツ音に含まれる間接音成分を除去して、コンテンツ音の直接音成分のみを出力する。例えば、間接音成分除去部１４７Ａは、コンテンツ音に含まれる間接音成分を特定し、当該特定された間接音成分の音圧レベルを下げることによって、間接音成分を除去する。すなわち、間接音成分除去部１４７Ａは、間接音成分の音圧レベルを零（ほぼ零）まで下げることによって、間接音成分をほぼ完全に除去する。 When the performance sound input from the electronic musical instrument 4 or the electric musical instrument 5 contains only the direct sound component and does not contain the indirect sound component, whereas the content sound contains the direct sound component and the indirect sound component. There is. Therefore, for example, the indirect sound component removing unit 147A removes the indirect sound component included in the content sound and outputs only the direct sound component of the content sound. For example, the indirect sound component removing unit 147A removes the indirect sound component by specifying the indirect sound component included in the content sound and lowering the sound pressure level of the specified indirect sound component. That is, the indirect sound component removing unit 147A removes the indirect sound component almost completely by lowering the sound pressure level of the indirect sound component to zero (nearly zero).

なお、間接音成分除去部１４７Ａは、間接音成分の音圧レベルをある程度まで下げることによって、間接音成分をある程度まで除去（低減）するようにしてもよい。すなわち、「間接音成分を除去する」には、間接音成分をほぼ完全に除去することだけでなく、間接音成分をある程度まで除去（低減）することも含まれる。ここで、「ある程度」とは、間接音成分が残っていたとしてもユーザに違和感を感じさせないような程度である。 The indirect sound component removing unit 147A may remove (reduce) the indirect sound component to some extent by lowering the sound pressure level of the indirect sound component to some extent. That is, "removing the indirect sound component" includes not only removing the indirect sound component almost completely but also removing (reducing) the indirect sound component to some extent. Here, "to some extent" is a degree that does not make the user feel uncomfortable even if the indirect sound component remains.

演奏音調整部１３１は演奏音を調整する（Ｓ５２）。ステップＳ５２は第１実施形態又は第２実施形態のステップＳ１０，２０と同様であり、演奏音調整部１４１は第１実施形態又は第２実施形態の演奏音調整部１１０，１１１と同様であるため、ここでは説明を省略する。 The performance sound adjustment unit 131 adjusts the performance sound (S52). Since step S52 is the same as steps S10 and 20 of the first embodiment or the second embodiment, and the performance sound adjustment unit 141 is the same as the performance sound adjustment units 110 and 111 of the first embodiment or the second embodiment. , The description is omitted here.

なお、図１５では、便宜上、ステップＳ５０，Ｓ５１とステップＳ５２とが順番に実行されるように示されているが、ステップＳ５０，Ｓ５１とステップＳ５２とは並列的に実行される。 In FIG. 15, for convenience, steps S50 and S51 and step S52 are shown to be executed in order, but steps S50 and S51 and step S52 are executed in parallel.

間接音成分が除去されたコンテンツ音（直接音成分）は演奏音とミックスされ（Ｓ５３）、当該ミックス音は、プリプロセッシング部１４２を経て、間接音成分生成部１４３に供給される。プリプロセッシング部１４２は、上記ミックス音に対して、プリプロセッシングを実行する（Ｓ５４）。ステップＳ５４は第１実施形態のステップＳ１１と同様であり、プリプロセッシング部１４２は第１実施形態のプリプロセッシング部１１２と同様であるため、ここでは説明を省略する。 The content sound (direct sound component) from which the indirect sound component has been removed is mixed with the performance sound (S53), and the mixed sound is supplied to the indirect sound component generation unit 143 via the preprocessing unit 142. The preprocessing unit 142 executes preprocessing on the mixed sound (S54). Since step S54 is the same as step S11 of the first embodiment and the preprocessing unit 142 is the same as the preprocessing unit 112 of the first embodiment, the description thereof will be omitted here.

間接音成分生成部１４３は、上記ミックス音（コンテンツ音の直接音成分と演奏音の直接音成分）に対応する擬似的な間接音成分を生成する（Ｓ５５）。ステップＳ５５は第３実施形態のステップＳ３４と基本的に同様であり、間接音成分生成部１４３は第３実施形態の間接音成分生成部１２３と基本的に同様であるため、ここでは説明を省略する。 The indirect sound component generation unit 143 generates a pseudo indirect sound component corresponding to the mixed sound (direct sound component of the content sound and the direct sound component of the performance sound) (S55). Since step S55 is basically the same as step S34 of the third embodiment and the indirect sound component generation unit 143 is basically the same as the indirect sound component generation unit 123 of the third embodiment, the description thereof is omitted here. To do.

間接音成分生成部１４３によって生成された間接音成分は、ポストプロセッシング部１４４を経て出力制御部１４５に供給される。ポストプロセッシング部１４４はポストプロセッシングを実行する（Ｓ５６）。ステップＳ５６は第１実施形態のステップＳ１３と同様であり、ポストプロセッシング部１４４は第１実施形態のポストプロセッシング部１１４と同様であるため、ここでは説明を省略する。なお、図１４に示すように、間接音成分が除去されたコンテンツ音（すなわち、コンテンツ音の直接音成分）も出力制御部１４５に供給される。また、演奏音（直接音成分）は経路１４９を介して出力制御部１４５に供給される。経路１４９は第２実施形態の経路１２９と同様である。 The indirect sound component generated by the indirect sound component generation unit 143 is supplied to the output control unit 145 via the post-processing unit 144. The post-processing unit 144 executes post-processing (S56). Since step S56 is the same as step S13 of the first embodiment and the post-processing unit 144 is the same as the post-processing unit 114 of the first embodiment, the description thereof will be omitted here. As shown in FIG. 14, the content sound from which the indirect sound component has been removed (that is, the direct sound component of the content sound) is also supplied to the output control unit 145. Further, the performance sound (direct sound component) is supplied to the output control unit 145 via the path 149. Route 149 is similar to Route 129 of the second embodiment.

出力制御部１４５は、経路１４９を介して供給された演奏音（直接音成分）と、コンテンツ音（直接音成分）と、間接音成分生成部１４３によって生成された間接音成分とをミックスし、当該ミックス音を出力部１４に出力する（Ｓ５７）。出力部１４に出力されたミックス音はスピーカ６によって放音される。 The output control unit 145 mixes the performance sound (direct sound component) supplied via the path 149, the content sound (direct sound component), and the indirect sound component generated by the indirect sound component generation unit 143. The mixed sound is output to the output unit 14 (S57). The mixed sound output to the output unit 14 is emitted by the speaker 6.

以上に説明した第５実施形態に係る音処理装置１によれば、ユーザは音楽コンテンツの演奏者の一員となってホール等で演奏している気分を楽しむことができる。また、第５実施形態に係る音処理装置１によれば、ユーザの演奏音とコンテンツ音との間接音成分の量を合わせることが可能になり、その結果、ユーザの演奏音とコンテンツ音との一体感をユーザが十分に感じることが可能になる。 According to the sound processing device 1 according to the fifth embodiment described above, the user can enjoy the feeling of playing in a hall or the like as a member of the performer of the music content. Further, according to the sound processing device 1 according to the fifth embodiment, it is possible to match the amounts of the indirect sound components of the user's performance sound and the content sound, and as a result, the user's performance sound and the content sound are combined. The user can fully feel the sense of unity.

なお、図１４に示したように、第５実施形態においても、アコースティック楽器の楽器音又は歌唱音がマイク３から入力されるようにしてもよい。ただし、この場合、演奏音に間接音成分が含まれる場合があるため、演奏音調整部１４１では演奏音に含まれる間接音成分を除去するようにしてもよい。 As shown in FIG. 14, also in the fifth embodiment, the musical instrument sound or the singing sound of the acoustic musical instrument may be input from the microphone 3. However, in this case, since the performance sound may include an indirect sound component, the performance sound adjustment unit 141 may remove the indirect sound component included in the performance sound.

［第６実施形態］次に、第６実施形態について説明する。第６実施形態は第５実施形態の変形例である。第６実施形態に係る音処理装置１では、演奏音が入力されている場合にのみコンテンツ音の間接音成分を除去する。 [Sixth Embodiment] Next, the sixth embodiment will be described. The sixth embodiment is a modification of the fifth embodiment. The sound processing device 1 according to the sixth embodiment removes the indirect sound component of the content sound only when the performance sound is input.

図１６は、第６実施形態に係る音処理装置１で実現される機能を示す機能ブロック図であり、図１７は、第６実施形態に係る音処理装置１で実行される処理を示すフロー図である。図１６に示すように、第６実施形態に係る音処理装置１は、演奏音調整部１４１、プリプロセッシング部１４２、間接音成分生成部１４３、ポストプロセッシング部１４４、出力制御部１４５、コンテンツデコード部１４６、コンテンツ音調整部１４７、及び入力検出部１４８を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 16 is a functional block diagram showing a function realized by the sound processing device 1 according to the sixth embodiment, and FIG. 17 is a flow diagram showing a process executed by the sound processing device 1 according to the sixth embodiment. Is. As shown in FIG. 16, the sound processing device 1 according to the sixth embodiment includes a performance sound adjustment unit 141, a pre-processing unit 142, an indirect sound component generation unit 143, a post-processing unit 144, an output control unit 145, and a content decoding unit. 146, content sound adjustment unit 147, and input detection unit 148 are included. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

第６実施形態に係る音処理装置１は入力検出部１４８を含み、ステップＳ５１の代わりにステップＳ５１Ａ，５１Ｂを含む点で第５実施形態と異なる。以下、第５実施形態との相違点について主に説明する。 The sound processing device 1 according to the sixth embodiment is different from the fifth embodiment in that it includes an input detection unit 148 and includes steps S51A and 51B instead of step S51. Hereinafter, the differences from the fifth embodiment will be mainly described.

第６実施形態に係る音処理装置１では、入力検出部１４８は、電子楽器４、電気楽器５、又はマイク３から演奏音が入力されていることを検出する。間接音成分除去部１４７Ａは、入力検出部１４８の検出結果に応じて、コンテンツ音に含まれる間接音成分を除去する。具体的には、演奏音が入力されていることが検出されている場合に（Ｓ５１Ａ：Ｙｅｓ）、間接音成分除去部１４７Ａはコンテンツ音から間接音成分を除去する（Ｓ５１Ｂ）。一方、演奏音が入力されていることが検出されていない場合に（Ｓ５１Ａ：Ｎｏ）、間接音成分除去部１４７Ａはコンテンツ音から間接音成分を除去しない。なお、図１７では省略されているが、この場合、ステップＳ５２，Ｓ５３も実行されず、プリプロセッシング部１４２にはコンテンツ音のみが供給され、ステップＳ５７ではコンテンツ音が出力部１４に出力される。 In the sound processing device 1 according to the sixth embodiment, the input detection unit 148 detects that the performance sound is input from the electronic musical instrument 4, the electric musical instrument 5, or the microphone 3. The indirect sound component removing unit 147A removes the indirect sound component contained in the content sound according to the detection result of the input detection unit 148. Specifically, when it is detected that the performance sound is input (S51A: Yes), the indirect sound component removing unit 147A removes the indirect sound component from the content sound (S51B). On the other hand, when it is not detected that the performance sound is input (S51A: No), the indirect sound component removing unit 147A does not remove the indirect sound component from the content sound. Although omitted in FIG. 17, in this case, steps S52 and S53 are not executed, only the content sound is supplied to the preprocessing unit 142, and the content sound is output to the output unit 14 in step S57.

以上に説明した第６実施形態に係る音処理装置１では、演奏音が入力されている場合に限って、コンテンツ音から間接音成分が除去される。演奏音が入力されていない場合には、演奏音とコンテンツ音とで間接音成分の量を合わせる必要がなく、コンテンツ音から間接音成分を除去する必要がない。この点、第６実施形態に係る音処理装置１によれば、コンテンツ音から間接音成分を除去する必要がない場合には、コンテンツ音から間接音成分を除去する処理が実行されなくなるため、音処理装置１の処理負荷を軽減することが可能になる。 In the sound processing device 1 according to the sixth embodiment described above, the indirect sound component is removed from the content sound only when the performance sound is input. When the performance sound is not input, it is not necessary to match the amount of the indirect sound component between the performance sound and the content sound, and it is not necessary to remove the indirect sound component from the content sound. In this regard, according to the sound processing device 1 according to the sixth embodiment, when it is not necessary to remove the indirect sound component from the content sound, the process of removing the indirect sound component from the content sound is not executed, so that the sound It becomes possible to reduce the processing load of the processing device 1.

［第７実施形態］次に、第７実施形態について説明する。第７実施形態に係る音処理装置１のハードウェア構成は第１実施形態と同様である。また、ユーザの演奏環境は第１実施形態又は第２実施形態と同様である。 [7th Embodiment] Next, the 7th embodiment will be described. The hardware configuration of the sound processing device 1 according to the seventh embodiment is the same as that of the first embodiment. The playing environment of the user is the same as that of the first embodiment or the second embodiment.

第５実施形態や第６実施形態では、コンテンツ音の間接音成分の量を演奏音と合わせるために、コンテンツ音に含まれる間接音成分を除去するのに対し、第７実施形態に係る音処理装置１では、コンテンツ音の間接音成分の量に合わせて、演奏音に間接音成分を付加するようになっている。 In the fifth and sixth embodiments, the indirect sound component included in the content sound is removed in order to match the amount of the indirect sound component of the content sound with the performance sound, whereas the sound processing according to the seventh embodiment is performed. In the device 1, the indirect sound component is added to the performance sound according to the amount of the indirect sound component of the content sound.

図１８は、第７実施形態に係る音処理装置１で実現される機能を示す機能ブロック図である。図１８に示すように、第７実施形態に係る音処理装置１は、演奏音調整部１５１、プリプロセッシング部１５２、間接音成分生成部１５３、ポストプロセッシング部１５４、出力制御部１５５、コンテンツデコード部１５６、及び間接音成分量解析部１５７を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 18 is a functional block diagram showing a function realized by the sound processing device 1 according to the seventh embodiment. As shown in FIG. 18, the sound processing device 1 according to the seventh embodiment has a performance sound adjustment unit 151, a pre-processing unit 152, an indirect sound component generation unit 153, a post-processing unit 154, an output control unit 155, and a content decoding unit. 156 and indirect sound component amount analysis unit 157 are included. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

図１９は、第７実施形態に係る音処理装置１で実行される処理を示すフロー図である。以下、図１９を参照しながら各機能ブロックの機能について説明する。 FIG. 19 is a flow chart showing a process executed by the sound processing device 1 according to the seventh embodiment. Hereinafter, the functions of the functional blocks will be described with reference to FIG.

まず、コンテンツデコード部１５６は、コンテンツ再生装置２から入力されるマルチチャンネルのコンテンツ音をフォーマットデコードすることによって、ＰＣＭ信号に変換する（Ｓ６０）。ステップＳ６０は第３実施形態のステップＳ３０と基本的に同様であり、コンテンツデコード部１５６は第３実施形態のコンテンツデコード部１２６と基本的に同様であるため、ここでは説明を省略する。 First, the content decoding unit 156 converts the multi-channel content sound input from the content playback device 2 into a PCM signal by format decoding (S60). Since step S60 is basically the same as step S30 of the third embodiment and the content decoding unit 156 is basically the same as the content decoding unit 126 of the third embodiment, the description thereof will be omitted here.

間接音成分量解析部１５７はコンテンツ音に含まれる間接音成分の量を解析する（Ｓ６１）。例えば、間接音成分量解析部１５７はコンテンツ音に含まれる間接音成分の数や大きさ（音圧レベル）を解析する。コンテンツ音に含まれる間接音成分の量を解析する方法としては公知の各種方法を採用することができる。 The indirect sound component amount analysis unit 157 analyzes the amount of the indirect sound component contained in the content sound (S61). For example, the indirect sound component amount analysis unit 157 analyzes the number and magnitude (sound pressure level) of indirect sound components included in the content sound. Various known methods can be adopted as a method for analyzing the amount of indirect sound components contained in the content sound.

演奏音調整部１５１は演奏音を調整する（Ｓ６２）。ステップＳ６２は第１実施形態又は第２実施形態のステップＳ１０，Ｓ２０と基本的に同様であり、演奏音調整部１５１は第１実施形態又は第２実施形態の演奏音調整部１０１，１１１と基本的に同様である。 The performance sound adjustment unit 151 adjusts the performance sound (S62). Step S62 is basically the same as steps S10 and S20 of the first embodiment or the second embodiment, and the performance sound adjusting unit 151 is basically the same as the performance sound adjusting units 101 and 111 of the first embodiment or the second embodiment. Is similar.

ただし、第７実施形態の演奏音調整部１５１は、演奏音とコンテンツ音との特性を合わせるために演奏音を調整する役割も果たす。すなわち、演奏音調整部１５１は間接音成分付加部１５１Ａを含み、ステップＳ６２において、間接音成分付加部１５１Ａは、演奏音に対応する間接音成分を当該演奏音に対して付加する。特に、間接音成分付加部１５１Ａは、演奏音に対して付加する間接音成分の量を、間接音成分量解析部１５７の解析結果に基づいて設定する。すなわち、間接音成分付加部１５１Ａは、演奏音に対して付加する間接音成分の数や大きさを、コンテンツ音に含まれる間接音成分の数や大きさに合わせて設定する。つまり、間接音成分付加部１５１Ａは、演奏音に対して付加する間接音成分の数や大きさを、コンテンツ音に含まれる間接音成分の数や大きさと同程度に設定する。 However, the performance sound adjustment unit 151 of the seventh embodiment also plays a role of adjusting the performance sound in order to match the characteristics of the performance sound and the content sound. That is, the performance sound adjustment unit 151 includes the indirect sound component addition unit 151A, and in step S62, the indirect sound component addition unit 151A adds the indirect sound component corresponding to the performance sound to the performance sound. In particular, the indirect sound component addition unit 151A sets the amount of the indirect sound component added to the performance sound based on the analysis result of the indirect sound component amount analysis unit 157. That is, the indirect sound component addition unit 151A sets the number and size of the indirect sound components added to the performance sound according to the number and size of the indirect sound components included in the content sound. That is, the indirect sound component addition unit 151A sets the number and size of the indirect sound components added to the performance sound to the same degree as the number and size of the indirect sound components included in the content sound.

第７実施形態に係る音処理装置１では、コンテンツ音と、間接音成分付加部１５１Ａによって間接音成分が付加された演奏音とがミックスされ（Ｓ６３）、当該ミックス音がプリプロセッシング部１５２に供給される。そして、当該ミックス音に基づいて、プリプロセッシング部１５２、間接音成分生成部１５３、及びポストプロセッシング部１５４による処理が実行される（Ｓ６４，Ｓ６５，Ｓ６６）。ステップＳ６４〜Ｓ６６は第３実施形態のステップＳ３３〜Ｓ３５と基本的に同様であり、プリプロセッシング部１５２、間接音成分生成部１５３、及びポストプロセッシング部１５４は第３実施形態のプリプロセッシング部１２２、間接音成分生成部１２３、及びポストプロセッシング部１２４と基本的に同様であるため、ここでは説明を省略する。 In the sound processing device 1 according to the seventh embodiment, the content sound and the performance sound to which the indirect sound component is added by the indirect sound component addition unit 151A are mixed (S63), and the mixed sound is supplied to the preprocessing unit 152. Will be done. Then, based on the mixed sound, processing by the pre-processing unit 152, the indirect sound component generation unit 153, and the post-processing unit 154 is executed (S64, S65, S66). Steps S64 to S66 are basically the same as steps S33 to S35 of the third embodiment, and the pre-processing unit 152, the indirect sound component generation unit 153, and the post-processing unit 154 are the pre-processing unit 122 of the third embodiment. Since it is basically the same as the indirect sound component generation unit 123 and the post-processing unit 124, the description thereof will be omitted here.

なお、間接音成分付加部１５１Ａによって間接音成分が付加された演奏音は、経路１５９を介して出力制御部１５５に供給される。経路１５９は第２実施形態の経路１１９と同様である。 The performance sound to which the indirect sound component is added by the indirect sound component addition unit 151A is supplied to the output control unit 155 via the path 159. Route 159 is similar to Route 119 of the second embodiment.

出力制御部１５５は、コンテンツ音と、間接音成分付加部１５１Ａによって間接音成分が付加された演奏音と、間接音成分生成部１５３によって生成された間接音成分とをミックスし、当該ミックス音を出力部１４に出力する（Ｓ６７）。出力部１４に出力されたミックス音はスピーカ６によって放音される。 The output control unit 155 mixes the content sound, the performance sound to which the indirect sound component is added by the indirect sound component addition unit 151A, and the indirect sound component generated by the indirect sound component generation unit 153, and mixes the mixed sound. Output to the output unit 14 (S67). The mixed sound output to the output unit 14 is emitted by the speaker 6.

以上に説明した第７実施形態に係る音処理装置１によれば、ユーザは音楽コンテンツの演奏者の一員となってホール等で演奏している気分を楽しむことができる。また、第７実施形態に係る音処理装置１によれば、ユーザの演奏音とコンテンツ音との間接音成分の量を合わせることが可能になり、その結果、ユーザの演奏音とコンテンツ音との一体感をユーザが十分に感じることが可能になる。 According to the sound processing device 1 according to the seventh embodiment described above, the user can enjoy the feeling of playing in a hall or the like as a member of the performer of the music content. Further, according to the sound processing device 1 according to the seventh embodiment, it is possible to match the amounts of the indirect sound components of the user's performance sound and the content sound, and as a result, the user's performance sound and the content sound are combined. The user can fully feel the sense of unity.

なお、図１８に示したように、第７実施形態においても、アコースティック楽器の楽器音又は歌唱音がマイク３から入力されるようにしてもよい。ただし、この場合、マイク３から含まれる演奏音に間接音成分が予め含まれる場合があるため、演奏音調整部１５１では、演奏音に含まれる間接音成分を一旦除去した後で、間接音成分付加部１５１Ａによって間接音成分を演奏音に対して付加するようにしてもよい。 As shown in FIG. 18, in the seventh embodiment as well, the musical instrument sound or the singing sound of the acoustic musical instrument may be input from the microphone 3. However, in this case, since the performance sound included from the microphone 3 may contain an indirect sound component in advance, the performance sound adjustment unit 151 once removes the indirect sound component included in the performance sound, and then the indirect sound component. The indirect sound component may be added to the performance sound by the addition unit 151A.

［第８実施形態］次に、第８実施形態について説明する。第８実施形態は第７実施形態の変形例である。第８実施形態に係る音処理装置１では、演奏音への間接音の付加の仕方を当該演奏音の種類に応じて変える。 [Eighth Embodiment] Next, the eighth embodiment will be described. The eighth embodiment is a modification of the seventh embodiment. In the sound processing device 1 according to the eighth embodiment, the method of adding the indirect sound to the performance sound is changed according to the type of the performance sound.

図２０は、第８実施形態に係る音処理装置１で実現される機能を示す機能ブロック図であり、図２１は、第８実施形態に係る音処理装置１で実行される処理を示すフロー図である。図２０に示すように、第８実施形態に係る音処理装置１は、演奏音調整部１５１、プリプロセッシング部１５２、間接音成分生成部１５３、ポストプロセッシング部１５４、出力制御部１５５、コンテンツデコード部１５６、間接音成分量解析部１５７、及び演奏音種類特定部１５８を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 20 is a functional block diagram showing a function realized by the sound processing device 1 according to the eighth embodiment, and FIG. 21 is a flow diagram showing a process executed by the sound processing device 1 according to the eighth embodiment. Is. As shown in FIG. 20, the sound processing device 1 according to the eighth embodiment has a performance sound adjustment unit 151, a preprocessing unit 152, an indirect sound component generation unit 153, a post processing unit 154, an output control unit 155, and a content decoding unit. 156, an indirect sound component amount analysis unit 157, and a performance sound type identification unit 158 are included. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

第８実施形態に係る音処理装置１は演奏音種類特定部１５８を含み、ステップＳ６２の代わりにステップＳ６２Ａ，Ｓ６２Ｂを含む点で第７実施形態と異なる。以下、第７実施形態との相違点について主に説明する。 The sound processing device 1 according to the eighth embodiment is different from the seventh embodiment in that the performance sound type specifying unit 158 is included and steps S62A and S62B are included instead of step S62. Hereinafter, the differences from the seventh embodiment will be mainly described.

第８実施形態に係る音処理装置１では、演奏音種類特定部１５８は、入力された演奏音の種類を特定する（ステップＳ６２Ａ）。例えば、演奏音種類特定部１５８は、入力された演奏音が楽器音であるか否かを判定する。また、入力された演奏音が楽器音である場合、演奏音種類特定部１５８は楽器音の種類を特定する。すなわち、演奏音種類特定部１５８は、入力された演奏音が複数種類の楽器（例えばギター、バイオリン、又はピアノ等）のうちのいずれの音であるのかを特定する。また例えば、演奏音種類特定部１５８は、入力された演奏音が歌唱音であるか否かを判定する。なお、演奏音の種類を特定する方法としては公知の各種方法を採用することができる。 In the sound processing device 1 according to the eighth embodiment, the performance sound type specifying unit 158 specifies the type of the input performance sound (step S62A). For example, the performance sound type specifying unit 158 determines whether or not the input performance sound is a musical instrument sound. When the input performance sound is a musical instrument sound, the performance sound type specifying unit 158 specifies the type of the musical instrument sound. That is, the performance sound type specifying unit 158 specifies which of the plurality of types of musical instruments (for example, a guitar, a violin, a piano, etc.) the input performance sound is. Further, for example, the performance sound type specifying unit 158 determines whether or not the input performance sound is a singing sound. As a method for specifying the type of performance sound, various known methods can be adopted.

また、第８実施形態の間接音成分付加部１５１Ａは、間接音成分量解析部１５７の解析結果だけでなく、演奏音種類特定部１５８の特定結果にも基づいて、演奏音に対応する間接音成分を当該演奏音に対して付加する（Ｓ６２Ｂ）。すなわち、間接音成分付加部１５１Ａは、間接音成分量解析部１５７の解析結果だけでなく、演奏音種類特定部１５８の特定結果にも基づいて、演奏音に対して付加する間接音成分を設定する。 Further, the indirect sound component addition unit 151A of the eighth embodiment is based on not only the analysis result of the indirect sound component amount analysis unit 157 but also the specific result of the performance sound type identification unit 158, and the indirect sound corresponding to the performance sound. A component is added to the performance sound (S62B). That is, the indirect sound component addition unit 151A sets the indirect sound component to be added to the performance sound based not only on the analysis result of the indirect sound component amount analysis unit 157 but also on the specific result of the performance sound type identification unit 158. To do.

現実の音響空間における演奏音の放射特性は演奏音の種類ごとに異なるため、間接音成分付加部１５１Ａは、演奏音の種類ごとに異なる放射特性を踏まえて、演奏音に対して付加する間接音成分を設定する。 Since the radiation characteristics of the performance sound in the actual acoustic space differ depending on the type of the performance sound, the indirect sound component addition unit 151A adds the indirect sound to the performance sound based on the radiation characteristics different for each type of the performance sound. Set the ingredients.

例えば、ギターの楽器音は他の方向に比べて正面方向に放射される傾向があるため、演奏音がギターの楽器音である場合、間接音成分付加部１５１Ａは、正面方向に対応するチャンネルに対して間接音成分（残響成分等）を付加する。または、間接音成分付加部１５１Ａは、正面方向に対応するチャンネルに対して付加する間接音成分の量を、他のチャンネルに対して付加する間接音成分の量よりも大きくする。 For example, since the instrument sound of a guitar tends to be radiated in the front direction as compared with other directions, when the performance sound is the instrument sound of a guitar, the indirect sound component addition unit 151A is set to the channel corresponding to the front direction. On the other hand, an indirect sound component (reverberation component, etc.) is added. Alternatively, the indirect sound component addition unit 151A makes the amount of the indirect sound component added to the channel corresponding to the front direction larger than the amount of the indirect sound component added to the other channels.

また例えば、バイオリンの楽器音は他の方向に比べて上方向に放射される傾向があるため、演奏音がバイオリンの楽器音である場合、間接音成分付加部１５１Ａは、上方向に対応するチャンネルに対して間接音成分（残響成分等）を付加する。または、間接音成分付加部１５１Ａは、上方向に対応するチャンネルに対して付加する間接音成分の量を、他のチャンネルに対して付加する間接音成分の量よりも大きくする。 Further, for example, since the instrument sound of the violin tends to be radiated upward as compared with other directions, when the performance sound is the instrument sound of the violin, the indirect sound component addition unit 151A is a channel corresponding to the upward direction. An indirect sound component (reverberation component, etc.) is added to the sound. Alternatively, the indirect sound component addition unit 151A makes the amount of the indirect sound component added to the channel corresponding to the upward direction larger than the amount of the indirect sound component added to the other channels.

また例えば、歌唱音は他の方向に比べて正面方向に放射される傾向があるため、演奏音が歌唱音である場合、間接音成分付加部１５１Ａは、正面方向に対応するチャンネルに対して間接音成分（残響成分等）を付加する。または、間接音成分付加部１５１Ａは、正面方向に対応するチャンネルに対して付加する間接音成分の量を、他のチャンネルに対して付加する間接音成分の量よりも多くする。 Further, for example, since the singing sound tends to be radiated in the front direction as compared with other directions, when the playing sound is a singing sound, the indirect sound component addition unit 151A is indirect with respect to the channel corresponding to the front direction. Add sound components (reverberation components, etc.). Alternatively, the indirect sound component addition unit 151A makes the amount of the indirect sound component added to the channel corresponding to the front direction larger than the amount of the indirect sound component added to the other channels.

以上に説明した第８実施形態に係る音処理装置１によれば、ユーザの演奏音の放射特性を踏まえて、間接音成分を演奏音に対して付加することが可能になり、より自然な間接音成分を演奏音に対して付加できるようになる。 According to the sound processing device 1 according to the eighth embodiment described above, it is possible to add an indirect sound component to the playing sound based on the radiation characteristics of the playing sound of the user, which is more natural indirect. Sound components can be added to the playing sound.

［第９実施形態］次に、第９実施形態について説明する。第９実施形態に係る音処理装置１のハードウェア構成は第１実施形態と同様である。また、ユーザの演奏環境は第１実施形態又は第２実施形態と同様である。 [Ninth Embodiment] Next, the ninth embodiment will be described. The hardware configuration of the sound processing device 1 according to the ninth embodiment is the same as that of the first embodiment. The playing environment of the user is the same as that of the first embodiment or the second embodiment.

第６実施形態〜第８実施形態では、演奏音とコンテンツ音とで間接音成分の量を合わせるのに対し、第９実施形態に係る音処理装置１では、演奏音とコンテンツ音とで音色を合わせるようになっている。 In the sixth to eighth embodiments, the amount of the indirect sound component is matched between the performance sound and the content sound, whereas in the sound processing device 1 according to the ninth embodiment, the performance sound and the content sound are used to produce a timbre. It is designed to match.

図２２は、第９実施形態に係る音処理装置１で実現される機能を示す機能ブロック図である。図２２に示すように、第９実施形態に係る音処理装置１は、演奏音調整部１６１、プリプロセッシング部１６２、間接音成分生成部１６３、ポストプロセッシング部１６４、出力制御部１６５、コンテンツデコード部１６６、第１音色解析部１６７、及び第２音色解析部１６８を含む。これらの機能ブロックはＣＰＵ１１及び音響信号処理部１５によって実現される。例えば、ＣＰＵ１１がプログラムに従って音響信号処理部１５を制御することによって、上記の機能ブロックが実現される。 FIG. 22 is a functional block diagram showing a function realized by the sound processing device 1 according to the ninth embodiment. As shown in FIG. 22, the sound processing device 1 according to the ninth embodiment includes a performance sound adjustment unit 161, a preprocessing unit 162, an indirect sound component generation unit 163, a post processing unit 164, an output control unit 165, and a content decoding unit. 166, the first tone color analysis unit 167, and the second tone color analysis unit 168 are included. These functional blocks are realized by the CPU 11 and the acoustic signal processing unit 15. For example, the above functional block is realized by the CPU 11 controlling the acoustic signal processing unit 15 according to a program.

図２３は、第９実施形態に係る音処理装置１で実行される処理を示すフロー図である。以下、図２３を参照しながら各機能ブロックの機能について説明する。 FIG. 23 is a flow chart showing a process executed by the sound processing device 1 according to the ninth embodiment. Hereinafter, the functions of each functional block will be described with reference to FIG. 23.

まず、コンテンツデコード部１６６は、コンテンツ再生装置２から入力されるマルチチャンネルのコンテンツ音をフォーマットデコードすることによって、ＰＣＭ信号に変換する（Ｓ７０）。ステップＳ７０は第３実施形態のステップＳ３０と基本的に同様であり、コンテンツデコード部１６６は第３実施形態のコンテンツデコード部１２６と基本的に同様であるため、ここでは説明を省略する。 First, the content decoding unit 166 converts the multi-channel content sound input from the content playback device 2 into a PCM signal by format decoding (S70). Since step S70 is basically the same as step S30 of the third embodiment and the content decoding unit 166 is basically the same as the content decoding unit 126 of the third embodiment, the description thereof will be omitted here.

第２音色解析部１６８はコンテンツ音の音色を解析する（Ｓ７１）。例えば、コンテンツ音に複数種類の楽器音が含まれる場合に、第２音色解析部１６８は、当該複数種類の楽器音のうちの、演奏音に含まれる種類の楽器音を特定し、当該楽器音の音色を解析する。また例えば、演奏音に歌唱音が含まれる場合に、第２音色解析部１６８はコンテンツ音に含まれる歌唱音の音色を解析する。なお、コンテンツ音に含まれる楽器音又は歌唱音を特定する方法や、楽器音又は歌唱音の音色を解析する方法としては、公知の各種方法を採用することができる。 The second timbre analysis unit 168 analyzes the timbre of the content sound (S71). For example, when the content sound includes a plurality of types of musical instrument sounds, the second timbre analysis unit 168 identifies the type of musical instrument sound included in the performance sound among the plurality of types of musical instrument sounds, and the musical instrument sound. Analyze the tone of. Further, for example, when the performance sound includes a singing sound, the second timbre analysis unit 168 analyzes the timbre of the singing sound included in the content sound. Various known methods can be adopted as a method for identifying the musical instrument sound or the singing sound included in the content sound and a method for analyzing the timbre of the musical instrument sound or the singing sound.

第１音色解析部１６７は演奏音の音色を解析する（Ｓ７２）。第１音色解析部１６７は演奏音に含まれる楽器音又は歌唱音を特定し、当該楽器音又は歌唱音の音色を解析する。演奏音の音色を解析する方法としては公知の各種方法を採用することができる。 The first timbre analysis unit 167 analyzes the timbre of the performance sound (S72). The first timbre analysis unit 167 identifies a musical instrument sound or a singing sound included in the performance sound, and analyzes the timbre of the musical instrument sound or the singing sound. Various known methods can be adopted as a method for analyzing the timbre of the performance sound.

なお、図２３では、便宜上、ステップＳ７０，７１とステップＳ７２とが順番に実行されるように示されているが、ステップＳ７０，Ｓ７１とステップＳ７２とは並列的に実行される。 In FIG. 23, for convenience, steps S70 and 71 and step S72 are shown to be executed in order, but steps S70 and S71 and step S72 are executed in parallel.

演奏音調整部１６１は演奏音を調整する（Ｓ７３）。ステップＳ６２は第１実施形態又は第２実施形態のステップＳ１０，Ｓ２０と基本的に同様であり、演奏音調整部１６１は第１実施形態又は第２実施形態の演奏音調整部１０１，１１１と基本的に同様である。 The performance sound adjustment unit 161 adjusts the performance sound (S73). Step S62 is basically the same as steps S10 and S20 of the first embodiment or the second embodiment, and the performance sound adjusting unit 161 is basically the same as the performance sound adjusting units 101 and 111 of the first embodiment or the second embodiment. Is similar.

ただし、第９実施形態の演奏音調整部１６１は、演奏音とコンテンツ音との特性を合わせるために演奏音を調整する役割も果たす。すなわち、演奏音調整部１６１は音色調整部１６１Ａを含み、ステップＳ７３において、音色調整部１６１Ａは、第１音色解析部１６７の解析結果と、第２音色解析部１６８の解析結果との比較に基づいて、演奏音の音色を調整する。 However, the performance sound adjustment unit 161 of the ninth embodiment also plays a role of adjusting the performance sound in order to match the characteristics of the performance sound and the content sound. That is, the performance sound adjustment unit 161 includes the tone color adjustment unit 161A, and in step S73, the tone color adjustment unit 161A is based on the comparison between the analysis result of the first tone color analysis unit 167 and the analysis result of the second tone color analysis unit 168. To adjust the tone of the performance sound.

例えば、演奏音に含まれるバイオリン音（バイオリンの楽器音）の高域成分が多いとの解析結果が第１音色解析部１６７によって得られ、かつ、コンテンツ音に含まれるバイオリン音の高域成分が少ないとの解析結果が第２音色解析部１６８によって得られた場合、音色調整部１６１Ａは演奏音に含まれるバイオリン音の高域成分を減少させる。要するに、演奏音に含まれる楽器音の特定帯域成分の量とコンテンツ音に含まれる同種の楽器音の特定帯域成分の量とが異なる場合に、演奏音に含まれる楽器音の特定帯域成分とコンテンツ音に含まれる同種の楽器音の高域成分と同程度に設定すべく、音色調整部１６１Ａは演奏音に含まれる楽器音の特定帯域成分を調整する。 For example, the first timbre analysis unit 167 obtained an analysis result that the violin sound (violin instrument sound) contained in the performance sound had many high-frequency components, and the high-frequency component of the violin sound contained in the content sound was obtained. When the second tone color analysis unit 168 obtains the analysis result that the amount is small, the tone color adjustment unit 161A reduces the high frequency component of the violin sound contained in the performance sound. In short, when the amount of the specific band component of the instrument sound contained in the performance sound and the amount of the specific band component of the same type of instrument sound contained in the content sound are different, the specific band component and the content of the instrument sound included in the performance sound. The tone color adjusting unit 161A adjusts a specific band component of the instrument sound included in the performance sound in order to set it to the same level as the high frequency component of the same type of instrument sound included in the sound.

第９実施形態に係る音処理装置１では、コンテンツ音と、音色調整部１６１Ａによって音色が調整された演奏音とがミックスされ（Ｓ７４）、当該ミックス音がプリプロセッシング部１６２に供給される。そして、当該ミックス音に基づいて、プリプロセッシング部１６２、間接音成分生成部１６３、及びポストプロセッシング部１６４による処理が実行される（Ｓ７５，Ｓ７６，Ｓ７７）。ステップＳ７５〜Ｓ７７は第３実施形態のステップＳ３３〜Ｓ３５と基本的に同様であり、プリプロセッシング部１６２、間接音成分生成部１６３、及びポストプロセッシング部１６４は第３実施形態のプリプロセッシング部１２２、間接音成分生成部１２３、及びポストプロセッシング部１２４と基本的に同様であるため、ここでは説明を省略する。 In the sound processing device 1 according to the ninth embodiment, the content sound and the performance sound whose timbre is adjusted by the timbre adjusting unit 161A are mixed (S74), and the mixed sound is supplied to the preprocessing unit 162. Then, based on the mixed sound, processing by the pre-processing unit 162, the indirect sound component generation unit 163, and the post-processing unit 164 is executed (S75, S76, S77). Steps S75 to S77 are basically the same as steps S33 to S35 of the third embodiment, and the pre-processing unit 162, the indirect sound component generation unit 163, and the post-processing unit 164 are the pre-processing units 122 of the third embodiment. Since it is basically the same as the indirect sound component generation unit 123 and the post-processing unit 124, the description thereof will be omitted here.

なお、音色調整部１６１Ａによって音色が調整された演奏音は、経路１６９を介して、出力制御部１６５に供給される。経路１６９は第２実施形態の経路１１９と同様である。 The performance sound whose timbre is adjusted by the timbre adjusting unit 161A is supplied to the output control unit 165 via the path 169. Route 169 is similar to Route 119 of the second embodiment.

第３実施形態の出力制御部１２５と同様に、出力制御部１６５は、経路１６９を介して供給された演奏音（音色調整部１６１Ａによって音色が調整された演奏音）と、ポストプロセッシング部１６４から供給されるコンテンツ音及び間接音成分とをミックスし、当該ミックス音を出力部１４に出力する（Ｓ７８）。出力部１４に出力されたミックス音はスピーカ６によって放音される。 Similar to the output control unit 125 of the third embodiment, the output control unit 165 is the performance sound supplied via the path 169 (the performance sound whose timbre is adjusted by the timbre adjustment unit 161A) and the post-processing unit 164. The supplied content sound and indirect sound component are mixed, and the mixed sound is output to the output unit 14 (S78). The mixed sound output to the output unit 14 is emitted by the speaker 6.

以上に説明した第９実施形態に係る音処理装置１によれば、ユーザは音楽コンテンツの演奏者の一員となってホール等で演奏している気分を楽しむことができる。また、第９実施形態に係る音処理装置１によれば、ユーザの演奏音とコンテンツ音との音色を合わせることが可能になり、その結果、ユーザの演奏音とコンテンツ音との一体感をユーザが十分に感じることが可能になる。 According to the sound processing device 1 according to the ninth embodiment described above, the user can enjoy the feeling of playing in a hall or the like as a member of the performer of the music content. Further, according to the sound processing device 1 according to the ninth embodiment, it is possible to match the tones of the user's performance sound and the content sound, and as a result, the user feels a sense of unity between the user's performance sound and the content sound. Can be fully felt.

なお、演奏音の音色を調整する音色調整部１６１Ａの代わりに、コンテンツ音の音色を第１音色解析部１６７の解析結果と第２音色解析部１６８の解析結果との比較に基づいて調整する音色調整部を設けるようにしてもよい。 Instead of the tone color adjusting unit 161A that adjusts the tone color of the performance sound, the tone color that adjusts the tone color of the content sound based on the comparison between the analysis result of the first tone color analysis unit 167 and the analysis result of the second tone color analysis unit 168. An adjustment unit may be provided.

この音色調整部は、例えば、演奏音に含まれるバイオリン音の高域成分が多いとの解析結果が第１音色解析部１６７によって得られ、かつ、コンテンツ音に含まれるバイオリン音の高域成分が少ないとの解析結果が第２音色解析部１６８によって得られた場合に、コンテンツ音に含まれるバイオリン音の高域成分を増加させるようにしてもよい。 In this tone adjustment unit, for example, the first tone color analysis unit 167 obtains an analysis result that the high frequency component of the violin sound contained in the performance sound is large, and the high frequency component of the violin sound contained in the content sound is obtained. When the analysis result that the amount is small is obtained by the second tone color analysis unit 168, the high frequency component of the violin sound contained in the content sound may be increased.

また、この音色調整部は、例えば、演奏音ではバイオリン音（バイオリンの楽器音）の高域成分が多いとの解析結果が第１音色解析部１６７によって得られ、かつ、コンテンツ音では、バイオリン音とは異なる楽器音であるギター音の高域成分が多いとの解析結果が第２音色解析部１６８によって得られた場合に、コンテンツ音に含まれるギター音の高域成分を減少させるようにしてもよい。 Further, in this tone adjustment unit, for example, the first tone color analysis unit 167 obtains an analysis result that the performance sound has a large amount of high frequency components of the violin sound (musical instrument sound of the violin), and the content sound is the violin sound. When the second tone color analysis unit 168 obtains the analysis result that there are many high-frequency components of the guitar sound, which is a different musical instrument sound, the high-frequency components of the guitar sound contained in the content sound are reduced. May be good.

なお、演奏音の音色を調整する音色調整部１６１Ａと、コンテンツ音の音色を調整する音色調整部との両方を設けるようにしてもよい。例えば、第１音色解析部１６７によって、演奏音に含まれるバイオリン音の高域成分が多いとの解析結果が得られ、かつ、第２音色解析部１６８によって、コンテンツ音に含まれるバイオリン音の高域成分が少ないとの解析結果が得られた場合に、演奏音に含まれるバイオリン音の高域成分とコンテンツ音に含まれるバイオリン音の高域成分とを同程度に設定すべく、演奏音に含まれるバイオリン音の高域成分と、演奏音に含まれるバイオリン音の高域成分とをそれぞれ調整するようにしてもよい。 It should be noted that both the tone color adjusting unit 161A for adjusting the tone color of the performance sound and the tone color adjusting unit for adjusting the tone color of the content sound may be provided. For example, the first tone color analysis unit 167 obtains an analysis result that there are many high-frequency components of the violin sound contained in the performance sound, and the second tone color analysis unit 168 obtains the height of the violin sound contained in the content sound. When the analysis result that the range component is small is obtained, the high range component of the violin sound contained in the performance sound and the high range component of the violin sound contained in the content sound are set to the same level in the performance sound. The high-frequency component of the violin sound included and the high-frequency component of the violin sound included in the performance sound may be adjusted respectively.

［変形例］本発明は以上説明した第１実施形態〜第９実施形態に限定されるものではない。 [Modified Example] The present invention is not limited to the first to ninth embodiments described above.

例えば、第１実施形態〜第９実施形態のうちの複数を組み合わせるようにしてもよい。 For example, a plurality of the first to ninth embodiments may be combined.

また例えば、以上では、音処理装置１がＡＶレシーバであることを前提として説明しており、以上に説明したような機能はＡＶレシーバを用いて実現することができるが、音処理装置１はＡＶレシーバ以外の装置であってもよく、以上に説明したような機能はＡＶレシーバ以外の装置によって実現するようにしてもよい。例えば、音処理装置１はスピーカに内蔵されるようにしてもよい。また例えば、音処理装置１は、デスクトップ型コンピュータ、ラップトップ型コンピュータ、タブレット型コンピュータ、又はスマートフォン等によって実現するようにしてもよい。 Further, for example, the above description is based on the premise that the sound processing device 1 is an AV receiver, and the functions described above can be realized by using the AV receiver, but the sound processing device 1 is an AV. A device other than the receiver may be used, and the functions described above may be realized by a device other than the AV receiver. For example, the sound processing device 1 may be built in the speaker. Further, for example, the sound processing device 1 may be realized by a desktop computer, a laptop computer, a tablet computer, a smartphone, or the like.

［付記］以上に説明した実施形態についての記載から把握されるように、本明細書では以下に記載の発明を含む多様な技術的思想が開示されている。 [Additional Notes] As can be understood from the description of the embodiments described above, various technical ideas including the inventions described below are disclosed in the present specification.

本発明に係る音処理装置は、ユーザの演奏音の入力を受け付ける入力手段と、前記演奏音と、コンテンツデータに基づいて得られる音であるコンテンツ音との特性を合わせるために、前記演奏音と前記コンテンツ音との少なくとも一方を調整する調整手段と、前記演奏音と前記コンテンツ音とをミックスしてなる音に対応する間接音成分を生成する生成手段と、前記演奏音と、前記コンテンツ音の少なくとも直接音成分と、前記間接音成分とをミックスしてなる音を出力手段に出力する出力制御手段と、を含む。 In the sound processing device according to the present invention, in order to match the characteristics of the input means that receives the input of the user's performance sound, the performance sound, and the content sound that is the sound obtained based on the content data, the performance sound and the performance sound An adjustment means for adjusting at least one of the content sounds, a generation means for generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, the performance sound, and the content sound. It includes at least an output control means for outputting a sound formed by mixing the direct sound component and the indirect sound component to the output means.

上記発明では、前記調整手段は、前記コンテンツ音に含まれる間接音成分を除去し、前記生成手段は、前記演奏音と、前記調整手段によって間接音成分が除去された前記コンテンツ音とをミックスしてなる音に対応する間接音成分を生成するようにしてもよい。 In the above invention, the adjusting means removes the indirect sound component contained in the content sound, and the generating means mixes the playing sound and the content sound from which the indirect sound component has been removed by the adjusting means. The indirect sound component corresponding to the resulting sound may be generated.

上記発明では、前記調整手段は、前記演奏音の入力に応じて、前記コンテンツ音に含まれる間接音成分を除去するようにしてもよい。 In the above invention, the adjusting means may remove the indirect sound component included in the content sound in response to the input of the playing sound.

上記発明では、前記コンテンツ音に含まれる間接音成分の量を解析する解析手段を含み、前記調整手段は、前記演奏音に対応する間接音成分を前記演奏音に対して付加するものであり、前記演奏音に対して付加する間接音成分の量を前記解析手段の解析結果に基づいて設定し、前記生成手段は、前記調整手段によって間接音成分が付加された前記演奏音と、前記コンテンツ音とをミックスしてなる音に対応する間接音成分を生成するようにしてもよい。 The present invention includes an analysis means for analyzing the amount of indirect sound components contained in the content sound, and the adjustment means adds an indirect sound component corresponding to the performance sound to the performance sound. The amount of the indirect sound component added to the performance sound is set based on the analysis result of the analysis means, and the generation means includes the performance sound to which the indirect sound component is added by the adjustment means and the content sound. The indirect sound component corresponding to the sound formed by mixing and may be generated.

上記発明では、前記演奏音に含まれる楽器音の種類を特定する手段を含み、前記調整手段は、前記演奏音に含まれる楽器音の種類に基づいて、前記演奏音に対して付加する間接音成分を設定するようにしてもよい。 The present invention includes means for specifying the type of musical instrument sound included in the playing sound, and the adjusting means is an indirect sound added to the playing sound based on the type of musical instrument sound included in the playing sound. Ingredients may be set.

上記発明では、前記演奏音に前記歌唱音が含まれるか否かを判定する手段を含み、前記調整手段は、前記演奏音に前記歌唱音が含まれるか否かの判定結果に基づいて、前記演奏音に対して付加する間接音成分を設定するようにしてもよい。 In the above invention, the means for determining whether or not the performance sound includes the singing sound is included, and the adjusting means is said based on the determination result of whether or not the performance sound includes the singing sound. The indirect sound component to be added to the performance sound may be set.

上記発明では、前記演奏音に含まれる楽器音の音色を解析する第１の音色解析手段と、前記コンテンツ音に含まれる楽器音の音色を解析する第２の音色解析手段と、を含み、前記調整手段は、前記第１の音色解析手段の解析結果と、前記第２の音色解析手段の解析結果との比較に基づいて、前記演奏音と前記コンテンツ音との少なくとも一方の音色を調整するようにしてもよい。 The present invention includes a first timbre analysis means for analyzing the timbre of the instrument sound included in the performance sound, and a second timbre analysis means for analyzing the timbre of the instrument sound included in the content sound. The adjusting means adjusts at least one of the performance sound and the content sound based on the comparison between the analysis result of the first tone color analysis means and the analysis result of the second tone color analysis means. It may be.

上記発明では、前記演奏音に含まれる楽器音の種類を特定する手段を含み、前記第２の音色解析手段は、前記コンテンツ音に複数種類の楽器音が含まれる場合に、当該複数種類の楽器音のうちの、前記演奏音に含まれる種類の楽器音の音色を解析するようにしてもよい。 In the above invention, the means for specifying the type of the instrument sound included in the performance sound is included, and the second timbre analysis means is the plurality of types of instruments when the content sound includes a plurality of types of instrument sounds. Of the sounds, the timbre of the type of instrument sound included in the performance sound may be analyzed.

上記発明では、前記演奏音に含まれる歌唱音の音色を解析する第１の音色解析手段と、前記コンテンツ音に含まれる歌唱音の音色を解析する第２の音色解析手段と、を含み、前記調整手段は、前記第１の音色解析手段の解析結果と、前記第２の音色解析手段の解析結果との比較に基づいて、前記演奏音と前記コンテンツ音との少なくとも一方の音色を調整するようにしてもよい。 The present invention includes a first timbre analysis means for analyzing the timbre of the singing sound included in the performance sound, and a second timbre analysis means for analyzing the timbre of the singing sound included in the content sound. The adjusting means adjusts at least one of the performance sound and the content sound based on the comparison between the analysis result of the first tone color analysis means and the analysis result of the second tone color analysis means. It may be.

１音処理装置、２コンテンツ再生装置、３マイク、４電子機器、５電気機器、６，６Ａ，６Ｂ，６Ｃ，６Ｄ，６Ｅスピーカ、１１ＣＰＵ、１２メモリ、１３入力部、１４出力部、１５音信号処理部、１６映像信号処理部、１０１，１１１，１２１，１３１，１４１，１５１，１６１演奏音調整部、１０２，１１２，１２２，１３２，１４２，１５２，１６２プリプロセッシング部、１０３，１１３，１２３，１３３，１４３，１５３，１６３間接音成分生成部、１０４，１１４，１２４，１３４，１４４，１５４，１６４ポストプロセッシング部、１０５，１１５，１２５，１３５，１４５，１５５，１６５出力制御部、１１９，１２９，１３９，１４９，１５９，１６９経路、１２６，１３６，１４６，１５６，１６６コンテンツデコード部、１３６Ａ特定成分除去部、１４７コンテンツ音調整部、１４７Ａ間接音成分除去部、１４８入力検出部、１５１Ａ間接音成分付加部、１５７間接音成分解析部、１５８演奏音種類特定部、１６１Ａ音色調整部、１６７第１音色解析部、１６８第２音色解析部、Ｕユーザ。 1 sound processing device, 2 content playback device, 3 microphone, 4 electronic device, 5 electrical device, 6,6A, 6B, 6C, 6D, 6E speaker, 11 CPU, 12 memory, 13 input section, 14 output section, 15 sound Signal processing unit, 16 Video signal processing unit, 101,111,121,131,141,151,161 Performance sound adjustment unit, 102,112,122,132,142,152,162 Preprocessing unit, 103,113,123 , 133, 143, 153, 163 Indirect sound component generation unit, 104, 114, 124, 134, 144, 154, 164 Post processing unit, 105, 115, 125, 135, 145, 155, 165 Output control unit, 119, 129,139,149,159,169 paths, 126,136,146,156,166 Content decoding unit, 136A Specific component removal unit, 147 Content sound adjustment unit, 147A Indirect sound component removal unit, 148 Input detection unit, 151A Indirect Sound component addition unit, 157 indirect sound component analysis unit, 158 performance sound type identification unit, 161A sound color adjustment unit, 167 first sound color analysis unit, 168 second sound color analysis unit, U user.

Claims

An input means that accepts the input of the user's performance sound,
An adjusting means for adjusting at least one of the performance sound and the content sound in order to match the characteristics of the performance sound and the content sound which is a sound obtained based on the content data.
A generation means for generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, and
An output control means that outputs a sound obtained by mixing the performance sound, at least a direct sound component of the content sound, and the indirect sound component to the output means.
Only including,
The adjusting means removes the indirect sound component contained in the content sound, and removes the indirect sound component.
The generation means generates an indirect sound component corresponding to a sound formed by mixing the performance sound and the content sound from which the indirect sound component has been removed by the adjustment means.
A sound processing device characterized by this.

An input means that accepts the input of the user's performance sound,
An adjusting means for adjusting at least one of the performance sound and the content sound in order to match the characteristics of the performance sound and the content sound which is a sound obtained based on the content data.
A generation means for generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, and
An output control means that outputs a sound obtained by mixing the performance sound, at least a direct sound component of the content sound, and the indirect sound component to the output means.
The A including sound processing equipment,
Further, it includes an analysis means for analyzing the amount of indirect sound components contained in the content sound.
The adjusting means adds an indirect sound component corresponding to the playing sound to the playing sound, and the amount of the indirect sound component added to the playing sound is based on the analysis result of the analysis means. Set,
The generation means generates an indirect sound component corresponding to a sound obtained by mixing the performance sound to which the indirect sound component is added by the adjustment means and the content sound.
A sound processing device characterized by this.

In the sound processing apparatus according to claim 1 ,
The adjusting means removes the indirect sound component contained in the content sound in response to the input of the playing sound.
A sound processing device characterized by this.

In the sound processing apparatus according to claim 2 ,
Including means for identifying the type of musical instrument sound included in the performance sound, including
The adjusting means sets an indirect sound component to be added to the playing sound based on the type of musical instrument sound included in the playing sound.
A sound processing device characterized by this.

In the sound processing apparatus according to claim 2 or 4 .
Includes means for determining whether or not include the song唱音the performance sound,
The adjusting means sets an indirect sound component to be added to the playing sound based on a determination result of whether or not the playing sound includes the singing sound.
A sound processing device characterized by this.

In the sound processing apparatus according to any one of claims 1 to 5 .
A first timbre analysis means for analyzing the timbre of the musical instrument sound included in the performance sound,
A second timbre analysis means for analyzing the timbre of the musical instrument sound included in the content sound is included.
The adjusting means adjusts at least one of the performance sound and the content sound based on the comparison between the analysis result of the first tone color analysis means and the analysis result of the second tone color analysis means. ,
A sound processing device characterized by this.

In the sound processing apparatus according to claim 6 ,
Including means for identifying the type of musical instrument sound included in the performance sound, including
When the content sound includes a plurality of types of musical instrument sounds, the second timbre analysis means analyzes the timbre of the type of musical instrument sound included in the performance sound among the plurality of types of musical instrument sounds.
A sound processing device characterized by this.

In the sound processing apparatus according to any one of claims 1 to 5 .
A first timbre analysis means for analyzing the timbre of the singing sound included in the performance sound,
A second timbre analysis means for analyzing the timbre of the singing sound included in the content sound is included.
The adjusting means adjusts at least one of the performance sound and the content sound based on the comparison between the analysis result of the first tone color analysis means and the analysis result of the second tone color analysis means. ,
A sound processing device characterized by this.

An adjustment step of adjusting at least one of the performance sound and the content sound in order to match the characteristics of the user's performance sound and the content sound obtained based on the content data.
A generation step of generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, and
An output control step that outputs a sound obtained by mixing the performance sound, at least the direct sound component of the content sound, and the indirect sound component to the output means.
Only including,
In the adjustment step, the indirect sound component contained in the content sound is removed.
In the generation step, the indirect sound component corresponding to the sound formed by mixing the performance sound and the content sound from which the indirect sound component has been removed in the adjustment step is generated.
A sound processing method characterized by that.

An adjusting means for adjusting at least one of the performance sound and the content sound in order to match the characteristics of the user's performance sound and the content sound obtained based on the content data.
A generation means for generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, and
An output control means that outputs a sound obtained by mixing the performance sound, at least a direct sound component of the content sound, and the indirect sound component to the output means.
A program for causing a computer to function as,
The adjusting means removes the indirect sound component contained in the content sound, and removes the indirect sound component.
The generation means generates an indirect sound component corresponding to a sound formed by mixing the performance sound and the content sound from which the indirect sound component has been removed by the adjustment means.
A program characterized by that.

An adjustment step of adjusting at least one of the performance sound and the content sound in order to match the characteristics of the user's performance sound and the content sound obtained based on the content data.
A generation step of generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, and
An output control step that outputs a sound obtained by mixing the performance sound, at least the direct sound component of the content sound, and the indirect sound component to the output means.
The A including sound processing method,
Further, it includes an analysis step of analyzing the amount of indirect sound components contained in the content sound.
In the adjustment step, an indirect sound component corresponding to the performance sound is added to the performance sound, and the amount of the indirect sound component added to the performance sound is based on the analysis result of the analysis step. Set,
In the generation step, an indirect sound component corresponding to a sound obtained by mixing the performance sound to which the indirect sound component is added in the adjustment step and the content sound is generated.
A sound processing method characterized by that.

An adjusting means for adjusting at least one of the performance sound and the content sound in order to match the characteristics of the user's performance sound and the content sound obtained based on the content data.
A generation means for generating an indirect sound component corresponding to a sound obtained by mixing the performance sound and the content sound, and
An output control means that outputs a sound obtained by mixing the performance sound, at least a direct sound component of the content sound, and the indirect sound component to the output means.
A program for causing a computer to function as,
Further, the analysis means for analyzing the amount of indirect sound components contained in the content sound is included.
The adjusting means adds an indirect sound component corresponding to the playing sound to the playing sound, and the amount of the indirect sound component added to the playing sound is based on the analysis result of the analysis means. Set,
The generation means generates an indirect sound component corresponding to a sound obtained by mixing the performance sound to which the indirect sound component is added by the adjustment means and the content sound.
A program characterized by that.