JP2022063777A

JP2022063777A - Performance information prediction device, effective string vibration determination model training device, performance information generation system, performance information prediction method, and effective string vibration determination model training method

Info

Publication number: JP2022063777A
Application number: JP2020172194A
Authority: JP
Inventors: 博毅佐藤; Hirotake Sato
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2020-10-12
Filing date: 2020-10-12
Publication date: 2022-04-22

Abstract

To provide a technique capable of precisely converting the performance of an electronic stringed instrument into a piece of performance information.SOLUTION: A performance information prediction device in an embodiment includes: a pre-processing unit that generates a piece of volume-pitch featured data including volume information and pitch information of each string from string vibration waveform data representing the performance of a stringed instrument; and a performance information prediction unit that predicts the performance information of the performance of the stringed instrument from the volume-pitch featured data using a trained effective string vibration determination model.SELECTED DRAWING: Figure 1

Description

本開示は、演奏情報予測装置、有効弦振動判定モデル訓練装置、演奏情報生成システム、演奏情報予測方法及び有効弦振動判定モデル訓練方法に関する。 The present disclosure relates to a performance information prediction device, an effective string vibration determination model training device, a performance information generation system, a performance information prediction method, and an effective string vibration determination model training method.

ギターのような楽器の弦振動波形を、マグネティックピックアップもしくはピエゾピックアップによって電気信号化し、そのピッチや音量を分析することで、ＭＩＤＩ（ＭｕｓｉｃａｌＩｎｓｔｒｕｍｅｎｔａｌＤｉｇｉｔａｌＩｎｔｅｒｆａｃｅ）メッセージ等のデジタル演奏データに変換するギターコントローラ（もしくはギターシンセサイザー）という電子楽器が存在する。このようなタイプのコントローラは音源を鳴らすためのみの専用ギターコントローラと異なり、通常のギターのシェイプや機能を残しつつ、そこに演奏情報取得用の各弦に独立のピックアップを搭載することでＭＩＤＩ演奏も可能となるという大きなメリットがあり、最も一般的な形態であると言える。 A guitar controller (a guitar controller) that converts the string vibration waveform of an instrument such as a guitar into an electric signal by a magnetic pickup or a piezo pickup, and analyzes the pitch and volume to convert it into digital performance data such as a MIDI (Musical Instrumental Digital Interface) message. Or there is an electronic musical instrument called a guitar synthesizer). Unlike a dedicated guitar controller that only sounds a sound source, this type of controller retains the shape and functions of a normal guitar, but by mounting an independent pickup on each string for acquiring performance information, MIDI performance is performed. It can be said that it is the most common form because it has a great merit that it can be used.

特開平９－６３３９号公報Japanese Unexamined Patent Publication No. 9-6339 特開２０００－１０５５９０号公報Japanese Unexamined Patent Publication No. 2000-105590

このような非鍵盤楽器において、長年解決されていない大きな問題の１つは、演奏者が意図せずに発生させた弦振動も区別なく演奏情報に変換されてしまうことである。例えば、３弦を押さえて演奏している際に、開放されている４弦に手のひらが触れてしまって、わずかに振動してしまったり、１弦をピックで弾いた際に、勢い余って２弦にも接触してしまい、僅かな間、振動が発生してしまうということは、スキルの高い演奏者の演奏でも度々起きることある。必要な弦振動だけ残して不要な弦振動を全てミュートすることができれば理想的ではあるが、演奏によってはほぼ不可能なケースも多々ある。 One of the major problems that has not been solved for many years in such non-keyboard instruments is that string vibrations unintentionally generated by the performer are also converted into performance information without distinction. For example, when playing while holding down the 3rd string, the palm of the hand touches the open 4th string and it vibrates slightly, or when the 1st string is picked, the momentum is 2 The fact that the strings also come into contact with each other and vibrate for a short period of time often occurs even in the performances of highly skilled performers. It would be ideal if all unnecessary string vibrations could be muted, leaving only the necessary string vibrations, but there are many cases where it is almost impossible depending on the performance.

また、ギターコントローラ特有の問題もあり、これらの現象は、通常のギターの生音や電気音による演奏と異なり、ギターコントローラとして使用する際には以下に示すような理由により問題がより深刻になる。 There are also problems peculiar to the guitar controller, and these phenomena become more serious when used as a guitar controller for the following reasons, unlike the performance by the live sound or the electric sound of a normal guitar.

ケース１：ダイナミックレンジの狭い演奏
エレクトリックギターでも意図的に音量に強弱をつけて演奏することもあるが、むしろディストーションサウンドのように演奏音の大小にかかわらず同じ音の大きさで発音することが好まれることも多い。しかし、通常のエレクトリックギターでは全弦の信号がミックスされてディストーションを掛けるために、小さい開放弦の音は振幅の大きい弦の音にかき消されてしまうため、さほど問題にならない。ところがギターコントローラを使った演奏でシンセサイザーなどを鳴らす場合は、遥かに問題が大きくなる。シンセサイザーの発音も強弱を付けないことは珍しくなく、弦ごとに独立した音源をもつギターシンセサイザーを発音させる場合は、僅かな弦振動の開放弦と、普通に演奏した弦の振動に対する発音が同じ音量になってしまうために、不要な弦振動の楽音が意図的に演奏した楽音と同じくらいの音量で再生されてしまい、演奏に大きく支障を来す。 Case 1: Performance with a narrow dynamic range Even with an electric guitar, the volume may be intentionally adjusted to a different volume, but rather, like a distortion sound, it can be pronounced at the same volume regardless of the volume of the performance sound. Often preferred. However, in a normal electric guitar, the signals of all strings are mixed and distorted, so the sound of a small open string is drowned out by the sound of a string with a large amplitude, so this is not a problem. However, when playing with a guitar controller and playing a synthesizer, the problem becomes much bigger. It is not uncommon for synthesizer sounds to be unbalanced, and when playing a guitar synthesizer that has an independent sound source for each string, the sound of the open strings with slight string vibration and the sound of the vibration of the normally played strings are the same. Therefore, the musical sound of unnecessary string vibration is reproduced at the same volume as the music sound that was intentionally played, which greatly hinders the performance.

ケース２：表示や演奏解析
ギターコントローラは音源を鳴らすだけではなく、演奏を譜面表示させたり、コード判別させたり、フレーズをコンピュータに入力して曲データを作成したりと、いろいろな用途に演奏情報を使用する可能性がある。この際、全く不要な音符情報が混入してしまうと、非常に見にくい譜面になったり、全く誤ったコードと解釈されたりと、致命的な不具合になりかねない。これを解決するために記録された演奏情報から不要情報を個々に取り除く必要があり、大きな手間がかかってしまったりする。 Case 2: Display and performance analysis The guitar controller not only plays the sound source, but also displays the performance on the score, discriminates the chords, inputs the phrase into the computer to create song data, and so on. May be used. At this time, if completely unnecessary note information is mixed in, the score may be very difficult to see, or it may be interpreted as a completely wrong chord, which may be a fatal problem. In order to solve this, it is necessary to individually remove unnecessary information from the recorded performance information, which may take a lot of time and effort.

ケース３：モノフォニックモード
ギターの弦は複数本あって和音が演奏できるが、ソロやメロディーの演奏のために、敢えてモノフォニック（単音）で演奏したいケースがある。特にポルタメント効果をかけて別の音程に滑らかに移動するレガートのフレーズを演奏することもある。鍵盤楽器においては同様の理由から和音を演奏することができるポリフォニックの鍵盤楽器で、敢えて単音しか発生しないモノフォニックモードを用意しているものも多い。ギターシンセサイザーでも、そのようなモノフォニックモードに対応した音源を内蔵しているものや、外部にＭＩＤＩ信号を送信することで外部のモノフォニック音源をドライブするケースもある。鍵盤を複数押鍵しても１つのキーのみが有効になるのと同様に、ギターのモノフォニックモードの場合も複数の弦が同時に振動しても有効なのはいずれか一本の弦振動となる。鍵盤の場合は高音優先、低音優先、後着優先などのルールによって１音が決定するのが一般的であるが、ギターコントローラの場合は後着優先のケースが多い。ところが、ギターの場合は鍵盤楽器と比較するとかけ離れた音程のノートが不用意に発生するケースがあるために、単音フレーズの中に不必要なノートが混入してしまうと演奏フレーズが大きく崩れてしまう。つまり別の発音が消音されて、その不用意なノートの発音に置き換わってしまうのである。さらに影響が大きいのは、前述したポルタメント効果を掛けている際の演奏である。例えば、１弦、２弦のハイポジションを使った演奏をしている際に、遠く離れた６弦の開放弦の音が一瞬振動するということはよくあることであるが、この場合には発音中のピッチが全く意図しない低い方向に突然移動してしまい、次に正しいノートを発生してピッチが戻ってくるまでの間に無意味かつ大きなピッチ変化が発生してしまう。 Case 3: Monophonic mode There are multiple strings on a guitar and you can play chords, but there are cases where you want to play monophonically (single note) for solo or melody performance. In particular, he sometimes plays legato phrases that move smoothly to another pitch with a portamento effect. Many keyboard instruments are polyphonic keyboard instruments that can play chords for the same reason, and have a monophonic mode that dares to generate only a single note. Some guitar synthesizers have a built-in sound source that supports such a monophonic mode, and some guitar synthesizers drive an external monophonic sound source by transmitting a MIDI signal to the outside. Just as even if you press multiple keys, only one key is valid, even if multiple strings vibrate at the same time in the monophonic mode of the guitar, only one string vibration is valid. In the case of a keyboard, one note is generally determined by rules such as high-pitched tone priority, low-pitched tone priority, and late-arrival priority, but in the case of a guitar controller, there are many cases where late-arrival priority is given. However, in the case of a guitar, there are cases where notes with pitches that are far from each other are inadvertently generated compared to keyboard instruments, so if unnecessary notes are mixed in a single note phrase, the performance phrase will be greatly destroyed. .. In other words, another pronunciation is muted and replaced by the inadvertent pronunciation of the note. Even more influential is the performance when the portamento effect mentioned above is applied. For example, when playing using the high position of the 1st and 2nd strings, it is common for the sound of the open string of the 6th string, which is far away, to vibrate for a moment. The pitch inside suddenly moves in a completely unintended low direction, and a meaningless and large pitch change occurs between the next correct note and the return of the pitch.

従来、一般的に従来のギターコントローラでは、これらの不必要な弦振動に対して弦の信号レベルを監視して、単純に所定の音量以下の音を無効にするという対策が取られている。 Conventionally, in general, in a conventional guitar controller, a measure is taken to monitor the signal level of a string against these unnecessary string vibrations and simply invalidate a sound below a predetermined volume.

しかしながら、この対策の副作用として、弱いピッキングや音量の小さいタッピングなどによる奏法の楽音も一緒に無視されるということである。発音すべきものが発音されないというのは、深刻な問題であり、高速なフレーズ演奏などに対して大きな障害となっているのが現状である。 However, as a side effect of this measure, the musical tone of the playing style due to weak picking and low volume tapping is also ignored. The fact that what should be pronounced is not pronounced is a serious problem, and the current situation is that it is a major obstacle to high-speed phrase performance.

上記課題を鑑み、本開示の課題は、電子弦楽器の演奏を演奏情報に高精度に変換するための技術を提供することである。 In view of the above problems, an object of the present disclosure is to provide a technique for converting the performance of an electronic stringed instrument into performance information with high accuracy.

上記課題を解決するため、本開示の一態様は、弦楽器演奏を表す弦振動波形データから各弦の音量情報及びピッチ情報を含む音量ピッチ特徴化データを生成する前処理部と、訓練済み有効弦振動判定モデルを利用して、前記音量ピッチ特徴化データから前記弦楽器演奏の演奏情報を予測する演奏情報予測部と、を有する演奏情報予測装置に関する。 In order to solve the above problems, one aspect of the present disclosure includes a preprocessing unit that generates volume pitch characterization data including volume information and pitch information of each string from string vibration waveform data representing stringed instrument performance, and a trained effective string. The present invention relates to a performance information prediction device including a performance information prediction unit that predicts performance information of the stringed instrument performance from the volume pitch characterization data using a vibration determination model.

本開示によると、電子弦楽器の演奏を演奏情報に高精度に変換することができる。 According to the present disclosure, the performance of an electronic stringed instrument can be converted into performance information with high accuracy.

本開示の一実施例によるギターコントローラを示す概略図である。It is a schematic diagram which shows the guitar controller by one Example of this disclosure. 本開示の一実施例による演奏情報の構成を示す図である。It is a figure which shows the structure of the performance information by one Example of this disclosure. 本開示の一実施例によるＴＡＢ譜を示す図である。It is a figure which shows the tablature by one Example of this disclosure. 本開示の一実施例によるギターコントローラの外観を示す図である。It is a figure which shows the appearance of the guitar controller by one Example of this disclosure. 本開示の一実施例によるギターのハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware composition of the guitar by one Example of this disclosure. 本開示の一実施例による制御装置のハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware composition of the control device by one Embodiment of this disclosure. 本開示の一実施例による演奏情報予測装置及び有効弦振動判定モデル訓練装置の機能構成を示すブロック図である。It is a block diagram which shows the functional structure of the performance information prediction apparatus and the effective string vibration determination model training apparatus by one Embodiment of this disclosure. 本開示の一実施例による音量ピッチ特徴化データフレームを示す概略図である。It is a schematic diagram which shows the volume pitch characteristic data frame by one Example of this disclosure. 本開示の一実施例による音量検出及びピッチ検出を示す概略図である。It is a schematic diagram which shows the volume detection and the pitch detection by one Example of this disclosure. 本開示の一実施例による演奏情報生成処理を示す概略図である。It is a schematic diagram which shows the performance information generation processing by one Example of this disclosure. 本開示の一実施例による有効弦振動判定モデルのアーキテクチャを示す図である。It is a figure which shows the architecture of the effective string vibration determination model by one Example of this disclosure. 本開示の他の実施例による有効弦振動判定モデルのアーキテクチャを示す図である。It is a figure which shows the architecture of the effective string vibration determination model by another Example of this disclosure. 本開示の一実施例による演奏情報予測処理を示すフローチャートである。It is a flowchart which shows the performance information prediction processing by one Example of this disclosure. 本開示の一実施例による有効弦振動判定モデル訓練装置の動作を示す概略図である。It is a schematic diagram which shows the operation of the effective string vibration determination model training apparatus by one Example of this disclosure. 本開示の一実施例による有効弦振動判定モデル訓練処理を示すフローチャートである。It is a flowchart which shows the effective string vibration determination model training process by one Example of this disclosure.

以下の実施例では、ギターの演奏によって生成される弦振動波形から演奏情報（例えば、ＭＩＤＩメッセージなど）を生成するギターコントローラが開示される。なお、本開示は、ギターコントローラに限定されず、弦振動波形抽出機能を備えた弦楽器の演奏から演奏情報を生成する他の何れかの演奏情報生成装置に適用されてもよい。
［本開示の概要］
後述される実施例を概略すると、図１に示されるように、本開示の一実施例によるギターコントローラ１０は、ギター５０及び制御装置１００を有する。ギターコントローラ１０は、ニューラルネットワークなどの機械学習モデルとして実現される有効弦振動判定モデルを利用して、ギター５０の演奏によって生成される弦振動波形から演奏情報を生成する。 In the following examples, a guitar controller that generates performance information (for example, a MIDI message) from a string vibration waveform generated by playing a guitar is disclosed. The present disclosure is not limited to the guitar controller, and may be applied to any other performance information generation device that generates performance information from the performance of a stringed instrument having a string vibration waveform extraction function.
[Summary of this disclosure]
To outline the examples described below, as shown in FIG. 1, the guitar controller 10 according to the embodiment of the present disclosure includes a guitar 50 and a control device 100. The guitar controller 10 uses an effective string vibration determination model realized as a machine learning model such as a neural network to generate performance information from the string vibration waveform generated by the performance of the guitar 50.

本開示の一実施例による演奏情報は、図２に示されるように、発音情報、消音情報及びピッチ変更情報の演奏種別を示す。 As shown in FIG. 2, the performance information according to the embodiment of the present disclosure indicates the performance type of the pronunciation information, the muffling information, and the pitch change information.

発音情報は、分類モデルとしての有効弦振動判定モデルによって判別される意図的な撥弦と、エンベロープ検出による撥弦の強さとを示す。発音をＭＩＤＩメッセージにより表現する場合、意図的な撥弦のノート番号及び撥弦の強さはＮｏｔｅＯｎ：０ｘ９ｎ，ｋｋ，ｖｖによって表してもよい。 The pronunciation information indicates the intentional string repellency determined by the effective string vibration determination model as a classification model and the strength of the string repellency by envelope detection. When the pronunciation is expressed by a MIDI message, the note number of the intentional string repellent and the strength of the string repellent may be expressed by Note On: 0x9n, kk, vv.

また、消音情報は、エンベロープ検出によって検出され、０）発音停止及び１）置き換えを表す。消音をＭＩＤＩメッセージにより表現する場合、ＮｏｔｅＯｆｆ：０ｘ８ｎ，ｋｋ，ｖｖによって表してもよい。 Further, the muffling information is detected by the envelope detection and represents 0) sound stop and 1) replacement. When the muffling is expressed by a MIDI message, it may be expressed by Note Off: 0x8n, kk, vv.

また、ピッチ変更情報は、ゼロクロスカウントによって検出される。ピッチ変化をＭＩＤＩメッセージにより表現する場合、ＰｉｔｃｈＢｅｎｄ：０ｘＥｎ，ｌｌ，ｍｍによって表してもよい。 In addition, the pitch change information is detected by zero cross count. When the pitch change is expressed by a MIDI message, it may be expressed by Pitch Bend: 0xEn, ll, mm.

図１に示された実施例では、ギターコントローラ１０は、有効弦振動判定モデルを訓練する訓練モードと、訓練した有効弦振動判定モデルを利用して演奏情報を予測する演奏モードとの２つの動作モードを有し、制御装置１００は、訓練モードにおいて利用される有効弦振動判定モデル訓練装置２００と、演奏モードにおいて利用される演奏情報予測装置３００とを有する。 In the embodiment shown in FIG. 1, the guitar controller 10 has two operations, a training mode for training an effective string vibration determination model and a performance mode for predicting performance information using the trained effective string vibration determination model. The control device 100 has a mode, an effective string vibration determination model training device 200 used in the training mode, and a performance information prediction device 300 used in the performance mode.

まず、訓練モードにおいて、ギターコントローラ１０は、訓練用演奏情報データベース８０から訓練データを取得する。訓練データは、例えば、楽譜データ（例えば、ＴＡＢ譜など）と、当該楽譜データに対応するＭＩＤＩファイルとのペアから構成される。ＴＡＢ譜は、例えば、図３に示されるような周知の記法に従って記述されたものであってもよい。ユーザが取得した訓練用楽譜データの楽譜に従ってギター５０を演奏すると、有効弦振動判定モデル訓練装置２００は、ユーザの演奏に基づきギター５０によって生成された弦振動情報を訓練対象の有効弦振動判定モデルに入力し、有効弦振動判定モデルから出力される演奏情報としてのＭＩＤＩメッセージと訓練用ＭＩＤＩファイルとを比較し、これらの誤差が小さくなるように有効弦振動判定モデルを訓練する。本開示では、弦振動波形データから抽出された音量及びピッチに基づき特徴化された音量ピッチ特徴化データを利用して、有効弦振動判定モデルによって有効と判定された弦振動から演奏情報を取得する。訓練が終了すると、有効弦振動判定モデル訓練装置２００は、訓練した有効弦振動判定モデルを演奏情報予測装置３００に提供する。 First, in the training mode, the guitar controller 10 acquires training data from the training performance information database 80. The training data is composed of, for example, a pair of musical score data (for example, TAB score) and a MIDI file corresponding to the musical score data. The tablature may be, for example, written according to a well-known notation as shown in FIG. When the guitar 50 is played according to the score of the training score data acquired by the user, the effective string vibration determination model training device 200 uses the string vibration information generated by the guitar 50 based on the user's performance as the effective string vibration determination model to be trained. The MIDI message as performance information output from the effective string vibration determination model is compared with the training MIDI file, and the effective string vibration determination model is trained so that these errors are reduced. In the present disclosure, performance information is acquired from the string vibration determined to be valid by the effective string vibration determination model by using the volume pitch characteristic data characterized based on the volume and pitch extracted from the string vibration waveform data. .. When the training is completed, the effective string vibration determination model training device 200 provides the trained effective string vibration determination model to the performance information prediction device 300.

次に、演奏モードでは、ユーザがギター５０を演奏すると、演奏情報予測装置３００は、ユーザの演奏に基づきギター５０によって生成された弦振動情報を訓練済み有効弦振動判定モデルに入力し、ＭＩＤＩメッセージなどの演奏情報を取得する。取得した演奏情報は、例えば、外部の再生装置やコンピュータに送信され、ユーザは、再生装置を介してユーザによる演奏を再生したり、演奏情報をコンピュータ上で利用できる。 Next, in the performance mode, when the user plays the guitar 50, the performance information prediction device 300 inputs the string vibration information generated by the guitar 50 based on the user's performance into the trained effective string vibration determination model, and inputs the MIDI message. Acquire performance information such as. The acquired performance information is transmitted to, for example, an external playback device or a computer, and the user can play back the performance by the user via the playback device or use the performance information on the computer.

これにより、電子弦楽器の演奏を演奏情報に変換する際の不要な弦振動を排除し、有効な弦振動による演奏情報を生成することが可能になる。 This makes it possible to eliminate unnecessary string vibration when converting the performance of an electronic stringed instrument into performance information, and to generate performance information based on effective string vibration.

なお、以下に説明する実施例によるギターコントローラ１０は、有効弦振動判定モデル訓練装置２００を有するが、本開示はこれに限定されず、例えば、有効弦振動判定モデルは、外部のコンピュータやサーバによって訓練され、訓練された有効弦振動判定モデル及び／又は有効弦振動判定モデルの更新情報が外部のコンピュータやサーバから演奏情報予測装置３００に提供されてもよい。
［ハードウェア構成］
次に、図４を参照して、ギターコントローラ１０の物理的構成を説明する。図４は、本開示の一実施例によるギターコントローラ１０の外観を示す図である。 The guitar controller 10 according to the embodiment described below has an effective string vibration determination model training device 200, but the present disclosure is not limited to this. For example, the effective string vibration determination model is based on an external computer or server. Trained and trained effective string vibration determination model and / or update information of the effective string vibration determination model may be provided to the performance information prediction device 300 from an external computer or server.
[Hardware configuration]
Next, the physical configuration of the guitar controller 10 will be described with reference to FIG. FIG. 4 is a diagram showing the appearance of the guitar controller 10 according to the embodiment of the present disclosure.

図４に示されるように、ギターコントローラ１０は、相互接続されたギター５０と制御装置１００とから構成されるセパレートタイプの演奏情報生成システムである。 As shown in FIG. 4, the guitar controller 10 is a separate type performance information generation system including an interconnected guitar 50 and a control device 100.

ギター５０は、通常のエレクトリックギターに、６つの弦の各弦の独立した振動を拾うためのヘクサディバイデッドピックアップ、演奏情報の音量をコントロールするためのＭＩＤＩボリューム、制御装置１００に対してパッチメモリ番号の上下切り替えを行うためのアップダウンスイッチ、を搭載したものである。これらの情報とノーマルなピックアップの出力が、マルチケーブルによって制御装置１００に送信されている。また、電源は制御装置１００からマルチケーブル経由で供給される。本実施例のヘクサディバイデッドピックアップは、ノーマルピックアップと同じマグネティック（磁気）ピックアップである。 The guitar 50 is a normal electric guitar, a hexadivided pickup for picking up the independent vibration of each of the six strings, a MIDI volume for controlling the volume of performance information, and a patch memory for the control device 100. It is equipped with an up / down switch for switching the number up and down. This information and the output of the normal pickup are transmitted to the control device 100 by the multi-cable. Further, the power supply is supplied from the control device 100 via the multi-cable. The hexadivided pickup of this embodiment is the same magnetic pickup as the normal pickup.

一方、制御装置１００は、ギターの弦振動の入力を受け、ＭＩＤＩフォーマットによる演奏情報を生成する。演奏情報の送信先は、限定することなく、音源ユニットやコンピュータ等であってもよい。制御装置１００は、図１に示されるように、各種設定を記憶したパッチメモリのバンク番号とナンバーを切り替えるフットスイッチ、任意の演奏メッセージを割り当てて送信することができるＣＯＮＴＲＯＬスイッチとフットペダルを有する。現在選択されているパッチメモリの番号はＢＡＮＫ／ＮＵＭ画面に表示される。メインの表示デバイスとしてＬＣＤがあるが、画面上にはタッチパネルが装着される。また、データを入力する際のロータリエンコーダもパネル上に装備される。端子として、ギター５０からのマルチケーブルの入力端子ＧＵＩＴＡＲＩＮＰＵＴ、ノーマルピックアップのオーディオ出力端子ＧＵＩＴＡＲＯＵＴ、ＭＩＤＩ演奏信号の出力端子ＭＩＤＩＯＵＴ、ホストコンピュータとの接続端子ＵＳＢｔｏＨＯＳＴ、ＡＣ電源入力端子ＡＣＰＯＷＥＲが備えられる。 On the other hand, the control device 100 receives the input of the string vibration of the guitar and generates the performance information in the MIDI format. The destination of the performance information may be a sound source unit, a computer, or the like without limitation. As shown in FIG. 1, the control device 100 includes a foot switch for switching a bank number and a number of a patch memory storing various settings, a CONTROL switch capable of assigning and transmitting an arbitrary performance message, and a foot pedal. The currently selected patch memory number is displayed on the BANK / NUM screen. There is an LCD as the main display device, but a touch panel is mounted on the screen. In addition, a rotary encoder for inputting data is also installed on the panel. As terminals, the input terminal of the multi-cable from the guitar 50, GUITAR INPUT, the audio output terminal of the normal pickup, GUITAR OUT, the output terminal of the MIDI performance signal, MIDI OUT, the connection terminal with the host computer, USB to HOST, and the AC power input terminal AC POWER. Be prepared.

次に、図５を参照して、本開示の一実施例によるギター５０のハードウェア構成を説明する。図５は、本開示の一実施例によるギター５０のハードウェア構成を示すブロック図である。 Next, with reference to FIG. 5, the hardware configuration of the guitar 50 according to the embodiment of the present disclosure will be described. FIG. 5 is a block diagram showing a hardware configuration of the guitar 50 according to an embodiment of the present disclosure.

図５に示されるように、ギター５０は、ヘクサディバイデッドピックアップのバッファアンプを通した信号、ＭＩＤＩボリュームコントロール、パッチメモリのアップダウンスイッチ、そしてノーマルピックアップの信号がマルチケーブルで制御装置１００に送信される。３つのノーマルピックアップはピックアップセレクターで選択され、トーンコントロール回路と、ボリュームコントロール回路を経てバッファアンプを通過したものが制御装置１００に送信される。 As shown in FIG. 5, in the guitar 50, the signal through the buffer amplifier of the hexadivided pickup, the MIDI volume control, the up / down switch of the patch memory, and the signal of the normal pickup are transmitted to the control device 100 by a multi-cable. Will be done. The three normal pickups are selected by the pickup selector, and those that have passed through the buffer amplifier via the tone control circuit and the volume control circuit are transmitted to the control device 100.

次に、図６を参照して、本開示の一実施例による制御装置１００のハードウェア構成を説明する。図６は、本開示の一実施例による制御装置のハードウェア構成を示すブロック図である。 Next, with reference to FIG. 6, the hardware configuration of the control device 100 according to the embodiment of the present disclosure will be described. FIG. 6 is a block diagram showing a hardware configuration of a control device according to an embodiment of the present disclosure.

図６に示されるように、制御装置１００は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）及びＤＳＰ（ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ）から構成され、ＣＰＵは制御装置１００全体の機能及び処理を管理し、ＤＳＰは高速処理が必要な波形解析処理を実行する。ＣＰＵのバスには、ＣＰＵが使用するＲＡＭ、ＦｌａｓｈＲＯＭ、ＬＣＤをコントロールするＬＣＤコントローラ、各種Ｉ／Ｏデバイスと接続されるＩ／Ｏインターフェース、ＤＳＰ、ＵＳＢインターフェース、及びＭＩＤＩインターフェースが接続される。さらに、Ｉ／Ｏインターフェースにはフットスイッチ、ロータリエンコーダ、ＬＣＤタッチパネル、ギター５０のＭＩＤＩボリュームと、制御装置１００のペダルの位置を検出するためのＡ／Ｄコンバータ、パッチメモリの番号表示用のＬＥＤが接続される。１つのＡ／Ｄコンバータしか図示されていないが、マルチプレクサによって入力ソースを時分割で切り替えて値を読み込んでいる。専用のＲＡＭとＦｌａｓｈＲＯＭが接続されているＤＳＰには、ヘクサディバイデッドピックアップの６つの弦の出力を高速にデジタル信号化するための独立したＡ／Ｄコンバータが接続されており高速な解析処理を行うことができる。 As shown in FIG. 6, the control device 100 is composed of a CPU (Central Processing Unit) and a DSP (Digital Signal Processor), the CPU manages the functions and processing of the entire control device 100, and the DSP requires high-speed processing. Perform various waveform analysis processes. The RAM used by the CPU, the Flash ROM, the LCD controller that controls the LCD, the I / O interface connected to various I / O devices, the DSP, the USB interface, and the MIDI interface are connected to the bus of the CPU. Further, the I / O interface includes a foot switch, a rotary encoder, an LCD touch panel, a MIDI volume of the guitar 50, an A / D converter for detecting the position of the pedal of the control device 100, and an LED for displaying the number of the patch memory. Be connected. Although only one A / D converter is shown, the value is read by switching the input source in time division by a multiplexer. An independent A / D converter for converting the output of the six strings of the hex divided pickup into a high-speed digital signal is connected to the DSP to which the dedicated RAM and Flash ROM are connected, and high-speed analysis processing is performed. It can be performed.

しかしながら、ギター５０及び制御装置１００は、上述したハードウェア構成に限定されるものでなく、他の何れか適切なハードウェア構成により実現されてもよい。
［演奏情報予測装置］
次に、図７～１２を参照して、本開示の一実施例による演奏情報予測装置３００を説明する。演奏情報予測装置３００は、有効弦振動判定モデル訓練装置２００によって訓練された有効弦振動判定モデルを利用して、演奏者によるギター５０の演奏から演奏情報（例えば、ＭＩＤＩファイルなど）を予測する。具体的には、演奏情報予測装置３００は、ギター５０からギター演奏を表す弦振動波形データを取得すると、取得した各弦の弦振動波形データフレームに対して音量検出及びピッチ検出を実行し、検出した各弦の音量情報及びピッチ情報を含む音量ピッチ特徴化データを取得する。そして、演奏情報予測装置３００は、取得した音量ピッチ特徴化データを訓練済み有効弦振動判定モデルに入力し、各弦の撥弦の有効性を判定する。演奏情報予測装置３００は、有効であると判定された各弦の音量及びピッチに基づきそれぞれ消音情報及びピッチ変更情報を生成すると共に、音量に基づき撥弦の強さ（速度）を示すベロシティー情報を生成し、発音情報に付加する。このようにして、演奏情報予測装置３００は、各時刻の音量ピッチ特徴化データから発音情報、消音情報及び／又はピッチ変更情報を含む各時刻の演奏情報（例えば、ＭＩＤＩメッセージなど）を生成し、外部機器（例えば、再生装置、コンピュータ等）に送信する。例えば、これらの演奏情報生成処理は、ＣＰＵによって実現されてもよい。 However, the guitar 50 and the control device 100 are not limited to the hardware configuration described above, and may be realized by any other appropriate hardware configuration.
[Performance information prediction device]
Next, the performance information prediction device 300 according to the embodiment of the present disclosure will be described with reference to FIGS. 7 to 12. The performance information prediction device 300 predicts performance information (for example, a MIBI file) from the performance of the guitar 50 by the performer by using the effective string vibration determination model trained by the effective string vibration determination model training device 200. Specifically, when the performance information prediction device 300 acquires string vibration waveform data representing a guitar performance from the guitar 50, it executes volume detection and pitch detection for the acquired string vibration waveform data frame of each string and detects them. The volume pitch featured data including the volume information and pitch information of each string is acquired. Then, the performance information prediction device 300 inputs the acquired volume pitch characteristic data into the trained effective string vibration determination model, and determines the effectiveness of the string repellent of each string. The performance information prediction device 300 generates muffling information and pitch change information based on the volume and pitch of each string determined to be effective, and velocity information indicating the strength (speed) of the string repellent based on the volume. Is generated and added to the pronunciation information. In this way, the performance information prediction device 300 generates performance information (for example, a LIMITED message) at each time including sounding information, mute information and / or pitch change information from the volume pitch featured data at each time. Send to an external device (eg, playback device, computer, etc.). For example, these performance information generation processes may be realized by the CPU.

図７Ａは、本開示の一実施例による演奏情報予測装置３００の機能構成を示すブロック図である。 FIG. 7A is a block diagram showing a functional configuration of the performance information prediction device 300 according to the embodiment of the present disclosure.

図７Ａに示されるように、演奏情報予測装置３００は、前処理部３１０及び演奏情報予測部３２０を有する。 As shown in FIG. 7A, the performance information prediction device 300 has a preprocessing unit 310 and a performance information prediction unit 320.

前処理部２１０は、弦楽器演奏を表す弦振動波形データから各弦の音量情報及びピッチ情報を含む音量ピッチ特徴化データを生成する。具体的には、演奏者によってギター５０が演奏されると、ギター５０は、時間と各弦の振幅とを示す弦振動波形データを取得し、演奏情報予測装置３００に送信する。すなわち、ギター５０は６弦からなるため、６種類の弦振動波形データが生成される。例えば、弦振動波形データは、弦振動の解析に十分な２０ＫＨｚ（５０μｓｅｃ間隔）のサンプリングレート及びデータワード長１６ビットにより取得されうる。 The preprocessing unit 210 generates volume pitch characteristic data including volume information and pitch information of each string from string vibration waveform data representing stringed instrument performance. Specifically, when the guitar 50 is played by the performer, the guitar 50 acquires string vibration waveform data indicating the time and the amplitude of each string and transmits it to the performance information prediction device 300. That is, since the guitar 50 is composed of 6 strings, 6 types of string vibration waveform data are generated. For example, string vibration waveform data can be acquired with a sampling rate of 20 KHz (50 μsec intervals) and a data word length of 16 bits, which is sufficient for string vibration analysis.

前処理部３１０は、各弦の弦振動波形データに対して音量検出及びピッチ検出を実行し、図８に示されるような弦毎の音量エンベロープ及びピッチエンベロープを取得する。例えば、音量検出について、前処理部３１０は、図９（ａ）に示されるように、弦振動波形データを絶対値化し、ローパスフィルタリングを実行し、フィルタリングされた波形の振幅レベルを当該基準時刻における音量として決定してもよい。当該基準時刻に対して訓練済み有効弦振動判定モデルが有効な撥弦を検出しなかった場合、あるいは、検出した音量が消音状態と認められる所定の閾値以下であった場合、前処理部３１０は、発音がなかったと判断し、演奏情報として消音情報を出力する。そうでない場合、前処理部３１０は、発音があったと判断し、検出した音量を当該発音のベロシティー値とし、有効弦振動判定モデルから出力されたノート番号と共に当該ベロシティー値を発音情報に含める。 The preprocessing unit 310 executes volume detection and pitch detection on the string vibration waveform data of each string, and acquires the volume envelope and pitch envelope for each string as shown in FIG. For example, for volume detection, the preprocessing unit 310 converts the string vibration waveform data into an absolute value, performs low-pass filtering, and sets the amplitude level of the filtered waveform at the reference time, as shown in FIG. 9A. It may be determined as the volume. If the trained effective string vibration determination model does not detect a valid string repellent for the reference time, or if the detected volume is below a predetermined threshold value that is recognized as a mute state, the preprocessing unit 310 , Judges that there was no pronunciation, and outputs mute information as performance information. If not, the preprocessing unit 310 determines that there is a pronunciation, sets the detected volume as the velocity value of the pronunciation, and includes the velocity value together with the note number output from the effective string vibration determination model in the pronunciation information. ..

また、ピッチ検出について、前処理部３２０は、図９（ｂ）に示されるように、弦振動波形データに対してハイパスフィルタリング及びローパスフィルタリングを実行し、ゼロクロス点を計数し、ピッチ情報を生成する。そして、前処理部３２０は、直近のピッチ情報又は発音情報と差異があった場合、ピッチ変更があったと判断し、ピッチ変更情報を出力する。 Further, regarding pitch detection, as shown in FIG. 9B, the preprocessing unit 320 executes high-pass filtering and low-pass filtering on the string vibration waveform data, counts zero cross points, and generates pitch information. .. Then, when there is a difference from the latest pitch information or pronunciation information, the preprocessing unit 320 determines that the pitch has been changed and outputs the pitch change information.

なお、前処理部３１０は、全ての弦に対して上述した処理を並列に実行し、図８に示されるように、所定の処理周期ｓで弦振動を検出し、時間軸に関して重複する弦振動波形データフレームを生成する。図８に示されるように、前処理部３１０は、各弦の音量情報及びピッチ情報を含む音量ピッチ特徴化データフレームを生成する。 The preprocessing unit 310 executes the above-mentioned processing for all strings in parallel, detects string vibration in a predetermined processing cycle s as shown in FIG. 8, and overlaps string vibrations with respect to the time axis. Generate a waveform data frame. As shown in FIG. 8, the preprocessing unit 310 generates a volume pitch characteristic data frame including volume information and pitch information of each string.

演奏情報予測部３２０は、訓練済み有効弦振動判定モデルを利用して、音量ピッチ特徴化データから弦楽器演奏の演奏情報を予測する。具体的には、演奏情報予測部３２０は、前処理部３１０によって生成された音量ピッチ特徴化データフレームを訓練済み有効弦振動判定モデルに入力し、各弦の有効又は無効を判定する。例えば、訓練済み有効弦振動判定モデルは、基準時刻の音量ピッチ特徴化データフレーム、基準時刻直前のｐ個の音量ピッチ特徴化データフレーム及び基準時刻直後のｎ個の音量ピッチ特徴化データフレームを入力として受け取り、基準時刻の各弦の有効又は無効を示す有効性情報を判定する。ここで、所定数ｐ，ｎは同一又は異なる所定値であってもよい。例えば、所定数ｐ，ｎは、演奏者によるギター５０の撥弦と、演奏情報予測装置３００における演奏情報の出力とのタイムラグが演奏者によって認知できない程度の値に設定されることが好ましい。このように一定の時間範囲の音量ピッチ特徴化データフレームを利用することによって、フレーム間の前後関係を考慮して各弦が意図された撥弦であるか判断することができると共に、撥弦の時間変化を判断することが可能になる。 The performance information prediction unit 320 predicts the performance information of the stringed instrument performance from the volume pitch characterization data by using the trained effective string vibration determination model. Specifically, the performance information prediction unit 320 inputs the volume pitch featured data frame generated by the preprocessing unit 310 into the trained effective string vibration determination model, and determines whether each string is valid or invalid. For example, the trained effective string vibration determination model inputs a volume pitch featured data frame at a reference time, p volume pitch featured data frames immediately before the reference time, and n volume pitch featured data frames immediately after the reference time. To determine the validity information indicating the validity or invalidity of each string at the reference time. Here, the predetermined numbers p and n may be the same or different predetermined values. For example, it is preferable that the predetermined numbers p and n are set to a value such that the time lag between the string repelling of the guitar 50 by the performer and the output of the performance information in the performance information prediction device 300 cannot be recognized by the performer. By using the volume pitch featured data frame in a certain time range in this way, it is possible to determine whether each string is the intended string repellent in consideration of the anteroposterior relationship between the frames, and it is possible to determine whether or not each string is the intended string repellent. It becomes possible to judge the time change.

このようにして、各弦の有効性を判定すると、演奏情報予測部３２０は、図１０に示されるように、演奏情報予測部３２０は、基準時刻における弦振動が有効であると判定された弦については、当該弦について検出された音量を当該弦の音量として決定し、基準時刻における弦振動が無効であると判定された弦については、当該弦について検出された音量をゼロに設定する。そして、演奏情報予測部３２０は、有効な弦振動のみ発音情報を構成し、検出されたピッチによってピッチ変更情報を構成することによって、演奏情報を生成する。 When the effectiveness of each string is determined in this way, as shown in FIG. 10, the performance information prediction unit 320 determines that the string vibration at the reference time is effective in the performance information prediction unit 320. For, the volume detected for the string is determined as the volume of the string, and for the string determined that the string vibration at the reference time is invalid, the volume detected for the string is set to zero. Then, the performance information prediction unit 320 generates performance information by configuring sound information only for valid string vibrations and configuring pitch change information according to the detected pitch.

なお、基準時刻において発音がなかった場合、すなわち、基準時刻が消音状態であった場合、有効弦振動判定モデルは、全ての弦について無効を出力するように訓練されてもよい。 If there is no sound at the reference time, that is, if the reference time is in the mute state, the effective string vibration determination model may be trained to output invalidity for all strings.

一実施例では、有効弦振動判定モデルは、ニューラルネットワークによって実現されてもよい。例えば、有効弦振動判定モデルは、図１１に示されるようなネットワークアーキテクチャを有するニューラルネットワークであってもよい。この場合、演奏情報予測部３２０は、ニューラルネットワークの入力層に基準時刻の音量ピッチ特徴化データフレームと、基準時刻前後の（ｐ＋ｎ）個の音量ピッチ特徴化データフレームとを入力し、中間層における演算を介し出力層から各弦の有効又は無効を示す値を取得する。 In one embodiment, the effective string vibration determination model may be realized by a neural network. For example, the effective string vibration determination model may be a neural network having a network architecture as shown in FIG. In this case, the performance information prediction unit 320 inputs the volume pitch featured data frame of the reference time and the (p + n) volume pitch featured data frames before and after the reference time into the input layer of the neural network, and in the intermediate layer. The value indicating the validity or invalidity of each string is obtained from the output layer through the calculation.

また、他の実施例では、有効弦振動判定モデルは、図１２に示されるような再帰型ニューラルネットワークによって実現されてもよい。この場合、演奏情報予測部３２０は、上述したｐ，ｎによる時間範囲より広い時間範囲の音量ピッチ特徴化データフレームを利用してもよく、例えば、基準時刻ｔの音量ピッチ特徴化データフレーム、基準時刻直前のｂ個（ｂ＞ｐ）の音量ピッチ特徴化データフレーム、及び基準時刻直後のｆ個（ｆ＞ｎ）の音量ピッチ特徴化データフレームを再帰型ニューラルネットワークの入力層Ｘ_ｔ－ｂ，・・・Ｘ_ｔ－１，Ｘ_ｔ，Ｘ_ｔ＋１，・・・，Ｘ_ｔ＋ｆに入力し、中間層における演算を介し出力層から各弦の有効又は無効を示す値を取得してもよい。再帰型ニューラルネットワークは、時系列データの処理に適しており、一定の時間範囲の音量ピッチ特徴化データフレームから各弦の有効性を高精度に予測することができると考えられる。
［演奏情報予測処理］
次に、図１３を参照して、本開示の一実施例による演奏情報予測処理を説明する。当該演奏情報予測処理は、上述した演奏情報予測装置３００によって実現され、例えば、プロセッサがプログラム又は命令を実行することによって実現されてもよい。図１３は、本開示の一実施例による演奏情報予測処理を示すフローチャートである。 Further, in another embodiment, the effective string vibration determination model may be realized by a recurrent neural network as shown in FIG. In this case, the performance information prediction unit 320 may use the volume pitch featured data frame having a time range wider than the time range of p and n described above. For example, the volume pitch featured data frame at the reference time t, the reference. The b (b> p) volume pitch featured data frames immediately before the time and the f (f> n) volume pitch featured data frames immediately after the reference time are input to the input layer _Xt-b of the recursive neural network. ... It may be input to X _t-1 , X _t , X _{t + 1} , ..., X _{t + f} , and a value indicating the validity or invalidity of each string may be obtained from the output layer through an operation in the intermediate layer. Recurrent neural networks are suitable for processing time-series data, and it is considered that the effectiveness of each string can be predicted with high accuracy from the volume pitch featured data frame in a certain time range.
[Performance information prediction processing]
Next, with reference to FIG. 13, the performance information prediction process according to the embodiment of the present disclosure will be described. The performance information prediction process is realized by the performance information prediction device 300 described above, and may be realized, for example, by the processor executing a program or an instruction. FIG. 13 is a flowchart showing a performance information prediction process according to an embodiment of the present disclosure.

図１３に示されるように、ステップＳ１０１において、演奏情報予測装置３００は、弦番号ｓを０に初期化する。ギター５０は６弦から構成されるため、弦番号ｓは０～５の値をとる。 As shown in FIG. 13, in step S101, the performance information prediction device 300 initializes the string number s to 0. Since the guitar 50 is composed of 6 strings, the string number s takes a value of 0 to 5.

ステップＳ１０２において、演奏情報予測装置３００は、弦振動波形データから弦ｓの現在の音量ｌ［ｓ］を検出する。例えば、演奏情報予測装置３００は、弦振動波形データに対して絶対値化及びローパスフィルタリングを適用し、音量ｌ［ｓ］を検出してもよい。 In step S102, the performance information prediction device 300 detects the current volume l [s] of the strings s from the string vibration waveform data. For example, the performance information prediction device 300 may apply absolute value conversion and low-pass filtering to the string vibration waveform data to detect the volume l [s].

ステップＳ１０３において、演奏情報予測装置３００は、弦振動波形から弦ｓの現在のピッチｐ［ｓ］を検出する。例えば、演奏情報予測装置３００は、弦振動波形データに対してハイパスフィルタリング及びローパスフィルタリングを適用し、ピッチｐ［ｓ］を検出してもよい。 In step S103, the performance information prediction device 300 detects the current pitch p [s] of the strings s from the string vibration waveform. For example, the performance information prediction device 300 may apply high-pass filtering and low-pass filtering to the string vibration waveform data to detect the pitch p [s].

ステップＳ１０４において、演奏情報予測装置３００は、弦番号ｓを１だけインクリメントする。 In step S104, the performance information prediction device 300 increments the string number s by 1.

ステップＳ１０５において、演奏情報予測装置３００は、全ての弦について音量及びピッチを検出したか判断する。 In step S105, the performance information prediction device 300 determines whether or not the volume and pitch have been detected for all the strings.

ステップＳ１０６において、演奏情報予測装置３００は、６つの弦のそれぞれの現在の音量ｌ［０］～ｌ［５］及びピッチｐ［０］～ｐ［５］を格納する。 In step S106, the performance information predictor 300 stores the current volumes l [0] to l [5] and pitches p [0] to p [5] of each of the six strings.

ステップＳ１０７において、演奏情報予測装置３００は、基準時刻の音量ピッチ特徴化データフレーム、基準時刻直前のｐ個の音量ピッチ特徴化データフレーム及び基準時刻直後のｎ個の音量ピッチ特徴化データフレームを訓練済み有効弦振動判定モデルに入力する。 In step S107, the performance information prediction device 300 trains the volume pitch featured data frame at the reference time, p volume pitch featured data frames immediately before the reference time, and n volume pitch featured data frames immediately after the reference time. Input to the completed effective string vibration judgment model.

ステップＳ１０８において、演奏情報予測装置３００は、各弦に対する判定結果をそれぞれａ［０］～ａ［５］に格納する。 In step S108, the performance information prediction device 300 stores the determination results for each string in a [0] to a [5], respectively.

ステップＳ１０９において、演奏情報予測装置３００は、デジタル信号プロセッサ（ＤＳＰ）に各弦の判定結果を示す弦振動有効性情報ａ［０］～ａ［５］をわたし、レベル信号を制御する。すなわち、ＤＳＰは、判定結果が有効弦振動である弦については弦振動のレベルを維持し、判定結果が有効弦振動でない弦については弦振動のレベルを無音に設定してもよい。 In step S109, the performance information prediction device 300 gives the digital signal processor (DSP) string vibration effectiveness information a [0] to a [5] indicating the determination result of each string, and controls the level signal. That is, the DSP may maintain the string vibration level for strings whose determination result is effective string vibration, and may set the string vibration level to silence for strings whose determination result is not effective string vibration.

ステップＳ１１０において、演奏情報予測装置３００は、弦番号ｓを０にリセットする。 In step S110, the performance information prediction device 300 resets the string number s to 0.

ステップＳ１１１において、演奏情報予測装置３００は、ＤＳＰの出力する音量情報をｌ［ｓ］に格納する。 In step S111, the performance information prediction device 300 stores the volume information output by the DSP in l [s].

ステップＳ１１２において、演奏情報予測装置３００は、ＤＳＰの出力するピッチ情報をｐ［ｓ］に格納する。 In step S112, the performance information prediction device 300 stores the pitch information output by the DSP in p [s].

ステップＳ１１３において、演奏情報予測装置３００は、弦ｓが発音中であるか判断する。弦ｓが発音中でない場合（Ｓ１１３：Ｎｏ）、演奏情報予測装置３００は、ステップＳ１１７に移行する。他方、弦ｓが発音中である場合（Ｓ１１３：Ｙｅｓ）、演奏情報予測装置３００は、ステップＳ１１４において、弦ｓの弦振動が無効なものであるか、すなわち、弦ｓの弦振動有効性情報がａ［ｓ］＝０であるか判断する。 In step S113, the performance information prediction device 300 determines whether the string s is being pronounced. When the string s is not being pronounced (S113: No), the performance information prediction device 300 proceeds to step S117. On the other hand, when the string s is being pronounced (S113: Yes), the performance information predictor 300 determines in step S114 whether the string vibration of the string s is invalid, that is, the string vibration effectiveness information of the string s. Is determined to be a [s] = 0.

弦ｓの弦振動が無効である場合（Ｓ１１４：Ｙｅｓ）、演奏情報予測装置３００は、ステップＳ１１６に移行する。他方、弦ｓの弦振動が有効である場合（Ｓ１１４：Ｎｏ）、演奏情報予測装置３００は、ステップＳ１１５において、音量ｌ［ｓ］が所定の発音最低音量未満であるか判断する。 When the string vibration of the string s is invalid (S114: Yes), the performance information prediction device 300 proceeds to step S116. On the other hand, when the string vibration of the string s is effective (S114: No), the performance information prediction device 300 determines in step S115 whether the volume l [s] is less than the predetermined minimum sounding volume.

ステップＳ１１５において、演奏情報予測装置３００は、音量ｌ［ｓ］が所定の発音レベルより大きいか判断する。音量ｌ［ｓ］が所定の発音レベルより大きい場合（Ｓ１１５：Ｙｅｓ）、演奏情報予測装置３００は、ステップＳ１１６において、弦ｓの現在の発音に対する消音情報を発生させる。他方、音量ｌ［ｓ］が所定の発音レベル以下である場合（Ｓ１１５：Ｎｏ）、演奏情報予測装置３００は、ステップＳ１１７に移行する。 In step S115, the performance information prediction device 300 determines whether the volume l [s] is higher than the predetermined pronunciation level. When the volume l [s] is larger than the predetermined pronunciation level (S115: Yes), the performance information prediction device 300 generates muffling information for the current pronunciation of the string s in step S116. On the other hand, when the volume l [s] is equal to or lower than the predetermined pronunciation level (S115: No), the performance information prediction device 300 proceeds to step S117.

ステップＳ１１７において、演奏情報予測装置３００は、ｌ［ｓ］が発音レベルより大きいか判断する。ｌ［ｓ］が発音レベルより大きい場合（Ｓ１１７：Ｙｅｓ）、演奏情報予測装置３００は、ステップＳ１１８に移行する。他方、ｌ［ｓ］が発音レベル以下である場合（Ｓ１１７：Ｎｏ）、演奏情報予測装置３００は、ステップＳ１２２に移行する。 In step S117, the performance information prediction device 300 determines whether l [s] is larger than the pronunciation level. When l [s] is larger than the pronunciation level (S117: Yes), the performance information prediction device 300 proceeds to step S118. On the other hand, when l [s] is equal to or lower than the pronunciation level (S117: No), the performance information prediction device 300 proceeds to step S122.

ステップＳ１１８において、演奏情報予測装置３００は、弦ｓの弦振動有効性情報がａ［ｓ］＝０であるか判断する。弦ｓの弦振動有効性情報がａ［ｓ］＝０である場合（Ｓ１１８：Ｙｅｓ）、演奏情報予測装置３００は、ステップＳ１２２に移行する。他方、弦ｓの弦振動有効性情報がａ［ｓ］＝０でない場合（Ｓ１１８：Ｎｏ）、演奏情報予測装置３００は、ステップＳ１１９において、ｐ［ｓ］に基づきノート番号ｋを計算する。 In step S118, the performance information prediction device 300 determines whether the string vibration effectiveness information of the string s is a [s] = 0. When the string vibration effectiveness information of the string s is a [s] = 0 (S118: Yes), the performance information prediction device 300 proceeds to step S122. On the other hand, when the string vibration effectiveness information of the string s is not a [s] = 0 (S118: No), the performance information prediction device 300 calculates the note number k based on p [s] in step S119.

ステップＳ１２０において、演奏情報予測装置３００は、ｌ［ｓ］に基づきベロシティー情報ｖを計算する。 In step S120, the performance information prediction device 300 calculates the velocity information v based on l [s].

ステップＳ１２１において、演奏情報予測装置３００は、ノート番号ｋ及びベロシティー情報ｖの発音情報を生成する。 In step S121, the performance information prediction device 300 generates pronunciation information of the note number k and the velocity information v.

ステップＳ１２２において、演奏情報予測装置３００は、ピッチｐ［ｓ］が前回の弦ｓのピッチＰ０［ｓ］に一致するか判断する。一致する場合（Ｓ１２２：Ｙｅｓ）、演奏情報予測装置３００は、ステップＳ１２５に移行する。他方、一致しない場合（Ｓ１２２：Ｎｏ）、演奏情報予測装置３００は、差分を求めてピッチベンド情報を生成する。 In step S122, the performance information prediction device 300 determines whether the pitch p [s] matches the pitch P0 [s] of the previous string s. If they match (S122: Yes), the performance information prediction device 300 proceeds to step S125. On the other hand, if they do not match (S122: No), the performance information prediction device 300 obtains the difference and generates pitch bend information.

ステップＳ１２４において、演奏情報予測装置３００は、Ｐ０［ｓ］をｐによって更新する。 In step S124, the performance information prediction device 300 updates P0 [s] with p.

ステップＳ１２５において、演奏情報予測装置３００は、弦番号ｓを１だけインクリメントする。 In step S125, the performance information prediction device 300 increments the string number s by 1.

ステップＳ１２６において、演奏情報予測装置３００は、全ての弦が処理されたか判断し、全ての弦が処理されていない場合（Ｓ１２６：Ｙｅｓ）、ステップＳ１１１に戻り、全ての弦が処理されている場合（Ｓ１２６：Ｎｏ）、当該処理を終了する。
［有効弦振動判定モデル訓練装置］
次に、図１４及び７Ｂを参照して、本開示の一実施例による有効弦振動判定モデル訓練装置２００を説明する。図１４は、本開示の一実施例による有効弦振動判定モデル訓練装置２００の動作を示す概略図である。 In step S126, the performance information prediction device 300 determines whether all the strings have been processed, and if all the strings have not been processed (S126: Yes), returns to step S111 and all the strings have been processed. (S126: No), the process is terminated.
[Effective string vibration judgment model training device]
Next, the effective string vibration determination model training device 200 according to the embodiment of the present disclosure will be described with reference to FIGS. 14 and 7B. FIG. 14 is a schematic view showing the operation of the effective string vibration determination model training device 200 according to the embodiment of the present disclosure.

図１４に示されるように、有効弦振動判定モデル訓練装置２００は、訓練データを格納する訓練用演奏情報データベース８０に格納されている訓練データを利用して有効弦振動判定モデルを訓練する。具体的には、訓練データは、訓練用楽譜データと当該楽譜データに対応する訓練用演奏情報とのペアから構成され、訓練用楽譜データに基づき表示された楽譜（例えば、ＴＡＢ譜など）を演奏者に表示し、演奏者は、メトロノームによるテンポ制御の下でギター５０を演奏する。当該演奏を表す弦振動波形データは、有効弦振動判定モデル訓練装置２００に提供され、有効弦振動判定モデル訓練装置２００は、取得した各弦の弦振動波形データに対して音量検出及びピッチ検出を実行し、各弦の音量及びピッチから構成される音量ピッチ特徴化データフレームを生成する。そして、有効弦振動判定モデル訓練装置２００は、基準時及び基準時前後の音量ピッチ特徴化データフレームを訓練対象の有効弦振動判定モデルに入力する。そして、有効弦振動判定モデル訓練装置２００は、有効弦振動判定モデルからの出力と訓練用演奏情報の発音情報（例えば、発音情報及び消音情報など）とを比較し、その誤差に応じて有効弦振動判定モデルのパラメータを更新する。有効弦振動判定モデル訓練装置２００は、所定の終了条件が充足されるまで上述した処理を繰り返し、有効弦振動判定モデルからの出力が訓練用演奏情報の発音情報に近づくように有効弦振動判定モデルを最適化する。 As shown in FIG. 14, the effective string vibration determination model training device 200 trains the effective string vibration determination model using the training data stored in the training performance information database 80 that stores the training data. Specifically, the training data is composed of a pair of the training score data and the training performance information corresponding to the score data, and plays the score displayed based on the training score data (for example, TAB score). The performer plays the guitar 50 under tempo control by the metronome. The string vibration waveform data representing the performance is provided to the effective string vibration determination model training device 200, and the effective string vibration determination model training device 200 performs volume detection and pitch detection on the acquired string vibration waveform data of each string. Executes to generate a volume pitch featured data frame consisting of the volume and pitch of each string. Then, the effective string vibration determination model training device 200 inputs the volume pitch characteristic data frames at the reference time and before and after the reference time into the effective string vibration determination model to be trained. Then, the effective string vibration determination model training device 200 compares the output from the effective string vibration determination model with the pronunciation information (for example, pronunciation information and muffling information) of the training performance information, and the effective string according to the error. Update the parameters of the vibration judgment model. The effective string vibration determination model training device 200 repeats the above-mentioned processing until a predetermined end condition is satisfied, and the effective string vibration determination model is such that the output from the effective string vibration determination model approaches the pronunciation information of the training performance information. Optimize.

図７Ｂは、本開示の一実施例による有効弦振動判定モデル訓練装置２００の機能構成を示すブロック図である。 FIG. 7B is a block diagram showing a functional configuration of an effective string vibration determination model training device 200 according to an embodiment of the present disclosure.

図７Ｂに示されるように、有効弦振動判定モデル訓練装置２００は、前処理部２１０及び有効弦振動判定モデル訓練部２２０を有する。 As shown in FIG. 7B, the effective string vibration determination model training device 200 has a preprocessing unit 210 and an effective string vibration determination model training unit 220.

前処理部２１０は、訓練用演奏情報に従って演奏された弦楽器演奏を表す弦振動波形データから各弦の音量及びピッチから構成される音量ピッチ特徴化データを生成する。具体的には、演奏者によってギター５０が演奏されると、ギター５０は、時間と各弦の振幅とを示す弦振動波形データを取得し、有効弦振動判定モデル訓練装置２００に送信する。すなわち、ギター５０は６弦からなるため、６種類の弦振動波形データが生成される。 The preprocessing unit 210 generates volume pitch characterization data composed of the volume and pitch of each string from the string vibration waveform data representing the performance of the stringed instrument played according to the training performance information. Specifically, when the guitar 50 is played by the performer, the guitar 50 acquires the string vibration waveform data indicating the time and the amplitude of each string and transmits it to the effective string vibration determination model training device 200. That is, since the guitar 50 is composed of 6 strings, 6 types of string vibration waveform data are generated.

前処理部２１０は、各弦の弦振動波形データに対して音量検出及びピッチ検出を実行し、検出した各弦の音量及びピッチから構成される音量ピッチ特徴化データを取得する。例えば、前処理部２１０は、弦振動波形データから時間軸に関して重複する窓幅ｗの弦振動波形フレームを抽出し、サンプリング毎に音量検出及びピッチ検出を実行し、各弦振動波形フレームを音量ピッチ特徴化データフレームに変換してもよい。前処理部２１０は、音量ピッチ特徴化データを生成し、有効弦振動判定モデル訓練部２２０に提供する。 The preprocessing unit 210 executes volume detection and pitch detection for the string vibration waveform data of each string, and acquires volume pitch characteristic data composed of the detected volume and pitch of each string. For example, the preprocessing unit 210 extracts a string vibration waveform frame having a window width w that overlaps with respect to the time axis from the string vibration waveform data, executes volume detection and pitch detection for each sampling, and sets each string vibration waveform frame to the volume pitch. It may be converted into a characterized data frame. The preprocessing unit 210 generates volume pitch characterization data and provides it to the effective string vibration determination model training unit 220.

有効弦振動判定モデル訓練部２２０は、訓練用演奏情報を利用して、基準時刻の音量ピッチ特徴化データフレームと、当該基準時刻の音量ピッチ特徴化データフレームの前後の音量ピッチ特徴化データフレームとから弦楽器演奏における各弦の有効性を判定する有効弦振動判定モデルを訓練する。ここで、訓練対象の有効弦振動判定モデルは、予測対象の基準時刻の演奏情報を予測する際、当該基準時刻の音量ピッチ特徴化データフレームだけでなく、当該基準時刻の前後の時刻の音量ピッチ特徴化データフレームを入力として取得し、基準時刻における各弦の有効性情報を出力する。例えば、有効弦振動判定モデル訓練部２２０は、基準時刻の音量ピッチ特徴化データフレームと、基準時刻直前のｐ個の音量ピッチ特徴化データフレームと、基準時刻直後のｎ個の音量ピッチ特徴化データフレームとを有効弦振動判定モデルに入力してもよい。ここで、所定数ｐ，ｎは同一又は異なる所定値であってもよい。例えば、所定数ｐ，ｎは、演奏者によるギター５０の撥弦と、演奏情報予測装置３００における演奏情報の出力とのタイムラグが演奏者によって認知できない程度の値に設定されることが好ましい。 The effective string vibration determination model training unit 220 uses the performance information for training to obtain a volume pitch featured data frame at a reference time and a volume pitch featured data frame before and after the volume pitch featured data frame at the reference time. To train an effective string vibration judgment model that judges the effectiveness of each string in playing a stringed instrument. Here, when predicting the performance information of the reference time of the prediction target, the effective string vibration determination model of the training target not only the volume pitch characterized data frame of the reference time but also the volume pitch of the time before and after the reference time. It takes a characterized data frame as an input and outputs the validity information of each string at the reference time. For example, the effective string vibration determination model training unit 220 has a volume pitch featured data frame at the reference time, p volume pitch featured data frames immediately before the reference time, and n volume pitch featured data immediately after the reference time. The frame may be input to the effective string vibration determination model. Here, the predetermined numbers p and n may be the same or different predetermined values. For example, it is preferable that the predetermined numbers p and n are set to a value such that the time lag between the string repelling of the guitar 50 by the performer and the output of the performance information in the performance information prediction device 300 cannot be recognized by the performer.

このように一定の時間範囲の音量ピッチ特徴化データフレームを利用することによって、フレーム間の前後関係を考慮して新たな撥弦が発生したか判断することができると共に、撥弦の時間変化を判断することが可能になる。 By using the volume pitch featured data frame in a certain time range in this way, it is possible to determine whether a new string repelling has occurred in consideration of the anteroposterior relationship between the frames, and to change the time of the string repelling. It becomes possible to judge.

また、訓練対象の有効弦振動判定モデルは、事前訓練された機械学習モデルであってもよく、有効弦振動判定モデル訓練部２２０は、上述した訓練処理によって、事前訓練された有効弦振動判定モデルをファインチューニングするようにしてもよい。これにより、初期状態の機械学習モデルから有効弦振動判定モデルを訓練するのと比較して、少ない訓練データにより高精度な有効弦振動判定モデルを構築することが可能になる。 Further, the effective string vibration determination model to be trained may be a pre-trained machine learning model, and the effective string vibration determination model training unit 220 is a pre-trained effective string vibration determination model by the above-mentioned training process. May be fine-tuned. This makes it possible to construct a highly accurate effective string vibration determination model with less training data than training an effective string vibration determination model from a machine learning model in the initial state.

有効弦振動判定モデルから各弦の有効性情報を取得すると、有効弦振動判定モデル訓練部２２０は、取得した有効性情報に基づき導出される発音情報と、訓練用演奏情報の発音情報とを比較し、これらが一致するように有効弦振動判定モデルのパラメータを更新する。例えば、有効弦振動判定モデルがニューラルネットワークにより実現される場合、有効弦振動判定モデル訓練部２２０は、周知の誤差逆伝播法に従って比較結果に応じてニューラルネットワークのパラメータを更新してもよい。なお、訓練用演奏情報の発音情報が無いにも関わらず発音情報が出力された場合、当該発音情報は不要弦振動と見做す。 When the effectiveness information of each string is acquired from the effective string vibration determination model, the effective string vibration determination model training unit 220 compares the pronunciation information derived based on the acquired effectiveness information with the pronunciation information of the training performance information. Then, update the parameters of the effective string vibration determination model so that they match. For example, when the effective string vibration determination model is realized by the neural network, the effective string vibration determination model training unit 220 may update the parameters of the neural network according to the comparison result according to a well-known error back propagation method. If the pronunciation information is output even though there is no pronunciation information of the training performance information, the pronunciation information is regarded as unnecessary string vibration.

有効弦振動判定モデル訓練部２２０は、所定の終了条件が充足されるまで、上述した処理を繰り返し、有効弦振動判定モデルを訓練し、所定の終了条件が充足されると、当該時点における有効弦振動判定モデルを訓練済み有効弦振動判定モデルとして演奏情報予測装置３００にわたす。ここで、所定の終了条件は、準備された全ての訓練データを処理したことなどであってもよい。
［有効弦振動判定モデル訓練処理］
次に、図１５を参照して、本開示の一実施例による有効弦振動判定モデル訓練処理を説明する。当該有効弦振動判定モデル訓練処理は、上述した有効弦振動判定モデル訓練装置２００によって実現され、例えば、プロセッサがプログラム又は命令を実行することによって実現されてもよい。図１５は、本開示の一実施例による有効弦振動判定モデル訓練処理を示すフローチャートである。 The effective string vibration determination model training unit 220 repeats the above-mentioned processing until the predetermined end condition is satisfied, trains the effective string vibration determination model, and when the predetermined end condition is satisfied, the effective string at that time point. The vibration determination model is passed to the performance information prediction device 300 as a trained effective string vibration determination model. Here, the predetermined termination condition may be that all the prepared training data have been processed.
[Effective string vibration judgment model training process]
Next, with reference to FIG. 15, the effective string vibration determination model training process according to the embodiment of the present disclosure will be described. The effective string vibration determination model training process is realized by the above-mentioned effective string vibration determination model training device 200, and may be realized, for example, by executing a program or an instruction by a processor. FIG. 15 is a flowchart showing an effective string vibration determination model training process according to an embodiment of the present disclosure.

図１５に示されるように、ステップＳ２０１において、有効弦振動判定モデル訓練装置２００は、訓練用演奏情報データベース８０から訓練用演奏情報を選択する。具体的には、有効弦振動判定モデル訓練装置２００は、ランダム、順次、ユーザ選択によって訓練用演奏情報を自動選択してもよい。 As shown in FIG. 15, in step S201, the effective string vibration determination model training device 200 selects training performance information from the training performance information database 80. Specifically, the effective string vibration determination model training device 200 may automatically select training performance information randomly, sequentially, or by user selection.

ステップＳ２０２において、有効弦振動判定モデル訓練装置２００は、演奏情報をＴＡＢ譜の表示情報に変換する。 In step S202, the effective string vibration determination model training device 200 converts the performance information into the display information of the TAB staff.

ステップＳ２０３において、有効弦振動判定モデル訓練装置２００は、ＴＡＢ譜を制御装置１００のＬＣＤなどに表示する。 In step S203, the effective string vibration determination model training device 200 displays the TAB score on the LCD of the control device 100 or the like.

ステップＳ２０４において、有効弦振動判定モデル訓練装置２００は、演奏情報のテンポに合わせてＭＩＤＩプレーヤーをスタートする。 In step S204, the effective string vibration determination model training device 200 starts the MIDI player according to the tempo of the performance information.

ステップＳ２０５において、有効弦振動判定モデル訓練装置２００は、テンポに合わせてメトロノームをスタートする。 In step S205, the effective string vibration determination model training device 200 starts the metronome in accordance with the tempo.

ステップＳ２０６において、ＭＩＤＩプレーヤーは、演奏情報を再生する。 In step S206, the MIDI player reproduces the performance information.

ステップＳ２０７において、メトロノームは、演奏情報を再生する。これにより、演奏者の演奏を取得するための準備が整い、演奏者は演奏を開始する。 In step S207, the metronome reproduces the performance information. As a result, the performer is ready to acquire the performance, and the performer starts playing.

ステップＳ２０８において、有効弦振動判定モデル訓練装置２００は、ＭＩＤＩプレーヤーから発生したｓチャネルの発音情報又は消音情報をメモリｐに格納する。 In step S208, the effective string vibration determination model training device 200 stores the pronunciation information or the muffling information of the s channel generated from the MIDI player in the memory p.

ステップＳ２０９において、有効弦振動判定モデル訓練装置２００は、演奏者による演奏を表す弦振動波形から音量ピッチ特徴化データフレームを生成し、基準時刻における各弦の有効又は無効を示す有効性情報を訓練対象の有効弦振動判定モデルに入力し、判定された有効性情報に基づき発音情報又は消音情報を生成し、メモリｏに格納する。 In step S209, the effective string vibration determination model training device 200 generates a volume pitch characteristic data frame from the string vibration waveform representing the performance by the performer, and trains the validity information indicating the validity or invalidity of each string at the reference time. It is input to the target effective string vibration determination model, and the sounding information or the muffling information is generated based on the determined effectiveness information and stored in the memory o.

ステップＳ２１０において、有効弦振動判定モデル訓練装置２００は、メモリｐの発音情報又は消音情報と、メモリｏの有効弦振動判定モデルの発音情報又は消音情報とを比較する。 In step S210, the effective string vibration determination model training device 200 compares the pronunciation information or muffling information of the memory p with the pronunciation information or muffling information of the effective string vibration determination model of the memory o.

ステップＳ２１１において、有効弦振動判定モデル訓練装置２００は、メモリｐの発音情報又は消音情報とメモリｏの発音情報又は消音情報との間に差分があるか判断する。 In step S211th, the effective string vibration determination model training device 200 determines whether there is a difference between the pronunciation information or muffling information of the memory p and the pronunciation information or muffling information of the memory o.

有意な差分があった場合（Ｓ２１１：Ｙｅｓ）、有効弦振動判定モデル訓練装置２００は、ステップＳ２１２において、当該差分から有効弦振動判定モデルを更新するための最適化情報を有効弦振動判定モデルに適用し、ステップＳ２１３に移行する。他方、有意な差分がなかった場合（Ｓ２１１：Ｎｏ）、有効弦振動判定モデル訓練装置２００は、有効弦振動判定モデルを更新することなく、ステップＳ２１３に移行する。 When there is a significant difference (S211: Yes), the effective string vibration determination model training device 200 uses the optimization information for updating the effective string vibration determination model from the difference in the effective string vibration determination model in step S212. It is applied, and the process proceeds to step S213. On the other hand, when there is no significant difference (S211: No), the effective string vibration determination model training device 200 proceeds to step S213 without updating the effective string vibration determination model.

ステップＳ２１３において、有効弦振動判定モデル訓練装置２００は、演奏が終了したか判断し、終了していない場合（Ｓ２１３：Ｎｏ）、ステップＳ２０６に戻る。 In step S213, the effective string vibration determination model training device 200 determines whether or not the performance has ended, and if not (S213: No), returns to step S206.

ステップＳ２１４において、有効弦振動判定モデル訓練装置２００は、メトロノーム及びＭＩＤＩプレーヤーを停止する。 In step S214, the effective string vibration determination model training device 200 stops the metronome and the MIDI player.

ステップＳ２１５において、有効弦振動判定モデル訓練装置２００は、ユーザなどによる終了操作があったか判断し、終了操作がない場合（Ｓ２１５：Ｎｏ）、ステップＳ２０１に戻り、次の演奏情報を選択し、終了操作があった場合（Ｓ２１５：Ｙｅｓ）、当該処理を終了する。 In step S215, the effective string vibration determination model training device 200 determines whether or not there has been an end operation by a user or the like, and if there is no end operation (S215: No), returns to step S201, selects the next performance information, and ends the operation. If there is (S215: Yes), the process is terminated.

なお、上述した実施例では、ギター５０などの弦楽器の弦振動波形データから演奏情報を予測する有効弦振動判定モデルを訓練し、訓練した有効弦振動判定モデルを利用して演奏情報を予測する演奏情報予測システムを説明したが、本開示はこれに限定されず、管楽器に適用されてもよい。すなわち、本開示は、管楽器の空気振動波形データから演奏情報を予測する有効弦振動判定モデルを訓練し、訓練した有効弦振動判定モデルを利用して演奏情報を予測する演奏情報予測システムに適用されてもよい。 In the above-described embodiment, an effective string vibration determination model that predicts performance information from string vibration waveform data of a string instrument such as a guitar 50 is trained, and a performance that predicts performance information using the trained effective string vibration determination model. Although the information prediction system has been described, the present disclosure is not limited to this, and may be applied to wind instruments. That is, the present disclosure is applied to a performance information prediction system that trains an effective string vibration determination model that predicts performance information from air vibration waveform data of a wind instrument and predicts performance information using the trained effective string vibration determination model. You may.

以上、本発明の実施例について詳述したが、本発明は上述した特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。 Although the embodiments of the present invention have been described in detail above, the present invention is not limited to the above-mentioned specific embodiments, and various modifications are made within the scope of the gist of the present invention described in the claims.・ Can be changed.

以下に、本願出願の当初の特許請求の範囲に記載された発明を付記する。
［付記］
本開示の一態様では、
弦楽器演奏を表す弦振動波形データから各弦の音量情報及びピッチ情報を含む音量ピッチ特徴化データを生成する前処理部と、
訓練済み有効弦振動判定モデルを利用して、前記音量ピッチ特徴化データから前記弦楽器演奏の演奏情報を予測する演奏情報予測部と、
を有する演奏情報予測装置が提供される。 The inventions described in the original claims of the present application are described below.
[Additional Notes]
In one aspect of the disclosure,
A preprocessing unit that generates volume pitch characterization data including volume information and pitch information for each string from string vibration waveform data representing stringed instrument performance, and a preprocessing unit.
A performance information prediction unit that predicts the performance information of the stringed instrument performance from the volume pitch characterization data using the trained effective string vibration determination model.
A performance information prediction device having the above is provided.

一実施例では、前記演奏情報は、発音情報、消音情報及びピッチ変更情報から構成されてもよい。 In one embodiment, the performance information may be composed of pronunciation information, muffling information, and pitch change information.

一実施例では、前記訓練済み有効弦振動判定モデルは、各弦の振動が有効又は無効であるかを出力してもよい。 In one embodiment, the trained effective string vibration determination model may output whether the vibration of each string is valid or invalid.

一実施例では、前記訓練済み有効弦振動判定モデルは、ニューラルネットワークにより実現されてもよい。 In one embodiment, the trained effective string vibration determination model may be realized by a neural network.

一実施例では、前記演奏情報は、ＭＩＤＩプロトコルに従って記述されてもよい。 In one embodiment, the performance information may be described according to the MIDI protocol.

本開示の他の態様では、
訓練用演奏情報に従って演奏された弦楽器演奏を表す弦振動波形データから各弦の音量情報及びピッチ情報を含む音量ピッチ特徴化データを生成する前処理部と、
前記訓練用演奏情報を利用して、前記音量ピッチ特徴化データから各弦の振動の有効性を判定する有効弦振動判定モデルを訓練する有効弦振動判定モデル訓練部と、
を有する有効弦振動判定モデル訓練装置が提供される。 In another aspect of the present disclosure,
A preprocessing unit that generates volume pitch characterization data including volume information and pitch information for each string from string vibration waveform data representing string instrument performance performed according to training performance information, and a preprocessing unit.
An effective string vibration determination model training unit that trains an effective string vibration determination model that determines the effectiveness of vibration of each string from the volume pitch characteristic data using the training performance information.
An effective string vibration determination model training device having the above is provided.

本開示の他の態様では、
電子弦楽器と、
上述した演奏情報予測装置と、
上述した有効弦振動判定モデル訓練装置と、
を有する演奏情報生成システムが提供される。 In another aspect of the present disclosure,
With electronic stringed instruments
With the performance information prediction device described above,
With the above-mentioned effective string vibration judgment model training device,
A performance information generation system having the above is provided.

本開示の他の態様では、
１つ以上のプロセッサが、弦楽器演奏を表す弦振動波形データから各弦の音量情報及びピッチ情報を含む音量ピッチ特徴化データを生成するステップと、
前記１つ以上のプロセッサが、訓練済み有効弦振動判定モデルを利用して、前記音量ピッチ特徴化データから前記弦楽器演奏の演奏情報を予測するステップと、
を有する演奏情報予測方法が提供される。 In another aspect of the present disclosure,
A step in which one or more processors generate volume pitch featured data, including volume information and pitch information for each string, from string vibration waveform data representing string instrument performance.
A step in which the one or more processors predict the performance information of the stringed instrument performance from the volume pitch characterization data using the trained effective string vibration determination model.
A performance information prediction method is provided.

本開示の他の態様では、
１つ以上のプロセッサが、訓練用演奏情報に従って演奏された弦楽器演奏を表す弦振動波形データから各弦の音量情報及びピッチ情報を含む音量ピッチ特徴化データを生成するステップと、
前記１つ以上のプロセッサが、前記訓練用演奏情報を利用して、前記音量ピッチ特徴化データから各弦の振動の有効性を判定する有効弦振動判定モデルを訓練するステップと、
を有する有効弦振動判定モデル訓練方法が提供される。 In another aspect of the present disclosure,
A step in which one or more processors generate volume pitch characteristic data including volume information and pitch information of each string from string vibration waveform data representing a stringed instrument performance played according to training performance information.
A step in which the one or more processors use the training performance information to train an effective string vibration determination model that determines the effectiveness of vibration of each string from the volume pitch characteristic data.
An effective string vibration determination model training method having the above is provided.

１０ギターコントローラ
５０ギター
１００制御装置
２００有効弦振動判定モデル訓練装置
２１０前処理部
２２０有効弦振動判定モデル訓練部
３００演奏情報予測装置
３１０前処理部
３２０演奏情報予測部 10 Guitar controller 50 Guitar 100 Control device 200 Effective string vibration judgment model training device 210 Pre-processing unit 220 Effective string vibration judgment model training unit 300 Performance information prediction device 310 Pre-processing unit 320 Performance information prediction unit

Claims

A preprocessing unit that generates volume pitch characterization data including volume information and pitch information for each string from string vibration waveform data representing stringed instrument performance, and a preprocessing unit.
A performance information prediction unit that predicts the performance information of the stringed instrument performance from the volume pitch characterization data using the trained effective string vibration determination model.
Performance information prediction device.

The performance information prediction device according to claim 1, wherein the performance information is composed of pronunciation information, mute information, and pitch change information.

The performance information prediction device according to claim 1 or 2, wherein the trained effective string vibration determination model outputs whether the vibration of each string is valid or invalid.

The performance information prediction device according to any one of claims 1 to 3, wherein the trained effective string vibration determination model is realized by a neural network.

The performance information prediction device according to any one of claims 1 to 4, wherein the performance information is described according to the MIDI protocol.

A preprocessing unit that generates volume pitch characterization data including volume information and pitch information for each string from string vibration waveform data representing string instrument performance performed according to training performance information, and a preprocessing unit.
An effective string vibration determination model training unit that trains an effective string vibration determination model that determines the effectiveness of vibration of each string from the volume pitch characteristic data using the training performance information.
Effective string vibration determination model training device with.

With electronic stringed instruments
The performance information prediction device according to any one of claims 1 to 5.
The effective string vibration determination model training device according to claim 6 and
Performance information generation system with.

A step in which one or more processors generate volume pitch featured data, including volume information and pitch information for each string, from string vibration waveform data representing string instrument performance.
A step in which the one or more processors predict the performance information of the stringed instrument performance from the volume pitch characterization data using the trained effective string vibration determination model.
Performance information prediction method having.

A step in which one or more processors generate volume pitch characteristic data including volume information and pitch information of each string from string vibration waveform data representing a stringed instrument performance played according to training performance information.
A step in which the one or more processors use the training performance information to train an effective string vibration determination model that determines the effectiveness of vibration of each string from the volume pitch characteristic data.
Effective string vibration determination model training method with.