JPWO2019023256A5

JPWO2019023256A5 -

Info

Publication number: JPWO2019023256A5
Application number: JP2020504152A
Authority: JP
Publication date: 2022-10-12

Description

一部の実施形態では、反復運動療法を提供する方法は、オーディオコンテンツのリポジトリを提供することと、患者に届けるオーディオコンテンツを選択することと、選択したオーディオコンテンツの分析を行うことであって、該分析が該オーディオコンテンツの高レベル特性及び低レベル特性を特定し、該分析が該オーディオコンテンツのテンポを算出する、分析を行うことと、該オーディオコンテンツのエントレインメント分析を行うことであって、該エントレインメント分析が、平均テンポ、テンポ分散、テンポ知覚度、拍子記号、リズムパターン分散、オーディオコンテンツ全体を通した複数のセクションにおけるリズム部分の検出、及びオーディオコンテンツにおける最初のビートと最後のビートの位置、のうちの少なくとも１つを含む要素に対して適合性スコアを割り当てる、分析を行うことと、該オーディオコンテンツに対してエントレインメントの補助的なキューを生成することであって、該補助的なキューが該オーディオコンテンツに加えられる音を含み、該加えられる音が、該オーディオコンテンツの４分音符で再生される単一の打楽器音、該オーディオコンテンツ及びその細分のビートで再生される打楽器音、該オーディオコンテンツに同期したドラムパターン、ならびに該オーディオコンテンツのビートをカウントするボイスのうちの少なくとも１つを含む、生成することと、を含む。 In some embodiments, a method of providing repetitive exercise therapy comprises providing a repository of audio content, selecting audio content for delivery to a patient, and performing an analysis of the selected audio content, comprising: performing an analysis, wherein the analysis identifies high-level characteristics and low-level characteristics of the audio content, the analysis calculating a tempo of the audio content; and performing an entrainment analysis of the audio content, The entrainment analysis includes: average tempo, tempo variance, tempo perceptibility, time signature, rhythm pattern variance, detection of rhythmic parts in multiple sections throughout audio content, and first and last beats in audio content. performing an analysis that assigns a relevance score to an element that includes at least one of: a position; and generating auxiliary cues of entrainment for the audio content, wherein the auxiliary cues include sounds to be added to the audio content, the added sounds being single percussion sounds played on quarter notes of the audio content, percussion sounds played on beats of the audio content and its subdivisions. , a drum pattern synchronized to the audio content, and a voice counting beats of the audio content.

以下にエントレインメント適合性を求めるための方程式を示しており、この方程式において分析から得られる値の範囲は０～１である。０．９～１は優良であり、０．７～０．９は使用可能であり、０．５～０．７は前もって増強することが必要であり、０．５未満は使用不可である。この方程式またはこの方程式の変化形を用いて異なる楽曲を分類する。拍子記号及び平均テンポ数は、バイナリーの０または１と表し、これらの数字はそれが定義された境界内にあるかどうかに応じて付けられる。ｙ１、ｙ２、ｙ３、．．．．、ｙＸで表される数は合計すると１に等しくなり、他のコンテクスト上の情報に応じて変更可能である。他の変数は０～１の範囲で表され、最も良い場合には１に等しく、最も悪い場合には０に等しくなる。この方程式は下記の通りに表される。
（拍子記号）×（平均テンポ）×（ｙ１×ビート強度＋ｙ２×ビート時刻信頼度＋ｙ３×リズム安定度＋ｙ４×テンポ知覚度＋ｙ５×リズム遍在度＋ｙ６×有効演奏時間） Below is an equation for determining entrainment suitability, in which the range of values obtained from the analysis is 0-1. 0.9-1 is excellent, 0.7-0.9 is usable, 0.5-0.7 needs pre-enhancement, less than 0.5 is unusable. Classify different songs using this equation or variations of this equation. Time signatures and average tempo numbers are represented as binary 0's or 1's, and these numbers are labeled depending on whether they fall within defined boundaries. y1, y2, y3, . . . . , yX add up to one and can be changed depending on other contextual information. Other variables range from 0 to 1, equal to 1 at best and 0 at worst. This equation is expressed as follows.
(time signature) x (average tempo) x (y1 x beat intensity + y2 x beat time reliability + y3 x rhythm stability + y4 x tempo perception + y5 x rhythm ubiquity + y6 x effective playing time)

エントレインメント適合性方程式の態様を、エントレインメント適合性を図示した図４でさらに定義している。
平均テンポ
曲の平均テンポは、１分当たりのビート数（ＢＰＭ）という単位で測定される。平均テンポは重要なＥＳ因子であるのに加えて、ＲＡＳセッションに使用する音楽の選択に有益となる選択基準でもある。本システムは任意に曲をタイムストレッチできるが、元来のテンポからストレッチすればするほどその影響が顕著になり、最良の結果はその曲の元来のテンポの２０％以内で得られている。したがって、ＲＡＳセッションで使用する音楽を選択する場合、元来のテンポはセッションでの歩調範囲の２０％以内であることが理想的である。
平均テンポ６０～１３０（通常のエントレインメント範囲）を持つ曲はスコアが１．０である。スコアは、範囲外となる２０ＢＰＭまで対数的に減少し、４０と１５０にはスコアが０．０と割り当てられている。
増強ストラテジー：楽曲を定数ファクタによりタイムシフトして、平均ＢＰＭをエントレインメント範囲内にするか、またはユーザーの目標とするエントレインメントの歩調に合わせることができる。
ビート強度
図５に示すように、検出ビート時刻でのＲＭＳＥ（曲の中央値）を０．０～１．０の範囲に直線的に調整した。ビート音の大きさを顕著に知覚するほどＲＡＳ刺激に相応しくなり、ビートが打楽器パートで演奏されていることを示すことが多い。１が最も強く、０が最も弱いことを示す。
マイケル・ジャクソンの「ビリー・ジーン」は最も強いビート強度の例であり、打楽器スペクトログラムのエネルギーから明らかである（信号の打楽器要素を、多重周波数区間を上下方向に広げたエネルギーを持つ時点として示す）。
増強ストラテジー：ビート強度増強ストラテジーについてはセクション３で詳細に議論する。これらにはビート時刻で音楽キューを加えることが含まれる。
ビート時刻信頼度
ビート時刻信頼度スコアは、各一連のＯＤＦ検出から得られるビート間の一致レベルに基づいて、音楽分析のビートトラッキング段階から返されるものである。様々な方法によって同様の顕著なリズムパルスが検出されるため、スコアが高い場合には適合性が良好であることを示し、この場合、曲がはっきりとしたリズム特性及びタイミング特性を持つことを示す傾向にある。
ビート時刻信頼度スコアは以下のようにＥＳスコア値と対応しており、０．０～１．５は信頼度が低いとみなされ、スコア０が割り当てられる。１．５～３．５は信頼度が良好であることを示し、スコア０．５が割り当てられる。３．５～５．３は信頼度が高いことを示し、スコア１．０が割り当てられる。
増強ストラテジー：（再）分析の副次的効果、ならびにＯＤＦの重み付け及び前処理工程などのビートトラッキング改善の結果として、信頼度スコアを上げることができる。
拍子記号
曲の平均拍子記号（要約特性）のことである。実際にバイナリーのタスクとしては、２拍子または４拍子（例えば、２／４、４／４、６／８）が適している。曲が許容される拍子記号を持つ場合にはスコアが１となり、それ以外の場合は０になる。
増強ストラテジー：適用されず。拍子記号は楽曲に不可欠なものであり、不明確であればその曲を使用すべきではない。
テンポ知覚一致
観測したユーザーエントレインメントデータによって算出される推定テンポの一致レベルのことである。テンポ検出に共通する問題点としてその固有の主観性が挙げられ、良く知られている課題は「オクターブエラー」であり、これは聴き手によっては別の聴き手の１／２倍速または倍速でビートを検出する場合があることである。システムで算出したテンポは、人の聴き手で知覚されるテンポと一致すべきである。
値は０か１になる可能性があり、テンポが一致する場合には１であり、１／２倍拍及び／または２倍拍の場合には０である。これは、ユーザー観測データに主に基づくものであるという理由から、曲の再分析で使用されかつ要素として含められる可能性が高い。
増強ストラテジー：この検出の精度は、ユーザー観測データを用いた場合に向上すると考えられる。
リズム遍在度
顕著なリズム要素を持つ曲演奏時間の割合を示すものである。リズム部分は実質的にＲＡＳ刺激となるため、リズム部分があるとエントレインメントに有利である。曲から外れたリズム部分は流れを乱し、ビート時刻の検出をより困難にする（ビート時刻の信頼度スコアが低くなる）。曲の遍在度を測定する１つの方法は、打楽器スペクトログラムにおいて打楽器要素の部分を検出することである（図６～図８を参照）。
スコアの範囲は０．０（０％のリズム遍在度）～１．０（１００％のリズム遍在度）である。
増強ストラテジー：正確なビート時刻がわかっているが低いビート強度を有するセクションにキューを加え、これにより全体的なリズム部分遍在度を高めることができる。
例：
前述したように、「アップタウン・ファンク」は初めから終わりまで一定の打楽器パートを含み、したがってリズム遍在度スコアは１．０と高くなっている。特に興味深いことは、打楽器スペクトログラムに大きな広帯域スパイクがあることである。スパイクの大きさが小さいイントロセクション（０．００～０．１６）であっても、はっきりとわかる打楽器パートがある。
図９に示すように、リズム遍在度が低い曲の例はビョークの「ミューチュアル・コア」である。この曲は、リズム部分を含む２つのセクションに別れているが、リズム部分は３０６秒の曲演奏時間のうちの６０秒（２０％）のみであり、リズム遍在度スコアが０．２と低くなっている。
有効演奏時間
使用できる演奏時間は、適切ではなくアドレス指定できないセクションを取り除いた後に、少なくとも６０秒の長さを持つべきである。これは、エッジケースの短い曲（長さがわずか０．５３秒であるトム・ウェイツの「レット・ミー・ダウン・アップ・オン・イット」）を使用しないように、また構造変更が加えられた場合にも十分な長さがあるようにしなければならないことを示している。
使用する曲の長さが最小しきい値である６０秒以上である場合にはスコアは１．０となり、それ以外ではスコアは０．０となる。
増強ストラテジー：適用されず。オーディオ信号が使用に際して十分長くない場合には別の曲を用いるべきである。
リズム安定度
リズム安定度は、曲全体のリズム面／拍節面の分散量を示す複合スコア（０．０～１．０）であり、テンポドリフト、テンポ変調、拍子記号の変化、及びリズムパターン分散を考慮に入れるものである。
リズム安定度の値は０～１の間であり、１が最も良く、０が最も悪い値となる。リズム安定度が高い場合には変動が少なく、ＲＡＳセッションに使用するのに適合性の高いコンテンツであることを示す。以下の方程式はｘ１、ｘ２、ｘ３、．．．ｘＺを合計１となる重み付けとして含み、それぞれが０～１の範囲の数であるリズム安定度因子のＡ１、Ａ２、Ａ３，．．．ＡＺとかけ合わせられている。
リズム安定度＝ｘ１×Ａ１＋ｘ２×Ａ２＋ｘ２×Ａ３＋ｘ３×Ａ３＋．．．．＋ｘＺ×ＡＺ
増強ストラテジー：テンポドリフトをオーディオ量子化により減じることができる。曲の中で問題のあるセクションをスキップし、適合性のあるセクションのみを使用することができる。
リズム安定度因子
１．テンポドリフト－Ａ１
ビート時刻の差の中央値から許容できる知覚可能な変動範囲内のビート時刻の差の割合を１．０から減算して得た数として得られ、１００％分散のスコアは０（１．０－１．０）であり、０％分散のスコアは１．０（１．０－０．０）である。
テンポの変動は人が演奏する場合、特にクリックトラックまたはコンピュータシーケンサによる伴奏（例えば、ドラムマシン、デジタル・オーディオ・ワークステーションなど）を用いて録音しなかった場合には、通常範囲となる。変動が大きい場合には、テンポ安定度スコアが小さくなる。モービーの「サウザンド」はテンポ分散の大きい極端な例であり、そのテンポは曲全体の中で絶えず変動し、そのピークは１，０００ＢＰＭに及ぶ。
以下は徐々にテンポの変化が生じ得る曲の例であり、図８～図９に示している。
●リタルダンド：次第に遅く
●アッチェレランド：次第に速く
●ルバート：演奏家はテンポを加減して表現を付けて演奏する（テンポは音楽のフレージングに応じて変化し得る）
２．テンポ変調－Ａ２
曲のテンポが元来のテンポから５％超も急に速くなるかまたは遅くなり、新しいテンポが保持される場合のことである。５％～２５％の範囲で変動するテンポはタイムシフトによりアドレス指定できるとみなされる。０％～５％の変化にはスコアが１と割り当てられる。５％～２５％の変化ではスコアが直線的に下がり、２５％以上の変化にはスコアが０と割り当てられる。
テンポ変調の１つのタイプは「拍節変調」であり、現在のビートかまたはビート細分をグループ化したものを別のパルス値として別コンテクストに当てはめることによって、テンポ及び／または拍節を変えるものである。この例はアーケイド・ファイアの「ヒア・カムズ・ザ・ナイト・タイム」で聴くことができ、そこでは４：３６にテンポが９５ＢＰＭ以下から１４５ＢＰＭ以下に突然変化し、９５ＢＰＭでグループ化されている３／１６音符群が新規に１４５ＢＰＭで４分音符となっている（テンポが１．５倍に増加する）。
拍節パルスとは無関係のテンポ変調の例を図１０に示し、ポール・マッカートニー＆ウィングスの「バンド・オン・ザ・ラン」のテンポグラムを示している。テンポが２：１４に８１ＢＰＭから１２７ＢＰＭに突然変化し、５７％上昇している。線はローカルテンポの値を表す。この場合、テンポが変わる前の所定の時間領域または変わった後において、セッションで曲の一部を使用できるように構造変更をすることができる（以下セクション３の「構造変更」を参照）。
３．拍子記号変化－Ａ３
拍子記号変化は、曲の途中で１つの拍子記号から別の拍子記号に任意の長さでシフトする場合のことである。曲が４／４拍子で始まるとして、３／４拍子のように奇数のビートを含む単一の小節が入ると、音楽のフェーズに合わせてバイナリー動作の左／右同時性が逆転することがある（楽曲のフレージングが小節構造に合わせて調整されていると仮定）。この種の曲のシフトはバイナリーが不適正になり、スコアは０になる。拍子記号の変化がない場合にはスコアは１になる。
ビートルズの「ハピネス・イズ・ア・ウォーム・ガン」は問題となる拍子記号変化を含む例であり、曲が４／４拍子で始まるが、後に９／８拍子及び１０／８拍子の交互の小節にシフトする。
４．リズムパターン分散－Ａ４
リズムパターン分散は曲の中の隣接したパターンの類似性を測ったものであり、トレンド除去変動解析（Detrended Fluctuation Analysis：ＤＦＡ：）またはオンセット間間隔の自己相関などの技法を用いて得ることができる。リズムパターンの同質性が高い曲は、リズム安定度が高くなっている。
完全に同質（１００％）の曲の値は１となり、同質性を全く持たない（０％）曲の値は０となる。実際に０は現実的ではなく、任意の同質性は３０％を超えることが多いことに留意されたい。
Aspects of the entrainment suitability equation are further defined in FIG. 4, which illustrates entrainment suitability.
Average Tempo The average tempo of a song is measured in units of beats per minute (BPM). In addition to being an important ES factor, average tempo is also a useful selection criterion for selecting music for RAS sessions. Although the system can arbitrarily timestretch a song, the effect is more pronounced the further it is stretched from the original tempo, with best results being obtained within 20% of the original tempo of the song. Therefore, when choosing music for use in a RAS session, ideally the original tempo should be within 20% of the tempo range of the session.
A song with an average tempo of 60-130 (normal entrainment range) has a score of 1.0. Scores decrease logarithmically to 20 BPM, which is out of range, with 40 and 150 assigned a score of 0.0.
Augmentation Strategy: Songs can be time-shifted by a constant factor to bring the average BPM within the entrainment range or keep pace with the user's target entrainment.
Beat Intensity As shown in FIG. 5, the RMSE (median value of the song) at the detected beat time was linearly adjusted in the range of 0.0 to 1.0. The more prominent the loudness of the beat is perceived, the more appropriate it is for RAS stimulation and often indicates that the beat is being played in a percussive part. 1 indicates the strongest and 0 the weakest.
Michael Jackson's "Billie Jean" is an example of the strongest beat intensity, evident from the energies of the percussion spectrogram (the percussion component of the signal is shown as points with energies spread vertically across the multi-frequency interval). .
Enhancement strategies: Beat intensity enhancement strategies are discussed in detail in Section 3. These include adding music cues at beat times.
Beat Time Confidence A beat time confidence score is returned from the beat tracking stage of music analysis based on the level of agreement between beats from each series of ODF detections. Similar prominent rhythmic pulses are detected by different methods, so a high score indicates a good match, in which case the song has distinct rhythmic and timing characteristics. There is a tendency.
Beat time confidence scores correspond to ES score values as follows, where 0.0-1.5 is considered low confidence and a score of 0 is assigned. 1.5-3.5 indicates good confidence and is assigned a score of 0.5. 3.5-5.3 indicates high confidence and is assigned a score of 1.0.
Augmentation strategy: As a side effect of (re)analysis and beat tracking improvements such as ODF weighting and preprocessing steps, the confidence score can be increased.
time signature The average time signature (summary characteristic) of a song. For binary tasks in practice, 2 or 4 time signatures (eg 2/4, 4/4, 6/8) are suitable. The score is 1 if the song has an acceptable time signature, 0 otherwise.
Augmentation Strategy: Not applicable. Time signatures are an integral part of a piece of music and should not be used if unclear.
Tempo Perceived Concordance Refers to the estimated tempo concordance level calculated from observed user entrainment data. A common problem with tempo detection is its inherent subjectivity, a well-known problem being "octave error", which is defined by one listener's beats at half or double the speed of another. is to be detected. The system-calculated tempo should match the tempo perceived by a human listener.
The value can be 0 or 1, 1 if the tempos match and 0 if the tempo is half and/or double. Since it is primarily based on user observation data, it is likely to be used and included as a factor in song reanalysis.
Augmentation Strategy: The accuracy of this detection is believed to improve when using user-observed data.
Rhythm omnipresence Indicates the percentage of song performance time that has prominent rhythmic elements. The presence of the rhythmic part is advantageous for entrainment, since the rhythmic part is essentially the RAS stimulus. Out-of-tune rhythmic sections disrupt the flow and make beat time detection more difficult (lower beat time confidence score). One way to measure the ubiquity of a song is to detect parts of the percussion element in the percussion spectrogram (see Figures 6-8).
The scores range from 0.0 (0% rhythmic ubiquity) to 1.0 (100% rhythmic ubiquity).
Augmentation Strategy: Add cues to sections with known exact beat times but low beat intensity, which can increase the overall omnipresence of rhythmic parts.
example:
As mentioned above, "Uptown Funk" contains constant percussion parts throughout and thus has a high rhythm ubiquity score of 1.0. Of particular interest is the large broadband spike in the percussion spectrogram. Even in the intro section with small spikes (0.00-0.16), there are distinct percussion parts.
As shown in FIG. 9, an example of a song with low rhythm ubiquity is Björk's "Mutual Core." This song is divided into two sections including a rhythm part, but the rhythm part is only 60 seconds (20%) of the 306 seconds of song performance time, and the rhythm ubiquity score is low at 0.2. It's becoming
Effective Play Time The available play time should have a length of at least 60 seconds after removing inappropriate and unaddressable sections. It was also restructured to avoid using edge-case short songs (Tom Waits'"Let Me Down Up On It", which is only 0.53 seconds long). It indicates that the length must be sufficient even in the case.
If the length of the song used is greater than or equal to the minimum threshold of 60 seconds, the score is 1.0, otherwise the score is 0.0.
Augmentation Strategy: Not applicable. Another song should be used if the audio signal is not long enough for use.
Rhythm Stability Rhythm Stability is a composite score (0.0 to 1.0) that indicates the amount of variance in the rhythmic/metrical plane of the entire piece, including tempo drift, tempo modulation, time signature change, and rhythm pattern. It takes dispersion into account.
Rhythm stability values range from 0 to 1, with 1 being the best and 0 being the worst. If the rhythm stability is high, it indicates that there is little fluctuation and that the content is highly suitable for use in a RAS session. The following equations are x1, x2, x3, . . . xZ as weightings that sum to 1, and rhythm stability factors A1, A2, A3, . . . It is crossed with AZ.
Rhythm stability=x1*A1+x2*A2+x2*A3+x3*A3+ . . . . +xZxAZ
Augmentation Strategy: Tempo drift can be reduced by audio quantization. You can skip problematic sections of a song and use only compatible sections.
Rhythm stability factor 1. Tempo Drift-A1
Obtained as the number obtained by subtracting from 1.0 the percentage of beat time differences within the acceptable perceptible range of variation from the median beat time difference, a score of 100% variance is 0 (1. 0-1.0) and a score of 0% variance is 1.0 (1.0-0.0).
Variations in tempo are in the normal range when played by a human, especially when not recorded with a click track or computer sequencer accompaniment (eg, drum machines, digital audio workstations, etc.). If the variation is large, the tempo stability score will be small. Moby's "Thousands" is an extreme example of high tempo variance, whose tempo fluctuates constantly throughout the song, peaking at 1,000 BPM.
The following are examples of songs that can have gradual tempo changes, and are shown in FIGS.
● Ritardando: Gradually slower ● Accelerando: Gradually faster ● Rubato: The performer adjusts the tempo to add expression (the tempo can change according to the phrasing of the music).
2. Tempo modulation - A2
This is the case when the tempo of a song suddenly speeds up or slows down by more than 5% from the original tempo and the new tempo is retained. Tempos that vary between 5% and 25% are considered addressable by time shifting. A score of 1 is assigned to changes between 0% and 5%. A change of 5% to 25% decreases the score linearly, and a change of 25% or more is assigned a score of 0.
One type of tempo modulation is "metric modulation", which alters tempo and/or metric by recontextualizing the current beat or grouping of beat subdivisions as different pulse values. be. An example of this can be heard on Arcade Fire's "Here Comes the Night Time", where at 4:36 the tempo suddenly changes from below 95 BPM to below 145 BPM and is grouped by 95 BPM. /16 notes are new quarter notes at 145 BPM (the tempo is increased by a factor of 1.5).
An example of tempo modulation independent of the metric pulse is shown in FIG. 10, which shows the tempogram of "Band on the Run" by Paul McCartney & Wings. The tempo suddenly changes from 81 BPM to 127 BPM at 2:14, a 57% increase. The line represents the local tempo value. In this case, a structural change can be made to allow a portion of the song to be used in the session either before or after the tempo change (see Section 3 below, "Structural Change").
3. Time signature change - A3
A time signature change is the shift of any length from one time signature to another in the middle of a song. Given that a song begins in 4/4 time, a single bar with an odd number of beats, such as 3/4 time, can reverse the left/right synchronicity of the binary movement to match the phase of the music. (assuming the phrasing of the piece is aligned with the bar structure). This kind of song shift will result in a bad binary and a score of 0. The score is 1 if there is no time signature change.
The Beatles'"Happiness Is a Warm Gun" is an example involving a problematic time signature change, where the song begins in 4/4 time, but later alternates bars in 9/8 and 10/8 time. shift to
4. Rhythm Pattern Variance-A4
Rhythmic pattern variance is a measure of the similarity of adjacent patterns in a song and can be obtained using techniques such as Detrended Fluctuation Analysis (DFA:) or autocorrelation of inter-onset intervals. can. Songs with high rhythm pattern homogeneity have high rhythm stability.
A song with perfect homogeneity (100%) has a value of 1, and a song with no homogeneity (0%) has a value of 0. Note that in practice 0 is not realistic and arbitrary homogeneity is often greater than 30%.

Claims

1. A method of providing repetitive exercise therapy, said method being implemented on a computer system comprising a processor configured to perform said method by computer executable code,
accessing one or more pieces of audio content using a processor ;
selecting a piece of audio content to deliver to the patient using the processor ;
performing , by the processor, analysis of the piece of audio content, comprising:
the analysis identifies audio characteristics of the piece of audio content ; and
performing an analysis of the piece of audio content, including extracting rhythmic and structural characteristics of the piece of audio content;
performing , by the processor, an entrainment suitability analysis of the piece of audio content;
generating, by the processor, entrainment auxiliary cues for the piece of audio content based on the entrainment suitability analysis, the auxiliary cues adding to the piece of audio content. generating the ancillary cues, including sounds for
applying, by the processor, the ancillary cues to the piece of audio content synchronously with the piece of audio content for output of the ancillary cue;
The processor determines the piece of audio content and the applied ancillary cues based on biometric data of the patient obtained using a sensor while playing the piece of audio content and the applied ancillary cues. assessing the entrainment effect of the patient with targeted cues, comprising :
continuing to play the piece of audio content if the piece of audio content and the applied ancillary cues are determined to be effective on entrainment of the patient ;
performing the entrainment suitability analysis if the piece of audio content and the applied ancillary cues are determined to be ineffective on entrainment of the patient; and repeating evaluating said effect .

2. The method of claim 1, further comprising updating a database of audio content to incorporate feedback from the evaluating step.

further comprising providing a range to the beat tracking algorithm;
2. The method of claim 1 , wherein said beat tracking algorithm performs said analysis based on said range .

4. The method of claim 3, wherein the piece of audio content comprises music and the range is the minimum and maximum tempo averages of a musical genre.

2. The method of claim 1 , wherein analyzing the piece of audio content comprises applying an onset detection function (ODF) that extracts rhythmic features of the piece of audio content.

6. The method of claim 5, wherein the ODF transforms the time domain of the audio signal to the time frequency domain.

2. The method of claim 1, further comprising making changes to the piece of audio content, wherein at least one change comprises adjusting the tempo of the piece of audio content.

3. The method of claim 1, wherein the piece of audio content is streamed to the patient.

A method of providing repetitive exercise therapy comprising:
accessing one or more pieces of audio content using a processor ;
selecting a piece of audio content to deliver to the patient using the processor ;
performing , by the processor, analysis of the piece of audio content, comprising:
performing an analysis of the piece of audio content, wherein the analysis includes identifying audio characteristics of the piece of audio content and determining a tempo of the piece of audio content;
providing the patient with the piece of audio content for playback during repetitive exercise therapy;
The processor renders the piece of audio content on entrainment of the patient based on biometric data of the patient obtained using sensors while playing the piece of audio content to the patient. evaluating the effect of
if the piece of audio content is not effective for the patient;
performing , by the processor, an entrainment suitability analysis of the piece of audio content, the entrainment suitability analysis comprising assigning suitability scores to a plurality of audio characteristics ; to do and
generating, by the processor, entrainment auxiliary cues for the piece of audio content based on the entrainment suitability analysis, the auxiliary cues adding to the piece of audio content. generating auxiliary cues for the entrainment , including sounds for
applying , by the processor, the ancillary cues to the piece of audio content for output of the ancillary cues synchronously with playback of the piece of audio content;
The processor determines the piece of audio content and the applied ancillary cues based on biometric data of the patient obtained using a sensor while playing the piece of audio content and the applied ancillary cues. and evaluating an effect on entrainment of the patient with targeted cues .

A plurality of entrainment suitability analyzes determined from the piece of audio content, wherein the entrainment suitability analysis consists of average tempo, beat intensity, beat time confidence, rhythm stability, time signature, tempo perception confidence, and effective duration. 10. The method of claim 9, calculating an entrainment suitability score as a function of at least one of the audio characteristics .

10. The method of claim 9, wherein generating auxiliary cues for entrainment comprises generating a single beat to be played on each beat of the piece of audio content.

10. The method of claim 9, wherein the entrainment auxiliary cue is output to one ear of the patient.

10. The method of claim 9, wherein the entrainment auxiliary cues are added to low bead strength sections of the piece of audio content.

10. The method of claim 9, further comprising modifying the piece of audio content, the modification comprising adjusting the tempo of the piece of audio content.

15. The method of claim 14, wherein modifying the piece of audio content comprises providing drum enhancements to a drum track of the piece of audio content.

15. The method of claim 14, wherein modifying the piece of audio content comprises modifying the structure of the piece of audio content.

15. The method of claim 14, wherein modifying the piece of audio content comprises altering the tempo to lengthen the piece of audio content.

A method of providing repetitive exercise therapy comprising:
accessing one or more pieces of audio content using a processor ;
selecting a piece of audio content to deliver to the patient using the processor ;
performing , by the processor, analysis of the piece of audio content, comprising:
performing an analysis of the piece of audio content, wherein the analysis includes identifying audio characteristics of the piece of audio content , including a tempo of the piece of audio content;
performing , by the processor, an entrainment suitability analysis of the piece of audio content, the entrainment suitability analysis comprising:
average tempo,
beat strength,
tempo distribution,
perceived tempo,
time signature,
Rhythm pattern dispersion,
song playing time,
Rhythm ubiquity , and
performing the entrainment suitability analysis, comprising assigning scores to one or more audio characteristics including at least one of a first beat position and a last beat position in the piece of audio content; to do and
generating entrainment auxiliary cues by the processor based on the entrainment suitability analysis, the auxiliary cues including sounds to be added to the piece of audio content, the sounds ,
a single percussion sound played on the beat of said piece of audio content;
percussion instrument sounds reproduced with the one audio content and its segmented sounds;
generating at least one of: a drum pattern synchronized to the piece of audio content; and a voice count beat of the piece of audio content.

further comprising determining an entrainment suitability score for the piece of audio content based on a correlation between the patient's pacing and a tempo of the piece of audio content , wherein the patient's pacing is associated with the piece of audio content 19. The method of claim 18 , acquired by a sensor while being played to the patient .

20. The method of claim 19, wherein the entrainment suitability score is determined before and after adding entrainment ancillary cues to the piece of audio content.