WO1999034345A1 - Procede et appareil d'entrainement des capacites auditives - Google Patents

Procede et appareil d'entrainement des capacites auditives Download PDF

Info

Publication number
WO1999034345A1
WO1999034345A1 PCT/US1998/027849 US9827849W WO9934345A1 WO 1999034345 A1 WO1999034345 A1 WO 1999034345A1 US 9827849 W US9827849 W US 9827849W WO 9934345 A1 WO9934345 A1 WO 9934345A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
auditory
pattern
auditory pattern
response
Prior art date
Application number
PCT/US1998/027849
Other languages
English (en)
Inventor
Gal A. Cohen
Anton Krukowski
Charles Boatwright
Original Assignee
Cohen Gal A
Anton Krukowski
Charles Boatwright
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cohen Gal A, Anton Krukowski, Charles Boatwright filed Critical Cohen Gal A
Priority to AU20988/99A priority Critical patent/AU2098899A/en
Publication of WO1999034345A1 publication Critical patent/WO1999034345A1/fr

Links

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • G09B19/04Speaking
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B15/00Teaching music
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B15/00Teaching music
    • G09B15/02Boards or like means for providing an indication of notes
    • G09B15/04Boards or like means for providing an indication of notes with sound emitters

Definitions

  • the present invention relates to a method and apparatus for training auditory skills. More particularly, the present invention relates to a method and apparatus for training music and language skills.
  • Training auditory skills in a music context includes training the ability to identify and match notes and intervals in a complex acoustic environment. This training is often performed using simple trial and error methods, where a note or interval is played and the student is asked to identify. This technique does not use adaptive difficulty.
  • Training auditory skills in a language context includes training the ability to discriminate between the elements of speech which make up words. This training has been performed by stretching speech in the time domain, in other words slowing down the speech. However, sounds sound fundamentally different when they are synchronous than when they are separate. Many individuals have good timing discrimination but still have bad accents.
  • Training auditory skills in a language context also includes training the ability to reproduce words without an accent. This training is often performed using simple playback and repetition methods, where an unaccented word or phrase is played and the user attempts to repeat the word or phrase as just played. However, there is no feedback and no adaptive difficulty.
  • the present invention is directed towards a method for generating an auditory pattern for training auditory skills. This includes receiving an auditory pattern having at least one subpattern; receiving at least one response from a user, the response representing a user perception of the auditory pattern; and modifying a frequency of at least one element based on at least one response from the user.
  • FIGURES 1A-1C show examples of non-ideal and ideal training tasks and shortcomings of non-ideal training tasks.
  • FIGURE 1C shows one embodiment of the present method for an ideal training task.
  • FIGURES 2A-2B show one embodiment of a method for adjusting task difficulty.
  • FIGURE 2C shows a flow process diagram of one embodiment for training auditory skills of the present invention.
  • FIGURE 3 A shows one embodiment of an apparatus capable of generating auditory patterns for use with the method of the present invention.
  • FIGURES 3B-3D show a method for generating auditory patterns suitable for use with the method of the present invention.
  • FIGURE 3E shows one embodiment of an apparatus capable of generating auditory patterns for use with the method of the present invention.
  • FIGURE 4A shows one embodiment of a method of the present invention for generating complex auditory patterns from basic auditory patterns.
  • FIGURE 4B shows one embodiment of a method of the present invention for generating auditory patterns.
  • FIGURES 5A-5H show a graphical representations of the auditory patterns described in the examples.
  • the present invention describes an ear training method and associated set of devices, designed to improve the ability to identify and match harmonic elements in a complicated auditory environment.
  • the present invention seeks to improve the ability to discern absolute and relative pitch, to identify the pitch intervals between two or more notes, and to improve the memory of a previously heard pitch or interval that is masked by a distracting sound. Further uses of this invention include training the ability to sing or play an instrument in tune and/or in harmony with a concurrent musical background. This invention also has applicability towards general language skill training, including reducing spoken accents and teaching foreign languages or dialects.
  • the ear training method of the present invention involves a series of tasks, in which a user is asked to categorize an auditory pattern or element thereof, or to categorize the difference between two auditory patterns. Task difficulty is increased until user can consistently perform categorization, and then decreased.
  • the auditory patterns presented to the user are derived from an initial set of two auditory patterns.
  • a continuum is defined based on the difference of frequency characteristics of 2 patterns. Derived auditory patterns are generated each trial such that they lie on this continuum, and such that the difference between them is not less than a certain value. This minimum difference of frequency characteristics is increased until the user can consistently discriminate between the two derived patterns, and then decreased.
  • Other embodiments of the method of the present invention employed in this patent to modulate task difficulty include frequency shifting, frequency modulation, and amplitude modulation of elements of auditory patterns.
  • the number of elements per auditory category, and number of categories as well as difference between categories may vary from trial to trial to increase or decrease task difficulty.
  • This training methodology seeks to train sharper tuning of auditory classification by continuous adaptation of task difficulty to maintain optimal learning.
  • Music and language training both have in common the underlying goal of teaching the user to differentiate between categories of sound patterns.
  • a typical training method presents auditory patterns, and asks the user to differentiate between, or to categorize them. Through enough repetition, it is hoped that the user will eventually improve their discrimination between different patterns.
  • FIGURE 1 A is shown "good" results of a typical approach, for the case where the user is being trained to discriminate between the notes of a musical scale. In this case, individual notes are repeated over and over again, until eventually the user can differentiate between them.
  • this approach is non-optimal for several reasons.
  • auditory processing centers in the human brain which are involved in this type of auditory discrimination task.
  • networks of neurons can be trained, or tuned, to recognize certain auditory patterns, and to reject other auditory patterns.
  • These auditory patterns can be thought to lie on a continuum.
  • the continuum is one-dimensional, and can be described by the underlying frequency of a given auditory pattern.
  • the different neural networks which are involved in this recognition task will exhibit tuning to certain bandwidths of frequency.
  • the ideal goal of this task should be to tune the receptive fields as narrowly as possible, and to center the receptive fields directly at the desired frequency, as shown in the second column.
  • the more usual result, shown in the first column, is to differentiate the receptive fields to the point where they include the desired note, but where they are broad and ill- centered.
  • An expected result of this non-ideal result would be that the user could still sing significantly off-key when trying to match a given note, because his singing would still fall into the correct receptive field, and would thus sound correct.
  • FIGURE IB A second problem with standard training protocols is shown in FIGURE IB.
  • a very simple training task is being performed: the user is being trained to differentiate between a "C” and a "D".
  • the current state of their receptive field tuning is such that a single broad receptive field includes both these notes.
  • "D" when “D” is played to the user, it sounds just like a "C", because the user can not discriminate between these two sounds.
  • music and language learning is characterized by "plateaus," in which the user seems to be stuck.
  • the approach of the present invention to this problem, shown in the second column, would be to modify the discrimination task, such that the two patterns fall into two different receptive fields.
  • the frequency of the "D" is increased (signified by "D+"), to the point where it lies outside the receptive field of the "C.”
  • the frequency of the "D” can be brought back to its normative frequency, as the receptive fields of the user are trained, and adapt and sharpen.
  • This increase of the difference between two auditory patterns, and then the subsequent reduction of the difference is one aspect of the present invention.
  • the change in the difference between the two patterns can be made in very small gradations.
  • the ability to create microtonal gradations, with respect to the frequency characteristics of the patterns, is an advantage of this present invention.
  • an approach that has met with some success is to "stretch" the timing of fast elements of speech in the time domain so that they can be parsed by the user.
  • This approach is limited in two respects. Firstly, sounds sound fundamentally different when they are synchronous, then when they are separate. As an example, many people, who have good timing discrimination, have bad accents. Thus, micro-tuning along a continuum in the frequency domain, is required.
  • a more ideal approach, which is utilized in the present invention would be to adaptively create categories for discrimination through combining accented and un- accented speech. This approach would train the user to discriminate along a spectrum of sounds which includes those which might be encountered in normal experience.
  • the present invention includes provisions to increase the differentiation between two auditory patterns by stretching the difference between the frequency, rather than the timing, characteristics of the two auditory patterns, until the user can correctly categorize the difference between the two patterns. The difference is then reduced.
  • These axes include frequency modulation and amplitude modulation.
  • This invention includes provisions for increasing the frequency modulation or amplitude modulation differences between elements of a pattern, until a user can differentiate between them, and then reducing this difference.
  • the present invention describes an ear training method and associated set of devices, designed to improve the ability to identify and match harmonic elements in a complicated auditory environment.
  • This invention is designed to improve the ability to discern absolute and relative pitch, to identify the pitch intervals between a plurality of notes, and to improve the memory of a previously heard pitch or interval that is masked by a distracting sound. Further uses of this invention include training the ability to sing or play an instrument in tune and/or in harmony with a concurrent musical background.
  • This invention also has applicability towards general language skill training, including reducing spoken accents and teaching foreign languages or dialects.
  • the ear training method described involves a series of tasks, in which a user is asked to categorize an auditory pattern or element thereof, or to categorize the difference between a plurality of auditory patterns or a plurality pattern elements.
  • Task difficulty is increased until user can consistently perform categori3zation, and then decreased.
  • the gradations in task difficulty can be of any granularity.
  • the auditory patterns presented to the user are derived from an initial set of two auditory patterns.
  • a continuum is defined based on the difference of frequency characteristics of 2 patterns. Derived auditory patterns are generated each trial such that they lie on this continuum, and such that the difference between them is not less than a certain value. This minimum difference of frequency characteristics is increased until the user can consistently discriminate between the two derived patterns, and then decreased.
  • inventions employed in the present invention to modulate task difficulty include frequency shifting, frequency modulation, and amplitude modulation of elements of auditory patterns.
  • the number of elements per auditory category, and number of categories as well as difference between categories can vary from trial to trial to increase or decrease task difficulty.
  • This training methodology seeks to train sharper tuning of auditory classification by continuous adaptation of task difficulty to maintain optimal learning. Again, the approach is to increase the difference between a plurality of patterns or pattern characteristics until the user can consistently discriminate between them, and then to reduce the difference.
  • a hardware implementation, and a software emulation of the hardware implementation, are described.
  • a series of tasks, which could be included as part of the device, are described below. Note that different versions of the basic device may be built by including different combinations of the tasks below, as well as other tasks.
  • FIGURE IA shows a flow process diagram of one embodiment of a method for auditory training 100 of the present invention.
  • the method includes: (1) receiving an auditory pattern having at least one element (block 110); (2) receiving at least one response from a user, the response representing a user perception of the auditory pattern (block 120); and (3) modifying a characteristic of at least one element based on at least one response from the user (block 130).
  • Receiving an auditory pattern having at least one element (block 110). Typically, this is an auditory pattern which has been presented to a user.
  • An auditory pattern may be sampled, synthesized, or recorded, or a combination.
  • An auditory pattern may be made up of other auditory patterns or subpatterns. Each subpattern includes at least one element.
  • Elements include musical notes and voicings. Elements may also include components of speech, such as consonants, vowels, words, and phonemes.
  • Each element has at least one characteristic. Characteristics may include a frequency, an amplitude, a frequency modulation, a frequency interval, an amplitude modulation, an accented or non-normative pronunciation, a difference between two characteristics, a difference between two elements, and a difference between two auditory patterns.
  • the response from the user may be an identification, comparison, or matching of the auditory pattern.
  • the response may be an identification or matching of absolute or relative pitches and/or frequency intervals of the auditory pattern.
  • the response is not limited to the frequency domain.
  • the response may be in the form of a delayed-hold, two-alternative forced choice, or multiple alternative forced choice.
  • the response may also be a pressed button or a vocal response.
  • the response may include an identification of frequencies and intervals, and identification of absolute pitch (perfect pitch training), and identification of absolute interval, a comparison of two intervals, and identification of an isolated note with a note in an interval, and a comparison of a note with a note in an interval.
  • the intervals may be concurrent or staggered, one note in an interval may be a stack, notes or elements of stacks may have differing envelopes, amplitude and/or frequency modulation, and notes in an interval may be of differing amplitude.
  • the response may include: sharp, flat, same, different, sharper than, flatter than; musical notes, including A, B, C, D, E, F, and G musical intervals including first, second, third, fourth, etc. more, less, more accented than, less accented than, no input the set of words, phonemes or elements a vocal response
  • Modifying a frequency of at least one element based on at least one response from the user (block 130).
  • Overall task difficulty may be modulated from trial to trial based on the performance of the user.
  • One way to do this is to use a staircase procedure.
  • the staircase procedure the task increases in difficulty if the user responds correctly n trials in a row. Traditionally, n is equal to 3, although this may vary. Also, the task gets easier if the user responds incorrectly m trials in a row. Traditionally, m is picked to equal 1. In this case, each time the user responds incorrectly to a trial, the task gets easier.
  • the staircase procedure enforces that the task is in a regime where learning will take place, and dynamically adjusts the difficulty of the task to maintain it in the current optimal learning range.
  • the trials may be structured using strategies in which the user must make at least one choice, in each trial.
  • One such procedure is the multi-alternative forced-choice procedure, in which the user chooses between several possible defined answers.
  • Another such procedure is the delayed-hold procedure, in which the user must keep pressing a button until the stimulus changes. Note that e may be changed from trial to trial by steps which can be microtonal - not limited to the half steps which characterize western music scales. If the response is wrong five times in a row, then the task may be temporarily halted, and the user given remedial instructions on how to perform the task.
  • the degree of pitch or interval discrimination which is required to complete a task correctly varies from trial to trial.
  • music training may be provided by teaching identification of individual notes as well as intervals between notes.
  • the auditory patterns are not limited to the musical scale, and microtonal differences may also be presented, as when a note is played sharp or flat.
  • the method of the present invention may be used with pitch differences of any size.
  • the modified auditory pattern is presented to the user again and the process repeated.
  • FIGURES 1 A-1C show examples of non-ideal and ideal training tasks and shortcomings of non-ideal training tasks.
  • FIGURE 1 A shows typical training music training tasks where the goal is to identify musical notes on a scale. While the notes lie on a frequency continuum, the non-ideal task does not seek to train sharp differentiation on the frequency scale. Rather, broad categorization of discrimination on the frequency scale is the goal.
  • FIGURE IB shows an example of a non-ideal implementation, where the user is asked to differentiate between the notes C and D. These notes are repeated, however, since the user cannot differentiate between them, no progress is made in this task.
  • FIGURE IC shows an implementation of the present invention of the problem shown in FIGURE IB. When the user is presented with two patterns which he cannot discriminate the difference between, the difference is increased until the user can differentiate the difference between the patterns. This difference can be reduced in time.
  • FIGURES 2 A shows one example of a method for changing task difficulty.
  • FIGURE 2B shows another embodiment of a method for changing task difficulty. If the task is too easy, based on a staircase, the method may increase or not change the number of patterns, characteristics, or elements of the auditory patterns. In addition, differences between the patterns may be narrowed or not changed. If the task is at the correct level of difficulty, there is no change in the number of patterns, characteristics, and elements, and no change in difference between two patterns presented from task to task.
  • FIGURE 2C shows a flow process diagram of one embodiment for training auditory skills of the present invention.
  • An auditory pattern is presented to the user.
  • the user returns an input which represents a categorization of a pattern, an element, and/or a characteristic, or a difference between two or more patterns, elements, or characteristics.
  • the correctness of user response is analyzed and task difficulty is adjusted, for instance, with a staircase.
  • Parameters for generating auditory patterns in the next task are set as a function of whether task difficulty needs to be adjusted. Based on the parameters, the next auditory pattern is generated and presented to the user.
  • FIGURE 3 A shows one embodiment of an apparatus capable of generating auditory patterns for use with the method of the present invention.
  • a sinusoidal tone generator can present one or more sinusoids which spectral content is described.
  • Various frequency bands can be shifted independently relatively to one another or modulated by a frequency modulation. Amplitude of these bands can then be modulated in amplitude. Other filters are also possible. This tone may be combined with a sampled or stored complex tone which is also described spectrally, modulated in frequency and amplitude, and possibly by other filters. These are then combined to form an element of music or speech. Elements can also be combined to form an auditory pattern.
  • FIGURES 3B shows the frequency characteristics of two patterns, PI being considered a normative pattern, P2 being considered a non-normative pattern. P2 is different from PI because an element of P2 has been shifted in frequency relative to
  • FIGURES 3C and 3D show two possible ways, shown in FIGURES 3C and 3D.
  • FIGURE 3C a linear addition is shown of the frequency characteristics of PI and P2.
  • FIGURE 3D a morphing combination of PI and P2 is shown, creating a new pattern which has a morphed element.
  • FIGURE 3E shows one embodiment of an apparatus capable of generating auditory patterns for use with the method of the present invention.
  • FIGURE 4A show one method for deriving patterns from a continuum defined by two initial patterns.
  • the two initial patterns may be a normative pattern and a non-normative pattern of language, although any two auditory patterns may be used as inputs.
  • the normative pattern is the word "think"
  • the non-normative pattern is a word which sounds like "sink”.
  • the normative and non- normative patterns are decomposed into frequency versus time elements.
  • a correspondence is assigned between elements of the normative and non-normative pattern. This correspondence may include a one-to-one correspondence, a one-to- many correspondence, and a one-to-none or null element correspondence.
  • FIGURE 4B shows one embodiment of a method of the present invention for generating auditory patterns. Differences in the patterns along the continuum, defined in this case by a set of difference in weights termed epsilon.
  • Two patterns are derived from the continuum satisfying the requirement that the absolute value of the difference in weights is greater than or equal to epsilon. These patterns are presented to the user. The user responds with an answer categorizing their perception of the difference between the patterns. Based on a history of answers, for instance using a staircase, epsilon is modified to increase, decrease, or maintain task difficulty. For example, epsilon may be increased until the user can correctly categorize the difference between the patterns. Then epsilon can be decreased to increase task difficulty.
  • FIGURE 4C shows methods for deriving a non-normative pattern of FIGURE 4A.
  • a normative pattern may be presented to the user, and the user's pronunciation of the user pattern may be recorded. The user's pronunciation can be used as the non- normative word.
  • Another case involves presenting the non-normative pattern to the user, and the user responds with the user's pronunciation of the non-normative pattern.
  • the accent can be extracted from the user's pronunciation of the normative pattern. For instance, a set of frequency differences may be defined, representing the difference between the user's pronuncation of the normative pattern and the initial normative pattern. This accent may be applied to the normative word to derive a non- normative word. Another possibility is to use prerecorded normative and non- normative pairs. Yet another possibility is to synthesize or derive normative and non- normative pairs.
  • EXAMPLE 1 The user is presented with a set of two pure tones of different frequencies and the same or variable duration and amplitude.
  • FIGURE 5A shows a graphic representation of this auditory pattern. The user is asked to select the tone with the higher frequency.
  • Tone 1 and Tone 2 Receive an auditory pattern having at least one subpattern (block 110).
  • a first set of two pure tones, Tone 1 and Tone 2 is played one after another.
  • Tone 1 has a single frequency f lake and amplitude a, and duration d
  • Tone 2 has a single frequency f 2 , and amplitude a, and duration d 2 .
  • a time interval ⁇ t may be inserted between the two tones.
  • the absolute value of the difference between f, and f 2 ( ⁇ f) may be equal to zero, or may be equal to or greater than an amount termed e.
  • Tone 3 Modify a frequency of at least one element based on at least one response from the user (block 130).
  • Tone 3 has a single frequency f 3 , and amplitude a 3 and duration d 3 .
  • Tone 4 has a single frequency f 4 , and amplitude a 4 and duration d 4 .
  • Tone 3 may be the same as Tone 1.
  • Tone 4 is selected based on whether the last response and other previous responses were correct.
  • a staircase scheme may be used to adjust e and modulate task difficulty. For example, e is increased for wrong answers to make the task easier, and decreased for correct answers to make the task more difficult.
  • EXAMPLE 2 The user is presented with a single tone or a series of tones played in sequence.
  • FIGURE 5B shows a graphical representation of this auditory pattern.
  • the user is asked to identify and match a single tone or a series of tones.
  • Receive an auditory pattern having at least one subpattern (block 110).
  • the auditory pattern includes a single tone f, or a series of tones f detox f 2 , . . . f n played in sequence.
  • the user may identify the tone or series of tones by the absolute pitch of each tone.
  • the user may identify the tone by frequency, e.g. 660 Hz, or by name, e.g. G.
  • the user may identify the tone or series of tones through a user interface such as a button.
  • a user interface such as a button.
  • only one button would be available to the user.
  • one button corresponding to a tone of 660 Hz is available to the user.
  • the user activates the button if the user perceives the tone to have a frequency of 660 Hz.
  • multiple buttons would be available to the user. For example, three buttons corresponding to 660, 1000, and 1400 Hz are available to the user.
  • the user After the user hears a tone, the user activates the corresponding button if the user perceives the tone to have a frequency of 660, 1000, or 1400 Hz.
  • a second tone or series of tones is selected.
  • ⁇ f between the buttons may be reduced, in other words the number of possible responses available to the user is increased.
  • a staircase scheme may be used to adjust e and modulate task difficulty. For example, e is increased for wrong answers to make the task easier, and decreased for correct answers to make the task more difficult. Three right answers in a row lead to a decrease of e.
  • a variation of this task presents the user with a tone of a single pitch, and asks the user has to select one of three or more buttons corresponding to one or more specific frequencies or notes and a sharp and flat buttons. For example, the user may respond that the tone is middle C, or sharp or flat of middle C by an amount equal to or greater than e, where e is modulated by a staircase scheme.
  • EXAMPLE 3 The user is presented with a set of two tones of different frequencies and the same or variable duration and amplitude.
  • FIGURE 5C shows a graphic representation of this auditory pattern. The user is asked to identify the frequency interval between the tones.
  • Receive an auditory pattern having at least one subpattern (block 110).
  • a first set of two tones, Tone 1 and Tone 2 is played simultaneously or in a staggered fashion.
  • the user indicates the frequency interval between the two tones. For example, the user may respond that the frequency interval was a fifth.
  • a second set of two tones, Tone 3 and Tone 4 is selected.
  • a staircase scheme may be used to modulate task difficulty. The user must press a button to indicate which tones they heard. In a simple task, the user would only have one button to press - the interval is always a fifth. In a more difficult task, more buttons would be added, so now the user must choose between flat second, fourth, and sharp sixth if there are three buttons. Also, to increase difficulty, ⁇ f between the buttons may be reduced.
  • a variation of this example is a task where the user hears one interval, and has to select one of three buttons - a given interval, or a sharp or flat button.
  • the interval could be a fourth, or sharp or flat of a fourth by an amount equal to or greater than e, where e is modulated by the staircase.
  • e is modulated by the staircase.
  • Another variation of this example is to use a delayed hold procedure - the user presses a button to start the trial. A series of intervals are then heard. The user must keep the button down, until an interval different from the others is heard. The user must react within a certain time window.
  • An interval may start at a different root note, each time it is played.
  • the staircase may also be used to modulate the duration, and amplitude, of each tone, as well as the ⁇ t between tones.
  • FIGURE 5D shows a graphical representation of this auditory pattern.
  • the user must indicate whether the second interval was the same as, sharper than, or flatter than, the first interval.
  • a staircase scheme may be used to modulate task difficulty.
  • the durations, amplitudes, and amplitude and frequency modulation of a given tone may be modulated.
  • the time between the two intervals may be modulated.
  • the range of the root notes, as well as the difference in interval between the two intervals may be modulated. Note that having roots which are close to the same frequency for the two intervals may make the task easier, rather than harder, in some cases, and the staircase should account for this.
  • One variation of this example is where the two intervals overlap in time, but have a different envelope, or voicing, which the user can utilize to differentiate between them.
  • the staircase can be used to modulate the temporal overlap, and the similarity between the two voicings.
  • This task asks the user to identify intervals.
  • FIGURE 5E shows a graphical representation of this auditory pattern.
  • the user must indicate the interval.
  • the staircase is used to modulate task difficulty. In the easiest task, all intervals would be the same, and the user would only have one button to choose. The task gets harder as more choices are added. The interval steps between the buttons can be reduced, to increase difficulty. Note that buttons don't need to be restricted to "pure" intervals like fourths or sixths - you could have flat seconds, for instance, as one of your choices.
  • the durations, amplitudes, and amplitude and frequency modulation of a given tone which is an element of an interval may be changed from trial to trial.
  • the time between the two intervals may be modulated.
  • the range of the root notes, as well as the difference in interval between the two intervals may be modulated. Note that having roots which are close to the same frequency for the two intervals may make the task easier, rather than harder, in some cases, and the staircase should account for this.
  • One variation of this example is where the user presses a button, or other controller, indicating which interval they would like to hear. They then have to decide if the interval that they actually heard was the same, or different from the interval they asked for.
  • EXAMPLE 6 This task asks the user to identify one or more elements of an interval.
  • the auditory pattern includes an interval or chord, followed or preceded by an individual tone. If the chord only consists of two tones, then the user will have two buttons available to press, corresponding to the two elements of the chord.
  • the user must indicate which element of the chord the individual tone matches. 3. Modify a frequency of at least one element based on at least one response from the user (block 130).
  • a staircase scheme may be used to modulate task difficulty.
  • the interval steps between the elements can be changed, as can the number of elements in the chord.
  • the durations, amplitudes, and amplitude and frequency modulation of a given tone which is an element of an interval may be changed from trial to trial.
  • the temporal separation between the two intervals may be modulated.
  • the range of the root notes, as well as the difference in interval between the two intervals may be modulated. Note that having roots which are close to the same frequency for the two intervals may make the task easier, rather than harder, in some cases, and the staircase should account for this.
  • EXAMPLE 7 This task asks the user to identify an element in a complex acoustic environment.
  • FIGURE 5F shows a graphical representation of this auditory pattern.
  • the user is asked whether the individual tone is the same as, or sharper, or flatter than one of the elements of the chord.
  • a staircase may be used to modulate task difficulty.
  • the interval steps between the elements can be changed, as can the number of elements in the chord.
  • that difference termed epsilon, may be modulated by the staircase.
  • the durations, amplitudes, and amplitude and frequency modulation of a given tone which is an element of an interval may be changed from trial to trial.
  • the time between the two intervals may be modulated.
  • the range of the root notes, as well as the difference in interval between the two intervals may be modulated. Note that having roots which are close to the same frequency for the two intervals may make the task easier, rather than harder, in some cases, and the staircase should account for this.
  • EXAMPLE 8 This task asks the user to identify a voicing in a complex acoustic environment.
  • Receive an auditory pattern having at least one subpattern (block 110).
  • An interval, chord, or set of complex acoustic sounds is played, followed by, preceded by, or including, an individual tone or complex sound.
  • Each element of the interval, chord, or set of sounds has a different voicing, and these elements can be synchronous, or staggered. This difference in voicing can arise from amplitude and/or frequency modulation, or other envelopes.
  • Each element may have different AM and/or FM modulation and/or envelope in each frequency band.
  • FIGURE 5G shows a graphical representation of this auditory pattern.
  • Element 1 has frequency f territory and amplitude modulation pattern a
  • Element 2 has frequency f 2 , and amplitude modulation a 2
  • the individual tone has frequency f territory but amplitude modulation a ⁇
  • the user must press a button, indicating whether the individual tone matched the amplitude modulation pattern of element 1 or element 2.
  • a staircase scheme may be used to modulate task difficulty.
  • the interval steps between the elements can be changed, as can the number of elements in the chord.
  • the durations, amplitudes, envelopes, and amplitude and frequency modulation of a given tone which is an element of an interval may be changed from trial to trial.
  • the time between the two intervals may be modulated.
  • the range of the root notes, as well as the difference in interval between the two intervals may be modulated. Note that having roots which are close to the same frequency for the two intervals may make the task easier, rather than harder, in some cases, and the staircase should account for this.
  • EXAMPLE 9 This task asks the user to identify language spoken without accent. It is possible, when learning a foreign language, or a new dialect of a non-foreign language, that the student mis-hears the phrasing that they are supposed to learn, because their auditory processing isn't tuned to pick up these new phrasings.
  • Receive an auditory pattern having at least one subpattern (block 110).
  • the auditory pattern includes versions of a word or word fragment, one of which has an undesired accent.
  • FIGURE 5H shows a graphical representation of this auditory pattern.
  • Receive at least one response from a user, the response representing a user perception of the auditory pattern (block 120). The user must pick which word had the accent, or alternately, which word was correct.
  • Modify a frequency of at least one element based on at least one response from the user (block 130).
  • a staircase is used to modulate the amount of accent which is present in the word with an accent. This could also be used to teach the difference between two nearly similar-sounding, but different, unaccented words or word fragments.
  • Several possible methods could be used to vary the amount of accent or distortion of the accented word. a.
  • the spectrogram (frequency vs time pattern) of the accented word may be a weighted average of a fully accented word and an unaccented word.
  • the accented and normative words' spectrograms could be reduced to a set of bands limited to specific frequency and time windows, with specific envelopes. In some cases, a band for the accented word will have a corresponding band in the normative word.
  • these two corresponding bands may be transformed from one another. If we take a case where the normative band is exactly the same as the accented band, but is shifted in frequency by 100 Hz, then in the previous algorithm, a word which was the average of the two words would have two bands, each of half normal amplitude, with one being 100 Hz shifted from the other. Instead, in this algorithm, your "half accented" word would only have one band, shifted by 50 Hz from the normative band, and this band would be full amplitude. For unmatching bands, algorithm 1 could be used, c. Amplify those bands which are different between the accented and normative words. d. "Stretch" in the frequency domain the difference between the normative and the accented word. e.
  • e is difference between accented and normative word/phoneme, then one can define the "anti-accented word" as normative word/phoneme - e.
  • User must pick if they heard normative, accented, or anti-accented word, e can change as function of the staircase.
  • f The user is asked to read a list of words. These recorded words form the accented set of words.
  • This task may involve the presentation of a digital display of the spelling of a given word.
  • Complex tones may be stored and/or filtered samples of a violin playing a note, or a sample of a human voice singing that note.
  • the tones could be a baritone and a tenor voice.
  • each tone could really consist of a concurrent stack of individual frequencies, in which each frequency band might have a differing envelope.
  • the staircase could be used to change the difference between the two voices. Envelopes may be amplitude-modulated and/or frequency-modulated. The staircase could be used to control the amount of envelope modulation on the tones.
  • Masks are a set of distracting background tones. These tones may be concurrent, in a stack, and/or they may be staggered in time.
  • a mask may have an amplitude-modulated or frequency-modulated envelope. To change the difficulty of the task, the amplitude, duration, or envelope of the mask may be varied from trial to trial. For instance, the staircase may be used to modulate these parameters.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

Cette invention présente un procédé d'entraînement de l'ouïe et un jeu associé de dispositifs, conçus pour améliorer la capacité d'une personne à identifier et à faire correspondre des éléments harmoniques dans un environnement sonore compliqué. Cette invention vise à améliorer la capacité d'une personne à discerner des hauteurs de sons absolues et relatives, afin d'identifier les intervalles de hauteurs de sons entre deux notes ou davantage, et afin d'améliorer la mémoire d'une hauteur de sons ou d'un intervalle de sons préalablement entendu, qui est masqué par un son perturbateur. D'autres utilisations de cette invention ont pour but d'entraîner la capacité d'une personne à chanter ou à jouer d'un instrument en accord et/ou en harmonie avec un fond musical simultané. Cette invention peut également s'appliquer à l'apprentissage général du langage, par exemple pour l'atténuation des accents dans le langage parlé et pour l'apprentissage des langues étrangères ou de dialectes.
PCT/US1998/027849 1997-12-30 1998-12-30 Procede et appareil d'entrainement des capacites auditives WO1999034345A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU20988/99A AU2098899A (en) 1997-12-30 1998-12-30 Method and apparatus for training auditory skills

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US6897897P 1997-12-30 1997-12-30
US60/068,978 1997-12-30

Publications (1)

Publication Number Publication Date
WO1999034345A1 true WO1999034345A1 (fr) 1999-07-08

Family

ID=22085921

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1998/027849 WO1999034345A1 (fr) 1997-12-30 1998-12-30 Procede et appareil d'entrainement des capacites auditives

Country Status (2)

Country Link
AU (1) AU2098899A (fr)
WO (1) WO1999034345A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2371913A (en) * 2001-02-01 2002-08-07 Univ Oxford Frequency discrimination training, e.g. for phonemes or tones.
WO2003094149A1 (fr) * 2002-04-29 2003-11-13 Mindweavers Ltd Generation de la parole synthetique

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1993017408A1 (fr) * 1992-02-20 1993-09-02 Bertrand Perroud Procede et appareil pour former l'oreille
FR2695750A1 (fr) * 1992-09-17 1994-03-18 Lefevre Frank Dispositif de traitement d'un signal sonore et appareil comportant un tel dispositif.
WO1996018184A1 (fr) * 1994-12-08 1996-06-13 The Regents Of The University Of California Procede et dispositif d'amelioration de la reconnaissance de la parole chez des individus atteints de troubles de la parole
US5563358A (en) * 1991-12-06 1996-10-08 Zimmerman; Thomas G. Music training apparatus
US5585583A (en) * 1993-10-14 1996-12-17 Maestromedia, Inc. Interactive musical instrument instruction system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5563358A (en) * 1991-12-06 1996-10-08 Zimmerman; Thomas G. Music training apparatus
WO1993017408A1 (fr) * 1992-02-20 1993-09-02 Bertrand Perroud Procede et appareil pour former l'oreille
FR2695750A1 (fr) * 1992-09-17 1994-03-18 Lefevre Frank Dispositif de traitement d'un signal sonore et appareil comportant un tel dispositif.
US5585583A (en) * 1993-10-14 1996-12-17 Maestromedia, Inc. Interactive musical instrument instruction system
WO1996018184A1 (fr) * 1994-12-08 1996-06-13 The Regents Of The University Of California Procede et dispositif d'amelioration de la reconnaissance de la parole chez des individus atteints de troubles de la parole

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2371913A (en) * 2001-02-01 2002-08-07 Univ Oxford Frequency discrimination training, e.g. for phonemes or tones.
WO2003094149A1 (fr) * 2002-04-29 2003-11-13 Mindweavers Ltd Generation de la parole synthetique

Also Published As

Publication number Publication date
AU2098899A (en) 1999-07-19

Similar Documents

Publication Publication Date Title
Culling et al. Perceptual and computational separation of simultaneous vowels: Cues arising from low‐frequency beating
US8210851B2 (en) Method for modulating listener attention toward synthetic formant transition cues in speech stimuli for training
Siegel et al. Absolute identification of notes and intervals by musicians
Kain et al. Improving the intelligibility of dysarthric speech
US8057234B2 (en) Foreign language learning apparatus
US20030229497A1 (en) Speech recognition method
US7562018B2 (en) Speech synthesis method and speech synthesizer
KR980700637A (ko) 언어 장애자의 언어 인식 강화를 위한 방법 및 장치(method and device for enhancing the recognition of speechamong speech-impai red individuals)
US9147393B1 (en) Syllable based speech processing method
Turcott et al. Efficient evaluation of coding strategies for transcutaneous language communication
Higashikawa et al. Acoustical-perceptual correlates of" whisper pitch" in synthetically generated vowels
Niebuhr et al. Perception and cognitive processing of tonal alignment in German
Iverson et al. Vowel recognition via cochlear implants and noise vocoders: Effects of formant movement and duration
Miller et al. SPATS: Speech perception assessment and training system
Vurma et al. The perception of'forward'and'backward placement'of the singing voice
WO1999034345A1 (fr) Procede et appareil d'entrainement des capacites auditives
US20070111173A1 (en) Method for modulating listener attention toward synthetic formant transition cues in speech stimuli for training
Divenyi Is pitch a learned attribute of sounds? Two points in support of Terhardt’s pitch theory
Do et al. Vietnamese Text-To-Speech system with precise tone generation
Sjerps et al. Extrinsic normalization for vocal tracts depends on the signal, not on attention
JP7060857B2 (ja) 語学習得装置及び語学習得プログラム
Sinha et al. Employing Deep Learning Model to Evaluate Speech Information in Acoustic Simulations of Auditory Implants
Munson et al. The influence of lexical factors on vowel distinctiveness: Effects of jaw positioning
Gertner et al. Music for children with hearing loss
Cychosz et al. How to vocode: Using channel vocoders for cochlear-implant research

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
NENP Non-entry into the national phase

Ref country code: KR

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase