EP0183712A1 - Sound responsive toy - Google Patents

Sound responsive toy

Info

Publication number
EP0183712A1
EP0183712A1 EP19850901745 EP85901745A EP0183712A1 EP 0183712 A1 EP0183712 A1 EP 0183712A1 EP 19850901745 EP19850901745 EP 19850901745 EP 85901745 A EP85901745 A EP 85901745A EP 0183712 A1 EP0183712 A1 EP 0183712A1
Authority
EP
European Patent Office
Prior art keywords
sound
sounds
data
electrical energy
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP19850901745
Other languages
German (de)
French (fr)
Inventor
Joseph Truchsess
Michael R. Newsome
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
R Dakin & Co
Original Assignee
R Dakin & Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by R Dakin & Co filed Critical R Dakin & Co
Publication of EP0183712A1 publication Critical patent/EP0183712A1/en
Withdrawn legal-status Critical Current

Links

Classifications

    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63HTOYS, e.g. TOPS, DOLLS, HOOPS OR BUILDING BLOCKS
    • A63H30/00Remote-control arrangements specially adapted for toys, e.g. for toy vehicles
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems

Definitions

  • the present invention relates to an electronic sound-responsive toy capable of distinguishing between several types of input sounds and generating a variety of speech-like sounds in response thereto which relate in a perceptible way to the input sound.
  • Sound-generating toys have been available for many years, some based on mechanical sound generators, some, more recent designs, employing electronic means to produce the sounds.
  • Generation of complex sounds such as human speech or animal noises by electronic means is well-known.
  • it is too costly and complex for implementation in toys using such prior art apparatus.
  • Sound responsive toys are also commonly known, most of them accepting specific input sounds as "commands", after receipt of which a specific action is taken.
  • An example of this is the model car which upon sensing a loud click or handclap starts to move forward, stopping or reversing or turning upon receipt of further clicks or claps.
  • the apparatus of the present invention comprises a toy which combines the functionalities of the sound-generating and sound-responsive toys mentioned above, resulting in a "conversational" mode of stimulus-response interaction with the user.
  • the invention relates to a sound responsive toy which produces vocal sounds in response to input sounds in a "conversational" manner.
  • the apparatus receives sound signals by means of a loudspeaker and amplifier.
  • the apparatus Upon sensing a sound stimulus which meets certain frequency and amplitude requirements, the apparatus, by means of logic circuitry, analyzes the received sound with regard to its spectral content and its duration.
  • the apparatus then waits for cessation of the sound, whereupon a response appropriate to the type of stimulus is chosen according to predetermined rules and generated by means of conditioning circuitry controlled by the logic circuitry and specialized controlling circuits.
  • the resulting signals are amplified and converted to sound energy via the same loudspeaker which is used to convert input sounds to electrical signals.
  • the apparatus is contained in a plush toy e.g. a parrot. Squeezing the toy energizes the apparatus which responds with a "wolf whistle". Conversational speech causes the apparatus to respond in parrot-like squawks when the speaker pauses. A sharp clap of the hands also causes a "wolf whistle” response. A whistle causes the apparatus to play one of three songs depending upon the previous number of whistles received during the energized period. An automatic shut-off feature is also included which removes power if a sound stimulus is not received within a aredetermined period.
  • Fig. 1 is a block diagram of the complete system.
  • Fig. 2 is a flow diagram showing the operational cycle of the device.
  • Fig. 3 depicts the operation of a piecewise-constant .parameter control mechanism of the prior art.
  • Fig. 4 depicts the operation of the pie ⁇ ewise-linear parameter control mechanism of the invention.
  • Fig. 5 is a detailed schematic diagram of one of the piecewise-linear controllers of the invention.
  • Fig. 6 is a schematic representation of the piecewise-linear pitch generator.
  • Fig. 7 is a detailed schematic diagram of the output driver circuit of the invention.
  • Fig. 8 is a schematic diagram of a formant filter used in the apparatus of the invention.
  • the device is first actuated by closure of a specially adapted squeeze switch 17 which is the subject of co-pending application serial No filed and assigned to the assignee of this application.
  • the closing of switch 17 causes power from the battery 19 to be supplied to the circuit through power control circuit 18.
  • the logic unit 5 causes control circuit 18 via line 20 to hold power on so that power to the circuit is maintained after release of switch 17. This operation is referred to hereinafter as the Wake Up Phase 42.
  • the logic unit 5 also controls the other phases of operation of the invention shown diagramatically in Fig. 2. In accordance with one phase of operation the logic unit 5 controls the Response Generation Phase 43 which generates an audio response via Piecewise Linear controllers 8, 9 formant filters 13, 14, mixer 15, output driver 16 and speaker 1.
  • the logic unit 5 controls the Acquisition Phase 40 which tests the audio input received via speaker 1 and preamplifier 2.
  • actuation of Switch 17 causes the apparatus to go from the Sleep Phase 82 into the power holding. Wake Up Phase 42 described hereinabove.
  • a predetermined initial audio response is generated.
  • a sound characteristic of an animal is generated e.g. a squawking Parrot sound.
  • the specifics of the unique response generation circuit of the apparatus is described more fully hereinbelow.
  • the logic unit 5 activates the acquisition phase 40. -6-
  • the unit Upon entering the Acquisition Phase 40 the unit is ready and waiting for input stimuli.
  • sounds impinging on the loudspeaker 1 are amplified by preamplifier 2.
  • the preamplifier gain may be set to one of two values at any time by the logic unit 5 via line 4.
  • the logic unit is thus able to compensate for low or high levels of ambient noise.
  • the output of the preamplifier consists of a rectangular pulse waveform on line 3 whenever the input signal from Speaker 1 exceeds a specific amplitude.
  • the logic unit 5 responds only when the number of input transitions (zero crossings) exceeds a predetermined threshold during a specific predetermined period of time. Thus the apparatus only responds to signals whose average frequency exceeds a certain value.
  • the Acquisition Phase 40 is shown. During this period the input signal is tested for existence of the predetermined minimum frequency at 46a. If this frequency is not present, a waiting period 46b is entered. If the minimum frequency is not detected during this period the logic unit 5 increases the sensitivity of preamplifier 2 thereby adapting to a low ambient noise environment. After increasing the sensitivity another waiting period 46c is initiated, if no stimulus is received after approximately four minutes in the specific embodiment, the logic unit 5 will remove power to the entire apparatus via the power control circuit 18 and the Sleep Phase 82 will be entered, thus conserving power if the device is inadvertently left on» Sleep Phase may also be entered at any time by a second closure of the switch 17.
  • the logic unit 5 begins simultaneously to both time the input signal's duration and count its -7-
  • zero-crossings 49 Zero-crossings are counted for a pre-determined amount of time, then while continuing to time the input's duration,.a period waiting for the input signal to stay below the minimum average frequency requirement for a pre-determined amount of time is entered. When the wait period condition is satisfied, the Response Selection Phase 41 of the operation cycle begins. If the input frequency remains above the minimum average frequency requirement for longer than a preset period of time the logic unit will reduce the sensitivity of the preamplifier thereby adapting to high ambient noise environments. The number of input transitions counted and the duration of the input stimulus are also used by the logic unit 5 in the selection of the response to be produced.
  • the Response Selection Phase 41 consists of selection of the response to be produced and the number of times the response is to be repeated. This selection is based on the duration of the stimulus and the number of transitions which were counted during the Acquisition Phase 40.
  • stimuli are categorized as one of three types: a handclap, speech, or a whistle. Parameters of the input preamplifier and aquisition logic have been selected such that stimuli will be sorted correctly into these three categories. For a given category of input stimulus the system chooses between a range of possible responses. These responses may be selected at random, correlated to input frequency and duration, or varying proportions of weight may be given to any of these criteria.
  • control unit 5 is a COP 422 microprocessor manufactured by the National Semiconductor Corporation on which executes the stored program listed in Appendicies I & II.
  • a clap input produces a wolf whistle; a whistle input produces one of those singing responses cycled in the order of Bali Hai, Yellow Bird and Sailor's Hornpipe; and speech, produces responsive squawks.
  • the program steps for accomplishing this selection appear in Appendix I at pages 7 and 12. -
  • Piecewise-Linear controllers are used to greatly reduce the amount of data required to faithfully reproduce the selected audio responses.
  • a time-varying parameter 70 of a process such as speech synthesis For purposes of this discussion the function from starting point 83 to ending point 84 will be referred to as a "sound sequence". A prior art piecewise-constant representation of this function might look like the stepped function 71. On the time scale shown, 40 steps (data elements) are required to model the function at a 25 s sample rate, which would normally be regarded as a low sample rate producing poor fidelity. Referring now to Fig. 4 the same function is modeled using a piecewise-linear controller of the invention. As can be seen a significant data reduction is achieved.
  • Inflection points 73 - 76 are connected by segments 78 - 81, producing a smooth function 72 which requires as data only the initial value and the slopes at the four inflection points. By precisely controlling the segment slopes and durations, the inflection point ordinates are readily reached.
  • the logic unit 5 enters the Response Generation Phase 43 and initializes several parameters of the sound sequence to be generated.
  • the initial pitch, the pitch slopes for the.first segment, the formant frequencies, and the formant slopes for the first segment are set to their sta ting values.
  • a "spontaneous" quality is imparted to the responses by randomizing the initial pitch and formant positions so that more variation is perceived in the vocal qualities of the generated sounds. This feature gives the responses to speech input in particular a more conversational quality. Also initialized is the duration of the first segment, at the end of which the desired inflection point is reached and the slope and duration parameters are updated for the next segment.
  • the piecewise-linear segments which comprise the parameter control functions are executed by the controllers described below.
  • control is transferred from the sound-generating routine to the higher-level "executive" control level routine of logic unit 5 Appendix I page 15.
  • the executive level routine determines the number of repetitions of the sound sequence that remain to be executed. Further levels of control could be implemented by those skilled in the art whereby a "macro sequence" consisting of several sound sequences are executed as a single sequence. Applied to speech generation, this amounts to word construction from phonemes. Adding another level permits phrase or sentence generation from words.
  • the piecewise-linear filter control circuit of the specific embodiment is illustrated in Fig. 5.
  • This circuit is used for P/L CTRLLRS 8 and 9 of Fig. 1.
  • An operational amplifier 35 is configured as an integrator which performs continuous integration of any current flowing through node 38. A fixed current flow into or out of this node will result in a linearly increasing or decreasing voltage at the circuit's output 36. The output increase or decrease is constrained to within the output voltage range of the operational amplifier 35. If no current flows through node 38 the integrator's output voltage will remain constant. If the voltage at node 38 is maintained at that of reference node 37 while the integrator is functioning within its linear range (i.e.
  • a predictable current flow through node 38 and the resultant rate of change of the output voltage at 36 is produced by connecting a resistor of known value between node 38 and a known reference voltage.
  • the reference voltage is applied at 22 and binary-weighted resistors 26 - 28 sum binary-weighted currents into node 38.
  • the direction of current flow is determined by the polarity of the voltage at 22 with respect to the voltage at 37, which is made to be at the midpoint of the range of the voltage swing at 22.
  • 22 is regarded as the most significant bit of a digital word, it will be seen that it acts as a sign bit, determining the direction of current flow through node 38 and the direction of the output voltage slope.
  • the remaining less-significant bits of the digital control word comprise lines 23 - 25 and are used to control electronic switches 29 - 31 which enable or disable current flow through binary-weighted resistors 26 - 28.
  • the result is a 4-bit sign-magnitude integrating digital-to-analog converter, the output voltage rate of change as opposed to instantaneous value of which is determined by the digital control word at inputs 22 - 25.
  • the output voltage at 36 is in turn used to control the resonant frequency of formant filters 13, 14.
  • Switch 33 is used to discharge the integrating capacitor 34 under control of the logic unit 5 via line 39 thereby pre-setting the output voltage to a known reference value (i.e. reference voltage 37) at the commencement of each sound generating sequence.
  • a resistor 32 limits the discharge rate of the capacitor. Its value and the amount 5 of time allowed for capacitor discharge are chosen such that the capacitor is not completely discharged, so that the initial integrator voltage (and subsequent voltage outputs during production of the sound sequence) are in part dependent on the output voltage of the integrator
  • the AC impendance of diode 93 is controlled.
  • Diode 93 in turn controls the center frequency of the filter.
  • the input signal to be filtered (the excitation pitch of the specific embodiment) is applied at 96 through resistor 95.
  • a DC reference voltage 0 is applied at 87 for DC biasing of the amplifier 85.
  • the filtered output signal appears at output node 86.
  • Fig. 1 the output of two identical formant filters 13, 14 with center frequencies one octave apart are input to mixer 15.
  • the pitch of the excitation waveform is specified in piecewise-linear terms, with the accompanying data reduction benefits.
  • the pitch generation is performed by the logic unit 5 under program control.
  • Fig. 6 schematically depicts a piecewise-linear pitch generator which is functionally equivalent to the pitch generator of the invention.
  • the excitation pitch is produced by a controllable oscillator 50.
  • This device emits a rectangular pulse train at its output 51 the period of which is linearly proportional to the value presented to a control input at 52.
  • the control input signal is taken from a time-delay element 56.
  • the control input is initialized to give the desired initial period of the first segment of .the sound sequence to be executed. When released from this initial condition, the period of the audio output will continuously increase, decrease, or remain constant, dependent on the value of the slope control 54 which is applied to one input of a multiplier 55.
  • the output of multiplier 55 is the product of the values present at its two inputs 53 and 54, namely the period control signal and the pitch slope control signal.
  • the output pitch will remain constant if the slope has value of exactly one. Greater slope values will result in continuously decreasing pitch as the period grows larger, and slope values less than one yield increasing pitch.
  • the time delay element 56 also controls the rate of change of the output pitch by determining the rate at which the period control value is updated by the multiplier product.
  • Use of the multiplier allows the pitch slope parameter to be expressed in units of octaves per second, i.e. the frequency changes exponentially as a function of time. This is a very useful characteristic in all audio work, be it music or speech-o iented.
  • the piecewise-linear pitch generator affords the same data-reduction benefits as its formant-controller counterpart.
  • a single data element, the pitch slope is sufficient to program a continuous (for all intents and purposes) glissando of pitches for an indefinite amount of time. This may be contrasted with the piecewise-constant approach which requires ever-increasing amount of data in proportion to the length of sound to be produced.
  • the piecewise-linear tone generator of the specific embodiment is implemented in the control program of Appendix I and II, but could be implemented in a variety of ways known to those skilled " in the art, including discrete digital logic elements or analog circuits (voltage-controlled oscillators, analog multipliers, etc.) while retaining its essential functions.
  • the specific embodiment of the invention uses for both its audio input and output a single dynamic loudspeaker 1.
  • the circuit 16 used to drive the loudspeaker during the Response Generation Phase 43 must have low output impedance in order to drive the speaker to sufficient sound output levels yet must not present such a low impedance to the speaker during the Acquisition Phase 40 that input sensitivity is degraded.
  • a solution to this problem is illustrated in Fig. 7.
  • An inverting operational amplifier 59 drives a pair of complementary emitter-followers 61, 62. Negative DC feedback is supplied to the op-amp by a resistor-capacitor network 64 - 66 where any AC feedback signal is filtered out by capacitor 66.
  • the feedback network allows the circuit to rest in an equilibrium state with the op-amp output 60 voltage equal to its input voltage 58. Due to resistor 63 the same voltage will appear at the common emitters of the transistors 69. Since there is no voltage difference between the base and emitter of either transistor, they both present a high impedance to the loudspeaker. Note that there is no AC feedback path around the op-amp and it therefore has a very high voltage gain for any input signal. As soon as one is applied through input capacitor 57 the op-amp output voltage at 60 will slew rapidly either above or below the quiescent voltage depending on the slope of the input signal.
  • the appendices are listings of the program stored in the memory of the microprocessor of the specific embodiment. Briefly the listing of Appendix I is of program statements performing the following functions.
  • Page 1 Assigns symbol names to RAM (data) locations Pages 2-3 Assign values to symbolic names used in the program Page 4 Initializes system and jumps to sound generating routine Page 5 Waits for average input frequency to exceed threshold for valid input
  • Page 6 Measures input frequency, times duration of input stimulus, and waits for input frequency to remain below threshold for time specified by variable EXITTIME (pg. 3) Page 7, 12 choose sound to be produced
  • Pages 17-19 Comprise a timekeeping routine which is called from various parts of the program. It uses the microprocessor's built-in timer to increment several independent counters. When a counter overflows, a corresponding flag bit is set which indicates to the calling routine that the time interval has elapsed
  • Page 21 Generates the audio tone by executing the LEI instructions at lines 734 and 737 at a rate determined by the contents of the DELAY register.
  • the PIPELINE routine called in lines 722 and 735 performs the multiplication operation necessary for generating the sloping pitch segments described in the theory of operation.
  • the multiplication if performed in its entirety in one subroutine call would take too much time to be performed concurrently with the tone generation, therefore it is "pipelined", i.e. a portion of the operation is performed each time the routine is called, with the product being made available before it is required for the updating of the DELAY register.
  • Pages 23 - 31 Comprise the pipelined multiply routine described above.
  • PIPE13 and PIPE14 on pg. 31 are used to look up data from the sound data area at the beginning of each piecewise-linear parameter control segment.
  • the pitch slope is looked up and stored in its RAM location and the formant frequency controllers' slopes are looked up and output via the "L" I/O Port.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrophonic Musical Instruments (AREA)

Abstract

Jouet électronique produisant, après avoir été stimulé par un signal sonore d'entrée, une réponse sonore caractérisée par des qualités vocales ou analogues à la parole. Des réponses sonores spécifiques sont mises en relation avec des signaux sonores d'entrée spécifiques de sorte que l'appareil semble choisir ses réponses de manière intelligente. Les signaux sonores sont introduits et reproduits par l'intermédiaire d'un unique haut-parleur (1). Un circuit logique (5) teste le signal d'entrée, sélectionne la réaction spécifique et commande les circuits produisant le son. Un nouveau type de circuits et de procédés de production de sons analogues à la parole présente une commande de la cadence de fréquence des formants (13, 14), ce qui permet d'obtenir une réduction sensible de la quantité de donnés nécessaires pour représenter les sons en mémoire, ainsi que de la complexité des circuits servant à reproduire les sons représentés.Electronic toy producing, after being stimulated by an input sound signal, a sound response characterized by vocal or speech-like qualities. Specific audio responses are matched with specific input audio signals so that the device appears to intelligently choose its responses. The sound signals are introduced and reproduced via a single loudspeaker (1). A logic circuit (5) tests the input signal, selects the specific reaction and controls the circuits producing the sound. A new type of circuits and methods for producing sounds analogous to speech presents a control of the frequency cadence of the formants (13, 14), which makes it possible to obtain a significant reduction in the quantity of data necessary to represent the sounds in memory, as well as the complexity of the circuits used to reproduce the sounds represented.

Description

SOUND RESPONSIVE TOY " " "
The present invention relates to an electronic sound-responsive toy capable of distinguishing between several types of input sounds and generating a variety of speech-like sounds in response thereto which relate in a perceptible way to the input sound.
Sound-generating toys have been available for many years, some based on mechanical sound generators, some, more recent designs, employing electronic means to produce the sounds. Generation of complex sounds such as human speech or animal noises by electronic means is well-known. However, it is too costly and complex for implementation in toys using such prior art apparatus.
Sound responsive toys are also commonly known, most of them accepting specific input sounds as "commands", after receipt of which a specific action is taken. An example of this is the model car which upon sensing a loud click or handclap starts to move forward, stopping or reversing or turning upon receipt of further clicks or claps. The apparatus of the present invention comprises a toy which combines the functionalities of the sound-generating and sound-responsive toys mentioned above, resulting in a "conversational" mode of stimulus-response interaction with the user. Further more, novel use has been made in the apparatus of a technique known in music synthesis as "Piecewise-Linear" control (see Bernstein and Cooper, "The Piecewise-Linear Technique of Electronic Music Synthesis", Journal of the Audio Engineering Society, Jul /August 1976 Vol. 24 No. 6, pp. 446-454. To reduce both the quantity of data and circuit complexity required for faithful reproduction of speech and speech-like sounds, thereby making possible the production of low-cost electronic toys capable of generating high-quality vocal sounds. Traditional control schemes for speech synthesizer parameters e ply digital-to-analog converters or their equivalent which are able to specifiy a paramenter to any desired point within its range. There is a one-to-one correspondence between the digital codes used to specify a parameter and the resultant parameter value. This type of control mechanism is referred to as "Piecewise Constant". Using this approach it is necessary to specify explicitly each frequency desired at the instant in time at which it is desird. This scheme requires a great deal of data to produce smooth transitions from one value to another, since a great many intermediate points must be specified during the period of transition (refer to Fig. 3) . The "Piecewise Linear" process presented here circumvents this problem by specifying not the absolute formant frequencies desired, but rather the rate at which the frequencies are to change and the direction of their motion. The amount of data required is thus reduced since new data is required only when the slope of a parameter changes, not when its value changes (refer to Fig. 4) . Of note is the lack of steps in the transition from one value to another despite the low data rate; the transition is smooth, resulting in a natural-sounding synthesis.
SUMMARY OF THE INVENTION The invention relates to a sound responsive toy which produces vocal sounds in response to input sounds in a "conversational" manner. The apparatus receives sound signals by means of a loudspeaker and amplifier. Upon sensing a sound stimulus which meets certain frequency and amplitude requirements, the apparatus, by means of logic circuitry, analyzes the received sound with regard to its spectral content and its duration. The apparatus then waits for cessation of the sound, whereupon a response appropriate to the type of stimulus is chosen according to predetermined rules and generated by means of conditioning circuitry controlled by the logic circuitry and specialized controlling circuits. The resulting signals are amplified and converted to sound energy via the same loudspeaker which is used to convert input sounds to electrical signals. In the specific embodiment the apparatus is contained in a plush toy e.g. a parrot. Squeezing the toy energizes the apparatus which responds with a "wolf whistle". Conversational speech causes the apparatus to respond in parrot-like squawks when the speaker pauses. A sharp clap of the hands also causes a "wolf whistle" response. A whistle causes the apparatus to play one of three songs depending upon the previous number of whistles received during the energized period. An automatic shut-off feature is also included which removes power if a sound stimulus is not received within a aredetermined period. A BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 is a block diagram of the complete system.
Fig. 2 is a flow diagram showing the operational cycle of the device.
Fig. 3 depicts the operation of a piecewise-constant .parameter control mechanism of the prior art.
Fig. 4 depicts the operation of the pieσewise-linear parameter control mechanism of the invention.
Fig. 5 is a detailed schematic diagram of one of the piecewise-linear controllers of the invention.
Fig. 6 is a schematic representation of the piecewise-linear pitch generator.
Fig. 7 is a detailed schematic diagram of the output driver circuit of the invention.
Fig. 8 is a schematic diagram of a formant filter used in the apparatus of the invention.
DETAILED DESCRIPTION OF INVENTION
Referring to Fig. 1 the device is first actuated by closure of a specially adapted squeeze switch 17 which is the subject of co-pending application serial No filed and assigned to the assignee of this application. The closing of switch 17 causes power from the battery 19 to be supplied to the circuit through power control circuit 18. After power is applied, the logic unit 5 causes control circuit 18 via line 20 to hold power on so that power to the circuit is maintained after release of switch 17. This operation is referred to hereinafter as the Wake Up Phase 42. The logic unit 5 also controls the other phases of operation of the invention shown diagramatically in Fig. 2. In accordance with one phase of operation the logic unit 5 controls the Response Generation Phase 43 which generates an audio response via Piecewise Linear controllers 8, 9 formant filters 13, 14, mixer 15, output driver 16 and speaker 1. In accordance with a second phase the logic unit 5 controls the Acquisition Phase 40 which tests the audio input received via speaker 1 and preamplifier 2. Referring now to Fig. 2 actuation of Switch 17 causes the apparatus to go from the Sleep Phase 82 into the power holding. Wake Up Phase 42 described hereinabove. Upon completion of that phase Response Generator phase 43 is entered and a predetermined initial audio response is generated. In the specific embodiment described herein a sound characteristic of an animal is generated e.g. a squawking Parrot sound. The specifics of the unique response generation circuit of the apparatus is described more fully hereinbelow. After completing the initial response the logic unit 5 activates the acquisition phase 40. -6-
Upon entering the Acquisition Phase 40 the unit is ready and waiting for input stimuli. Referring again to Fig. 1, sounds impinging on the loudspeaker 1 are amplified by preamplifier 2. As is discussed hereinbelow, the preamplifier gain may be set to one of two values at any time by the logic unit 5 via line 4. The logic unit is thus able to compensate for low or high levels of ambient noise. The output of the preamplifier consists of a rectangular pulse waveform on line 3 whenever the input signal from Speaker 1 exceeds a specific amplitude. To enhance the noise-rejection ability of the system, the logic unit 5 responds only when the number of input transitions (zero crossings) exceeds a predetermined threshold during a specific predetermined period of time. Thus the apparatus only responds to signals whose average frequency exceeds a certain value.
Referring again to Fig. 2 and entry period 46 of ' the Acquisition Phase 40 is shown. During this period the input signal is tested for existence of the predetermined minimum frequency at 46a. If this frequency is not present, a waiting period 46b is entered. If the minimum frequency is not detected during this period the logic unit 5 increases the sensitivity of preamplifier 2 thereby adapting to a low ambient noise environment. After increasing the sensitivity another waiting period 46c is initiated, if no stimulus is received after approximately four minutes in the specific embodiment, the logic unit 5 will remove power to the entire apparatus via the power control circuit 18 and the Sleep Phase 82 will be entered, thus conserving power if the device is inadvertently left on» Sleep Phase may also be entered at any time by a second closure of the switch 17.
Once the requirements for input signal validity have been met, the logic unit 5 begins simultaneously to both time the input signal's duration and count its -7-
zero-crossings 49. Zero-crossings are counted for a pre-determined amount of time, then while continuing to time the input's duration,.a period waiting for the input signal to stay below the minimum average frequency requirement for a pre-determined amount of time is entered. When the wait period condition is satisfied, the Response Selection Phase 41 of the operation cycle begins. If the input frequency remains above the minimum average frequency requirement for longer than a preset period of time the logic unit will reduce the sensitivity of the preamplifier thereby adapting to high ambient noise environments. The number of input transitions counted and the duration of the input stimulus are also used by the logic unit 5 in the selection of the response to be produced.
The Response Selection Phase 41 consists of selection of the response to be produced and the number of times the response is to be repeated. This selection is based on the duration of the stimulus and the number of transitions which were counted during the Acquisition Phase 40. In the specific embodiment stimuli are categorized as one of three types: a handclap, speech, or a whistle. Parameters of the input preamplifier and aquisition logic have been selected such that stimuli will be sorted correctly into these three categories. For a given category of input stimulus the system chooses between a range of possible responses. These responses may be selected at random, correlated to input frequency and duration, or varying proportions of weight may be given to any of these criteria.
In the specific embodiment, the control unit 5 is a COP 422 microprocessor manufactured by the National Semiconductor Corporation on which executes the stored program listed in Appendicies I & II. In accordance with that program a clap input produces a wolf whistle; a whistle input produces one of those singing responses cycled in the order of Bali Hai, Yellow Bird and Sailor's Hornpipe; and speech, produces responsive squawks. The program steps for accomplishing this selection appear in Appendix I at pages 7 and 12. -
In accordance with a unique feature of the invention Piecewise-Linear controllers are used to greatly reduce the amount of data required to faithfully reproduce the selected audio responses.
Referring to Fig. 3 there is shown a time-varying parameter 70 of a process such as speech synthesis. For purposes of this discussion the function from starting point 83 to ending point 84 will be referred to as a "sound sequence". A prior art piecewise-constant representation of this function might look like the stepped function 71. On the time scale shown, 40 steps (data elements) are required to model the function at a 25 s sample rate, which would normally be regarded as a low sample rate producing poor fidelity. Referring now to Fig. 4 the same function is modeled using a piecewise-linear controller of the invention. As can be seen a significant data reduction is achieved. Inflection points 73 - 76 are connected by segments 78 - 81, producing a smooth function 72 which requires as data only the initial value and the slopes at the four inflection points. By precisely controlling the segment slopes and durations, the inflection point ordinates are readily reached. After the response and number of repetitions of that response have been chosen in the Response Selection Phase, 41, the logic unit 5 enters the Response Generation Phase 43 and initializes several parameters of the sound sequence to be generated. The initial pitch, the pitch slopes for the.first segment, the formant frequencies, and the formant slopes for the first segment are set to their sta ting values. A "spontaneous" quality is imparted to the responses by randomizing the initial pitch and formant positions so that more variation is perceived in the vocal qualities of the generated sounds. This feature gives the responses to speech input in particular a more conversational quality. Also initialized is the duration of the first segment, at the end of which the desired inflection point is reached and the slope and duration parameters are updated for the next segment.
The piecewise-linear segments which comprise the parameter control functions are executed by the controllers described below. Upon completion of the execution of the final segment of the sound sequence, control is transferred from the sound-generating routine to the higher-level "executive" control level routine of logic unit 5 Appendix I page 15. In the specific embodiment,. the executive level routine determines the number of repetitions of the sound sequence that remain to be executed. Further levels of control could be implemented by those skilled in the art whereby a "macro sequence" consisting of several sound sequences are executed as a single sequence. Applied to speech generation, this amounts to word construction from phonemes. Adding another level permits phrase or sentence generation from words.
Once all repetitions and/or lower control levels have been completed and the desired response has been generated by the Response Generation Phase 43 the system returns to the Acquisition Phase 40.
The piecewise-linear filter control circuit of the specific embodiment is illustrated in Fig. 5. This circuit is used for P/L CTRLLRS 8 and 9 of Fig. 1. An operational amplifier 35 is configured as an integrator which performs continuous integration of any current flowing through node 38. A fixed current flow into or out of this node will result in a linearly increasing or decreasing voltage at the circuit's output 36. The output increase or decrease is constrained to within the output voltage range of the operational amplifier 35. If no current flows through node 38 the integrator's output voltage will remain constant. If the voltage at node 38 is maintained at that of reference node 37 while the integrator is functioning within its linear range (i.e. not at either of its output voltage limits) ; a predictable current flow through node 38 and the resultant rate of change of the output voltage at 36 is produced by connecting a resistor of known value between node 38 and a known reference voltage. The reference voltage is applied at 22 and binary-weighted resistors 26 - 28 sum binary-weighted currents into node 38. The direction of current flow is determined by the polarity of the voltage at 22 with respect to the voltage at 37, which is made to be at the midpoint of the range of the voltage swing at 22. Hence if 22 is regarded as the most significant bit of a digital word, it will be seen that it acts as a sign bit, determining the direction of current flow through node 38 and the direction of the output voltage slope. The remaining less-significant bits of the digital control word comprise lines 23 - 25 and are used to control electronic switches 29 - 31 which enable or disable current flow through binary-weighted resistors 26 - 28.
The result is a 4-bit sign-magnitude integrating digital-to-analog converter, the output voltage rate of change as opposed to instantaneous value of which is determined by the digital control word at inputs 22 - 25. The output voltage at 36 is in turn used to control the resonant frequency of formant filters 13, 14. Switch 33 is used to discharge the integrating capacitor 34 under control of the logic unit 5 via line 39 thereby pre-setting the output voltage to a known reference value (i.e. reference voltage 37) at the commencement of each sound generating sequence. A resistor 32 limits the discharge rate of the capacitor. Its value and the amount 5 of time allowed for capacitor discharge are chosen such that the capacitor is not completely discharged, so that the initial integrator voltage (and subsequent voltage outputs during production of the sound sequence) are in part dependent on the output voltage of the integrator
10 prior to the discharge interval. This results in varying vocal qualities being produced each time a sound sequence is generated, even though the formant control data remain . constant. This effect is further enhanced by randomly varying the discharge interval itself within a range of
15 values. In the specific embodiment this function is performed by the logic unit 5 under control of the program of Appendices I and II. The formant filters, 13, 14 have been implemented as shown in Fig. 8.an operational amplifier .85 is configured as a multiple-feedback resonant
20 filter, the center frequency of which (and other parameters) is determined by the values of resistors 89 and 95, capacitors 90 and 91, and the AC impedance of semiconductor diode 93. By varying the voltage at control node 94 and thereby varying the current through resistor
25 92 and diode 93, the AC impendance of diode 93 is controlled. Diode 93 in turn controls the center frequency of the filter. The input signal to be filtered (the excitation pitch of the specific embodiment) is applied at 96 through resistor 95. A DC reference voltage 0 is applied at 87 for DC biasing of the amplifier 85. The filtered output signal appears at output node 86. In Fig. 1 the output of two identical formant filters 13, 14 with center frequencies one octave apart are input to mixer 15. In like manner to that employed in control of the 5 formant filters, the pitch of the excitation waveform is specified in piecewise-linear terms, with the accompanying data reduction benefits. The pitch generation is performed by the logic unit 5 under program control. It should be noted that although the specific embodiment of the formant filters employs analog filters controlled by digital-to-analog converters,' the data reduction benefits of the piecewise-linear control method would still be realized were the filters to be implemented in some other way, e.g. digital filters whose resonant frequencies could be controlled directly by a digital word.
Fig. 6 schematically depicts a piecewise-linear pitch generator which is functionally equivalent to the pitch generator of the invention. The excitation pitch is produced by a controllable oscillator 50. This device emits a rectangular pulse train at its output 51 the period of which is linearly proportional to the value presented to a control input at 52. The control input signal is taken from a time-delay element 56. At the beginnning of each sound sequence the control input is initialized to give the desired initial period of the first segment of .the sound sequence to be executed. When released from this initial condition, the period of the audio output will continuously increase, decrease, or remain constant, dependent on the value of the slope control 54 which is applied to one input of a multiplier 55. The output of multiplier 55 is the product of the values present at its two inputs 53 and 54, namely the period control signal and the pitch slope control signal. The output pitch will remain constant if the slope has value of exactly one. Greater slope values will result in continuously decreasing pitch as the period grows larger, and slope values less than one yield increasing pitch. It can be seen that the time delay element 56 also controls the rate of change of the output pitch by determining the rate at which the period control value is updated by the multiplier product. Use of the multiplier allows the pitch slope parameter to be expressed in units of octaves per second, i.e. the frequency changes exponentially as a function of time. This is a very useful characteristic in all audio work, be it music or speech-o iented.
The piecewise-linear pitch generator affords the same data-reduction benefits as its formant-controller counterpart. A single data element, the pitch slope, is sufficient to program a continuous (for all intents and purposes) glissando of pitches for an indefinite amount of time. This may be contrasted with the piecewise-constant approach which requires ever-increasing amount of data in proportion to the length of sound to be produced. The piecewise-linear tone generator of the specific embodiment is implemented in the control program of Appendix I and II, but could be implemented in a variety of ways known to those skilled "in the art, including discrete digital logic elements or analog circuits (voltage-controlled oscillators, analog multipliers, etc.) while retaining its essential functions.
The specific embodiment of the invention uses for both its audio input and output a single dynamic loudspeaker 1. To make this possible the circuit 16 used to drive the loudspeaker during the Response Generation Phase 43 must have low output impedance in order to drive the speaker to sufficient sound output levels yet must not present such a low impedance to the speaker during the Acquisition Phase 40 that input sensitivity is degraded. A solution to this problem is illustrated in Fig. 7. An inverting operational amplifier 59 drives a pair of complementary emitter-followers 61, 62. Negative DC feedback is supplied to the op-amp by a resistor-capacitor network 64 - 66 where any AC feedback signal is filtered out by capacitor 66. If no signal is applied to the input 68, the feedback network allows the circuit to rest in an equilibrium state with the op-amp output 60 voltage equal to its input voltage 58. Due to resistor 63 the same voltage will appear at the common emitters of the transistors 69. Since there is no voltage difference between the base and emitter of either transistor, they both present a high impedance to the loudspeaker. Note that there is no AC feedback path around the op-amp and it therefore has a very high voltage gain for any input signal. As soon as one is applied through input capacitor 57 the op-amp output voltage at 60 will slew rapidly either above or below the quiescent voltage depending on the slope of the input signal. When the voltage difference between the base and emitter of either transistor exceeds approximately .7 Volts that transistor will begin to conduct current and function as normal emitter follower, argumenting the drive capability of the OP amp. In this state there will exist a low impedance path for AC feedback through the conducting transistor and the resistor at 63, and the op-amp will operate in a closed-loop fashion as long as the output voltage swing at the op-amp 60 exceeds the forward voltage drops necessary to turn on the transistors. Thus this circuit can be seen to exhibit a high output impedance when no excitation signal is being generated (Acquition Phase) and a low output impedance for driving the loudspeaker in Response Generation Phase when the mixed formant signals are applied to the circuit's input at 68.
The appendices are listings of the program stored in the memory of the microprocessor of the specific embodiment. Briefly the listing of Appendix I is of program statements performing the following functions.
Page 1 Assigns symbol names to RAM (data) locations Pages 2-3 Assign values to symbolic names used in the program Page 4 Initializes system and jumps to sound generating routine Page 5 Waits for average input frequency to exceed threshold for valid input
Page 6 Measures input frequency, times duration of input stimulus, and waits for input frequency to remain below threshold for time specified by variable EXITTIME (pg. 3) Page 7, 12 Choose sound to be produced
Page 15 Makes control decisions after completion of a sound sequence Pages 17-19 Comprise a timekeeping routine which is called from various parts of the program. It uses the microprocessor's built-in timer to increment several independent counters. When a counter overflows, a corresponding flag bit is set which indicates to the calling routine that the time interval has elapsed
Page 21 Generates the audio tone by executing the LEI instructions at lines 734 and 737 at a rate determined by the contents of the DELAY register. The PIPELINE routine called in lines 722 and 735 performs the multiplication operation necessary for generating the sloping pitch segments described in the theory of operation. The multiplication if performed in its entirety in one subroutine call would take too much time to be performed concurrently with the tone generation, therefore it is "pipelined", i.e. a portion of the operation is performed each time the routine is called, with the product being made available before it is required for the updating of the DELAY register. Pages 23 - 31 Comprise the pipelined multiply routine described above. The PIPE12 section on pg. 30 also monitors flags returned by the timekeeping routine and in turn sets flags which control exit from the sound 10Op. PIPE13 and PIPE14 on pg. 31 are used to look up data from the sound data area at the beginning of each piecewise-linear parameter control segment. The pitch slope is looked up and stored in its RAM location and the formant frequency controllers' slopes are looked up and output via the "L" I/O Port. By including these functions in the pipelined routine, they may be performed while the sound loop is executing without disturbing its timing so that no audible
"clicks" or inconsistencies are produced at parameter inflection points when new data is retrieved. Note that the "error" on pg. 23 is not in fact a true error but a use of an idiosyncrasy of the microprocessor to gain more memory space for the pipelined routine. The JID instruction in line 772 will access memory locations in the next block of program space. The same technique is used in line 1115 with an LQID instruction. The MACROS file of Appendix II lists several macro subroutines used by the program of Appendix I.

Claims

We ciaim:
1. A sound responsive toy having an enclosure containing sound detecting and generating apparatus comprising: a) first acoustic transducer means for detecting and converting sound to electrical energy; b) first amplifier means connected to said first acoustic transducer means for amplifying said electrical energy; c) means for supplying electrical energy; d) second acoustic transducer means for converting electrical energy to sound; e) second amplifier means connected to said second acoustic transducer means for amplifying said electrical energy; f) formant filter means for conditioning electrical signals to produce selected sounds, connected to means for inputting said signals to said amplifier means; g) piecewise linear control means for controlling selection of sounds to be produced connected to said formant filter means; h) data storage means for storing frequency data, segment length data and slope value data; i) control logic means connected to, said data storage means, said formant filter means and said piecewise linear control means, for accessing predetermined data in response to predetermined detected j) switching means connected to said means for supplying electrical energy and said control logic means for applying electrical energy to said control logic means whereby said switching means energizes said control logic means producing a variety of predetermined sounds. sounds and for controlling said formant filter means and said piecewise linear control means in accordance with said accessed data to produce predetermined sounds from said second acoustic transducer means; and
2. The apparatus of claim 1 wherein said first amplifier means is adjustable and said control logic means adjusts the sensitivity of said just amplifier means in response to the level of detected sound.
3. The apparatus of claim 1 wherein said first and second acoustic transducers comprise a single dynamic speaker and said second amplifier means having a low impedence when during said speaker and a high impedance at all other times thereby presenting minimal load when said speaker is used as a sound detector.
4. The apparatus of claim 1 wherein said data storage means contains data for producing characteristic animal sounds wherby the apparatus responds to detected sounds in a voice having characteristics associated with the toy animal in which it is enclose.
EP19850901745 1984-03-13 1985-03-13 Sound responsive toy Withdrawn EP0183712A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US58919584A 1984-03-13 1984-03-13
US589195 1984-03-13

Publications (1)

Publication Number Publication Date
EP0183712A1 true EP0183712A1 (en) 1986-06-11

Family

ID=24357018

Family Applications (1)

Application Number Title Priority Date Filing Date
EP19850901745 Withdrawn EP0183712A1 (en) 1984-03-13 1985-03-13 Sound responsive toy

Country Status (3)

Country Link
EP (1) EP0183712A1 (en)
AU (1) AU4110485A (en)
WO (1) WO1985004275A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4630301B1 (en) * 1985-06-04 1999-09-07 Well Made Toy Mfg Co Voice activated echo generator
GB9011819D0 (en) * 1990-05-25 1990-07-18 Lamb Rodney Talking parrot toy
WO2002095728A1 (en) * 2001-05-18 2002-11-28 Tae-Soo Yoon Portable voice/letter processing apparatus
GB0221166D0 (en) * 2002-09-12 2002-10-23 Genie Toys Plc Sound-responsive toy
CN112929766B (en) * 2019-12-05 2024-04-26 刘兆净 Intelligent sound box and control method thereof

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4163120A (en) * 1978-04-06 1979-07-31 Bell Telephone Laboratories, Incorporated Voice synthesizer
JPS56133678A (en) * 1980-03-25 1981-10-19 Sharp Corp Timepiece device
US4385355A (en) * 1981-03-30 1983-05-24 United Technologies Corporation Automatic outer loop centering of aircraft inner loop
US4388495A (en) * 1981-05-01 1983-06-14 Interstate Electronics Corporation Speech recognition microcomputer
US4475228A (en) * 1981-11-27 1984-10-02 Bally Manufacturing Corporation Programmable sound circuit for electronic games

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO8504275A1 *

Also Published As

Publication number Publication date
AU4110485A (en) 1985-10-11
WO1985004275A1 (en) 1985-09-26

Similar Documents

Publication Publication Date Title
US4624012A (en) Method and apparatus for converting voice characteristics of synthesized speech
CA1068948A (en) Electronic tone-generating system
US4151368A (en) Music synthesizer with breath-sensing modulator
US5890115A (en) Speech synthesizer utilizing wavetable synthesis
US4130730A (en) Voice synthesizer
CA1070018A (en) Voice synthesizer
GB1584836A (en) Voice synthesizer
US20210005176A1 (en) Sound processing method, sound processing apparatus, and recording medium
US3836717A (en) Speech synthesizer responsive to a digital command input
CN111009230B (en) Sound signal generating device, sound signal generating method, and recording medium
US4301328A (en) Voice synthesizer
US5321794A (en) Voice synthesizing apparatus and method and apparatus and method used as part of a voice synthesizing apparatus and method
US4470150A (en) Voice synthesizer with automatic pitch and speech rate modulation
EP0183712A1 (en) Sound responsive toy
US4175465A (en) Circuit for simulating string bass sound
US4440058A (en) Digital tone generation system with slot weighting of fixed width window functions
US4805220A (en) Conversionless digital speech production
US3974729A (en) Automatic rhythm playing apparatus
US4173915A (en) Programmable dynamic filter
US4236437A (en) Organ brass pulse keyer
US4641563A (en) Electronic musical instrument
US3511932A (en) Self-oscillating vocal tract excitation source
JPS5927916B2 (en) voice controlled electronic musical instrument
US4723467A (en) Automatic rhythm performing apparatus
US4233875A (en) Electronic musical instrument with automatic trill performance function

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH DE FR GB LI LU NL SE

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 19860214

RIN1 Information on inventor provided before grant (corrected)

Inventor name: NEWSOME, MICHAEL, R.

Inventor name: TRUCHSESS, JOSEPH