EP2235720A1 - Method for instantaneous peak level management and speech clarity enhancement - Google Patents

Method for instantaneous peak level management and speech clarity enhancement

Info

Publication number
EP2235720A1
EP2235720A1 EP09706215A EP09706215A EP2235720A1 EP 2235720 A1 EP2235720 A1 EP 2235720A1 EP 09706215 A EP09706215 A EP 09706215A EP 09706215 A EP09706215 A EP 09706215A EP 2235720 A1 EP2235720 A1 EP 2235720A1
Authority
EP
European Patent Office
Prior art keywords
wave form
rate
clipping
amplitude change
form amplitude
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP09706215A
Other languages
German (de)
French (fr)
Other versions
EP2235720A4 (en
Inventor
Desmond Arthur Smith
H. Christopher Schweitzer
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Able Planet Inc
Smith Desmond Arthur
Original Assignee
Able Planet Inc
Smith Desmond Arthur
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Able Planet Inc, Smith Desmond Arthur filed Critical Able Planet Inc
Publication of EP2235720A1 publication Critical patent/EP2235720A1/en
Publication of EP2235720A4 publication Critical patent/EP2235720A4/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0264Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Definitions

  • the present invention relates to audio signal processing generally. More particularly, the present invention is related to an improved system and method for instantaneous audio signal peak dynamic adjustment for improving the audibility of consonants while simultaneously preserving the sound quality of vowels, and for eliminating potentially damaging acoustic impulse transients to benefit hearing preservation.
  • U.S. Patent No. 4,208,548 and 5,168,526 also issued to Orban more specifically propose methods for controlling clipping in analog voltage amplification systems but also employ by high frequency filter methods to remove undesired distortion. It should be noted that high frequency filtering does not remove low frequency inter-modulation distortion components in complex signals. The present invention has several distinguishable properties of detection, and does not require filter techniques to remove perceptual distortions.
  • U.S. Patent No.5, 815,532 issued to Bhattacharya, et al. discloses a method for processing radio broadcast signals in which carrier frequencies can be
  • the processing method of the present invention overcomes these and other problems not solved by the prior art by abandoning the commonly used feedback loop and providing an innovative method of controlled peak clipping and signal detection.
  • This method introduces precisely calculated amplification of soft and medium sounds to the benefit of auditory detail perception and especially, speech understanding. It simultaneously reduces on an instantaneous basis, short duration high level impulse spikes. This effectively attenuates stress on the crucial hair cilia of the cochlea, thus providing a valuable hearing conservation benefit to the listener.
  • the combination of high level outputs and extended listening time for entertainment, telecommunication, and other electronic audio devices is well understood to cause permanent sensori-neural hearing impairment.
  • Figure 1 is a flow diagram of the processing stages, of the present invention.
  • Figure 2 is a graphic representation of the acoustic pattern of an example of a recorded passage of music illustrating that the average energy distribution lies at 10 dB below the peak energy values (32% of peak);
  • Figure 3 is an enlarged view of the acoustic pattern Figure 2 illustrating that the contribution to total power by excursions over 10 dB is less than half the power contributed by remaining signals;
  • Figure 4 is illustrates a post peak-excision of 10 dB of the peak power from the waveform of Figure 2;
  • Figure 5 illustrates the signal of Figures 2-4 amplified after clipping (or 'overdriven' by 10 dB);
  • Figure 6 illustrates the classical temporal integration pattern for human listeners showing the steep fall off of detection ability as a function of duration; Loudness does not fully integrate until signal duration reaches approximately 100 milliseconds;
  • Figure 7 illustrates the averaged spectrum of a single sentence speech sample without the processing of the present invention.
  • the low frequencies are
  • Figure 8 illustrates the speech sentence shown in Figure 7 following processing by the present invention showing that the averaged spectrum is flattened without the undesired consequence of biasing the frequency response by filtering the low frequency region;
  • Figure 9. a illustrates the acoustic waveform of a female speaker's utterance of the word, "Intuition";
  • Figure 9.b illustrates the wave form of Figure 9. a following processing by the present invention showing that soft consonants have been intensified rendering an audible clarity improvement;
  • Figure 10. a illustrates the acoustic waveform of a male speaker's utterance of a sentence simultaneously over-laid with a series of sharp, high intensity impulse. After processing by the present invention (Fig. 10.b) the impulses spikes are clearly removed. Simultaneously, soft speech has been intensified to the advantage of greater clarity.
  • Figure 10.b illustrates the waveform of Figure 10. a. following processing by the present invention showing the removal of the impulse spikes accompanied by soft speech intensification and audible sound clarity improvement.
  • the present invention exploits the psychoacoustic property of temporal integration in the human auditory system. This is a crucial aspect of the method. It is known that loudness of signals is integrated within a time window of approximately 100 milliseconds. Hence, shorter duration impulse spikes sound considerably softer and are often imperceptible. An illustration of this is shown in Figures 3 and 4. In that example, a particular dynamic amplitude pattern of a music passage is illustrated, by way of example, with 10 dB reduction of the amplitude peaks removed by the present invention with a net consequential loudness reduction of only 0.2 dB due to the psycho-acoustically determined temporal integration.
  • FIG. 5 the audio signal of Figures 3 and 4 is shown amplified after clipping or "overdriven” by 10 dB.
  • the average levels of long duration signals are increased, which results in increased loudness for soft and midlevel sounds, the net effect of which is to enhance the detail and clarity of the signal.
  • High level impulses that are extremely fast, i.e., less than 2 msec, are instantaneously adjusted downward by the third stage shown in Figure 1 which applies controlled clipping with no time delay.
  • the extreme brevity of these signals renders the distortion associated with the clipping to generally imperceptible levels due to the temporal integration roll off illustrated in Figure 6 and as explained previously.
  • Speech clarity in audio systems and especially noisy input environments is often compromised by the greater intensity of low frequency, higher energy vowels which tend to mask the higher frequency, lower intensity consonants.
  • Traditional approaches often apply filter techniques to attenuate the low frequency noise and voice components. In some cases the approach is to bias the spectrum in favor of the high frequencies. Both have the effect of creating an undesirable tinny sound and a negative perceptual effect on voice quality.
  • the present invention avoids this problem by boosting all soft and mid level sounds without filtering or frequency biasing.
  • the range of the applied gain value is between approximately 1dB and 4OdB.
  • a train of pulses impulses (or peaks in a continuous sinusoidal or complex signal) is treated as a Long Term signal. Because the attack and release is an exponential function the recovery on termination of a vowel in speech is relatively fast - which permits almost full amplification of consonants or other low level sounds, e.g., in music.

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Measurement Of Current Or Voltage (AREA)

Abstract

A method for raising the soft and mid-level amplitude of sounds for greater clarity and perceptual benefit, while simultaneously removing the high level amplitude peaks without delay and providing protection for the auditory sense organ. The method does not require a feedback mechanism for the accomplishment of this treatment and exploits the psychoacoustic phenomenon of temporal integration which reduces the audibility of short duration signals, including distortions associated with peak clipping. The human auditory system requires greater time to integrate signal energy for audibility than provided by brief duration waveform peaks.

Description

METHOD FOR INSTANTANEOUS PEAK LEVEL MANAGEMENT AND SPEECH CLARITY ENHANCEMENT
PRIORITY TO RELATED APPLICATIONS [0001] This application claims the benefit of U.S. Utility Application No. 12/361508 filed January 28, 2009, which claims the benefit of U.S. Provisional Application No. 61/024858 filed January 30, 2008, the entire contents of each of which are incorporated herein by reference.
FIELD OF THE INVENTION
[0002] The present invention relates to audio signal processing generally. More particularly, the present invention is related to an improved system and method for instantaneous audio signal peak dynamic adjustment for improving the audibility of consonants while simultaneously preserving the sound quality of vowels, and for eliminating potentially damaging acoustic impulse transients to benefit hearing preservation.
BACKGROUND OF THE INVENTION
[0003] The science and art of Signal Processing, in some cases enabled by digital control methods, has enabled the development of a wide range of signal alteration methods, including steep and flexible filtering, dynamic range compression, pitch transformations and various noise reduction schemes. Particularly in the area of dynamic range compression of signal amplitude, most prior art approaches require a feedback loop in which some detection threshold and voltage control mechanism is used to reduce outputs in excess of a defined output level. These approaches, by necessity, introduce some time constant or time delay
2449188 01 1 Docket No ABL004 127597 for the adjustments to take place, usually tens of milliseconds in duration. Perceptual disturbances often result from such delay times. Furthermore, brief transient peaks may pass through during the adaptive process, which may potentially damage the inner ear hair cells. Impulse noise damage is often more likely to occur than auditory damage resulting from longer duration noises, largely due to the fact that the integration time required for loudness experience in the human auditory system is on the order of 100 to 200 milliseconds. Stated differently, physically damaging intensity levels may not be perceived or experienced by a listener psycho-acoustically in such a way as to encourage listener withdrawal.
[0004] Signal processing designs intended to reduce excessively high peak intensities and/or control dynamic levels are disclosed in U.S. Patent No. 4,249,042 issued to Orban, which requires frequency band separation and the use of a gain control feedback loop. Although that method uses a clipping technique for overshoot protection, it will be shown that the present invention has important and innovative differences over the '042 disclosure with regard to the use of clipping.
U.S. Patent No. 4,208,548 and 5,168,526 also issued to Orban more specifically propose methods for controlling clipping in analog voltage amplification systems but also employ by high frequency filter methods to remove undesired distortion. It should be noted that high frequency filtering does not remove low frequency inter-modulation distortion components in complex signals. The present invention has several distinguishable properties of detection, and does not require filter techniques to remove perceptual distortions.
[0006] U.S. Patent No.5, 815,532 issued to Bhattacharya, et al. discloses a method for processing radio broadcast signals in which carrier frequencies can be
2449188 01 2 Docket No ABL004 127597 subdivided with control sidebands. More recently, Ishimitsu, et al. in U.S. Patent No. 5,255,325 describe yet another method of automatic gain control with a time constant table for adjusting the delays resulting from the feedback loop. Similarly, U.S. Patent No. 6,757,396 issued to Allred clearly introduces delays related to the feedback loop design. On the other hand, U.S. Patent Number 7,233,200 issued to Yamada discloses methodology which makes estimates for the appropriate recovery time constant based on detection of the signal level of the input signal in units of a period of the input signal. However, the method disclosed by Yamada is intended for recording purposes and is not appropriate for real time applications. Notably, the system and method of the present invention is suitable for both recorded and live audio processing.
[0007] The processing method of the present invention overcomes these and other problems not solved by the prior art by abandoning the commonly used feedback loop and providing an innovative method of controlled peak clipping and signal detection. This method introduces precisely calculated amplification of soft and medium sounds to the benefit of auditory detail perception and especially, speech understanding. It simultaneously reduces on an instantaneous basis, short duration high level impulse spikes. This effectively attenuates stress on the crucial hair cilia of the cochlea, thus providing a valuable hearing conservation benefit to the listener. The combination of high level outputs and extended listening time for entertainment, telecommunication, and other electronic audio devices, is well understood to cause permanent sensori-neural hearing impairment. By reducing exposures to many thousands of impulse peaks that occur over the course of even just a few hours of audio signal transmissions, a clear protective and prophylactic
2449188 01 O Docket No ABL004 127597 benefit is expected from the present invention's system and method of manipulating the processed audio signal.
BRIEF DESCRIPTION OF THE DRAWINGS
[0008] Figure 1. is a flow diagram of the processing stages, of the present invention;
[0009] Figure 2. is a graphic representation of the acoustic pattern of an example of a recorded passage of music illustrating that the average energy distribution lies at 10 dB below the peak energy values (32% of peak);
[0010] Figure 3 is an enlarged view of the acoustic pattern Figure 2 illustrating that the contribution to total power by excursions over 10 dB is less than half the power contributed by remaining signals;
[0011] Figure 4 is illustrates a post peak-excision of 10 dB of the peak power from the waveform of Figure 2;
[0012] Figure 5 illustrates the signal of Figures 2-4 amplified after clipping (or 'overdriven' by 10 dB);
[0013] Figure 6 illustrates the classical temporal integration pattern for human listeners showing the steep fall off of detection ability as a function of duration; Loudness does not fully integrate until signal duration reaches approximately 100 milliseconds;
[0014] Figure 7 illustrates the averaged spectrum of a single sentence speech sample without the processing of the present invention. The low frequencies are
2449188 01 4 Docket No ABL004 127597 naturally greater in intensity, which makes perception of the higher frequency consonants more difficult;
[0015] Figure 8 illustrates the speech sentence shown in Figure 7 following processing by the present invention showing that the averaged spectrum is flattened without the undesired consequence of biasing the frequency response by filtering the low frequency region;
[0016] Figure 9. a illustrates the acoustic waveform of a female speaker's utterance of the word, "Intuition";
[0017] Figure 9.b illustrates the wave form of Figure 9. a following processing by the present invention showing that soft consonants have been intensified rendering an audible clarity improvement;
[0018] Figure 10. a illustrates the acoustic waveform of a male speaker's utterance of a sentence simultaneously over-laid with a series of sharp, high intensity impulse. After processing by the present invention (Fig. 10.b) the impulses spikes are clearly removed. Simultaneously, soft speech has been intensified to the advantage of greater clarity.
[0019] Figure 10.b illustrates the waveform of Figure 10. a. following processing by the present invention showing the removal of the impulse spikes accompanied by soft speech intensification and audible sound clarity improvement.
DESCRIPTION OF THE INVENTION
[0020] It should be noted that the present description is by way of instructional examples and the concepts presented herein are not limited to use or application with any single audio processing device. Hence, while the details of the processing
2449188 01 5 Docket No ABL004 127597 innovation described herein are for the convenience of illustration and explanation, with respect to exemplary embodiments, the principles disclosed may be applied to other types and applications of audio electronic signal transmission. They can be implemented in both digital and analog constructions. !f in analog, the skillful selection of RC time constants can be used to enable the unique detection and treatment stages of the invention described in the next paragraph; whereas, in digital form, it is a matter of programming the appropriate parameters.
[0021] Referring now to Figure 1 , dynamically changing signals, such as those of a recorded music passage as shown in Figure 2 or a human speech pattern as shown in Figure 7, are examined and treated within three separate time analysis windows depending upon the rate of amplitude change. A distortion free fast detector applies a 2 millisecond (msec.) attack and release to brief impulses or generally quick changes in amplitude; by way of example, amplitude changes that occur in the range of approximately 2 msec, to approximately 2 seconds. A rapid decrease in amplitude triggers the fast release element. Hence, both the attack and the release are dependent upon the rate of input amplitude change.
[0022] Slower changing signal amplitudes, such as rhythmic vocal patterns, are managed by a 2000 msec. (2 second) attack and release time. This time period covers several spoken words and enables the general level of the voice to be identified. Essentially this component of the method maintains a continuous surveillance on the incoming level of a speech signal in order to best maintain clarity and naturalness in the signal's output and reduces the speed of the clipping step when the rate of input signal amplitude change is greater than approximately 2 seconds.
2449188.01 6 Docket No. ABL004 127597 [0023] The present invention exploits the psychoacoustic property of temporal integration in the human auditory system. This is a crucial aspect of the method. It is known that loudness of signals is integrated within a time window of approximately 100 milliseconds. Hence, shorter duration impulse spikes sound considerably softer and are often imperceptible. An illustration of this is shown in Figures 3 and 4. In that example, a particular dynamic amplitude pattern of a music passage is illustrated, by way of example, with 10 dB reduction of the amplitude peaks removed by the present invention with a net consequential loudness reduction of only 0.2 dB due to the psycho-acoustically determined temporal integration. Since the total period in which the brief transients occur is only about 10 msec, or 1/10th of the 100 msec, loudness integration window, the peak levels will contribute no more than 1/20th of the total power in the 100 msec, auditory integration window. This will result in a loudness increase of 10 (log (1+1/20)) or only 0.2 dB. Hence, it can be seen that the instantaneous limiting of peak power does not significantly affect loudness; however, the potentially damaging spikes have been removed. Prior art assumptions on the audibility of clipping induced distortions are predicated on conventional measurement methods which greatly elongate and often 'freeze' for visual analysis signals that are factually very brief. This common incorrect portrayal of the perceptual consequences of brief signal distortions, such as harmonics resulting from clipping, directly relates to the unique features of the present method.
[0024] Referring now to Figure 5, the audio signal of Figures 3 and 4 is shown amplified after clipping or "overdriven" by 10 dB. The average levels of long duration signals are increased, which results in increased loudness for soft and midlevel sounds, the net effect of which is to enhance the detail and clarity of the signal.
2449188 01 / Docket No ABL004 127597 [0025] High level impulses that are extremely fast, i.e., less than 2 msec, are instantaneously adjusted downward by the third stage shown in Figure 1 which applies controlled clipping with no time delay. The extreme brevity of these signals renders the distortion associated with the clipping to generally imperceptible levels due to the temporal integration roll off illustrated in Figure 6 and as explained previously.
[0026] Speech clarity in audio systems and especially noisy input environments is often compromised by the greater intensity of low frequency, higher energy vowels which tend to mask the higher frequency, lower intensity consonants. Traditional approaches often apply filter techniques to attenuate the low frequency noise and voice components. In some cases the approach is to bias the spectrum in favor of the high frequencies. Both have the effect of creating an undesirable tinny sound and a negative perceptual effect on voice quality. The present invention avoids this problem by boosting all soft and mid level sounds without filtering or frequency biasing. The range of the applied gain value is between approximately 1dB and 4OdB. As soft speech sounds pass through the system, a flattening of the spectrum is accomplished, leaving the vowels and vocal properties undisturbed, but a clear increase in the intensity and perceptibility of the softer, voiceless consonants. This is illustrated quite clearly in Figures 7 and 8. Additionally, Figure 9 shows the sequential waveforms of a female speaker uttering the multi-syllabic word "intuition." It is clear that the soft consonants, such as the "T" and "SH" are intensified in the processed sample using the present invention. It is important to note that the processing did not alter the basic vocal properties while instantaneously producing clarity enhancements.
2449188 01 8 Docket No ABL004 127597 [0027] Sudden sharp transient acoustical spikes are both annoying and potentially damaging to the delicate hair cell structures of the inner ear. The present invention instantaneously removes such impulses (Figure 10) without delay or added distortion typically associated with existing approaches.
[0028] A train of pulses impulses (or peaks in a continuous sinusoidal or complex signal) is treated as a Long Term signal. Because the attack and release is an exponential function the recovery on termination of a vowel in speech is relatively fast - which permits almost full amplification of consonants or other low level sounds, e.g., in music.
[0029] Changes may be made in the above methods, devices and structures without departing from the scope hereof. It should thus be noted that the matter contained in the above description and/or shown in the accompanying drawings should be interpreted as illustrative and not in a limiting sense. The following claims are intended to cover all generic and specific features described herein, as well as all statements of the scope of the present method, device and structure, which, as a matter of language, might be said to fall there between.
2449188 01 ϋ Docket No ABL004 127597

Claims

CLAIMSWHAT IS CLAIMED IS:
1. A method for improving the clarity of acoustic speech signals, comprising:
continuously measuring the average level of an input signal;
applying at least one gain value to the speech signal by a predetermined factor; and
simultaneously clipping the peak values of the input speech signal by a precalculated amount, whereby the soft high frequency unvoiced spoken components are perceptuallly enhanced.
2. The method of claim 1 further including continuously measuring the input signal wave form amplitude and the rate of wave form amplitude change.
3. The method of claim 2 including adjusting the speed of the clipping step in response to the measured rate of wave form amplitude change.
4. The method of claim 3 wherein the clipping step is performed instantaneously when the rate of wave form amplitude change is less than 2.0 milliseconds.
5. The method of daim 3 wherein the speed of the clipping step is reduced when the rate of wave form amplitude change is greater than 2.0 milliseconds.
6. The method of claim 5 wherein the speed of the clipping step is further reduced when the rate of wave form amplitude change is greater than 2.0 seconds.
7. The method of claim 1 wherein the range of applied gain value is between approximately 1 dB and approximately 40 dB.
8. The method of claim 1 wherein the input signal comprises a broadband signal.
9. The method of claim 1 wherein the input signal comprises multiple frequency band segmented signals.
2449188 01 1 0 Docket No ABL004.127597
EP09706215A 2008-01-30 2009-01-29 Method for instantaneous peak level management and speech clarity enhancement Ceased EP2235720A4 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US2485808P 2008-01-30 2008-01-30
US12/361,508 US20090192793A1 (en) 2008-01-30 2009-01-28 Method for instantaneous peak level management and speech clarity enhancement
PCT/US2009/032449 WO2009097437A1 (en) 2008-01-30 2009-01-29 Method for instantaneous peak level management and speech clarity enhancement

Publications (2)

Publication Number Publication Date
EP2235720A1 true EP2235720A1 (en) 2010-10-06
EP2235720A4 EP2235720A4 (en) 2012-01-25

Family

ID=40900108

Family Applications (1)

Application Number Title Priority Date Filing Date
EP09706215A Ceased EP2235720A4 (en) 2008-01-30 2009-01-29 Method for instantaneous peak level management and speech clarity enhancement

Country Status (8)

Country Link
US (1) US20090192793A1 (en)
EP (1) EP2235720A4 (en)
JP (1) JP5345638B2 (en)
CN (1) CN102144257A (en)
AU (1) AU2009209090B2 (en)
CA (1) CA2718968A1 (en)
NZ (1) NZ587052A (en)
WO (1) WO2009097437A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BRPI0913987A2 (en) * 2008-06-30 2015-10-20 Able Planet Inc signal processing method and system
EP2518723A4 (en) * 2009-12-21 2012-11-28 Fujitsu Ltd Voice control device and voice control method
RU2568281C2 (en) * 2013-05-31 2015-11-20 Александр Юрьевич Бредихин Method for compensating for hearing loss in telephone system and in mobile telephone apparatus
CN109979475A (en) 2017-12-26 2019-07-05 深圳Tcl新技术有限公司 Solve method, system and the storage medium of echo cancellor failure

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4208548A (en) * 1977-07-19 1980-06-17 Orban Associates, Inc. Apparatus and method for peak-limiting audio frequency signals
US4249042A (en) * 1979-08-06 1981-02-03 Orban Associates, Inc. Multiband cross-coupled compressor with overshoot protection circuit
US4928311A (en) * 1986-01-03 1990-05-22 Trompler Lyle D Noise limiting circuit for earmuffs
ATE79495T1 (en) * 1986-04-03 1992-08-15 Motorola Inc FM RECEIVER WITH NOISE REDUCTION WHEN RECEIVING SIGNALS WITH ''RALEIGH'' FADER.
JPS63203097A (en) * 1987-02-18 1988-08-22 Nippon Telegr & Teleph Corp <Ntt> Video conference system
US4926144A (en) * 1988-09-29 1990-05-15 General Electric Company Multi-function modulation and center frequency control port for voltage controlled oscillator
US5168526A (en) * 1990-10-29 1992-12-01 Akg Acoustics, Inc. Distortion-cancellation circuit for audio peak limiting
JP3295443B2 (en) * 1991-10-09 2002-06-24 パイオニア株式会社 Signal processing circuit in audio equipment
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
JPH07104788A (en) * 1993-10-06 1995-04-21 Technol Res Assoc Of Medical & Welfare Apparatus Voice emphasis processor
US5448646A (en) * 1993-11-01 1995-09-05 Unex Corporation Headset interface assembly
JPH08161704A (en) * 1994-12-07 1996-06-21 Pioneer Electron Corp Automatic bias control method and apparatus
US5631968A (en) * 1995-06-06 1997-05-20 Analog Devices, Inc. Signal conditioning circuit for compressing audio signals
US5862238A (en) * 1995-09-11 1999-01-19 Starkey Laboratories, Inc. Hearing aid having input and output gain compression circuits
US5815532A (en) * 1996-05-01 1998-09-29 Glenayre Electronics, Inc. Method and apparatus for peak-to-average ratio control in an amplitude modulation paging transmitter
US5737434A (en) * 1996-08-26 1998-04-07 Orban, Inc. Multi-band audio compressor with look-ahead clipper
KR100213073B1 (en) * 1996-11-09 1999-08-02 윤종용 Frequency response compensation apparatus of audio signal in playback mode
JPH10163775A (en) * 1996-12-02 1998-06-19 Eiden Kk Limiting amplifier
US6610917B2 (en) * 1998-05-15 2003-08-26 Lester F. Ludwig Activity indication, external source, and processing loop provisions for driven vibrating-element environments
US6757396B1 (en) * 1998-11-16 2004-06-29 Texas Instruments Incorporated Digital audio dynamic range compressor and method
US7027981B2 (en) * 1999-11-29 2006-04-11 Bizjak Karl M System output control method and apparatus
GB2359177A (en) * 2000-02-08 2001-08-15 Nokia Corp Orientation sensitive display and selection mechanism
US6731768B1 (en) * 2000-07-26 2004-05-04 Etymotic Research, Inc. Hearing aid having switched release automatic gain control
EP1405424A1 (en) * 2001-06-28 2004-04-07 Koninklijke Philips Electronics N.V. Narrowband speech signal transmission system with perceptual low-frequency enhancement
FR2831961B1 (en) * 2001-11-07 2004-07-23 Inst Francais Du Petrole METHOD FOR PROCESSING SEISMIC DATA OF WELLS IN ABSOLUTE PRESERVED AMPLITUDE
US6741844B2 (en) * 2001-11-27 2004-05-25 Motorola, Inc. Receiver for audio enhancement and method therefor
EP1599992B1 (en) * 2003-02-27 2010-01-13 Telefonaktiebolaget L M Ericsson (Publ) Audibility enhancement
JP4048499B2 (en) * 2004-02-27 2008-02-20 ソニー株式会社 AGC circuit and AGC circuit gain control method
US7391875B2 (en) * 2004-06-21 2008-06-24 Waves Audio Ltd. Peak-limiting mixer for multiple audio tracks
US20060206320A1 (en) * 2005-03-14 2006-09-14 Li Qi P Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
No further relevant documents disclosed *
See also references of WO2009097437A1 *

Also Published As

Publication number Publication date
CA2718968A1 (en) 2009-08-06
NZ587052A (en) 2013-04-26
AU2009209090A1 (en) 2009-08-06
CN102144257A (en) 2011-08-03
AU2009209090B2 (en) 2013-05-02
US20090192793A1 (en) 2009-07-30
WO2009097437A1 (en) 2009-08-06
JP5345638B2 (en) 2013-11-20
JP2011511964A (en) 2011-04-14
EP2235720A4 (en) 2012-01-25

Similar Documents

Publication Publication Date Title
Zorila et al. Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression
EP2011234B1 (en) Audio gain control using specific-loudness-based auditory event detection
US7343022B2 (en) Spectral enhancement using digital frequency warping
US5274711A (en) Apparatus and method for modifying a speech waveform to compensate for recruitment of loudness
Yoo et al. Speech signal modification to increase intelligibility in noisy environments
Marzinzik Noise reduction schemes for digital hearing aids and their use for the hearing impaired
AU2009209090B2 (en) Method for instantaneous peak level management and speech clarity enhancement
US10176824B2 (en) Method and system for consonant-vowel ratio modification for improving speech perception
Lin et al. Subband noise estimation for speech enhancement using a perceptual Wiener filter
Krause et al. Evaluating the role of spectral and envelope characteristics in the intelligibility advantage of clear speech
EP3595172B1 (en) Systems and methods for processing an audio signal for replay on an audio device
Graupe et al. Blind adaptive filtering of speech from noise of unknown spectrum using a virtual feedback configuration
Brouckxon et al. Time and frequency dependent amplification for speech intelligibility enhancement in noisy environments
US10149070B2 (en) Normalizing signal energy for speech in fluctuating noise
JP5005614B2 (en) Adaptive dynamic range optimized sound processor
Mauler et al. Improved reproduction of stops in noise reduction systems with adaptive windows and nonstationarity detection
EP2394271B1 (en) Method for separating signal paths and use for improving speech using electric larynx
JP3596580B2 (en) Audio signal processing circuit
Tejero-Calado et al. Combination compression and linear gain processing for digital hearing aids
Nishigaki et al. Influence of auditory feedback on uttering vowel speech in noisy environment
Zorila et al. Effectiveness of Near-End Speech Enhancement Under Equal-Loudness and Equal-Level Constraints.
Maher et al. Audio signal enhancement
Alam et al. WIENER DENOISING BASED ON PERCEPTUAL FREQUENCY WEIGHTING AND NOISE SPECTRUM SHAPING
Udrea et al. An Improved Multi-band Speech Enhancement Method for Colored Noise Estimation and Reduction
Jenssen Noise reduction in hearing aids

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20100830

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA RS

DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20111229

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 21/02 20060101AFI20111222BHEP

17Q First examination report despatched

Effective date: 20121109

REG Reference to a national code

Ref country code: DE

Ref legal event code: R003

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20130711