WO2002025886A1 - Digital signal processing techniques for improving audio clarity and intelligibility - Google Patents
Digital signal processing techniques for improving audio clarity and intelligibility Download PDFInfo
- Publication number
- WO2002025886A1 WO2002025886A1 PCT/US2001/029552 US0129552W WO0225886A1 WO 2002025886 A1 WO2002025886 A1 WO 2002025886A1 US 0129552 W US0129552 W US 0129552W WO 0225886 A1 WO0225886 A1 WO 0225886A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- signal components
- blocks
- readable medium
- computer readable
- Prior art date
Links
- 238000012545 processing Methods 0.000 title claims abstract description 92
- 238000000034 method Methods 0.000 title claims abstract description 53
- 230000005236 sound signal Effects 0.000 claims description 27
- 230000005540 biological transmission Effects 0.000 claims description 15
- 230000000694 effects Effects 0.000 claims description 9
- 230000003247 decreasing effect Effects 0.000 claims description 3
- 230000010267 cellular communication Effects 0.000 claims 2
- 238000004590 computer program Methods 0.000 claims 1
- 230000006870 function Effects 0.000 description 16
- 238000010586 diagram Methods 0.000 description 13
- 230000008569 process Effects 0.000 description 13
- 230000008901 benefit Effects 0.000 description 10
- 238000004891 communication Methods 0.000 description 5
- 230000007423 decrease Effects 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- QURLONWWPWCPIC-UHFFFAOYSA-N 2-(2-aminoethoxy)ethanol;3,6-dichloro-2-methoxybenzoic acid Chemical compound NCCOCCO.COC1=C(Cl)C=CC(Cl)=C1C(O)=O QURLONWWPWCPIC-UHFFFAOYSA-N 0.000 description 4
- 239000011449 brick Substances 0.000 description 4
- 230000001052 transient effect Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 238000012935 Averaging Methods 0.000 description 2
- 235000009508 confectionery Nutrition 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000007620 mathematical function Methods 0.000 description 1
- 230000000116 mitigating effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000000135 prohibitive effect Effects 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
- G10L19/0208—Subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
Definitions
- the present invention relates generally to digital signal processing, and more specifically to the processing of digital audio signals in a variety of contexts.
- Radio stations, concerts, speeches and lectures are all delivered over the web in streaming form.
- Encoders such as those offered by Microsoft and Real Audio reside on servers that deliver the audio stream at multiple bit rates over various types of connections (modem, TI, DSL, ISDN etc.) to a listener's computer.
- the streamed data is decoded by a player, e.g., RealPlayer software, that understands the particular encoding format.
- cable and satellite television systems deliver streaming video and audio to set top boxes in users' homes which decode and playback the encoded content.
- Audio files may also be downloaded over the Internet for storage and later playback using any of a variety of mechanisms including, for example, the listener's computer or any of a variety of available portable playback devices.
- undesirable artifacts are generated which interfere with the goal of faithfully reproducing a relatively high bandwidth signal (i.e., the original audio) using a low bandwidth technique (i.e., the low bit rate codec).
- Such artifacts may be dealt with, at least in part, by appropriate processing of the analog or digital audio signals at their source (e.g., by the digital audio broadcaster). This is typically accomplished using a variety of techniques involving expensive hardware, software techniques with a high computational overhead, or both. Unfortunately, these costly techniques only deal with half of the equation.
- the digital signal processors of the present invention may be configured to effect processing of the digital audio in a manner which enhances the listener's experience and imposes an acceptable level of computational overhead.
- the present invention provides methods and apparatus for effecting multi- band processing of an original sampled signal.
- the original sampled signal is separated into a plurality of signal components each corresponding to one of a plurality of frequency bands.
- the dynamic range associated with each one of the plurality of signal components is independently and dynamically controlled.
- At least one signal level associated with the plurality of signal components is modified.
- Figs, la and lb show a simplified block diagram of a signal processor designed according to a specific embodiment of the present invention.
- Fig. 2 is a simplified block diagram of various stages of a multi-band crossover for use with various specific embodiments of the present invention.
- Fig. 3 is a flowchart illustrating operation of a crossover stage in the multi-band crossover of Fig. 2.
- Fig. 4 is a flowchart illustrating operation of an automatic gain control processing block according to a specific embodiment of the invention.
- Fig. 5 is a flowchart illustrating operation of a nonlinear automatic gain control processing block according to a specific embodiment of the invention.
- Fig. 6 is a block diagram illustrating the playing of audio files over a network according to a specific embodiment of the present invention.
- Fig. 7 is a block diagram illustrating the decoding of audio files according to a specific embodiment of the invention.
- Fig. 8 is a block diagram illustrating the playing of audio files over a network according to another specific embodiment of the present invention.
- Figs. 9a and 9b show a simplified block diagram of a signal processor designed according to another specific embodiment of the present invention.
- Figs. 10a and 10b show a simplified block diagram of a signal processor designed according to yet another specific embodiment of the present invention.
- Fig. 11 is a simplified block diagram of a signal processor designed according to a further specific embodiment of the present invention.
- Figs. 12a and 12b are block diagrams illustrating the transmission and receiving sides of a digital audio broadcasting system according to a specific embodiment of the invention.
- Fig. 13 is a block diagram illustrating a satellite television system according to a specific embodiment of the present invention.
- Fig. 14 is a block diagram of a home entertainment system designed according to a specific embodiment of the invention.
- Fig. 15 shows a 3-band signal processor designed according to another specific embodiment which may be employed in voice or telephony applications.
- signal processor 30 for processing audio signals according to a specific embodiment of the present invention
- signal processor 30 is implemented entirely in software and may be incorporated, for example, within a server distributing digital audio files or streaming audio, or within any of a variety of other devices including, for example, digital radio transmitters and receivers, standard PCs, cell phones, personal digital assistants (PDAs), wireless application devices, portable playback devices, set top boxes, etc.
- the input block 32 in Fig. la receives audio signals from an audio source (not shown).
- the input block 32 converts the audio signals into pulse code modulated (PCM) samples according to any of a wide variety of well known digital encoding schemes. Subsequently, at the frequency shaping block 34, the very low frequency components of the PCM samples are eliminated which may otherwise degrade the audio quality of the samples.
- block 34 is a high pass filter (e.g., 5 Hz) which removes the DC offset.
- the audio samples are separated into two partially overlapping frequency bands.
- all of the crossover blocks in processor 30 have a relatively shallow characteristic so that each band blends nicely with adjacent bands.
- Each frequency band is subsequently processed at non-linear automatic gain control (AGC) loop blocks 38 and 40 which, according to a specific embodiment, have less aggressive attack and release times than subsequent AGCs and are primarily for putting the signal level into the "sweet spot" of the subsequent multi-band crossover block 44.
- AGC automatic gain control
- the volume of the input sample is either increased or decreased for the purpose of equalizing the amplitude of the input samples in each of the frequency bands.
- the gain factor is variable for different input samples as described in more detail below.
- the distinguishing factor between a non-linear AGC and an AGC is that the gain factor varies according to a nonlinear mathematical function in the non-linear AGC.
- the output of each of the non-linear AGCs 38 and 40 is the product of the input sample and the gain factor.
- AGCs 38 and 40 operate in a manner similar to that described below with reference to AGC 48 in processing block 60 of Fig. lb.
- the outputs of the two non-linear AGCs are mixed at the mixer block 42 so that in the resulting output all the frequencies are represented.
- the bands may include, for example, sub-bass, mid-bass, mid-range, presence, and treble.
- Multi- band crossover 44 behaves very similar to 2-band crossover 36 except that the former has more frequency bands.
- each frequency band may be equalized separately and independently from the other frequency bands. Independent processing of each frequency band is desirable where there is a combination of high-pitch, low-pitch and medium-pitch instruments playing simultaneously.
- a high-pitch sound such as crash of a symbol that is louder than any other instrument for a fraction of a second
- a single band AGC would reduce the amplitude of the entire sample including the low and medium frequency components present in the sample that may have originated from a vocalist or a bass. The result is a degradation of audio quality and introduction of undesirable artifacts into the music.
- a one band AGC would allow the component of frequency with the highest volume to control the entire sample, a phenomenon referred to as spectral gain intermodulation.
- each frequency band is independently processed by processing blocks 60, 62, and 64.
- Processing block 60 is dedicated to processing band 1 with components possessing the lowest frequency.
- Drive block 46 is a user programmable gain adjustment which uniformly exaggerates the signal component as it goes into AGC 48 which works to reduce changes in the gain. For every Nth sample that doesn't overshoot its threshold, AGC 48 incrementally increases the gain. Likewise, for every Nth sample which does overshoot the threshold, AGC 48 incrementally decreases the gain.
- Drive block 50 is another user programmable gain adjustment which precedes negative attack time limiter (NATL) 52.
- Drive block 50 works in concert with inverse drive block 54 to adjust the effective range of operation of NATL 52.
- AGC 48 may not react quickly enough and some overshooting samples would go otherwise go untreated resulting in a sharp overshoot at the beginning of the transient.
- NATL 52 looks at future samples and limits the gain of the current sample to avoid the distortion associated with such sharp overshoots. In practical terms, the lower the threshold is set, the more "dense" the sound becomes.
- samples are stored in a delay buffer so that the future samples may be used in equalizing the volume.
- a small block of earlier samples is extracted from the beginning of the buffer and the future block of samples is appended to the end of the buffer.
- the future sample is multiplied by the gain factor. If the resulting data has an amplitude greater than a threshold value (a user-fixed parameter) the gain factor is reduced to a value equal to the threshold value divided by the amplitude of the future sample.
- a counter referred to as the release counter is subsequently set equal to the length of the delay buffer.
- the resulting data are then passed through a low-pass filter so as to smooth out any abrupt changes in the gain that will have resulted from multiplication by the future sample.
- NATL 52 ensures that the transition from the present sample to the future sample is achieved in a smooth and inaudible fashion, and removes peaks on the audio signal that waste bandwidth.
- processing block 60 may include a soft clip block 56 which corresponds to a nonlinear function which essentially rounds off the waveform, creating harmonics which, in turn, create the effect that there is more bass than there is in the input signal. That is, within an output signal excursion which is less than the peak-to-peak excursion of the input signal from drive block 54 there is substantially more acoustic energy.
- the level mixer block 58 is another gain control wherein the sample is multiplied by a constant gain factor that may be preset by the user. Remixing of the signal components in the different frequency bands is performed at the mixer block 66.
- NATL 70 Another user programmable gain control 68 for general loudness is followed by a final NATL 70 which limits the total peak of the combined bands in the same way as discussed above with reference to NATL 52.
- the limiting function performed by NATL 70 is desirable, for example, where constructive interference between peaks in different bands causes peaks which need to be dealt with.
- Fig. 2 shows the four stages of a 5-band crossover block 80 which may be employed as a specific embodiment of multi-band crossover 44 of Fig. la.
- Crossover block 80 represents a series of linear operations to separate signals into overlapping frequency bands.
- a computation is performed resulting in a high pass output as shown in the loop 90. More specifically, at each stage corresponding to a particular frequency band only the output from the previous stage, referred to as the high pass output, is read.
- An averaging process is then performed wherein the weighted sum of the previous stage's output and the new sample is computed.
- the output of the averaging process is referred to as the low-pass output in Figs. 2 and 3.
- the low-pass output there are n-1 low pass outputs corresponding to the n frequency bands.
- the difference between the input sample and the low pass output is denoted as the high pass output which forms the input to the next stage of the multi-band crossover.
- Fig. 2 shows four stages corresponding to the 1 st , 2 nd , 3 rd , and 4 th stages of the multi-band crossover labeled 82-88, respectively.
- Fig. 4 shows a flowchart illustrating operation of a specific embodiment of an AGC loop 98 which may be employed, for example, to implement AGC 48 of Fig. lb.
- AGC loop 98 applies a gain factor to each sample it receives. Initially the gain factor is assumed and thereafter for each sample, as indicated at 92, the gain factor is increased slightly through multiplication by a number greater than 0.0 referred to herein as the release rate parameter. In this way, the gain factor increases with every sample. Every input sample is multiplied by the gain factor thus obtained, as indicated at 94. At 96 it is determined if the amplitude of the sample with the gain factor applied exceeds a preset threshold value. In the event the threshold value is exceeded, the gain factor is reduced slightly through multiplication by a number greater than 0.0 referred to herein as the attack rate parameter. Otherwise the gain factor remains unaltered and the process repeats by reading a new input sample.
- Fig. 5 shows a flowchart illustrating operation of a specific embodiment of a special AGC loop 100 which may be employed, for example, to implement AGC 38 of Fig. lb.
- the non-linear AGC loop 100 applies a gain factor to each sample it receives.
- the gain factor is increased for every sample by multiplying the gain factor with a number slightly greater 1.0, i.e., the release rate parameter.
- a trial multiplication is performed by multiplying each input sample with the gain factor. If the amplitude of the resulting signal is greater than a preset threshold value, the gain factor is reduced slightly by multiplication with a number slightly less than 1.0, i.e., the attack rate parameter.
- the gain factor is then modified according to a nonlinear function.
- the new gain factor is obtained by dividing the old gain factor by two and adding a fixed value to the outcome, thereby obtaining a nonlinear variation in the gain factor.
- the final output of the nonlinear AGC loop 100 is obtained by multiplying each input sample by the modified gain factor. Thereafter, the process is repeated for the incoming new input samples.
- Various embodiments of the present invention are implemented entirely in software.
- a Pentium processor within a standard PC is programmed in assembly language to perform the generalized signal processing depicted in Figs, la and lb, resulting in considerable reduction in both expense and complexity.
- the present invention is implemented in real-time, making it particularly desirable for use in the transmission of audio signals over any digital network such as the Internet.
- Fig. 6 depicts one application of the present invention wherein audio files are played over a digital network with dynamic processing optimization.
- Fig. 6 shows a communication system 120 comprising an audio server 106, a digital network 110, a PC 114 and speakers 118.
- Audio server 106 is coupled to the digital network 110 through transmission line 108, which may be a TI line.
- Digital network 110 is coupled to the PC 114 through the transmission line 112 and the PC 114 is coupled to the speakers 118 through the line 116.
- Within the audio server 106 which may be a PC or several connected PC's, are several blocks for the processing of audio signals.
- the audio files 122 stored on a disk maybe encoded using any of a variety of encoding algorithms such as, for example, the MP3 encoding scheme.
- the audio files are played at 124 using a decoding software, e.g., Winamp, and are subsequently converted to PCM samples.
- the PCM samples are then processed by the signal processing software 126, embodiments of which are described herein, e.g., the processor of Figs, la and lb.
- the output of the signal processing software 126 is encoded again using any desired encoding algorithm, e.g., MP3, and is transmitted through the line 108, across the digital network 110, and through the line 112 to the PC 114.
- the samples are decoded and converted into audio signals which are then fed to the speakers 118 through the line 116.
- Fig. 7 shows another generalized application of the present invention wherein a user is playing audio files stored in a digital audio playback device 130.
- Speaker 134 is coupled to playback device 130 through the line 132.
- Playback device 130 may comprise, for example, any of a wide variety of consumer electronic devices which would benefit from the signal processing innovations of the present invention such as a personal computer, any component of a home entertainment system, a handheld communication device, a portable CD or MP3 player, etc.
- playback device 130 might be part of an audio system located inside a user's car, the dynamic processing capabilities of the invention being employed to improve the quality of sound in the presence of the background noise typical in such an environment.
- Audio files 136 encoded using any of a variety of encoding techniques, are decoded by decoding software 138 (e.g., Winamp) and are converted to PCM samples.
- decoding software 138 e.g., Winamp
- the PCM samples are processed by signal processmg software 140 designed according to any of the various embodiments of the present invention.
- signal processing software 140 may employ a greater or fewer number of frequency bands and processing blocks than various ones of the embodiments described herein. That is, for different applications, a greater or lesser amount of processing resources are available to effect the signal processing techniques of the present invention. For example, the available number of processing cycles in a small portable playback device such as an MP3 player may be limited. By contrast, such limitations may not exist for an audio server such as server 106 of Fig. 6.
- the output of signal processing software 140 is finally converted to audio signals at conversion block 142 (which, in a PC, may be a sound card) which drives speakers 134 via line 132.
- Fig. 8 shows yet another application of the present invention wherein the signal processing techniques described herein are employed at the receiving end of a network communication system.
- a communication system 170 including an audio server 150, a digital network 154, a PC 158, and speakers 162.
- the audio server 150 is coupled to the digital network 154 through the transmission line 152
- the digital network 154 is coupled to the PC 158 through the transmission line 156
- the PC 158 is linked to the speakers 162 through the line 160.
- the audio server 150 in this case may or may not include signal processing software designed according to any of the embodiments of the present invention.
- Encoded PCM samples are transmitted from the audio server 150 through the transmission fine 152, across the digital network 154 and through the transmission line 156 to the PC 158. Inside the PC 158, the PCM samples are decoded at 164 using the appropriate decoding software. The decoded PCM samples are processed by signal processing software 166. The output of the signal processing software 166 is converted into audio signals by the sound card driver 168 which drives speakers 162 via line 160.
- the AGC and NATL blocks used in the various embodiments of the present invention are quite similar with the differences being largely due to the adjustment of time constants, i.e., the attack and release times, for different implementations and for different effects within the same implementation. That is, a particular desired sound might affect the attack and release times specified for specific blocks.
- available processing resources might affect the number of bands and/or blocks per band in a particular implementation, e.g., a small cycle budget in an MP3 player vs. a large cycle budget in a music file server.
- undesirable audible artifacts are generated.
- the present invention processes the audio samples such that these anticipated artifacts become less noticeable to the human ear.
- the signal processing of the present invention allows a low bit rate encoder to be used to encode an audio stream without suffering overly much from the undesirable artifacts created by trying to faithfully reproduce a high bandwidth signal (the original audio) with a low bandwidth system (the low bit rate codec).
- the signal processmg of the present invention may have other desirable effects such as, for example, the improvement of clarity in the presence of background noise and cut-to-cut evenness.
- a generalized topology of the present invention includes three different kinds of blocks, AGCs (including NATLs), drive blocks (e.g., drive blocks 46, 50 and 54 of Fig. lb), and filter blocks (e.g., crossovers 36 and 44 of Fig. la).
- AGCs including NATLs
- drive blocks e.g., drive blocks 46, 50 and 54 of Fig. lb
- filter blocks e.g., crossovers 36 and 44 of Fig. la.
- Signal processing networks combining these three elements in any of a wide variety of ways are considered within the scope of the invention.
- filter or crossover blocks typically are employed to perform a series of linear operations to separate signals into overlapping frequency bands.
- the AGC blocks of the present invention examine the recent history and/or immediate future of the signal and use this information to adjust a gain factor such that the signal is kept within a range of peak excursion.
- Different implementations of such blocks in various embodiments differ as to how much of the signal is used to make these adjustments, and how fast or how often the adjustments are made.
- the range of signals desired to be maintained at the output e.g., use of a threshold to act or not act in, for example, a NATL.
- a further nonlinear function may be applied to the gain value before applying it to the current sample. Fmally, the gain value may also be calculated with reference to the input signal level.
- Both feed forward and feed back AGC topologies may be employed according to various embodiments of the invention.
- AGCs There are two fundamental types of AGCs employed by the various embodiments of the invention, 1) the limiter type (e.g., NATL 52 of Fig. lb), and 2) the dynamic range control type (e.g., AGC 48 of Fig. lb).
- Drive blocks are simply preset level controls for putting samples in the sweet spot for subsequent processing block(s). Putting the processing block(s) between a drive block and an inverse drive block allows the processing block(s) to operate within its normal range while moving the effective range relative to the audio signal.
- the efficiency with which the fundamental blocks of the signal processors of the present invention operate relates in part to the use of low-precision integer arithmetic to implement the blocks' functions.
- separation of the work of the AGC and the NATL into two independent stages also contributes to efficiency and sound quality.
- FIGs. 9a and 9b show a 5-band signal processor 900 designed according to a specific embodiment of the present invention. It should be noted that the processing blocks of processor 900 operate in a similar manner to the corresponding blocks of processor 30 described above with reference to Figs, la and lb. It should also be understood that processor 900 may be employed for a wide variety of applications, particularly those application which have sufficient processing overhead to accommodate the associated computational load presented by this configuration.
- the received digital audio samples are high pass filtered in filter block 902 to suppress the DC component and other unnecessary signal components below 5 Hz.
- the filtered samples are then pre-processed in one of four parallel paths referred to herein as the "transparent,” “dual brick wall,” “wideband,” and “brick wall” paths, respectively.
- the "transparent” path divides the audio into two bands (bass and master) and processes them individually (with the bass band coupled to the master band). This can be thought of as a standard mode having negligible effect.
- the “dual brick wall” path is the same as the “transparent” path except that it is more audible in its gain changes.
- the "wideband” path processes the full-range audio with only one AGC. This provides slight spectral gain intermodulation which, in some embodiments, is exploited by the certain presets (e.g., rock presets).
- the "brick wall” path is like the “wideband” path but provides considerable spectral gain intermodulation which, according to various embodiments, may be exploited by certain presets (e.g., so called club or house presets).
- the pre-processed audio is then divided into five frequency bands using 2-way crossover blocks 952-955 having cutoff frequencies of 80 Hz, 200 Hz, 2 kHz, and 8 kHz, respectively. This may be accomplished, for example, as described above with reference to the multi-band crossover of Fig. 3.
- the samples in each of Bands 1-5 are then subjected to further processing as follows.
- Noisegate blocks 961-965 remove components of the audio signal that are below a certain level of amplitude.
- Delay blocks 956-960 are used by noisegate blocks 961-965 for look-ahead/negative attack time.
- Drive blocks 966-970 represent user programmable gain adjustments which uniformly exaggerate the received signal component as it goes into the following AGC block (i.e., 971-975) which works to reduce changes in the gain.
- AGC blocks 971-975 incrementally increases its gain.
- each of AGC blocks 971-975 incrementally decreases the gain.
- the release function of AGC blocks 971-975 is given by:
- gain gain + (gain * release)
- gain gain - (gain * attack)
- release and attack represent the release and attack time constants, respectively.
- NATLs 981-985 are another set of user programmable gain adjustments which precede negative attack time limiters (NATLs) 981-985.
- AGCs 971-975 may not react quickly enough and some overshooting samples would go otherwise go untreated resulting in a sharp overshoot at the beginning of the transient.
- NATLs 981-985 look at future samples and limit the gain of the current sample to avoid the distortion associated with such sharp overshoots. The lower the threshold is set, the more "dense" the sound becomes.
- Each of drive blocks 986-990 is the inverse of the corresponding one of drive blocks 976-980.
- Each of drive blocks 976-980 works in concert with the corresponding one of inverse drive blocks 986-990 to adjust the effective range of operation of the corresponding one of NATLs 981-985.
- drive block 986 feeds soft clip block 991 which corresponds to a nonlinear function which essentially rounds off the waveform, creating harmonics which create the perception that there is more bass than there is, i.e., within the same peak-to-peak excursion of the input signal there is a lot more acoustic energy in the output because of the harmonics.
- NATL 993 which limits the total peak of the combined bands, e.g., constructive interference between peaks in different bands may cause peaks which need to be dealt with.
- Clip block 994 which removes any remaining overshoots from the signal.
- Figs. 10a and 10b show another 5-band signal processor 1000 designed according to yet another embodiment of the invention.
- This embodiment of the invention has an advantage with respect to processor 900 of Figs. 9a and 9b in that it represents a lower load on the system's overall processing resources, i.e., it has a lower cycle budget, due to a few simplifications.
- the processing blocks of processor 1000 operate in a similar manner to the corresponding blocks of processors 30 and 900 described above. Indeed, as can be seen in Fig. 10a, the input samples are pre-processed in one of four parallel paths in much the same way (with the exception of the band-pass filters) as described above with reference to Fig. 9a.
- the preprocessed audio is then divided into five frequency bands using two three-way crossover blocks 1052 and 1054, each having cutoff frequency pairs of 80 and 400 Hz, and 2 and 8 kHz, respectively (instead of the four crossovers 952-955 in Fig. 9b).
- crossover blocks 1052 and 1054 include independent user programmable gain controls which eliminate the need for the subsequent drive blocks in other embodiments.
- the samples in each of Bands 1-5 are then subjected to further processing as follows.
- each of AGC blocks 1070- 1074 incrementally increases its gain. Likewise, for every sample which does overshoot the threshold, each of AGC blocks 1070-1074 incrementally decreases the gain.
- the release function of AGC blocks 1070-1074 is given by:
- gain gain + (gain/(2 ⁇ release))
- gain gain - (gain/(2 ⁇ attack))
- release and attack represent the release and attack time constants, respectively.
- AGCs 1070-1074 may not react quickly enough and some overshooting samples would go otherwise go untreated resulting in a sharp overshoot at the beginning of the transient. To deal with this,
- NATLs 1080-1084 look at future samples and limit the gain of the current sample to avoid the distortion associated with such sharp overshoots.
- soft clip block 1090 corresponds to a nonlinear function which essentially rounds off the waveform, creating harmomcs which create the perception that there is more bass than there is, i.e., within the same peak-to-peak excursion of the input signal there is a lot more acoustic energy in the output because of the harmonics.
- NATL 1092 which limits the total peak of the combined bands, e.g., constructive interference between peaks in different bands may cause peaks which need to be dealt with.
- Clip block 1093 which removes any remaining overshoots from the signal.
- Fig. 11 shows a 4-band signal processor 1100 designed according to still another embodiment of the invention.
- This embodiment of the invention presents an even lower load on processing resources than the previously described embodiments due to additional simplification.
- this embodiment is particularly amenable to applications in which a fairly sophisticated level of signal processmg is desired, but which have a paucity of processing resources, e.g., portable digital audio players such as
- processor 1100 operate in a similar manner to the corresponding blocks of processors 30, 900, and 1000 described above.
- crossover blocks 1152 and 1154 include independent user programmable gain controls which eliminate the need for the subsequent drive blocks in other embodiments.
- each of AGC blocks 1170-1173 incrementally increases its gain.
- each of AGC blocks 1170-1173 incrementally decreases the gain.
- the release function of AGC blocks 1170-1173 is given by:
- gain gain + (gain (2 ⁇ release))
- gain gain - (gain/(2 ⁇ attack))
- release and attack represent the release and attack time constants, respectively.
- Figs. 12a and 12b are simplified block diagrams of a digital audio broadcasting
- Radio station 1200 receives the program audio signal which may be an analog signal which is subsequently converted to a digital signal by A/D converter 1202 or an AES/EBU digital signal, one of which is then encoded using the station's codec 1204.
- the resulting AES digital audio signal is then provided to IBOC exciter 1206 which uses it to modulate a broadcast RF signal.
- the output AES digital signal is also provided to a signal processor 1208 designed according to the present invention.
- processor 1208 comprises processor 900 of Figs. 9a and 9b.
- Processor 1208 is configured by the digital broadcaster via control interface 1210 to effect a variety of goals including, for example, providing the station's "signature" sound.
- the resulting audio signal may be monitored by the broadcaster's personnel via an off air monitor 1212 which receives both a processed AES/EBU digital signal and a two-channel processed audio signal provided by D/A converter 1214. In this way, the broadcaster's desired sound can be achieved.
- processor 1208 does not process the digital audio prior to transmission. Instead, low speed digital data representing the desired processor configuration are provided to exciter 1206 for transmission on the RF signal along with the digital audio. These data may then be employed by the listener's system to configure a corresponding signal processor on the receiver side to process the digital audio signal in accordance with the broadcaster's programmed scheme.
- the configuration data set may include any of the parameters for any of the processor blocks, and may be less or more inclusive according to the broadcaster's design.
- DAB receiver-side system 1250 includes a DAB receiver 1252 and a compact disc (CD) player 1254 each of which maybe controlled by the user via control circuitry 1256 which may include, for example, a remote control (not shown). As shown in the figure, the user may select between receiver 1252 and CD player 1254 as the audio source. If the user selects DAB receiver 1252, both the PCM audio data and the low speed processor configuration data sent by station 1200 are provided to signal processor 1258 which, according to a specific embodiment comprises processor 900 of Figs. 9a and 9b. It will, however, be understood that any of a wide variety of implementations may be used. Processor 1258 is configured according to the received low speed data and processes the digital audio data accordingly.
- the listener may customize the configuration of processor 1258, augmenting or completely overriding the broadcaster's default configuration using control interface 1260 which, according to the embodiment shown, is also operable to control the system's volume, balance, and fader functions represented by block 1262.
- control interface 1260 which, according to the embodiment shown, is also operable to control the system's volume, balance, and fader functions represented by block 1262.
- Processor 1258 provides the processed digital audio samples to D/A converter
- the listening experience provided by the digital broadcasting system can be customized to conform to each hstening environment and according to each listener's preference, while retaining some level of control for the baseline experience in the hands of the broadcaster. That is, according to various embodiments, the user is given the option of selecting the predefined default processing configuration provided by the digital broadcaster, altering that configuration in some way, or completely overriding.
- the integration of these capabilities into the listener's system is made possible, at least in part, by the fact that the processing techniques of the present invention may be implemented with a very small impact on the processing resources already available in most such systems.
- satellite system 1300 employs a variety of disparate sources for the content it transmits to customers. This typically results in an uneven loudness across different channels and even for different content on a single channel which is undesirable from the end user's perspective.
- This may of course be dealt with by integrating the processing techniques of the present invention into the satellite system's headend equipment.
- this only addresses part of the problem. It still does not allow for customization of the individual user's listening experience. Therefore, according to the embodiment of the present invention, the processing techniques of the present invention are integrated into the user's equipment in much the same way as in the digital broadcasting system to provide the desired signal processing capabilities.
- different types of content are provided to the headend's satellite uplink 1308 which may or may not include some level of signal processing capability either according to the present invention or some other technique.
- the content is transmitted to satellite 1310 which then transmits the content to a user's antenna 1312 for decoding by a set top box 1314 and presentation on television 1316.
- a signal processor designed according to the present invention e.g., processor 1100 of Fig. 11
- set top box 1314 may be configured according to configuration data transmitted along with the content by the satellite provider in a manner similar to that described above with reference to Figs. 12a and 12b.
- a default configuration may be provided in the set top box itself.
- the user can either alter or override the default processor configuration using, for example, a menu driven interface which is accessed via television 1316 and an associated remote control (not shown). It will be understood, of course, that the preceding discussion applies equally well to a cable television system.
- a signal processor designed according to the invention is provided in the television set itself.
- any system which includes audio derived from disparate sources may benefit from the signal processing and normalization capabilities of the present invention.
- a home entertainment system 1400 may include multiple sources of audio signals such as a CD player 1402, an FM radio receiver 1404, and an MP3 player 1406. These audio signals may be received by a receiver 1408 which amplifies them using power amp 1410 which drives speakers 1412.
- receiver 1408 includes a signal processor 1414 designed according to the present invention which may be configured to eliminate the unevenness resulting from the differences between the audio sources, and which allows the user to customize the listening experience according to his preferences.
- this idea may be further generalized to encompass the integration of a signal processor designed accordmg to the invention into any electronic device or system which employs audio.
- This may include the types of devices discussed above, e.g., televisions, CD and MP3 players, car stereos, radios, etc. It may also include recording devices such as video and tape recorders, Mini Disc recorders, etc.
- the techniques of the invention may also be applied to any type of telephony or voice communication system whether over conventional telephone fines, the Internet, or in the wireless environment.
- An example of a multi-band processor for voice applications will now be described with reference to Fig. 15.
- Fig. 15 shows a 3-band signal processor 1500 which may be employed, for example, in voice or telephony applications.
- the input audio is pre-processed by AGC 1501.
- the pre-processed audio is then divided into three frequency bands using 2-way crossover blocks 1502 and 1504 having cutoff frequencies of 1000 Hz and 2000 Hz, respectively. This may be accomplished, for example, as described above with reference to the multi-band crossover of Fig. 3.
- the samples in each of Bands 1-3 are then subjected to further processing as follows.
- Noisegate blocks 1512-1516 remove components of the audio signal that are below a certain level of amplitude.
- Delay blocks 1518-1522 are used by noisegate blocks 1512-1516 for look-ahead/negative attack time.
- Drive blocks 1518-1522 represent user programmable gain adjustments which uniformly exaggerate the received signal component as it goes into the following AGC block (i.e., 1524-1528) which works to reduce changes in the gain.
- AGC block i.e., 1524-1528
- each of AGC blocks 1524-1528 incrementally increases its gain.
- each of AGC blocks 1524-1528 incrementally decreases the gain.
- the release function of AGC blocks 1524-1528 may correspond to any of the functions described above.
- NATLs 1536-1540 are another set of user programmable gain adjustments which precede negative attack time limiters (NATLs) 1536-1540.
- AGCs 1524-1528 may not react quickly enough and some overshooting samples would go otherwise go untreated resulting in a sharp overshoot at the beginning of the transient.
- NATLs 1536-1540 look at future samples and limit the gain of the current sample to avoid the distortion associated with such sharp overshoots. The lower the threshold is set, the more "dense" the sound
- Each of drive blocks 1542- 1546 is the inverse of the corresponding one of drive blocks 1530-1534, each of which works in concert with the corresponding one of inverse drive blocks to adjust the effective range of operation of the corresponding one of NATLs.
- Mixer block 1548 which has independently controllable gain for each band is followed by a final NATL 1550 which limits the total peak of the combined bands, e.g., constructive interference between peaks in different bands may cause peaks which need to be dealt with.
- NATL 1550 is followed by Clip block 1552 which removes any remaining overshoots from the signal.
- the manner in which the signal processing techniques of the present invention facilitate the bandwidth reduction of an audio encoding scheme such as MP3 encoding relates to yet another set of embodiments.
- the benefits of the invention maybe realized even without real-time application of the associated signal processing techniques to the digital audio. That is, any sequence of digital audio samples may be processed using a signal processor designed according to the present invention to generate audio files to be stored for playback at a later time. For example, a provider of MP3 files to be downloaded over the Internet is not in a position to provide the same real-time processmg as a provider of streaming audio. Nevertheless, the benefits of the present invention may be enjoyed by the provider and the user of such downloaded files even if the user does not have the signal processmg capabilities of the present invention.
- the provider of the MP3 files can apply the signal processmg techniques of any of the embodiments of the present invention to any MP3 files, and then store the processed MP3 files for serving to users over the Internet.
- the files may then be downloaded and played using any of the available decoders/players, and the listening experience will be very much the same as if the processing techniques of the invention were being applied in real time.
- the preprocessing can be for any of the desired effects described above with reference to the various embodiments of the invention such as, for example, mitigating the undesirable artifacts of a low bit rate codec or providing a "signature" sound for the provider of the audio files.
- Another example of a situation in which the benefits of the present invention may be enjoyed without the real-time processing of the audio samples is the production and distribution of recording media, e.g., compact discs, having audio files stored therein which have been preprocessed according to the present invention. That is, the manufacturer or distributor of audio CDs can preprocess the audio to be distributed on a CD for any of the purposes described above, e.g., providing a default sound for a particular type of music.
- recording media e.g., compact discs
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2002528975A JP2004509378A (en) | 2000-12-20 | 2001-09-19 | Digital signal processing techniques to improve audio clarity and intelligibility |
EP01973315A EP1325601A4 (en) | 2000-12-20 | 2001-09-19 | Digital signal processing techniques for improving audio clarity and intelligibility |
AU2001292908A AU2001292908A1 (en) | 2000-09-22 | 2001-09-19 | Digital signal processing techniques for improving audio clarity and intelligibility |
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/669,069 | 2000-09-22 | ||
US09/669,069 US6940987B2 (en) | 1999-12-31 | 2000-12-20 | Techniques for improving audio clarity and intelligibility at reduced bit rates over a digital network |
US28993601P | 2001-05-09 | 2001-05-09 | |
US60/289,936 | 2001-05-09 | ||
US29368401P | 2001-05-25 | 2001-05-25 | |
US60/293,684 | 2001-05-25 | ||
US09/927,578 | 2001-08-06 | ||
US09/927,578 US20020075965A1 (en) | 2000-12-20 | 2001-08-06 | Digital signal processing techniques for improving audio clarity and intelligibility |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2002025886A1 true WO2002025886A1 (en) | 2002-03-28 |
WO2002025886A8 WO2002025886A8 (en) | 2002-08-01 |
Family
ID=27501517
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2001/029552 WO2002025886A1 (en) | 2000-09-22 | 2001-09-19 | Digital signal processing techniques for improving audio clarity and intelligibility |
Country Status (5)
Country | Link |
---|---|
US (1) | US20020075965A1 (en) |
EP (1) | EP1325601A4 (en) |
JP (1) | JP2004509378A (en) |
AU (1) | AU2001292908A1 (en) |
WO (1) | WO2002025886A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8676361B2 (en) | 2002-06-05 | 2014-03-18 | Synopsys, Inc. | Acoustical virtual reality engine and advanced techniques for enhancing delivered sound |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030043972A1 (en) * | 2001-08-29 | 2003-03-06 | Burnham Robert J. | Wireless entertainment system for a vehicle |
KR100966415B1 (en) * | 2002-05-09 | 2010-06-28 | 넷스트림스 엘엘씨 | Audio network distribution system |
US9137035B2 (en) * | 2002-05-09 | 2015-09-15 | Netstreams Llc | Legacy converter and controller for an audio video distribution system |
US20040019520A1 (en) * | 2002-07-24 | 2004-01-29 | Guglielmucci Luis Felipe | Business model for the sale of recorded media through the Internet and other distribution channels adapted to the acoustic print and/or replay system set up of the customer |
US20040019527A1 (en) * | 2002-07-24 | 2004-01-29 | Guglielmucci Luis Felipe | System for the sale of recorded media through the internet adapted to the acoustic print and replay system set up of the customer |
US7903825B1 (en) | 2006-03-03 | 2011-03-08 | Cirrus Logic, Inc. | Personal audio playback device having gain control responsive to environmental sounds |
US20100303046A1 (en) * | 2009-05-27 | 2010-12-02 | Netstreams, Llc | Wireless video and audio network distribution system |
US9215527B1 (en) | 2009-12-14 | 2015-12-15 | Cirrus Logic, Inc. | Multi-band integrated speech separating microphone array processor with adaptive beamforming |
GB2563687B (en) * | 2017-06-19 | 2019-11-20 | Cirrus Logic Int Semiconductor Ltd | Audio test mode |
US10911013B2 (en) | 2018-07-05 | 2021-02-02 | Comcast Cable Communications, Llc | Dynamic audio normalization process |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5179730A (en) * | 1990-03-23 | 1993-01-12 | Rockwell International Corporation | Selectivity system for a direct conversion receiver |
US5625871A (en) * | 1994-09-30 | 1997-04-29 | Lucent Technologies Inc. | Cellular communications system with multicarrier signal processing |
WO1998056210A1 (en) * | 1997-06-06 | 1998-12-10 | Audiologic Hearing Systems, L.P. | Continuous frequency dynamic range audio compressor |
US6061405A (en) * | 1997-12-15 | 2000-05-09 | Motorola, Inc. | Time domain source matched multicarrier quadrature amplitude modulation (QAM) method and apparatus |
Family Cites Families (51)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3894195A (en) * | 1974-06-12 | 1975-07-08 | Karl D Kryter | Method of and apparatus for aiding hearing and the like |
US4243840A (en) * | 1978-12-22 | 1981-01-06 | Teledyne Industries, Inc. | Loudspeaker system |
US4249042A (en) * | 1979-08-06 | 1981-02-03 | Orban Associates, Inc. | Multiband cross-coupled compressor with overshoot protection circuit |
US4396806B2 (en) * | 1980-10-20 | 1998-06-02 | A & L Ventures I | Hearing aid amplifier |
US4412100A (en) * | 1981-09-21 | 1983-10-25 | Orban Associates, Inc. | Multiband signal processor |
ATE14361T1 (en) * | 1981-10-20 | 1985-08-15 | Craigwell Ind Ltd | HEARING AID DEVICES. |
US4720864A (en) * | 1982-05-04 | 1988-01-19 | Sanyo Electric Co., Ltd. | Speech recognition apparatus |
US4803732A (en) * | 1983-10-25 | 1989-02-07 | Dillon Harvey A | Hearing aid amplification method and apparatus |
US4704728A (en) * | 1984-12-31 | 1987-11-03 | Peter Scheiber | Signal re-distribution, decoding and processing in accordance with amplitude, phase, and other characteristics |
US4641361A (en) * | 1985-04-10 | 1987-02-03 | Harris Corporation | Multi-band automatic gain control apparatus |
US5177604A (en) * | 1986-05-14 | 1993-01-05 | Radio Telcom & Technology, Inc. | Interactive television and data transmission system |
US4901307A (en) * | 1986-10-17 | 1990-02-13 | Qualcomm, Inc. | Spread spectrum multiple access communication system using satellite or terrestrial repeaters |
US4829572A (en) * | 1987-11-05 | 1989-05-09 | Andrew Ho Chung | Speech recognition system |
US4852175A (en) * | 1988-02-03 | 1989-07-25 | Siemens Hearing Instr Inc | Hearing aid signal-processing system |
US5303306A (en) * | 1989-06-06 | 1994-04-12 | Audioscience, Inc. | Hearing aid with programmable remote and method of deriving settings for configuring the hearing aid |
US5305307A (en) * | 1991-01-04 | 1994-04-19 | Picturetel Corporation | Adaptive acoustic echo canceller having means for reducing or eliminating echo in a plurality of signal bandwidths |
US5263019A (en) * | 1991-01-04 | 1993-11-16 | Picturetel Corporation | Method and apparatus for estimating the level of acoustic feedback between a loudspeaker and microphone |
US5130665A (en) * | 1991-02-14 | 1992-07-14 | Walden Richard L | Audio volume level control |
US5278912A (en) * | 1991-06-28 | 1994-01-11 | Resound Corporation | Multiband programmable compression system |
US5365583A (en) * | 1992-07-02 | 1994-11-15 | Polycom, Inc. | Method for fail-safe operation in a speaker phone system |
US5473666A (en) * | 1992-09-11 | 1995-12-05 | Reliance Comm/Tec Corporation | Method and apparatus for digitally controlling gain in a talking path |
US5579404A (en) * | 1993-02-16 | 1996-11-26 | Dolby Laboratories Licensing Corporation | Digital audio limiter |
EP0967592B1 (en) * | 1993-06-23 | 2007-01-24 | Noise Cancellation Technologies, Inc. | Variable gain active noise cancellation system with improved residual noise sensing |
JP3626492B2 (en) * | 1993-07-07 | 2005-03-09 | ポリコム・インコーポレイテッド | Reduce background noise to improve conversation quality |
US5664021A (en) * | 1993-10-05 | 1997-09-02 | Picturetel Corporation | Microphone system for teleconferencing system |
US5485515A (en) * | 1993-12-29 | 1996-01-16 | At&T Corp. | Background noise compensation in a telephone network |
US5771301A (en) * | 1994-09-15 | 1998-06-23 | John D. Winslett | Sound leveling system using output slope control |
US5724340A (en) * | 1995-02-02 | 1998-03-03 | Unisys Corporation | Apparatus and method for amplitude tracking |
EP1146479A3 (en) * | 1995-03-29 | 2004-08-18 | Fuji Photo Film Co., Ltd. | Image processing method and apparatus |
US5915235A (en) * | 1995-04-28 | 1999-06-22 | Dejaco; Andrew P. | Adaptive equalizer preprocessor for mobile telephone speech coder to modify nonideal frequency response of acoustic transducer |
AU7118696A (en) * | 1995-10-10 | 1997-04-30 | Audiologic, Inc. | Digital signal processing hearing aid with processing strategy selection |
US6434246B1 (en) * | 1995-10-10 | 2002-08-13 | Gn Resound As | Apparatus and methods for combining audio compression and feedback cancellation in a hearing aid |
US5956674A (en) * | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
US5708722A (en) * | 1996-01-16 | 1998-01-13 | Lucent Technologies Inc. | Microphone expansion for background noise reduction |
US5778082A (en) * | 1996-06-14 | 1998-07-07 | Picturetel Corporation | Method and apparatus for localization of an acoustic source |
US5737434A (en) * | 1996-08-26 | 1998-04-07 | Orban, Inc. | Multi-band audio compressor with look-ahead clipper |
US5832444A (en) * | 1996-09-10 | 1998-11-03 | Schmidt; Jon C. | Apparatus for dynamic range compression of an audio signal |
US6044162A (en) * | 1996-12-20 | 2000-03-28 | Sonic Innovations, Inc. | Digital hearing aid using differential signal representations |
US6038435A (en) * | 1997-12-24 | 2000-03-14 | Nortel Networks Corporation | Variable step-size AGC |
US6282176B1 (en) * | 1998-03-20 | 2001-08-28 | Cirrus Logic, Inc. | Full-duplex speakerphone circuit including a supplementary echo suppressor |
US6212273B1 (en) * | 1998-03-20 | 2001-04-03 | Crystal Semiconductor Corporation | Full-duplex speakerphone circuit including a control interface |
US6351731B1 (en) * | 1998-08-21 | 2002-02-26 | Polycom, Inc. | Adaptive filter featuring spectral gain smoothing and variable noise multiplier for noise reduction, and method therefor |
US6285767B1 (en) * | 1998-09-04 | 2001-09-04 | Srs Labs, Inc. | Low-frequency audio enhancement system |
EP1172020B1 (en) * | 1999-02-05 | 2006-09-06 | Hearworks Pty Ltd. | Adaptive dynamic range optimisation sound processor |
US6324509B1 (en) * | 1999-02-08 | 2001-11-27 | Qualcomm Incorporated | Method and apparatus for accurate endpointing of speech in the presence of noise |
US6381570B2 (en) * | 1999-02-12 | 2002-04-30 | Telogy Networks, Inc. | Adaptive two-threshold method for discriminating noise from speech in a communication signal |
AU4904801A (en) * | 1999-12-31 | 2001-07-16 | Octiv, Inc. | Techniques for improving audio clarity and intelligibility at reduced bit rates over a digital network |
US6418303B1 (en) * | 2000-02-29 | 2002-07-09 | Motorola, Inc. | Fast attack automatic gain control (AGC) loop and methodology for narrow band receivers |
US6532358B1 (en) * | 2000-08-03 | 2003-03-11 | Tektronix, Inc. | Overload distortion protection for a wideband receiver |
CN1470147A (en) * | 2000-08-07 | 2004-01-21 | �µ��ǿƼ��ɷ���������˾ | Method and apparatus for filtering & compressing sound signals |
US6721411B2 (en) * | 2001-04-30 | 2004-04-13 | Voyant Technologies, Inc. | Audio conference platform with dynamic speech detection threshold |
-
2001
- 2001-08-06 US US09/927,578 patent/US20020075965A1/en not_active Abandoned
- 2001-09-19 AU AU2001292908A patent/AU2001292908A1/en not_active Abandoned
- 2001-09-19 JP JP2002528975A patent/JP2004509378A/en active Pending
- 2001-09-19 WO PCT/US2001/029552 patent/WO2002025886A1/en active Application Filing
- 2001-09-19 EP EP01973315A patent/EP1325601A4/en not_active Withdrawn
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5179730A (en) * | 1990-03-23 | 1993-01-12 | Rockwell International Corporation | Selectivity system for a direct conversion receiver |
US5625871A (en) * | 1994-09-30 | 1997-04-29 | Lucent Technologies Inc. | Cellular communications system with multicarrier signal processing |
WO1998056210A1 (en) * | 1997-06-06 | 1998-12-10 | Audiologic Hearing Systems, L.P. | Continuous frequency dynamic range audio compressor |
US6061405A (en) * | 1997-12-15 | 2000-05-09 | Motorola, Inc. | Time domain source matched multicarrier quadrature amplitude modulation (QAM) method and apparatus |
Non-Patent Citations (1)
Title |
---|
See also references of EP1325601A4 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8676361B2 (en) | 2002-06-05 | 2014-03-18 | Synopsys, Inc. | Acoustical virtual reality engine and advanced techniques for enhancing delivered sound |
Also Published As
Publication number | Publication date |
---|---|
AU2001292908A1 (en) | 2002-04-02 |
EP1325601A1 (en) | 2003-07-09 |
WO2002025886A8 (en) | 2002-08-01 |
EP1325601A4 (en) | 2005-11-09 |
JP2004509378A (en) | 2004-03-25 |
US20020075965A1 (en) | 2002-06-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20030023429A1 (en) | Digital signal processing techniques for improving audio clarity and intelligibility | |
US9837086B2 (en) | Encoded audio extended metadata-based dynamic range control | |
US9093968B2 (en) | Sound reproducing apparatus, sound reproducing method, and recording medium | |
US8892450B2 (en) | Signal clipping protection using pre-existing audio gain metadata | |
JP5129888B2 (en) | Transcoding method, transcoding system, and set top box | |
CN110853660B (en) | Decoder device for decoding a bitstream to generate an audio output signal from the bitstream | |
CN100481722C (en) | System and method for enhancing delivered sound in acoustical virtual reality | |
EP4290888A2 (en) | Encoded audio metadata-based equalization | |
US20080080722A1 (en) | Loudness controller with remote and local control | |
US6940987B2 (en) | Techniques for improving audio clarity and intelligibility at reduced bit rates over a digital network | |
US9780753B2 (en) | Adaptive equalization for an ultrasonic audio system | |
US20020075965A1 (en) | Digital signal processing techniques for improving audio clarity and intelligibility | |
KR101571197B1 (en) | Method for multi-channel processing in a multi-channel sound system | |
US20020064285A1 (en) | System and method for processing an audio signal prior to encoding | |
Liu et al. | Overview of wireless microphones—Part I: System and technologies | |
McMillen | A consumer adjustable dynamic range control system | |
Orban | Transmission Audio Processing | |
CN101615959A (en) | Be used to mate the apparatus and method of the playback spectrums of two audio-source |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref country code: JP Ref document number: 2002 528975 Kind code of ref document: A Format of ref document f/p: F |
|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
AK | Designated states |
Kind code of ref document: C1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: C1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2001973315 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2001973315 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |