WO2001013515A1 - Filtre numérique - Google Patents

Filtre numérique Download PDF

Info

Publication number
WO2001013515A1
WO2001013515A1 PCT/US2000/021912 US0021912W WO0113515A1 WO 2001013515 A1 WO2001013515 A1 WO 2001013515A1 US 0021912 W US0021912 W US 0021912W WO 0113515 A1 WO0113515 A1 WO 0113515A1
Authority
WO
WIPO (PCT)
Prior art keywords
coefficients
frequency
time
response
digital
Prior art date
Application number
PCT/US2000/021912
Other languages
English (en)
Inventor
Frank A. Scarpino
Arturo H. Garcia
Original Assignee
True Dimensional Sound, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by True Dimensional Sound, Inc. filed Critical True Dimensional Sound, Inc.
Priority to AU66310/00A priority Critical patent/AU6631000A/en
Publication of WO2001013515A1 publication Critical patent/WO2001013515A1/fr
Priority to US09/887,606 priority patent/US20050259833A1/en

Links

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03HIMPEDANCE NETWORKS, e.g. RESONANT CIRCUITS; RESONATORS
    • H03H17/00Networks using digital techniques
    • H03H17/02Frequency selective networks
    • H03H17/06Non-recursive filters

Definitions

  • a digital recording medium e.g., a compact disc
  • the creation of the digital recording typically involves conversion of the live sound into an audio signal in analog form.
  • the amplitude of the audio signal is then sampled at regularly spaced intervals and digitized for storage.
  • On playback the sound system generates an output signal at the sampling frequency.
  • the sampling frequency conforms to industry standards, most commonly 44,100 Hz. That is only slightly more than twice the highest frequency which may be heard by the unaided human ear and is therefore near the theoretical lower limit for full bandwidth recording (i.e., the Nyquist critical frequency). Tonal components required for high quality sound reproduction can be easily lost or severely limited by compromises and constraints placed upon the recording process.
  • a digital device for simply and inexpensively enhancing digital sound samples from an optical compact disk (CD) or other digital information storage medium in such a way that the quality of audible sound, produced from the sound samples, more closely approaches that of the sound heard live in an acoustically designed environment.
  • the present invention enhances digital sound in this way by providing a solution that utilizes an amplitude-shaping function that compensates for two of the most dominant human hearing phenomena that determine the quality of perceived sound.
  • Human hearing is typically most sensitive (sound appears to be the loudest) at some frequencies (i.e., sensitive frequencies) and less sensitive (sound appears to be quieter) at other frequencies (i.e., insensitive frequencies). This pitch-loudness phenomenon appears to be an "inner" ear effect. In the other human hearing phenomenon, the outer ear tends to mask or suppress tones of lesser intensity when presented together with tones of greater intensity. This suppression phenomenon or masking effect appears to be more pronounced at higher frequencies.
  • the present invention compensates for the pitch-loudness effect by non uniformly distorting the digital sound so that the resulting audible sound is perceived as being louder at the insensitive frequencies.
  • This enhancing distortion can be accomplished by non-uniformly amplifying the digital sound so as to selectively amplify such insensitive frequencies.
  • This enhancing distortion can also be accomplished by uniformly amplifying the digital sound and then selectively reducing the amplitude of the sensitive frequencies non-uniformly.
  • the present invention creates an impulse response that "unmasks" the typically masked tones by stretching the tone in the time domain (i.e., increasing the time smear or dwell time) long enough to be heard but not so long as to be "blurred".
  • the human ear is typically sensitive to sounds as short in duration as about 0.1 milliseconds. Therefore, the present invention typically stretches impulses to at least about 0.1 milliseconds in duration. How much the tone needs to be time smeared to be perceived and how much the tone can be time smeared before becoming perceptively blurred will likely vary from listener to listener.
  • the present invention provides an amplitude shaping function that overcomes both the pitch-loudness phenomenon and the suppression phenomenon, preferably, by providing a linear circuit that possesses two apparently contradictory impulse response characteristics.
  • the impulse response characteristics sought are: (1) a relatively long lasting impulse response and (2) an impulse response with a wide-band characteristic.
  • Conventional thinking teaches that these two characteristics are contradictory in real filters and, hence, unobtainable.
  • High frequency characteristics are revealed by the zero crossings of the impulse response of a linear system. The first zero crossing of the ideal impulse response appears at l A f c where f c is the cutoff frequency of the linear system. In other words, the zero crossings occur at relatively large time intervals from the impulse and relatively large time intervals from each other.
  • a theoretically ideal low pass filter with a bandwidth from 0 Hz to 20 kHz will have zero crossings in its impulse response at about 50 microseconds. The two zero crossings occur within 100 microseconds, which is 0.1 milliseconds. Furthermore, the sha ⁇ cutoff characteristic of such an ideal filter extends the impulse response in time with many oscillations.
  • the digital devices of the present invention can have an oscillating impulse response.
  • an exemplary digital device of the present invention with an oscillatory impulse response, can have zero crossings at approximately 30 and 60 micro-seconds and a total duration of over 150 micro-seconds.
  • the present invention can offer a broadband digital filter that exhibits the characteristics of both high frequency content and long duration (i.e., time smear or dwell time). Even the most staccato audio (e.g., musical) passage has a finite duration (theoretically, an impulse signal has zero duration). Therefore, with the present invention, the actual response to even the subtlest sound can be made of a duration significant enough to be effectively heard but will not be stretched so long as to be shrouded and blurred.
  • the present digital device can be classified in the special class of filters known as minimum-phase filters.
  • the frequency response of a minimum-phase filter may be defined in terms of two components, the magnitude (amplitude) response and the phase response.
  • the magnitude response and phase response of the filter are uniquely related to each other, in that if one is given, the other component can be computed uniquely.
  • a digital filter comprising a series of digitized time coefficients stored in a memory.
  • the time coefficients are mapped to a like number of frequency coefficients.
  • the frequency coefficients are spaced at frequency intervals, have either zero phase angles or linearly spaced phase angles and have amplitudes which are mirrored about a mid frequency to produce periodicity of a time response for the digital filter.
  • the time coefficients may be even or odd in number. Preferably, they are odd integers greater than or equal to 5, preferably 7, more preferably 9 and most preferably 11.
  • the time coefficients are defined by inverse discrete Fourier transforms of the frequency coefficients.
  • a portion of the frequency coefficients having frequencies within a free band are selected so as to achieve a generally constant oscillation frequency across a center band which is broader than the free band.
  • the frequency coefficients are spaced at equal frequency intervals.
  • a method of making a digital filter comprises the step of establishing a plurality of frequency response coefficients separated at frequency intervals.
  • the frequency response coefficients are graphically characterized by a base point, a series of principal points and a series of mirror points and have either zero phase angles or linearly spaced phase angles.
  • the base point has a frequency of zero.
  • a portion of the principal points are situated at frequencies encompassing at least the range of human hearing, or falling within the range of human hearing, and a portion of the mirror points have frequencies and amplitudes which mirror the portion of the principal points when viewed relative to a mid frequency higher than the frequencies of the principal points.
  • the method further comprises the steps of performing inverse discrete Fourier transformations to map the frequency response coefficients into corresponding time response coefficients, and storing the time response coefficients in a digital memory.
  • the portion of the principal points are situated at predetermined frequencies at a minimum within the range of human hearing and have amplitudes that roughly are inversely corresponding to human hearing sensitivity at the predetermined frequencies.
  • the frequency response coefficients can be established at uniformly spaced frequency intervals or non-uniformly spaced frequency intervals.
  • the establishing and performing steps preferably comprise the following steps: selecting a plurality of first frequency response coefficients separated at uniformly spaced frequency intervals, wherein the first frequency response coefficients have either zero phase angles or linearly spaced phase angles and each first frequency response coefficient further has an amplitude and a frequency; arranging the plurality of first frequency response coefficients in order from lowest frequency to highest frequency to define a list of first frequency response coefficients; performing inverse discrete Fourier transformations to map the plurality of first frequency response coefficients into corresponding first time response coefficients; discarding a pair of the first time response coefficients which have equal magnitudes and are positioned adjacent to one another in the list, with remaining time response coefficients defining second time response coefficients; assessing the effect on a frequency response of the digital filter after discarding the pair of the first time response coefficients; repeating the performing, discarding and assessing steps until a pair of discarded time response coefficients cause a significant change in the frequency response of the digital filter; and adding to remaining time response coefficients the pair of discarded time response coefficients
  • the added and remaining time response coefficients comprise final time response coefficients.
  • the method of making a digital filter preferably further comprises the steps of: multiplying each of the final time response coefficients by an integer conversion number to define converted final time response coefficients, wherein the conversion number is sufficiently large to permit discarding any remaining fractional portion without losing substantial final time response coefficient accuracy; and discarding from each of the converted final time response coefficients any remaining fractional portion.
  • the conversion number is preferably selected as a power of two.
  • the assessing step preferably comprises the steps of: generating a first frequency response curve from the first frequency response coefficients; performing discrete Fourier transformations to map the second time response coefficients into corresponding second frequency response coefficients; generating a second frequency response curve from the second frequency response coefficients; and comparing the first and second frequency response curves to determine if the second frequency response curve is substantially different from the first frequency response curve.
  • Each of the frequency response coefficients has an amplitude and a frequency.
  • the range of human hearing of interest is within a band of frequencies having a low end and a high end.
  • the human hearing range of interest may be the full range (typically from 20 Hz to 20kHz) or a portion thereof.
  • Many modern audio systems process audio signals of differing bandwidths. For example, (1) analog telephone signals consist of spectra covering about 300 Hz to about 3000 Hz, (2) Voice over Internet Protocol (VoIP) is limited to a range of up to about 8 kHz and (3) streaming audio is subject to other bandwidth restrictions.
  • VoIP Voice over Internet Protocol
  • a portion of or all of the frequency response coefficients, having frequencies between a reference or intermediate frequency and the high end increase in amplitude as per increasing frequencies from the reference frequency toward the high end and up to a significant amplitude peak at a peak high frequency so as to form a high frequency peak on the frequency response curve. It may also be preferable for a portion of or all of the frequency response coefficients, having frequencies between a reference frequency and the low end, to increase in amplitude as per decreasing frequencies from the reference frequency toward the low end and up to a significant amplitude peak at a peak low frequency. Preferably, a portion of or all of the frequency response coefficients, that form the frequency response curve, exhibit both of these frequency response characteristics.
  • a significant amplitude peak it is meant that the amplitude peak is at an amplitude or a frequency or both that is sufficient to result in a harmonic enhancement to the digital sound that is perceptible by the normal human ear, when converted into audible sound.
  • the frequency response curve exhibits up to a total of only two or three, significant amplitude peaks between the low end and the high end.
  • the frequency response curve of the present invention can include any number of insignificant amplitude peaks. Insignificant amplitude peaks are those that do not add any harmonic enhancement to the digital sound that is perceptible by the normal human ear, when converted into audible sound.
  • Insignificant amplitude peaks can include, for example, peaks that are at frequencies that fall on the edge or outside the range of normal human hearing, and peaks having small amplitudes that do not produce a perceptible change from the enhancement caused by the significant peak or peaks.
  • Such low amplitude insignificant peaks may be in the form of ripples in the frequency response curve (e.g., the portion of the curve that forms a significant amplitude peak).
  • the human ear is typically able to perceive changes in amplitude of about three decibels or more.
  • the frequency response coefficients having frequencies between the reference frequency and the high end, increase in amplitude up to a significant amplitude peak at a peak high frequency and decrease in amplitude as per increasing frequencies toward the high end above the peak high frequency.
  • the frequency response coefficients having frequencies between the reference frequency and the low end, increase in amplitude up to a significant amplitude peak at a peak low frequency and decrease in amplitude as per decreasing frequencies toward the low end below the peak low frequency.
  • the frequency response coefficients having frequencies between the reference frequency and the high end, increase in amplitude up to a significant amplitude peak at a peak high frequency, decrease in amplitude as per increasing frequencies down to a significant amplitude trough at a trough high frequency and increase in amplitude as per increasing frequencies toward the high end.
  • the reference frequency may fall within a range of from about 501 Hz to about 8018 Hz.
  • the peak high frequency may fall within a range of from about 2004 Hz to about 20045 Hz.
  • the amplification of the frequency response coefficient at the peak high frequency may be from about 1.3 times to about 6.0 times the amplification of the frequency response coefficient at the reference frequency.
  • the peak low frequency may fall within a range of from about 0 Hz to about 2004 Hz.
  • the amplification of the frequency response coefficient at the peak low frequency may be from about 1.1 times to about 3.0 times the amplification of the frequency response coefficient at the reference frequency.
  • a method for enhancing a series of digital audio samples comprising the steps of: receiving the series of digital audio samples, and generating a driving signal by convolving the series of samples in real time with a series of stored time coefficients.
  • the time coefficients are mapped to a like number of frequency coefficients.
  • the frequency coefficients are spaced at frequency intervals, have either zero phase angles or linearly spaced phase angles, and have amplitudes which are mirrored about a mid frequency.
  • the method for enhancing a series of digital audio samples may further comprise the step of generating an analog audio signal from the driving signal.
  • the time coefficients are preferably integer time coefficients.
  • the step of generating a driving signal comprises the step of: repeatedly solving the following equation for Y:
  • a 0 through A N .* are the stored time coefficients;
  • X(n) is the most recent sample received;
  • X(n-l) through X(n-[N-1]) correspond to N-l samples received prior to sample X(n);
  • n is the running index of the time coefficients being computed;
  • N is equal to the number of terms in the equation to the right side of the equal sign; and wherein calculated values of Y define the driving signal.
  • the step of generating a driving signal may further comprise the steps of dividing the values of Y by a number previously used to convert initial real number time coefficients to integer time coefficients; and discarding any remaining fractional portion of the divided values of Y.
  • the receiving step may comprise the step of reading the series of digital samples from a digital recording medium or from a compressed file.
  • the receiving step may also comprise the step of downloading audio sample streams from the Internet.
  • an apparatus for enhancing a series of digital audio samples.
  • the apparatus comprises a device for receiving the series of digital samples and a digital filter comprising a series of stored time response coefficients.
  • the time response coefficients are mapped to a like number of frequency response coefficients.
  • the frequency response coefficients are spaced at frequency intervals, have phase angles of zero and have amplitudes which are mirrored about a mid frequency.
  • the apparatus further comprises a microprocessor for generating a driving signal by convolving the sound samples in real time against the time coefficients.
  • the receiving device comprises a digital signal reader.
  • the microprocessor convolves the sound samples in real time against the time response coefficients by repeatedly solving the following equation for Y:
  • a 0 through A N _ are the stored time coefficients; X(n) is the most recent sample received; X(n-l) through X(n-[N-1]) correspond to N-l samples received prior to sample X(n); n is the running index of the time coefficients being computed; N is equal to the number of terms in the equation to the right side of the equal sign; and wherein calculated values of Y define the driving signal.
  • the microprocessor may further divide the values of Y by a number previously used to convert real number time coefficients to integer time coefficients and discards any remaining fractional portion of the divided values of Y.
  • the apparatus may further comprise a converting device responsive to the driving signal for generating an analog audio signal from the driving signal.
  • a filter package having two or more parallel digital filters.
  • a first filter is provided which comprises a series of digitized first time coefficients stored in a first memory. The time coefficients are mapped to a like number of first frequency coefficients. The first frequency coefficients have either zero phase angles or linearly spaced phase angles and have amplitudes which are mirrored about a mid frequency.
  • a second digital filter is provided and comprises a series of digitized second time coefficients at least one of which has a value which is different from each of the first time coefficients. The second time coefficients are stored in a second memory and mapped to a like number of second frequency coefficients. The second frequency coefficients have either zero phase angles or linearly spaced phase angles and have amplitudes which are mirrored about a mid frequency.
  • the number of the first time coefficients is equal to or greater than 5 and the number of the second time coefficients is equal to or greater than 5.
  • the first memory and the second memory may comprise the same memory component.
  • an apparatus for enhancing a series of digital audio samples.
  • the apparatus comprises a device for receiving the series of digital samples and a filter package.
  • the filter package has a first digital filter comprising a series of digitized first time response coefficients stored in a first memory. The time coefficients are mapped to a like number of first frequency coefficients. The first frequency coefficients have either zero phase angles or linearly spaced phase angles and have amplitudes which are mirrored about a mid frequency.
  • the filter package also includes a second digital filter comprising a series of digitized second time response coefficients at least one of which has a value which is different from each of the first time coefficients.
  • the second time coefficients are stored in a second memory and mapped to a like number of second frequency coefficients.
  • the second frequency coefficients have either zero phase angles or linearly spaced phase angles and have amplitudes which are mirrored about a mid frequency.
  • the apparatus further includes a microprocessor for generating a driving signal by convolving the sound samples in real time against either the first time response coefficients or the second time response coefficients.
  • the apparatus may further comprise an input device coupled to the microprocessor for selecting one of the first filter and the second filter.
  • the microprocessor generates the driving signal by convolving the sound samples in real time against the first time response coefficients when the first filter is selected and the second time response coefficients when the second filter is selected.
  • the apparatus may also comprise a converting device responsive to the driving signal for generating an analog audio signal from the driving signal.
  • Fig. 1 is a block diagram of one embodiment of an audio playback system according to the invention.
  • Fig. 2 is a schematic illustration of a frequency response curve for a digital filter in a first embodiment.
  • Fig 3 is a schematic illustration of a frequency response curve for a digital filter in a second embodiment.
  • Fig. 4 is a schematic illustration of a frequency response curve for a digital filter in a third embodiment.
  • Fig. 5 is a schematic illustration of a frequency response curve for a digital filter in a fourth embodiment.
  • Fig. 6 is a schematic illustration of a frequency response curve for a digital filter in a fifth embodiment.
  • Fig. 7 is a schematic illustration of a frequency response curve for a digital filter in a sixth embodiment.
  • Fig. 1 illustrates a personal computer 10, organized in accordance with the present invention.
  • personal computer 10 has a microprocessor 20, a CD reader 30 and a sound card 40.
  • digital filter 50 which preferably comprises an array of storage locations 52 within a random access memory. Further portions of the memory (not illustrated) may be devoted to other uses.
  • a series of N time response values, A 0 through A N .*, are tabulated in a written program for downloading into storage locations 52. These response values, hereinafter referred to as time coefficients, are established by a technique described below.
  • a system clock causes CD reader 30 to read a series of digitized sound samples, at a predetermined sampling frequency such as 44,100 Hz.
  • sampling frequencies such as 44,100 Hz.
  • sampling rates of 22,050 Hz and 11,025 Hz are used in many PC applications.
  • Some audio CD's are now produced using other sampling rates.
  • Digital video disks are produced at sampling rates of 64,000 Hz.
  • the telephone industry samples at a rate of 8000 Hz.
  • the apparatus and methods set out herein are applicable to any sampling rate.
  • Each sample is designated X(n) at the time of sampling and is stored temporarily in a designated storage location 22.
  • the sound samples are later shifted successively through a series of N-l other storage locations 22 for generation of time shifted samples X(n-l) through X(n-[N-1]).
  • Storage locations 22, may be thought of as being equivalent to stages of a shift register within microprocessor 20 and are so illustrated in Fig. 1.
  • microprocessor 20 may not have sufficient memory for this pu ⁇ ose, and therefore the computer program may cause the sound samples to be shifted through a designated portion of the random access memory.
  • Microprocessor 20 multiplies each A n by a corresponding
  • microprocessor 20 performs N shift operations, N multiplications and N summations for each sound sample read by CD reader 30.
  • the calculated sums are supplied to sound card 40 at 44,100 Hz and define a driving signal. Operations within CD reader 30 and sound card 40 are effected by conventional calls to the operating system. Shifting of samples, multiplication by time response coefficients and summation is a matter of routine programming and need not be further described.
  • a 0 through A N . are the stored time coefficients;
  • X(n) is the most recent sample received;
  • X(n- 1 ) through X(n-[N- 1 ]) correspond to N- 1 samples received prior to sample X(n);
  • n is the running index of the time coefficients being computed;
  • N is equal to the number of terms in the equation to the right side of the equal sign; and wherein calculated values of Y define the driving signal.
  • the value of N is at least 5, preferably 7, more preferably 9, and most preferably 11. It is also contemplated that the value of N may comprise an odd integer greater than 11. It is further contemplated that the value of N may comprise an even integer greater than or equal to 6, such as 8, 10 or 12. However, it is preferred that N comprise an odd integer.
  • the convolution process is carried out in real time using appropriate time response coefficients. These time response coefficients are calculated off-line and are mapped to a like number of frequency coefficients established as discussed below. That procedure generates N frequency coefficients H 0 -H N .,, each having a frequency and an amplitude, which are mapped into N time response coefficients Ao-A N ., by use of the discrete inverse Fourier transformation:
  • N may have a value of 19, which would call for 19 time response coefficients, all derived from a set of 19 frequency coefficients, regularly spaced at intervals F s in accordance with the time- frequency uncertainty principle, a relationship stated in the following equation:
  • the A n and the H k are all complex numbers, having a magnitude and a phase.
  • the H k have a phase angle of 0, and the A n have a phase angle of either 0 or ll.
  • Values of II are accommodated in the filter design by assigning a negative value to the amplitude of the time coefficient.
  • the 19 frequency domain points define a frequency response curve 58 extending from 0 Hz to 41,778 Hz as illustrated in Fig. 2. They include a base point 60, principal points 61 - 69 and mirror points 71 - 79. The specified numeric amplitudes of the various frequency domain points 60-69 and 71-79 do not have units.
  • Frequency domain points 60 - 69 and 71 - 79 are set at regular intervals of 2321 Hz with mirror points 71 - 79 having amplitudes equal to principal points 61 - 69 respectively. It will be observed that the principal points and the mirror points are mirrored about a mid frequency of 22,050 Hz, half the 44,100 Hz sampling frequency. This produces a periodic response at the sampling frequency. That periodicity effectively extends the frequency response curve 58 to 44,100 Hz and sets a virtual point (not illustrated) at 44,100 Hz having a magnitude equal to that of base point 60. This virtual frequency domain point is mapped to a virtual time response point (also not illustrated) equal to, and synchronous with, A 0 . Since 19 is an odd number there is no frequency domain point at the mirroring mid frequency.
  • frequency response coefficients 62-69 having frequencies between a reference frequency of 4642 Hz and a high end frequency of 20,000 Hz increase in amplitude as per increasing frequencies from the reference frequency up to a significant amplitude peak at a peak high frequency of 16,247 Hz and decrease in amplitude as per increasing frequencies down to a significant amplitude trough at a trough high frequency of 18,568 Hz.
  • the amplification of the frequency response coefficient at the peak high frequency is about 2.1 times the amplification of the frequency response coefficient at the reference frequency.
  • the frequency response coefficient 69 which is positioned between the reference frequency and a low end frequency of 20 Hz has an amplitude which is slightly greater than that of the frequency response coefficient 68 located at the reference frequency.
  • the required filter response is undefined outside the range of human hearing. That includes the region below about 20 Hz, a free band region between 20,000 Hz and 24, 100 Hz, and an end region between 24, 100 Hz and 44, 100 Hz.
  • the periodic nature of the filtering process requires some definition for all frequencies between zero and the sampling frequency. Therefore, frequency response points or taps were selected at amplitudes which would tend to lend smoothness to the response function in the undefined regions.
  • point 60 having a frequency of 0 Hz was selected so as to have an amplitude of 2.5, which is equal to the amplitude for point 69, which has a frequency of 2321 Hz.
  • the amplitudes of points 61 and 71 which fall within the free band region, were selected so as to achieve a generally constant oscillation frequency across a center band extending from about 11,605 Hz to about 32,494 Hz, i.e., the portions of the curve defined between points 65,62; 62,72; and 72,75 have approximately the same width.
  • Points 72-79 are mirror images of points 62-69.
  • a frequency response curve according to the present invention can be affected by manipulating various frequency response points or taps. Also included are some of the characteristics that can occur when manipulating specific taps located in each region of the frequency response curve noted.
  • manipulation of one or more of the taps at frequencies lower than the reference frequency can affect the low frequency response or base region of the frequency response curve (e.g., about 0 Hz to about 2004 Hz).
  • the base of the resulting sound can be made to sound more boomy and thick (i.e., the base sound is over emphasized and it loses some of its detail and clarity).
  • Raising the amplitude of one or more of the base region taps also seems to affect the high frequency response or treble region (above the reference frequency) so that the treble portion sounds thicker and not quite as brilliant (i.e., it loses detail and clarity). That is, affecting the base region in this manner can suppress the treble region in a manner similar to the high frequency masking effect caused by using noise suppression technology like that made by DolbyTM.
  • the amplitude of one or more of the taps in the base region are lowered, the base of the resulting sound can be made to sound much thinner, which can result in the treble region sounding brighter or brittle (i.e., more apparent or over emphasized), depending upon how the taps are adjusted.
  • manipulation of one or more of the taps within the range of the reference frequency can affect the reference frequency response of the frequency response curve.
  • Raising the amplitude of one or more of the taps in this region can produce sound appearing to have more air and greater separation between the sound sources, e.g., instruments (i.e., different sound sources are more easily distinguished). This greater separation can make the overall sound appear more like it was live and not recorded. Lowering the amplitude of one or more of the taps in this region can make the resulting signal sound dull and lifeless.
  • Manipulation of one or more of the taps at frequencies higher than the reference frequency, but within the frequency band "of interest" can affect the high frequency response or treble region of the frequency response curve (e.g., about 2004 Hz to about 20045 Hz).
  • the amplitude of the taps in the treble region By raising the amplitude of the taps in the treble region, more brilliance can be added to the resulting sound. In this way, a greater amount of detail can be offered, especially in audio application like, for example, a movie soundtrack.
  • the resulting sound can be made to sound flat and muted or muddy.
  • Manipulation of one or more of the taps out beyond the frequency band "of interest" can affect the dwell time of the signal, especially when the frequencies of the taps are at the edge or above the band of normal human hearing (e.g., about 20 Hz to about 20,000 Hz).
  • the center region of the frequency response curve of Fig. 3 contains taps 101 and 111. By raising the amplitude of the taps in this region, the dwell time can be reduced and the resulting enhancement can sound more like a mere amplitude shift, rather than overall harmonic enhancement. Increasing the amplitude of these taps can also increase the rate of clipping or overdriving of a signal.
  • the dwell time can be increased and the overall enhancement and stability of the audio signal improved, with a reduction in the risk of the signal being clipped.
  • the dwell time can be dramatically affected by changes in the amplitude of the center region taps, regardless of the number of the center region taps.
  • the rate of transition between the frequency band of interest and the center region can be controlled by adjusting the amplitude of one or more of the center region taps.
  • Dwell time is defined here as the time interval over which the impulse response of the filter has significant amplitude. For a filter with a fixed number of taps (i.e., a finite impulse response filter), the actual time duration of the impulse response cannot be modified.
  • a digital filter in accordance with a second embodiment of the present invention was created as follows.
  • a set of first frequency response coefficients separated at uniformly spaced frequency intervals were selected.
  • the first frequency response coefficients had zero phase angles.
  • the frequency response coefficients set out in Table 1 above were used as the first frequency response coefficients.
  • the plurality of first frequency response coefficients were mapped into corresponding first time response coefficients, which are also set out in Table 1.
  • the number of time coefficients should be minimized. So as to achieve that end, a pair of the first time response coefficients having equal magnitudes and being positioned adjacent to one another in Table 1 were removed. The discarded pair of time coefficients each had a magnitude of .0714.
  • the remaining time coefficients defined second time coefficients.
  • the one first time response coefficient not having a zero frequency and not having a mate of equal magnitude is also discarded.
  • a second frequency response curve was then generated using the second frequency response coefficients.
  • the first and second frequency response curves were compared to determine if the second frequency response curve was substantially different from the first frequency response curve. If not, then another pair of time response coefficients, i.e., a pair of the second time response coefficients, having equal magnitudes and being positioned adjacent to one another were removed. This process of removing and assessing continued until a pair of discarded time response coefficients caused a significant change in the perceived enhancement caused by the frequency response of the digital filter.
  • a significant change in the perceived enhancement caused by the frequency response of the digital filter corresponds to a substantial difference between an initial and a subsequent frequency response curve. When a significant change occurred, this last pair of discarded time response coefficients were added back to the time response coefficients. These remaining time response coefficients defined final time response coefficients.
  • the time response coefficients comprise integer numbers.
  • they were multiplied by an integer conversion number sufficiently large to permit any remaining fractional portion to be discarded without losing substantial final time response coefficient accuracy.
  • a substantial loss in final time response coefficient accuracy occurs when the resulting frequency response of the digital filter produces an enhancement perceptibly different from the desired enhancement.
  • the integer conversion number be selected as a power of two so that shifting may be used in place of division when subsequent renormalization occurs for calculated values of Y.
  • the final time coefficients (set out in brackets in Table 2 below) were multiplied by 2 13 (8192). Any remaining fractional portions of the converted time coefficients were discarded.
  • the integer final time coefficients are set out in Table 2 below. They may be tabulated in a written program for downloading into storage locations 52. This filter has 11 time response coefficients.
  • the microprocessor 20 Prior to the microprocessor 20 supplying the calculated values of Y to the sound card 40, the microprocessor 20 must renormalize those values. This involves dividing each value of Y by the integer conversion number. Alternatively, if the integer conversion number is selected as a power of two, the microprocessor 20 can effect renormalization by right shifting an appropriate number of bit positions, 13 in the illustrated embodiment.
  • Curve 80 has a base point 100. It also has principal points 101 - 105, which are mirrored by points 111- 115.
  • frequency response coefficients 102-105 having frequencies between a reference frequency of 4009 Hz and a high end frequency of 20,000 Hz increase in amplitude as per increasing frequencies from the reference frequency toward the high end.
  • a filter in accordance with a third embodiment of the present invention was created using the same design procedures undertaken to construct the filter of the second embodiment.
  • the third filter's final frequency and time response coefficients are set out in Table 3 below.
  • the final time response coefficients both before and after being multiplied by an integer conversion number equal to 8192 are set out in Table 3.
  • a frequency response curve 90 plotted using the final frequency response coefficients is shown in Fig. 4.
  • the curve has a base point 120, principal points 121-125 and mirror points 131-135.
  • frequency response coefficients 122-125 having frequencies between a reference frequency of 4009 Hz and a high end frequency of 20,000 Hz increase in amplitude as per increasing frequencies from the reference frequency toward the high end.
  • the procedure may be extended to other values of N.
  • Final frequency and time response coefficients for filters of fourth, fifth and sixth embodiments of the present invention are set out in Tables 4-6 below.
  • the final time response coefficients both before and after being multiplied by an integer conversion number equal to 8192 are set out in Tables 4-6.
  • Nine frequency and time response coefficients are provided for the filter set out in Table 4; seven frequency and time response coefficients are provided for the filter set out in Table 5; and five frequency and time response coefficients are provided for the filter set out in Table 6.
  • a frequency response curve 140 for the fourth filter is shown in Fig. 5
  • a frequency response curve 150 for the fifth filter is shown in Fig. 6
  • a frequency response curve 160 for the six filter is shown in Fig. 7.
  • the fourth and fifth filters provided an enhancement which is comparable to that provided by the first, second and third filters set out in Tables 1-3. While the enhancement of the sixth filter was not equal to that of the fourth and fifth filters, a noticeable enhancement did occur.
  • Filters constructed in accordance with the present invention may have a reference frequency which falls within the range of from about 501 Hz to about 8018 Hz; a peak high frequency which falls within the range of from about 1002 Hz to about 20045 Hz; a trough high frequency which can fall at any frequency after the peak high frequency; and a peak low frequency which falls within the range of from about 0 Hz to about 2004 Hz.
  • the amplification of the frequency response coefficient at the peak high frequency may be about 1.3 times to about 6.0 times the amplification of the frequency response coefficient at the reference frequency.
  • the amplification of the frequency response coefficient at the peak low frequency may be about 1.1 times to about 3.0 times the amplification of the frequency response coefficient at the reference frequency.
  • filters of the present invention may be used with a microprocessor or like device coupled to a digital signal reader (e.g., a real audio editor, CD or DVD player, etc.) which reads digital signals.
  • the source of such digital signals can be digital samples stored on a digital recording medium as well as digital signals from, for example, audio streams (i.e., compressed packets of digital information) received, for example, over the Internet; MP3, liquid audio, a2b, and other compressed files received over the Internet and stored on any conventional memory device; etc.
  • audio streams i.e., compressed packets of digital information
  • the audio signals may be processed directly.
  • compressed data is read, the data must be uncompressed prior to processing.
  • a microprocessor may store the driving signal in any conventional memory device capable of storing digital data, transfer the driving signal over the Internet, the airways (via for example radio or television signals, microwaves, etc.), a network, cable TV lines, the telephone lines, etc. to another digital signal reader, or digital signal storing device.
  • the filtering according to the present invention can be performed on a digital signal before it is compressed or after it is uncompressed, it has been found preferable for digital audio signals to be filtered according to the present invention before being compressed.
  • digital audio signals When comparing a recorded compressed audio format that was enhanced by the present filtering technique prior to the compression process, to one enhanced after compression, a significant difference in overall sound quality has resulted.
  • an audio format has been enhanced according to the preset invention before or after compression, the enhanced file always sounds better than the same recording without the present enhancement.
  • Audio signal compression works by applying a mathematical transform (e.g., Discrete Cosine Transform or DCT) to the audio signal, resulting in a set of DCT frequency coefficients (i.e., "coefficients"). These coefficients are used to recreate the original audio file, as needed.
  • the compression process typically includes a quantizing step in which the subtlest characteristics of the digital audio file (often high frequency coefficients) are discarded, since they usually turn out to be small and are believed not necessary to substantially recreate the original audio file. This discarding of coefficients means that the audio file will not be able to be completely reconstructed from the remaining coefficients.
  • audio signal compression techniques e.g., MP3 are often referred to as "Lossy" transformations.
  • the frequency characteristics of the original file are altered so as to emphasize the characteristics of the sound to which the ear is less sensitive.
  • the corresponding coefficients of the audio file are less likely to be evaluated by the DCT as being small and, therefore, subject to quantizing. If the coefficients are not considered small, they are less likely to be discarded. It is the most dominant signal characteristics (i.e., typically those located around the reference frequency) that are the more likely to be discarded. The result is a more enhanced sound after decompression (i.e., the resulting audio signal sounds more like or better than the pre-compressed sound).
  • Frequency response coefficients for a filter of a seventh embodiment of the present invention are set out in Table 6A below.
  • the frequency response coefficients have linearly spaced phase angles. This is in contrast to the zero phase angles of the time response coefficients set out in Tables 1-6 above.
  • the frequency response coefficient may be mapped into corresponding time response coefficients by use of the discrete inverse Fourier transformation set out above.
  • a filter package may comprise a first filter having a series of digitized first time coefficients stored in the memory component.
  • the first time coefficients are mapped to a like number of first frequency coefficients.
  • the first frequency coefficients have either zero phase angles or linearly spaced phase angles and have amplitudes which are mirrored about a mid frequency.
  • the filter package further includes a second digital filter comprising a series of digitized second time coefficients at least one of which has a value which is different from each of the first time coefficients.
  • the second time coefficients are stored in a second memory and mapped to a like number of second frequency coefficients.
  • the second frequency coefficients have either zero phase angles or linearly spaced phase angles and have amplitudes which are mirrored about a mid frequency.
  • An input device such as a keyboard is coupled to a microprocessor for allowing the user to select one of the first and second filters for use at any given time for enhancement of digital audio samples.
  • the microprocessor generates a driving signal by convolving the sound samples in real time against the first time response coefficients when the first filter is selected and the second time response coefficients when the second filter is selected.
  • the apparatus may also comprise a converting device responsive to the driving signal for generating an analog audio signal from the driving signal.
  • the filter package may be provided to a prospective customer who intends to purchase a hardware version of a digital filter. Each filter of the software filter package is then tested by the customer. Once a desired filter is selected, the filter manufacturer designs a hardware version of the selected filter and provides it to the customer for testing. Accordingly, the digital filter package allows a filter manufacturer to provide two or more filters to a prospective customer for testing without having to design and build two or more separate hardware filters.
  • the time coefficients set out in Table 1 may be multiplied by an integer conversion number.
  • the digital filters disclosed above are finite impulse response (FIR) filters.
  • FIR finite impulse response
  • the procedure for designing an infinite impulse response (IIR) filter implementation can be very similar to the above described FIR procedure. Assuming that a desired frequency response has been specified at various frequencies, an IIR filter can be developed using the same technique described above for the FIR filter implementation.
  • IIR filter coefficient computation technique such as that found in Matlab (a software system for filter design and digital signal processing) choose an IIR filter order and input the desired filter order and frequency response.
  • the Matlab function will return the IIR filter coefficients that meet the frequency response and filter order requirements, within some tolerance that is a function of the chosen filter order.
  • the IIR filter has the form (first-order example), as follows:
  • the FIR includes terms from the input, i.e., x(k).
  • the filter coefficients are prepared for real-time implementation as in the FIR case. Each coefficient is multiplied by a large number and the decimal portion of the result is eliminated. In this way the coefficients are made integers, which can make the real-time implementation faster for most computer systems. Because improper scaling could lead to instability, care should be taken when performing this operation with an IIR filter due to the feedback term.
  • the IIR filter has the advantage of potentially requiring fewer coefficients and therefore a more efficient real-time implementation. This means that an IIR filter may take less time to compute an output sample than an equivalent FIR filter.
  • a possible disadvantage of the IIR filter is that it typically does not possess linear phase. If the filter design requires a linear phase, as is preferred for the frequency response described above, then an FIR filter is normally the only choice.
  • the IIR filter may have approximately linear phase and may only approximate the magnitude response, depending on the order of the IIR filter specified in the design. It is possible to obtain linear phase from an IIR filter by running the data through the IIR filter twice (both forward and backward). However, this doubles the order of the filter.
  • the follow equation is a 3rd order IIR filter approximation of the frequency response disclosed above.
  • y(k) 0.0355x(k) - 0.0709x(k-l) + 0.1509x(k-2) + 0.0533x(k-3) - 0.2944y(k-l) - 0.0399y(k-2) + 0.1033y(k-3)
  • the phase response for the 3rd order IIR filter is nonlinear, especially at the higher frequencies, although it appears to be very linear up to about 15 kHz. Hence, at higher frequencies the nonlinear phase may also contribute to noticeable differences in reproduction quality.
  • Phase means phase of the time coefficient. All Frequency phases are zero. 40a
  • Phase means phase of the time coefficient. All Frequency phases are zero.
  • Phase means phase of the time coefficient. All Frequency phases are zero.
  • Phase means phase of the time coefficient. All Frequency phases are zero.
  • Phase means phase of the time coefficient. All Frequency phases are zero.
  • Phase means phase of the time coefficient. All Frequency phases are zero. 53
  • Phase means phase of the time coefficient. All Frequency phases are zero.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Mathematical Physics (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)

Abstract

L'invention concerne un filtre numérique permettant une lecture de qualité élevée d'un signal audio enregistré. Ce filtre est conçu dans le domaine des fréquences par établissement d'un point de base, et d'une série de points principaux à des intervalles de fréquence réguliers. Lesdits points principaux sont manipulés par symétrie à l'aide d'une série de points symétriques de fréquence plus élevée, agencés symétriquement autour d'une fréquence moyenne, au niveau de la moitié de la fréquence d'échantillonnage du signal audio enregistré. Ces points principaux situés dans la plage d'audition humaine représentent des amplitudes données correspondant approximativement inversement à la sensibilité d'écoute à des fréquences associées. Une fois la procédure de conception exécutée, les points du domaine de fréquences sont mappés en coefficients de réponse temporelle, à l'aide de la transformée de Fourier discrète inverse. Ces coefficients de réponse temporels sont ensuite stockés pour une convolution en temps réel avec des échantillons audio enregistrés.
PCT/US2000/021912 1993-02-23 2000-08-10 Filtre numérique WO2001013515A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU66310/00A AU6631000A (en) 1999-08-13 2000-08-10 Digital filter
US09/887,606 US20050259833A1 (en) 1993-02-23 2001-06-22 Frequency responses, apparatus and methods for the harmonic enhancement of audio signals

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US14878799P 1999-08-13 1999-08-13
US60/148,787 1999-08-13
US43137199A 1999-11-01 1999-11-01
US09/431,371 1999-11-01

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US09/887,606 Continuation-In-Part US20050259833A1 (en) 1993-02-23 2001-06-22 Frequency responses, apparatus and methods for the harmonic enhancement of audio signals

Publications (1)

Publication Number Publication Date
WO2001013515A1 true WO2001013515A1 (fr) 2001-02-22

Family

ID=26846167

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/021912 WO2001013515A1 (fr) 1993-02-23 2000-08-10 Filtre numérique

Country Status (2)

Country Link
AU (1) AU6631000A (fr)
WO (1) WO2001013515A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116827308A (zh) * 2023-08-24 2023-09-29 上海力通通信有限公司 资源优化型fir滤波器及其实现方法

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
G.A NELSON ET AL: "High speed octave band digital filtering", IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS., vol. 20, no. 1, March 1972 (1972-03-01), IEEE INC. NEW YORK., US, pages 58 - 65, XP002156334 *
L.R.RABINER: "techniques for designing finite duration impilse response digital filters", IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY., vol. 19, no. 2, April 1971 (1971-04-01), IEEE INC. NEW YORK., US, pages 188 - 195, XP002156333 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116827308A (zh) * 2023-08-24 2023-09-29 上海力通通信有限公司 资源优化型fir滤波器及其实现方法
CN116827308B (zh) * 2023-08-24 2023-11-24 上海力通通信有限公司 资源优化型fir滤波器及其实现方法

Also Published As

Publication number Publication date
AU6631000A (en) 2001-03-13

Similar Documents

Publication Publication Date Title
US7369906B2 (en) Digital audio signal processing
US6504935B1 (en) Method and apparatus for the modeling and synthesis of harmonic distortion
US6118879A (en) BTSC encoder
US5075880A (en) Method and apparatus for time domain interpolation of digital audio signals
US8094835B2 (en) Signal processing apparatus
JP3765622B2 (ja) オーディオ符号化復号化システム
KR20060004695A (ko) 오디오 신호의 대역을 확장하기 위한 방법 및 장치
JP2008191659A (ja) 音声強調方法及び音声再生システム
JP2010020356A (ja) オーディオ信号帯域拡張装置
US5267095A (en) Digital reconstructing of harmonics to extend band of frequency response
CN1550002A (zh) 声音信号的带宽扩展
JP2007178675A (ja) オーディオ再生の効果付加方法およびその装置
JPH06177688A (ja) オーディオ信号処理装置
Kulp Digital equalization using Fourier transform techniques
EP1895516B1 (fr) Appareil et methode pour elargir une bande de signal audio
EP1163721B1 (fr) Filtre pour convertisseurs analogiques-numériques
US7233833B2 (en) Method of modifying low frequency components of a digital audio signal
JPH0537298A (ja) デイジタルフイルタ
JP4070835B2 (ja) オーディオ信号のフィルタリングのための方法及び装置
US20050259833A1 (en) Frequency responses, apparatus and methods for the harmonic enhancement of audio signals
CN101422054A (zh) 声像定位装置
EP1117179B1 (fr) Egalisateur graphique/paramétrique numérique
WO2001013515A1 (fr) Filtre numérique
US8022289B2 (en) Harmonic sound generator and a method for producing harmonic sound
JP3873282B2 (ja) 音色制御フィルタ

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 09887606

Country of ref document: US

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP