WO2023024725A1 - 音频控制方法及装置、终端和存储介质 - Google Patents

音频控制方法及装置、终端和存储介质 Download PDF

Info

Publication number
WO2023024725A1
WO2023024725A1 PCT/CN2022/104501 CN2022104501W WO2023024725A1 WO 2023024725 A1 WO2023024725 A1 WO 2023024725A1 CN 2022104501 W CN2022104501 W CN 2022104501W WO 2023024725 A1 WO2023024725 A1 WO 2023024725A1
Authority
WO
WIPO (PCT)
Prior art keywords
loudness
signal
frequency
processed
audio signal
Prior art date
Application number
PCT/CN2022/104501
Other languages
English (en)
French (fr)
Inventor
许逸君
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Publication of WO2023024725A1 publication Critical patent/WO2023024725A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/60Substation equipment, e.g. for use by subscribers including speech amplifiers
    • H04M1/6033Substation equipment, e.g. for use by subscribers including speech amplifiers for providing handsfree use or a loudspeaker mode in telephone sets
    • H04M1/6041Portable telephones adapted for handsfree use
    • H04M1/605Portable telephones adapted for handsfree use involving control of the receiver volume to provide a dual operational mode at close or far distance from the user
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Definitions

  • the present application relates to the technical field of audio processing, and in particular to an audio control method, an audio control device, a terminal, and a non-volatile computer-readable storage medium.
  • mobile terminals are capable of playing audio.
  • the present application provides an audio control method, an audio control device, a terminal and a non-volatile computer-readable storage medium.
  • the audio control method in the embodiment of the present application includes determining the compensation corresponding to the signal of each frequency in the audio signal to be processed according to the preset mapping curve, the preset mapping curve is determined according to the equal loudness curve of different loudness, in the In the preset mapping curve, the signal of each frequency corresponds to the compensation; according to the signal of each frequency and the compensation corresponding to the signal of each frequency, the audio to be processed is calculated the loudness of the signal; and adjusting the loudness to a preset loudness range.
  • the audio control device in the embodiment of the present application includes a determination module, a calculation module and an adjustment module.
  • the determination module is used to determine the compensation corresponding to the signal of each frequency in the audio signal to be processed according to a preset mapping curve.
  • the preset mapping curve is determined according to equal loudness curves of different loudness.
  • the signal of each frequency corresponds to the compensation;
  • the calculation module is used to calculate the compensation according to the signal of each frequency and the compensation corresponding to the signal of each frequency the loudness of the audio signal to be processed; and the adjustment module is used to adjust the loudness to a preset loudness range.
  • the terminal in the embodiment of the present application includes a processor.
  • the processor is configured to: determine the compensation corresponding to the signal of each frequency in the audio signal to be processed according to a preset mapping curve, the preset mapping curve is determined according to equal loudness curves of different loudness, in the preset In the mapping curve, the signal of each frequency corresponds to the compensation; according to the signal of each frequency and the compensation corresponding to the signal of each frequency, the loudness of the audio signal to be processed is calculated ; and adjust the loudness to a preset loudness range.
  • the processors are made to execute the audio control method.
  • the audio control method includes determining the compensation corresponding to each frequency signal in the audio signal to be processed according to a preset mapping curve, the preset mapping curve is determined according to equal loudness curves of different loudness, in the preset In the mapping curve, the signal of each frequency corresponds to the compensation; according to the signal of each frequency and the compensation corresponding to the signal of each frequency, the loudness of the audio signal to be processed is calculated ; and adjust the loudness to a preset loudness range.
  • FIG. 1 is a schematic flowchart of an audio control method in some embodiments of the present application
  • FIG. 2 is a block diagram of an audio control device in some embodiments of the present application.
  • FIG. 3 is a schematic plan view of a terminal in some embodiments of the present application.
  • FIG. 4 and FIG. 5 are schematic diagrams of the principles of the audio control method in some embodiments of the present application.
  • FIG. 6 to 8 are schematic flowcharts of audio control methods in some embodiments of the present application.
  • Fig. 9 is a schematic diagram of connection between a processor and a computer-readable storage medium in some embodiments of the present application.
  • the audio control method in the embodiment of the present application includes determining the compensation corresponding to the signal of each frequency in the audio signal to be processed according to the preset mapping curve.
  • the preset mapping curve is determined according to the equal loudness curve of different loudness.
  • the signal of each frequency corresponds to the compensation; calculate the loudness of the audio signal to be processed according to the signal of each frequency and the compensation corresponding to the signal of each frequency; and adjust the loudness to the preset loudness range.
  • the audio control method further includes acquiring an audio input signal, and dividing the audio input signal into a plurality of initial audio signals with a predetermined duration, where the initial audio signals are time-domain signals.
  • the audio control method further includes transforming the original audio signal to convert it into an audio signal to be processed, where the audio signal to be processed is a frequency domain signal.
  • the compensation corresponding to each frequency signal in the preset first frequency band in the mapping curve is a negative number, and is negatively correlated with the magnitude of the frequency; each signal in the preset second frequency band in the mapping curve The compensation corresponding to the frequency signal is a positive number and is positively correlated with the magnitude of the frequency; the compensation corresponding to the signal of each frequency in the third frequency band preset in the mapping curve is a positive number and a fixed value; the first frequency band, The second frequency band and the third frequency band do not overlap with each other, and the center frequency of the first frequency band, the center frequency of the second frequency band and the center frequency of the third frequency band increase sequentially.
  • adjusting the loudness to a preset loudness range includes:
  • the loudness corresponding to the largest amplitude in the audio signal to be processed is obtained, and a first gain is performed on the audio signal to be processed, so that the loudness corresponding to the largest amplitude reaches a predetermined loudness, and generating a first processed audio signal;
  • a second gain is performed on the first processed audio signal within the first loudness range to generate a second processed audio signal, the second processed audio signal
  • the loudness of is in the loudness range.
  • adjusting the loudness to a preset loudness range includes:
  • a first gain is performed on the audio signal to be processed to generate a first processed audio signal, and the loudness of the first audio signal is within the loudness range.
  • performing a second gain on a signal within the first loudness range in the first processed audio signal to generate a second processed audio signal includes:
  • a signal in the first frequency band, a signal in the second frequency band, and a signal in the third frequency band within the first loudness range are respectively subjected to a second gain to generate a second processed audio signal;
  • Performing a second gain on a signal within the second loudness range in the first processed audio signal to generate a second processed audio signal comprising:
  • a signal of the first frequency band, a signal of the second frequency band and a signal of the third frequency band within the second loudness range are respectively subjected to a second gain to generate a second processed audio signal, the first frequency band, The second frequency band and the third frequency band do not overlap with each other.
  • calculating the loudness of the audio signal to be processed according to the signal of each frequency and the compensation corresponding to the signal of each frequency includes:
  • the frequency band corresponding to the audio signal of the first frequency band is [0 Hz, 1000 Hz)
  • the frequency band corresponding to the audio signal of the second frequency band is [1000 Hz, 5000 Hz)
  • the audio signal of the third frequency band corresponds to The frequency bands are [5000 Hz, 16000 Hz).
  • the audio control device 10 includes a determination module 11 , a calculation module 12 and an adjustment module 13 .
  • the determination module 11 is used to determine the compensation corresponding to the signal of each frequency in the audio signal to be processed according to the preset mapping curve.
  • the preset mapping curve is determined according to the equal loudness curves of different loudness. In the preset mapping curve, each There is a one-to-one correspondence between the signals of each frequency and the compensation; the calculation module 12 is used to calculate the loudness of the audio signal to be processed according to the signal of each frequency and the compensation corresponding to the signal of each frequency; the adjustment module 13 is used to adjust the loudness to a preset Loudness range.
  • the terminal 100 in the embodiment of the present application includes a processor 20 .
  • the processor 20 is used to determine the compensation corresponding to the signal of each frequency in the audio signal to be processed according to the preset mapping curve.
  • the preset mapping curve is determined according to the equal loudness curves of different loudness.
  • each There is a one-to-one correspondence between the signals of each frequency and the compensation; the loudness of the audio signal to be processed is calculated according to the signal of each frequency and the compensation corresponding to the signal of each frequency; and the loudness is adjusted to a preset loudness range.
  • the processor 20 is further configured to obtain an audio input signal, and divide the audio input signal into a plurality of initial audio signals with a predetermined duration, where the initial audio signals are time-domain signals.
  • the processor 20 is further configured to perform transformation processing on the original audio signal to convert it into an audio signal to be processed, and the audio signal to be processed is a frequency domain signal.
  • the compensation corresponding to each frequency signal in the preset first frequency band in the mapping curve is a negative number, and is negatively correlated with the magnitude of the frequency; each signal in the preset second frequency band in the mapping curve The compensation corresponding to the frequency signal is a positive number and is positively correlated with the magnitude of the frequency; the compensation corresponding to the signal of each frequency in the third frequency band preset in the mapping curve is a positive number and a fixed value; the first frequency band, The second frequency band and the third frequency band do not overlap with each other, and the center frequency of the first frequency band, the center frequency of the second frequency band and the center frequency of the third frequency band increase sequentially.
  • the processor 20 is further configured to obtain the loudness corresponding to the largest amplitude in the audio signal to be processed when the loudness of the audio signal to be processed is smaller than the minimum value of the loudness range, and perform the first gain on the audio signal to be processed , so that the loudness corresponding to the maximum amplitude reaches a predetermined loudness, and a first processed audio signal is generated; when the loudness of the first processed audio signal is smaller than the minimum value of the loudness range, the first processed audio signal is located in the first loudness range The signal within is subjected to a second gain to generate a second processed audio signal, and the loudness of the second processed audio signal is within the loudness range.
  • the processor 20 is further configured to perform a first gain on the audio signal to be processed when the loudness of the audio signal to be processed is greater than the maximum value of the loudness range, so as to generate a first processed audio signal, the first audio signal Loudness is in the loudness range.
  • the processor 20 is further configured to perform a second gain on the first processed audio signal, the signal of the first frequency band, the signal of the second frequency band and the signal of the third frequency band within the first loudness range , to generate a second processed audio signal; in the first processed audio signal, the signal of the first frequency band, the signal of the second frequency band and the signal of the third frequency band within the second loudness range are respectively subjected to the second gain to generate the second Second, the audio signal is processed, and the first frequency band, the second frequency band and the third frequency band do not overlap with each other.
  • the processor 20 is further configured to calculate the loudness corresponding to the signal of each frequency according to the compensation and amplitude corresponding to the signal of each frequency, and use the sum of the loudness corresponding to the signal of each frequency as the audio to be processed The loudness of the signal.
  • the frequency band corresponding to the audio signal of the first frequency band is [0 Hz, 1000 Hz)
  • the frequency band corresponding to the audio signal of the second frequency band is [1000 Hz, 5000 Hz)
  • the audio signal of the third frequency band corresponds to The frequency bands are [5000 Hz, 16000 Hz).
  • one or more non-transitory computer-readable storage media 300 containing a computer program 302 when the computer program 302 is executed by one or more processors 20, the processors 20 can Execute the audio control method in any one of the above implementation manners.
  • the audio control method in the embodiment of the present application includes the following steps:
  • 011 Determine the compensation corresponding to each frequency signal in the audio signal to be processed according to the preset mapping curve.
  • the preset mapping curve is determined according to the equal loudness curve of different loudness. In the preset mapping curve, each frequency One-to-one correspondence between signal and compensation;
  • 012 Calculate the loudness of the audio signal to be processed according to the signal of each frequency and the compensation corresponding to the signal of each frequency;
  • the audio control device 10 in the embodiment of the present application includes a determination module 11 , a calculation module 12 and an adjustment module 13 .
  • the determination module 11 is used to determine the compensation corresponding to the signal of each frequency in the audio signal to be processed according to the preset mapping curve.
  • the preset mapping curve is determined according to the equal loudness curves of different loudness. In the preset mapping curve, each There is a one-to-one correspondence between the signals of each frequency and the compensation; the calculation module 12 is used to calculate the loudness of the audio signal to be processed according to the signal of each frequency and the compensation corresponding to the signal of each frequency;
  • the adjustment module 13 is used to adjust the loudness to a preset Loudness range. That is to say, step 011 can be implemented by the determination module 11 , step 012 can be performed by the calculation module 12 and step 013 can be performed by the adjustment module 13 .
  • the terminal 100 in the embodiment of the present application includes a processor 20 .
  • the processor 20 is used to determine the compensation corresponding to the signal of each frequency in the audio signal to be processed according to the preset mapping curve.
  • the preset mapping curve is determined according to the equal loudness curves of different loudness.
  • each There is a one-to-one correspondence between the signals of each frequency and the compensation; the loudness of the audio signal to be processed is calculated according to the signal of each frequency and the compensation corresponding to the signal of each frequency; and the loudness is adjusted to a preset loudness range. That is to say, step 011 , step 012 and step 013 may be executed by the processor 20 .
  • the terminal 100 may be a mobile phone, a smart watch, a tablet computer, a display device, a notebook computer, a teller machine, a gate, a head-mounted display device, a game machine, and the like.
  • the embodiment of the present application is described by taking the terminal 100 as an example of a mobile phone. It can be understood that the specific form of the terminal 100 is not limited to the mobile phone.
  • the human ear has different real hearing perceptions of different frequency bands at the same loudness, when audio of different frequencies is played at the same volume, the loudness heard by the human ear is different. For example, at the same volume, the loudness of the human ear to receive low-frequency audio signals is greater than that of high-frequency audio signals. For human ears, the loudness of low-frequency signals will be greater. Therefore, if you want to keep If the receiving loudness of the low-frequency audio signal and the high-frequency audio signal are consistent, it is necessary to process the low-frequency audio signal and/or the high-frequency audio signal, such as reducing the loudness of the low-frequency audio signal and/or increasing the high-frequency audio signal loudness.
  • the amplitude of reducing the loudness of the low-frequency audio signal and/or increasing the loudness of the high-frequency audio signal can be determined according to the equal-loudness curves corresponding to the actual loudness of signals of different frequencies, that is, the compensation of the audio signals of different frequencies can be determined.
  • Figure 4 is the equal loudness curve of different frequencies, the abscissa is the frequency, the ordinate is the actual loudness, and the curve indicates the actual loudness corresponding to different frequencies under the same received loudness. According to multiple equal loudness curves, a mapping curve can be fitted to determine the compensation corresponding to different frequencies.
  • the abscissa is the frequency
  • the ordinate is the compensation value.
  • the compensation values of the low-frequency audio signal are all negative values, that is, negative compensation, which means that in the same reception Under the loudness, the actual loudness of the low-frequency audio signal needs to be reduced, and the size of the negative compensation is positively related to the frequency, that is, the higher the frequency, the smaller the actual loudness needs to be reduced;
  • the compensation values for the intermediate frequency signal are all positive numbers , that is, positive compensation, which means that under the same received loudness, the actual loudness of the intermediate frequency audio signal needs to be increased, and the size of the positive compensation is positively correlated with the frequency, that is, the higher the frequency, the smaller the actual loudness needs to be increased; and
  • the compensation values are all positive numbers, that is, positive compensation, which means that under the same receiving loudness, the actual loudness
  • the first frequency band, the second frequency band and the third frequency band do not overlap with each other, and the center frequency of the first frequency band, the center frequency of the second frequency band and the center frequency of the third frequency band increase sequentially.
  • the frequency band corresponding to the audio signal of low frequency i.e. the audio signal of the first frequency band
  • the frequency band corresponding to the audio signal of intermediate frequency i.e. the audio signal of the second frequency band
  • the frequency band corresponding to the high-frequency audio signal that is, the audio signal in the third frequency band
  • the frequency band corresponding to the high-frequency audio signal that is, the audio signal in the third frequency band
  • the processor 20 can acquire the initial audio signal.
  • the initial audio signal is generally a time-domain signal. Therefore, in order to facilitate the acquisition of the amplitude corresponding to each frequency, it is necessary to first convert the initial audio signal. For example, Fourier transform is performed on the original audio signal, so as to convert the original audio signal into an audio signal to be processed in the frequency domain.
  • processor 20 determines the compensation corresponding to the signal of each frequency in the audio signal to be processed according to the mapping curve, and then calculates the loudness of the audio signal to be processed according to the compensation and amplitude corresponding to the signal of each frequency, because for each frequency All signals are compensated accordingly, so that the final calculated loudness of the audio signal to be processed can accurately represent the receiving loudness of the human ear. Then the processor 20 adjusts the loudness of the audio signal to be processed so that the loudness of the audio signal to be processed reaches a preset loudness range.
  • the preset loudness range is determined according to user input, for example, the preset loudness range is determined according to the volume adjusted by the user.
  • the adjusted loudness conforms to the preset loudness range, so as to ensure that different audio signals conform to the actual sound of the human ear after the above adjustments. sense of hearing.
  • the preset mapping curve can also be adjusted by the user's input.
  • a normal volume such as 50% of the maximum volume
  • the processor 20 can increase the overall compensation of the mapping curve according to the user's input.
  • the processor 20 can reduce the overall compensation of the mapping curve (for example, by 20%) according to the user's input, so that after the audio loudness is adjusted, it conforms to the current user's sense of hearing.
  • the audio control method, audio control device 10, and terminal 100 of the present application compensate the signal of each frequency in the audio signal according to the preset mapping curve determined by the equal loudness curves of different frequencies, thereby eliminating the noise caused by different frequencies.
  • the loudness difference makes the calculated loudness of the audio signal more accurate and conforms to the actual hearing of the human ear, so that the playback experience of the audio signal adjusted to the preset loudness range is better, preventing the audio from being played at the same volume
  • the human ear receives audio of different frequencies
  • the receiving loudness is different. For example, the receiving loudness of low-frequency signals will be higher than that of high-frequency signals.
  • the audio control method also includes:
  • 014 Acquire an audio input signal, and divide the audio input signal into multiple initial audio signals of a predetermined duration, where the initial audio signal is a time-domain signal.
  • the audio control device 10 further includes an acquisition module 14, and the acquisition module 14 is also used to acquire an audio input signal, and divide the audio input signal into a plurality of initial audio signals of a predetermined duration, and the initial audio signal is a time domain Signal. That is to say, step 014 can be executed by the obtaining module 14 .
  • the processor 20 is further configured to acquire an audio input signal, and divide the audio input signal into a plurality of initial audio signals with a predetermined duration, where the initial audio signals are time-domain signals. That is to say, step 014 may be executed by the processor 20 .
  • the processor 20 can receive an audio input signal. It can be understood that the audio input signal is a long time-domain signal. Therefore, in order to ensure the accuracy of loudness calculation of the audio signal to be processed, the audio input signal needs to be segmented to Multiple initial audio signals are generated, for example, divided by a predetermined duration (such as 10 milliseconds (ms), 15 ms, 20 ms, etc.). In order to prevent the length of the audio signal to be processed after the conversion processing of each initial audio signal from being too long, affecting the accuracy of loudness calculation of the audio signal to be processed.
  • the predetermined duration can also be comprehensively determined according to the duration of the audio input signal, the processing efficiency of the audio signal by the processor 20, etc., so as to achieve a balance between degree calculation accuracy and processing speed.
  • step 012 includes:
  • 0121 Calculate the loudness corresponding to the signal of each frequency according to the compensation and amplitude corresponding to the signal of each frequency, and use the sum of the loudness corresponding to the signal of each frequency as the loudness of the audio signal to be processed.
  • the calculation module 12 is also used to calculate the loudness corresponding to the signal of each frequency according to the compensation and amplitude corresponding to the signal of each frequency, and use the sum of the loudness corresponding to the signal of each frequency as the audio to be processed The loudness of the signal. That is to say, step 121 can be executed by the calculation module 12 .
  • the processor 20 is further configured to calculate the loudness corresponding to the signal of each frequency according to the compensation and amplitude corresponding to the signal of each frequency, and use the sum of the loudness corresponding to the signal of each frequency as the audio to be processed The loudness of the signal. That is to say, step 0121 may be executed by the processor 20 .
  • the compensation corresponding to the signal of each frequency in the audio signal to be processed can be determined according to the preset mapping curve, and according to the amplitude of the signal of each frequency and the corresponding compensation, That is, the loudness of the signal of each frequency can be calculated, and then the loudness of the signal of each frequency in the audio signal to be processed can be added to obtain the loudness of the audio signal to be processed.
  • the average value of the loudness of each frequency signal in the audio signal to be processed is taken as the loudness of the audio signal to be processed.
  • step 013 includes:
  • the adjustment module 13 is further configured to obtain the loudness corresponding to the largest amplitude in the audio signal to be processed when the loudness of the audio signal to be processed is smaller than the minimum value of the loudness range, and perform the first gain on the audio signal to be processed , so that the loudness corresponding to the maximum amplitude reaches a predetermined loudness, and a first processed audio signal is generated; when the loudness of the first processed audio signal is smaller than the minimum value of the loudness range, the first processed audio signal is located in the first loudness range The second gain is performed on the signal within to generate a second processed audio signal, the loudness of the second processed audio signal is within the loudness range; when the loudness of the audio signal to be processed is greater than the maximum value of the loudness range, the audio signal to be processed is subjected to the first gain to generate a first processed audio signal, the loudness of the first audio signal is within the loudness range. That is to say, step 0131 , step 0132 and step 0133 can be executed by the adjustment module
  • the processor 20 is further configured to obtain the loudness corresponding to the largest amplitude in the audio signal to be processed when the loudness of the audio signal to be processed is smaller than the minimum value of the loudness range, and perform the first gain on the audio signal to be processed , so that the loudness corresponding to the maximum amplitude reaches a predetermined loudness, and a first processed audio signal is generated; when the loudness of the first processed audio signal is smaller than the minimum value of the loudness range, the first processed audio signal is located in the first loudness range The second gain is performed on the signal within to generate a second processed audio signal, the loudness of the second processed audio signal is within the loudness range; when the loudness of the audio signal to be processed is greater than the maximum value of the loudness range, the audio signal to be processed is subjected to the first gain to generate a first processed audio signal, the loudness of the first audio signal is within the loudness range. That is to say, step 0131 , step 0132 and step 0133 may be executed by the processor 20 .
  • the overall gain will not affect the dynamic range of the audio signal to be processed, but excessive gain will easily cause wave clipping. problem, which leads to the distortion of the audio signal to be processed. Therefore, when the overall gain cannot adjust the loudness of the audio signal to be processed to the preset loudness range, part of the dynamic range of the audio signal to be processed can be appropriately sacrificed, and then the local gain method can be used. Gain part of the signal, and adjust the loudness of the audio signal to be processed to a preset loudness range under the condition that the audio signal to be processed is not distorted.
  • the processor 20 judges the relationship between the loudness of the audio signal to be processed and the preset loudness range. If the loudness of the audio signal to be processed is within the loudness range, it is not necessary to adjust the loudness of the audio signal to be processed. If it is less than the minimum value of the loudness range, it means that the loudness of the audio signal to be processed needs to be increased, but in order to prevent the wave clipping problem, the processor 20 first obtains the loudness corresponding to the maximum amplitude in the audio signal to be processed, so as to obtain the loudness corresponding to the largest amplitude in the audio signal to be processed.
  • the amplitude determines the first gain, such as the loudness corresponding to the maximum amplitude is -6db, and the maximum first gain that can prevent wave clipping is 6db.
  • the loudness corresponding to the maximum amplitude reaches the predetermined loudness (such as 0db), and then
  • the processor 20 may adjust the audio signal to be processed with the first gain, and gain 6db to all signals of the audio signal to be processed, thereby generating the first processed audio signal.
  • the processor 20 determines the signal compensation of each frequency in the first processed audio signal according to the mapping curve to calculate the loudness of the first processed audio signal, and then judges whether the loudness of the first processed audio signal is within the loudness range, if it is located Within the loudness range, there is no need to adjust the loudness of the processed audio signal. Therefore, the loudness of the audio signal to be processed is adjusted to a preset loudness range while ensuring that the audio signal to be processed is not distorted and does not affect the dynamic range.
  • the second gain is performed on the signal in the first loudness range of the first processed audio signal, such as dividing the signal into the first loudness signal and the second loudness signal according to the loudness.
  • the second loudness signal and the third loudness signal, the loudness range corresponding to the first loudness signal is the first loudness range (such as less than -75db), and the loudness range corresponding to the second loudness signal is the third loudness range (such as the interval [-75db,- 10db)), the loudness range corresponding to the third loudness signal is the second loudness range (such as interval [-10db, 0db]).
  • the processor 20 performs a second gain on the first loudness signal to generate a second processed audio signal.
  • the gain value of the second gain can be up to 75db (to ensure that the first loudness signal does not cancel the wave), and the promotion is relatively large, so the second gain is performed.
  • the loudness of the second processed audio signal can basically be adjusted within the loudness range, thereby completing the loudness adjustment of the audio signal to be processed.
  • the processor 20 can firstly convert the first processed audio signal into a time domain signal, and then obtained as the first processed audio signal of the time domain signal, the signal of the first frequency band, the signal of the second frequency band and the signal of the third frequency band (that is, the low frequency signal, the intermediate frequency signal and the high frequency signal) in the first loudness range signal), and then perform a second gain on the signal in the first frequency band, the signal in the second frequency band, and the signal in the third frequency band respectively, so as to generate a second processed audio signal.
  • the processor 20 may also perform the second gain on the second loudness signal and/or the third loudness signal, In this way, it is ensured that the loudness is adjusted within the loudness range under the condition that the audio signal to be processed is not distorted.
  • the processor 20 can first gain the audio signal to be processed.
  • the signal is adjusted, and the first gain is a negative gain. For example, if the signal is -6db and the first gain is -6db, the signal becomes -12db after the first gain adjustment, thereby reducing the loudness of the signal. In this way, it is only necessary to perform the first gain on the audio signal to be processed, and by setting the value of the first gain, it can be ensured that the loudness of the first audio signal after the first gain is within the loudness range.
  • the second gain can also be performed only on the first loudness signal, the second loudness signal or the third loudness signal in the audio signal to be processed, so that the adjusted loudness of the second audio signal changes Smaller, it is beneficial to realize precise loudness adjustment of the audio signal to be processed.
  • the processor 20 can firstly convert the second processed audio signal into a time domain signal, and then obtain the first processed audio signal as a time domain signal, the signal of the first frequency band, the signal of the second frequency band and the signal of the third frequency band (that is, the low frequency signal, the intermediate frequency signal and the high frequency signal) in the second loudness range signal), and then perform a second gain on the signal in the first frequency band, the signal in the second frequency band, and the signal in the third frequency band respectively, so as to generate a second processed audio signal.
  • the second gain of the signal in the first frequency band, the second gain of the signal in the second frequency band, and the second gain of the signal in the third frequency band can be the same or different, so as to realize the dynamic control of high, medium and low frequencies, as desired
  • one or more non-transitory computer-readable storage media 300 containing a computer program 302 when the computer program 302 is executed by one or more processors 20, the processors 20 can Execute the audio control method in any one of the above implementation manners.
  • the processors 20 are made to perform the following steps:
  • 011 Determine the compensation corresponding to each frequency signal in the audio signal to be processed according to the preset mapping curve, and the mapping curve is determined according to the equal loudness curve of different frequencies;
  • processors 20 may also perform the following steps:

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Stereophonic System (AREA)

Abstract

一种音频控制方法、音频控制装置(10)、终端(100)和非易失性计算机可读存储介质(300)。方法包括(011)根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿;(012)根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度;及(013)调整所述响度至预设的响度范围。

Description

音频控制方法及装置、终端和存储介质
优先权信息
本申请请求2021年8月23日向中国国家知识产权局提交的、专利申请号为202110969111.3的专利申请的优先权和权益,并且通过参照将其全文并入此处。
技术领域
本申请涉及音频处理技术领域,特别涉及一种音频控制方法、音频控制装置、终端和非易失性计算机可读存储介质。
背景技术
目前,移动终端能够播放音频。
发明内容
本申请提供了一种音频控制方法、音频控制装置、终端和非易失性计算机可读存储介质。
本申请实施方式的音频控制方法包括根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,所述预设的映射曲线根据不同响度的等响曲线确定,在所述预设的映射曲线中,所述每个频率的信号和所述补偿一一对应;根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度;及调整所述响度至预设的响度范围。
本申请实施方式的音频控制装置包括确定模块、计算模块和调整模块。所述确定模块用于根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,所述预设的映射曲线根据不同响度的等响曲线确定,在所述预设的映射曲线中,所述每个频率的信号和所述补偿一一对应;所述计算模块用于根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度;及所述调整模块用于整所述响度至预设的响度范围。
本申请实施方式的终端包括处理器。所述处理器用于:根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,所述预设的映射曲线根据不同响度的等响曲线确定,在所述预设的映射曲线中,所述每个频率的信号和所述补偿一一对应;根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度;及调整所述响度至预设的响度范围。
本申请实施方式的一种包含计算机程序的非易失性计算机可读存储介质,当所述计算机程序被一个或多个处理器执行时,使得所述处理器执行所述音频控制方法。所述音频控制方法包括根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,所述预设的映射曲线根据不同响度的等响曲线确定,在所述预设的映射曲线中,所述每个频率的信号和所述补偿一一对应;根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度;及调整所述响度至预设的响度范围。
本申请的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明 显,或通过本申请的实践了解到。
附图说明
为了更清楚地说明本申请实施方式或现有技术中的技术方案,下面将对实施方式或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施方式,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是本申请某些实施方式的音频控制方法的流程示意图;
图2是本申请某些实施方式的音频控制装置的模块示意图;
图3是本申请某些实施方式的终端的平面示意图;
图4和图5是本申请某些实施方式的音频控制方法的原理示意图;
图6至图8是本申请某些实施方式的音频控制方法的流程示意图;及
图9是本申请某些实施方式的处理器和计算机可读存储介质的连接示意图。
具体实施方式
以下结合附图对本申请的实施方式作进一步说明。附图中相同或类似的标号自始至终表示相同或类似的元件或具有相同或类似功能的元件。另外,下面结合附图描述的本申请的实施方式是示例性的,仅用于解释本申请的实施方式,而不能理解为对本申请的限制。
本申请实施方式的音频控制方法包括根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,预设的映射曲线根据不同响度的等响曲线确定,在预设的映射曲线中,每个频率的信号和补偿一一对应;根据每个频率的信号及每个频率的信号对应的补偿,计算待处理音频信号的响度;及调整响度至预设的响度范围。
在某些实施方式中,音频控制方法还包括获取音频输入信号,并将音频输入信号分为预定时长的多个初始音频信号,初始音频信号为时域信号。
在某些实施方式中,音频控制方法还包括对初始音频信号进行变换处理,以转化为待处理音频信号,待处理音频信号为频域信号。
在某些实施方式中,映射曲线中预设的第一频段内的每个频率的信号对应的补偿为负数,且与频率的大小负相关;映射曲线中预设的第二频段内的每个频率的信号对应的补偿为正数,且与频率的大小正相关;映射曲线中预设的第三频段内的每个频率的信号对应的补偿为正数,且为固定值;第一频段、第二频段和第三频段互不重合,且第一频段的中心频率、第二频段的中心频率和第三频段的中心频率依次增大。
在某些实施方式中,调整响度至预设的响度范围,包括:
在待处理音频信号的响度小于响度范围的最小值时,获取待处理音频信号中最大的振幅对应的响度,并对待处理音频信号进行第一增益,以使得最大的振幅对应的响度达到预定响度,并生成第一处理音频信号;
在第一处理音频信号的响度小于响度范围的最小值时,对第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,第二处理音频信号的响度位于响度范围内。
在某些实施方式中,调整响度至预设的响度范围,包括:
在待处理音频信号的响度大于响度范围的最大值时,对待处理音频信号进行第一增益,以生成第一处理音频信号,第一音频信号的响度位于响度范围内。
在某些实施方式中,对第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,包括:
对第一处理音频信号中,位于第一响度范围内的第一频段的信号、第二频段的信号和第三频段的信号分别进行第二增益,以生成第二处理音频信号;
对第一处理音频信号中,位于第二响度范围内的信号进行第二增益,以生成第二处理音频信号,包括:
对第一处理音频信号中,位于第二响度范围内的第一频段的信号、第二频段的信号和第三频段的信号分别进行第二增益,以生成第二处理音频信号,第一频段、第二频段和第三频段互不重合。
在某些实施方式中,根据每个频率的信号及每个频率的信号对应的补偿,计算待处理音频信号的响度,包括:
根据每个频率的信号对应的补偿和振幅计算每个频率的信号对应的响度,并将每个频率的信号对应的响度的和作为待处理音频信号的响度。
在某些实施方式中,第一频段的音频信号对应的频段为[0赫兹,1000赫兹),第二频段的音频信号对应的频段为[1000赫兹,5000赫兹),第三频段的音频信号对应的频段为[5000赫兹,16000赫兹)。
请参阅图2,本申请实施方式的音频控制装置10包括确定模块11、计算模块12和调整模块13。确定模块11用于根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,预设的映射曲线根据不同响度的等响曲线确定,在预设的映射曲线中,每个频率的信号和补偿一一对应;计算模块12用于根据每个频率的信号及每个频率的信号对应的补偿,计算待处理音频信号的响度;调整模块13用于调整响度至预设的响度范围。
请参阅图3,本申请实施方式的终端100包括处理器20。处理器20用于根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,预设的映射曲线根据不同响度的等响曲线确定,在预设的映射曲线中,每个频率的信号和补偿一一对应;根据每个频率的信号及每个频率的信号对应的补偿,计算待处理音频信号的响度;及调整响度至预设的响度范围。
在某些实施方式中,处理器20还用于获取音频输入信号,并将音频输入信号分为预定时长的多个初始音频信号,初始音频信号为时域信号。
在某些实施方式中,处理器20还用于对初始音频信号进行变换处理,以转化为待处理音频信号,待处理音频信号为频域信号。
在某些实施方式中,映射曲线中预设的第一频段内的每个频率的信号对应的补偿为负数,且与频率的大小负相关;映射曲线中预设的第二频段内的每个频率的信号对应的补偿为正数,且与频率的大小正相关;映射曲线中预设的第三频段内的每个频率的信号对应的补偿为正数,且为固定值;第一频段、第二频段和第三频段互不重合,且第一频段的中心频率、第二频段的中心频率和第三频段的中心频率依次增大。
在某些实施方式中,处理器20还用于在待处理音频信号的响度小于响度范围的最小值时,获取待处理音频信号中最大的振幅对应的响度,并对待处理音频信号进行第一增益, 以使得最大的振幅对应的响度达到预定响度,并生成第一处理音频信号;在第一处理音频信号的响度小于响度范围的最小值时,对第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,第二处理音频信号的响度位于响度范围内。
在某些实施方式中,处理器20还用于在待处理音频信号的响度大于响度范围的最大值时,对待处理音频信号进行第一增益,以生成第一处理音频信号,第一音频信号的响度位于响度范围内。
在某些实施方式中,处理器20还用于对第一处理音频信号中,位于第一响度范围内的第一频段的信号、第二频段的信号和第三频段的信号分别进行第二增益,以生成第二处理音频信号;对第一处理音频信号中,位于第二响度范围内的第一频段的信号、第二频段的信号和第三频段的信号分别进行第二增益,以生成第二处理音频信号,第一频段、第二频段和第三频段互不重合。
在某些实施方式中,处理器20还用于根据每个频率的信号对应的补偿和振幅计算每个频率的信号对应的响度,并将每个频率的信号对应的响度的和作为待处理音频信号的响度。
在某些实施方式中,第一频段的音频信号对应的频段为[0赫兹,1000赫兹),第二频段的音频信号对应的频段为[1000赫兹,5000赫兹),第三频段的音频信号对应的频段为[5000赫兹,16000赫兹)。
请参阅图9,本申请实施方式的一个或多个包含计算机程序302的非易失性计算机可读存储介质300,当计算机程序302被一个或多个处理器20执行时,使得处理器20可执行上述任一实施方式的音频控制方法。
请参阅图1至图3,本申请实施方式的音频控制方法包括以下步骤:
011:根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,预设的映射曲线根据不同响度的等响曲线确定,在预设的映射曲线中,每个频率的信号和补偿一一对应;
012:根据每个频率的信号及每个频率的信号对应的补偿,计算待处理音频信号的响度;及
013:调整响度至预设的响度范围。
本申请实施方式的音频控制装置10包括确定模块11、计算模块12和调整模块13。确定模块11用于根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,预设的映射曲线根据不同响度的等响曲线确定,在预设的映射曲线中,每个频率的信号和补偿一一对应;计算模块12用于根据每个频率的信号及每个频率的信号对应的补偿,计算待处理音频信号的响度;调整模块13用于调整响度至预设的响度范围。也即是说,步骤011可以由确定模块11实现、步骤012可以由计算模块12执行和步骤013可以由调整模块13执行。
本申请实施方式的终端100包括处理器20。处理器20用于根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,预设的映射曲线根据不同响度的等响曲线确定,在预设的映射曲线中,每个频率的信号和补偿一一对应;根据每个频率的信号及每个频率的信号对应的补偿,计算待处理音频信号的响度;及调整响度至预设的响度范围。也即是说,步骤011、步骤012和步骤013可以由处理器20执行。
具体地,终端100可以是手机、智能手表、平板电脑、显示设备、笔记本电脑、柜员机、闸机、头显设备、游戏机等。如图2所示,本申请实施方式以终端100是手机为例进行说明,可以理解,终端100的具体形式并不限于手机。
可以理解,由于人耳对于相同响度下,各频段的真实听感不一致,导致不同频率的音频在相同音量下进行播放时,人耳听到的响度是不同的。例如,在相同音量下,人耳接收低频的音频信号的接收响度,比接收高频的音频信号的接收响度要大,对人耳来说,低频信号的响度会更大,因此,如要保持低频的音频信号和高频的音频信号的接收响度一致,则需要对低频的音频信号和/或高频的音频信号进行处理,如降低低频的音频信号的响度和/或提高高频的音频信号的响度。因此,可根据不同频率的信号的实际响度对应的等响曲线,来确定降低低频的音频信号的响度和/或提高高频的音频信号的响度的幅度,即确定不同频率的音频信号的补偿。
图4为不同频率的等响曲线,横坐标为频率,纵坐标为实际响度,曲线则表示相同的接收响度下,不同频率对应的实际响度。根据多个等响曲线,可拟合生成一条映射曲线,以确定不同频率对应的补偿。
如图5所示,为拟合生成的映射曲线,横坐标为频率,纵坐标为补偿值,可以看出,低频的音频信号的补偿值均为负值,即负补偿,表示在相同的接收响度下,低频的音频信号的实际响度需要降低,且负补偿的大小与频率的大小正相关,即频率越高,实际响度需要降低的幅度越小;而对于中频信号的补偿值均为正数,即正补偿,表示在相同的接收响度下,中频的音频信号的实际响度需要增加,且正补偿的大小与频率的大小正相关,即频率越高,实际响度需要增加的幅度越小;而对于高频信号(即第三频段,如5000HZ到16000HZ)的补偿值均为正数,即正补偿,表示在相同的接收响度下,高频的音频信号的实际响度需要增加,但由于不能过度补偿以导致音频失真,因此,对应高频的音频信号的补偿值可设置为固定值,如为中频的音频信号的最大正补偿(图5所示的4分贝(db))。
其中,第一频段、第二频段和第三频段互不重合,且第一频段的中心频率、第二频段的中心频率和第三频段的中心频率依次增大。如图5所示,低频的音频信号(即第一频段的音频信号)对应的频段为[0赫兹(HZ),1000HZ),中频的音频信号(即第二频段的音频信号)对应的频段为[1000HZ,5000HZ),高频的音频信号(即第三频段的音频信号)对应的频段为[5000HZ,16000HZ)。
在确定了映射曲线后,处理器20可获取初始音频信号,可以理解,初始音频信号一般为时域信号,因此,为了方便获取每个频率对应的振幅,需要先对初始音频信号进行变换处理,例如,对初始音频信号进行傅里叶变换,以将初始音频信号转化为频域的待处理音频信号。然后处理器20根据映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,然后根据每个频率的信号对应的补偿和振幅,来计算待处理音频信号的响度,由于对每个频率的信号都进行了对应的补偿,使得最终计算的待处理音频信号的响度即可准确表示人耳的接收响度。然后处理器20对待处理音频信号的响度进行调整,从而使得待处理音频信号的响度达到预设的响度范围。其中,预设的响度范围根据用户输入确定,如根据用户调整后的音量,确定预设的响度范围。如此,通过准确计算每个待处理音频的响度(即人耳的接收响度),使得调整后的响度符合预设的响度范围,从而保证不同的音频信号在经过上述调整后,符合人耳的实际听感。
不同用户随着年龄、身体因素等原因,导致听力存在差异,比如50db的响度在有的人的听感下,只能达到40db的响度,因此,预设的映射曲线还可由用户的输入进行调整。例如,用户在正常音量(如最大音量的50%)下,感觉音量偏小时,可将音量进度条调节到60%,处理器20根据用户的输入,即可将映射曲线的整体补偿均增大(如增大20%),从而使得音频进行响度调整后,符合当前用户的听感;再例如用户在正常音量(如最大音量的50%)下,感觉音量偏大时,可将音量进度条调节到40%,处理器20根据用户的输入,即可将映射曲线的整体补偿均减小(如减小20%),从而使得音频进行响度调整后,符合当前用户的听感。
由于音频中的不同频率的信号的比例的不同,在以相同的音量播放时,存在音频响度大小不一的问题。
本申请的音频控制方法、音频控制装置10和终端100,根据预设的由不同频率的等响曲线确定的映射曲线,对音频信号中每个频率的信号进行补偿,从而消除因频率不同导致的响度差异,使得计算得到的音频信号的响度的准确性较高,符合人耳的实际听感,从而使得调整到预设响度范围后的音频信号的播放体验较好,防止在以相同音量播放音频时,人耳接收不同频率的音频的接收响度不同的问题,如低频信号的接收响度会高于高频信号的接收响度。
请参阅图2、图3和图6,在某些实施方式中,音频控制方法还包括:
014:获取音频输入信号,并将音频输入信号分为预定时长的多个初始音频信号,初始音频信号为时域信号。
在某些实施方式中,音频控制装置10还包括获取模块14,获取模块14还用于获取音频输入信号,并将音频输入信号分为预定时长的多个初始音频信号,初始音频信号为时域信号。也即是说,步骤014可以由获取模块14执行。
在某些实施方式中,处理器20还用于获取音频输入信号,并将音频输入信号分为预定时长的多个初始音频信号,初始音频信号为时域信号。也即是说,步骤014可以由处理器20执行。
具体地,处理器20可接收音频输入信号,可以理解,音频输入信号为时长较长的时域信号,因此,为了保证待处理音频信号的响度计算准确性,需要对音频输入信号进行分割,以生成多个初始音频信号,如以预定时长(如10毫秒(ms)、15ms、20ms等)进行分割。以防止每个初始音频信号进行变换处理后的待处理音频信号的长度过长,影响待处理音频信号的响度计算准确性。当然,预定时长还可根据音频输入信号的时长、处理器20对音频信号的处理效率等综合进行确定,以实现度计算准确性和处理速度之间的平衡。
请参阅图2、图3和图7,在某些实施方式中,步骤012包括:
0121:根据每个频率的信号对应的补偿和振幅计算每个频率的信号对应的响度,并将每个频率的信号对应的响度的和作为待处理音频信号的响度。
在某些实施方式中,计算模块12还用于根据每个频率的信号对应的补偿和振幅计算每个频率的信号对应的响度,并将每个频率的信号对应的响度的和作为待处理音频信号的响度。也即是说,步骤121可以由计算模块12执行。
在某些实施方式中,处理器20还用于根据每个频率的信号对应的补偿和振幅计算每个频率的信号对应的响度,并将每个频率的信号对应的响度的和作为待处理音频信号的响 度。也即是说,步骤0121可以由处理器20执行。
具体的,在获取到频域的待处理音频信号后,根据预设的映射曲线可确定待处理音频信号中每个频率的信号对应的补偿,根据每个频率的信号的振幅和对应的补偿,即可计算得到每个频率的信号的响度,然后将待处理音频信号中每个频率的信号的响度加起来,即可得到待处理音频信号的响度。或者,将待处理音频信号中每个频率的信号的响度的平均值作为待处理音频信号的响度。
请参阅图2、图3和图8,在某些实施方式中,步骤013包括:
0131:在待处理音频信号的响度小于响度范围的最小值时,获取待处理音频信号中最大的振幅对应的响度,并对待处理音频信号进行第一增益,以使得最大的振幅对应的响度达到预定响度,并生成第一处理音频信号;
0132:在第一处理音频信号的响度小于响度范围的最小值时,对第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,第二处理音频信号的响度位于响度范围内;
0133:在待处理音频信号的响度大于响度范围的最大值时,对待处理音频信号进行第一增益,以生成第一处理音频信号,第一音频信号的响度位于响度范围内。
在某些实施方式中,调整模块13还用于在待处理音频信号的响度小于响度范围的最小值时,获取待处理音频信号中最大的振幅对应的响度,并对待处理音频信号进行第一增益,以使得最大的振幅对应的响度达到预定响度,并生成第一处理音频信号;在第一处理音频信号的响度小于响度范围的最小值时,对第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,第二处理音频信号的响度位于响度范围内;在待处理音频信号的响度大于响度范围的最大值时,对待处理音频信号进行第一增益,以生成第一处理音频信号,第一音频信号的响度位于响度范围内。也即是说,步骤0131、步骤0132和步骤0133可以由调整模块13执行。
在某些实施方式中,处理器20还用于在待处理音频信号的响度小于响度范围的最小值时,获取待处理音频信号中最大的振幅对应的响度,并对待处理音频信号进行第一增益,以使得最大的振幅对应的响度达到预定响度,并生成第一处理音频信号;在第一处理音频信号的响度小于响度范围的最小值时,对第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,第二处理音频信号的响度位于响度范围内;在待处理音频信号的响度大于响度范围的最大值时,对待处理音频信号进行第一增益,以生成第一处理音频信号,第一音频信号的响度位于响度范围内。也即是说,步骤0131、步骤0132和步骤0133可以由处理器20执行。
具体地,在对待处理音频信号进行响度调整时,可采用整体增益和局部增益结合的方式进行调整,其中,整体增益不会影响待处理音频信号的动态范围,但增益过大容易产生消波的问题,导致待处理音频信号失真,因此,在整体增益无法将待处理音频信号的响度调整到预设的响度范围内时,可适当牺牲待处理音频信号的部分动态范围,再使用局部增益的方式对部分信号进行增益,在保证待处理音频信号不失真的情况下,将待处理音频信号的响度调整到预设的响度范围内。
处理器20判断待处理音频信号的响度和预设的响度范围的关系,若待处理音频信号的响度位于响度范围内,则不需要对处理音频信号的响度进行调整,若待处理音频信号的 响度小于响度范围的最小值,则表示需要增加待处理音频信号的响度,但为了防止消波问题,处理器20先获取待处理音频信号中最大的振幅对应的响度,从而根据待处理音频信号中最大的振幅确定第一增益,如最大的振幅对应的响度为-6db,防止消波能够进行的最大的第一增益为6db,此时,最大的振幅对应的响度达到预定响度(如0db),然后处理器20可以第一增益对待处理音频信号进行调整,将待处理音频信号的所有信号均增益6db,从而生成第一处理音频信号。
然后处理器20再根据映射曲线确定第一处理音频信号中每个频率的信号的补偿,以计算第一处理音频信号的响度,然后判断第一处理音频信号的响度是否位于响度范围内,若位于响度范围内,则不需要对处理音频信号的响度进行调整。从而在保证待处理音频信号不失真且不影响动态范围的情况下,将待处理音频信号的响度调整到预设的响度范围。
若第一处理音频信号的响度仍小于响度范围的最小值,则对第一处理音频信号中,位于第一响度范围的信号进行第二增益,如根据响度将信号分为第一响度信号、第二响度信号和第三响度信号,第一响度信号对应的响度范围为第一响度范围(如小于-75db),第二响度信号对应的响度范围为第三响度范围(如区间[-75db,-10db)),第三响度信号对应的响度范围为第二响度范围(如区间[-10db,0db])。处理器20对第一响度信号进行第二增益,以生成第二处理音频信号,第二增益的增益值最大可为75db(保证第一响度信号不消波),提升较大,因此在进行第二增益后,第二处理音频信号的响度基本都能够调整到响度范围内,从而完成待处理音频信号的响度调整。
在对第一处理音频信号中,位于第一响度范围内的信号进行第二增益时,处理器20可首先将第一处理音频信号经过逆变换处理(如逆傅里叶变换),转换为时域信号,然后获取为时域信号的第一处理音频信号中,第一响度范围内的第一频段的信号、第二频段的信号和第三频段的信号(即低频信号、中频信号和高频信号),然后分别对第一频段的信号、第二频段的信号和第三频段的信号进行第二增益,从而生成第二处理音频信号。
当然,在以最大的第二增益调整后,第二处理音频信号的响度依旧无法达到响度范围内时,处理器20还可对第二响度信号和/或第三响度信号也进行第二增益,从而保证待处理音频信号不失真的情况下,响度调整到响度范围内。
若待处理音频信号的响度大于响度范围的最大值,则表示降低待处理音频信号的响度,此时不会出现消波问题,因此,为了保证动态范围,处理器20可以第一增益对待处理音频信号进行调整,第一增益为负增益,如信号为-6db,第一增益为-6db,则信号经过第一增益调整后变为-12db,从而实现对信号的响度的降低。如此,只需对待处理音频信号进行第一增益,通过设置第一增益的值,即可保证第一增益后的第一音频信号的响度位于响度范围内。
当然,为了实现更小程度的响度调整,同样可仅对待处理音频信号中的第一响度信号、第二响度信号或第三响度信号进行第二增益,这样调整后的第二音频信号的响度变化较小,有利于实现待处理音频信号精确的响度调整。
在对第一处理音频信号中,位于第二响度范围内的信号进行第二增益时,处理器20可首先将第二处理音频信号经过逆变换处理(如逆傅里叶变换),转换为时域信号,然后获取为时域信号的第一处理音频信号中,第二响度范围内的第一频段的信号、第二频段的信号和第三频段的信号(即低频信号、中频信号和高频信号),然后分别对第一频段的信 号、第二频段的信号和第三频段的信号进行第二增益,从而生成第二处理音频信号。
可以理解,第一频段的信号的第二增益、第二频段的信号的第二增益和第三频段的信号进行第二增益可以是相同或不同的,从而实现对高中低频的动态控制,如想要低音增强,则设置低频信号的第二增益较大,如想要高音增强,则设置高频信号的第二增益较大。
如此,在保证待处理音频信号不失真的前提下,实现动态控制,并将待处理音频信号的响度调整到预设的响度范围内,有利于提升播放体验。
请参阅图9,本申请实施方式的一个或多个包含计算机程序302的非易失性计算机可读存储介质300,当计算机程序302被一个或多个处理器20执行时,使得处理器20可执行上述任一实施方式的音频控制方法。
例如,请结合图1至图3,当计算机程序302被一个或多个处理器20执行时,使得处理器20执行以下步骤:
011:根据预设的映射曲线确定待处理音频信号中每个频率的信号对应的补偿,映射曲线根据不同频率的等响曲线确定;
012:根据每个频率的信号对应的振幅和补偿,计算待处理音频信号的响度;及
013:调整响度至预设的响度范围。
再例如,请结合图2、图3和图8,当计算机程序302被一个或多个处理器20执行时,处理器20还可以执行以下步骤:
0131:在待处理音频信号的响度小于响度范围的最小值时,获取待处理音频信号中最大的振幅对应的响度,并对待处理音频信号进行第一增益,以使得最大的振幅对应的响度达到预定响度,并生成第一处理音频信号;
0132:在第一处理音频信号的响度小于响度范围的最小值时,对第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,第二处理音频信号的响度位于响度范围内;
0133:在待处理音频信号的响度大于响度范围的最大值时,对待处理音频信号进行第一增益,以生成第一处理音频信号,第一音频信号的响度位于响度范围内。
在本说明书的描述中,参考术语“一个实施方式”、“一些实施方式”、“示意性实施方式”、“示例”、“具体示例”或“一些示例”等的描述意指结合实施方式或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施方式或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施方式或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施方式或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施方式或示例以及不同实施方式或示例的特征进行结合和组合。
流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现特定逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本申请的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本申请的实施方式所属技术领域的技术人员所理解。
尽管上面已经示出和描述了本申请的实施方式,可以理解的是,上述实施方式是示例性的,不能理解为对本申请的限制,本领域的普通技术人员在本申请的范围内可以对上述 实施方式进行变化、修改、替换和变型。

Claims (20)

  1. 一种音频控制方法,其特征在于,包括:
    根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,所述预设的映射曲线根据不同响度的等响曲线确定,在所述预设的映射曲线中,所述每个频率的信号和所述补偿一一对应;
    根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度;及
    调整所述响度至预设的响度范围。
  2. 根据权利要求1所述的音频控制方法,其特征在于,还包括:
    获取音频输入信号,并将所述音频输入信号分为预定时长的多个初始音频信号,所述初始音频信号为时域信号。
  3. 根据权利要求2所述的音频控制方法,其特征在于,还包括:
    对所述初始音频信号进行变换处理,以转化为所述待处理音频信号,所述待处理音频信号为频域信号。
  4. 根据权利要求1所述的音频控制方法,其特征在于,所述映射曲线中预设的第一频段内的所述每个频率的信号对应的所述补偿为负数,且与频率的大小负相关;所述映射曲线中预设的第二频段内的所述每个频率的信号对应的所述补偿为正数,且与频率的大小正相关;所述映射曲线中预设的第三频段内的所述每个频率的信号对应的所述补偿为正数,且为固定值;所述第一频段、所述第二频段和所述第三频段互不重合,且所述第一频段的中心频率、所述第二频段的中心频率和所述第三频段的中心频率依次增大。
  5. 根据权利要求1所述的音频控制方法,其特征在于,所述调整所述响度至预设的响度范围,包括:
    在所述待处理音频信号的响度小于所述响度范围的最小值时,获取所述待处理音频信号中最大的所述振幅对应的响度,并对所述待处理音频信号进行第一增益,以使得最大的所述振幅对应的响度达到预定响度,并生成第一处理音频信号;
    在所述第一处理音频信号的响度小于所述响度范围的最小值时,对所述第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,所述第二处理音频信号的响度位于所述响度范围内。
  6. 根据权利要求1所述的音频控制方法,其特征在于,所述调整所述响度至预设的响度范围,包括:
    在所述待处理音频信号的响度大于所述响度范围的最大值时,对所述待处理音频信号进行第一增益,以生成第一处理音频信号,所述第一音频信号的响度位于所述响度范围内。
  7. 根据权利要求6所述的音频控制方法,其特征在于,所述对所述第一处理音频信 号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,包括:
    对所述第一处理音频信号中,位于所述第一响度范围内的第一频段的信号、第二频段的信号和第三频段的信号分别进行所述第二增益,以生成所述第二处理音频信号;
    所述对所述第一处理音频信号中,位于第二响度范围内的信号进行第二增益,以生成第二处理音频信号,包括:
    对所述第一处理音频信号中,位于所述第二响度范围内的第一频段的信号、第二频段的信号和第三频段的信号分别进行所述第二增益,以生成所述第二处理音频信号,所述第一频段、所述第二频段和所述第三频段互不重合。
  8. 根据权利要求1所述的音频控制方法,其特征在于,所述根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度,包括:
    根据每个频率的信号对应的所述补偿和振幅计算每个频率的信号对应的响度,并将每个频率的信号对应的响度的和作为待处理音频信号的响度。
  9. 根据权利要求4所述的音频控制方法,其特征在于,所述第一频段的音频信号对应的频段为[0赫兹,1000赫兹),第二频段的音频信号对应的频段为[1000赫兹,5000赫兹),第三频段的音频信号对应的频段为[5000赫兹,16000赫兹)。
  10. 一种音频控制装置,其特征在于,包括:
    确定模块,用于根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,所述预设的映射曲线根据不同响度的等响曲线确定,在所述预设的映射曲线中,所述每个频率的信号和所述补偿一一对应;
    计算模块,用于根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度;及
    调整模块,用于调整所述响度至预设的响度范围。
  11. 一种终端,其特征在于,包括处理器,所述处理器用于根据预设的映射曲线确定待处理音频信号中,每个频率的信号对应的补偿,所述预设的映射曲线根据不同响度的等响曲线确定,在所述预设的映射曲线中,所述每个频率的信号和所述补偿一一对应;根据所述每个频率的信号及所述每个频率的信号对应的所述补偿,计算所述待处理音频信号的响度;及调整所述响度至预设的响度范围。
  12. 根据权利要求11所述的终端,其特征在于,所述处理器还用于获取音频输入信号,并将所述音频输入信号分为预定时长的多个初始音频信号,所述初始音频信号为时域信号。
  13. 根据权利要求12所述的终端,其特征在于,所述处理器还用于对所述初始音频信号进行变换处理,以转化为所述待处理音频信号,所述待处理音频信号为频域信号。
  14. 根据权利要求11所述的终端,其特征在于,所述映射曲线中预设的第一频段内的 所述每个频率的信号对应的所述补偿为负数,且与频率的大小负相关;所述映射曲线中预设的第二频段内的所述每个频率的信号对应的所述补偿为正数,且与频率的大小正相关;所述映射曲线中预设的第三频段内的所述每个频率的信号对应的所述补偿为正数,且为固定值;所述第一频段、所述第二频段和所述第三频段互不重合,且所述第一频段的中心频率、所述第二频段的中心频率和所述第三频段的中心频率依次增大。
  15. 根据权利要求11所述的终端,其特征在于,所述处理器还用于:
    在所述待处理音频信号的响度小于所述响度范围的最小值时,获取所述待处理音频信号中最大的所述振幅对应的响度,并对所述待处理音频信号进行第一增益,以使得最大的所述振幅对应的响度达到预定响度,并生成第一处理音频信号;
    在所述第一处理音频信号的响度小于所述响度范围的最小值时,对所述第一处理音频信号中,位于第一响度范围内的信号进行第二增益,以生成第二处理音频信号,所述第二处理音频信号的响度位于所述响度范围内。
  16. 根据权利要求11所述的终端,其特征在于,所述处理器还用于在所述待处理音频信号的响度大于所述响度范围的最大值时,对所述待处理音频信号进行第一增益,以生成第一处理音频信号,所述第一音频信号的响度位于所述响度范围内。
  17. 根据权利要求16所述的终端,其特征在于,所述处理器还用于:
    对所述第一处理音频信号中,位于所述第一响度范围内的第一频段的信号、第二频段的信号和第三频段的信号分别进行所述第二增益,以生成所述第二处理音频信号;
    对所述第一处理音频信号中,位于所述第二响度范围内的第一频段的信号、第二频段的信号和第三频段的信号分别进行所述第二增益,以生成所述第二处理音频信号,所述第一频段、所述第二频段和所述第三频段互不重合。
  18. 根据权利要求11所述的终端,其特征在于,所述处理器还用于根据每个频率的信号对应的所述补偿和振幅计算每个频率的信号对应的响度,并将每个频率的信号对应的响度的和作为待处理音频信号的响度。
  19. 根据权利要求14所述的终端,其特征在于,所述第一频段的音频信号对应的频段为[0赫兹,1000赫兹),第二频段的音频信号对应的频段为[1000赫兹,5000赫兹),第三频段的音频信号对应的频段为[5000赫兹,16000赫兹)。
  20. 一种包括计算机程序的非易失性计算机可读存储介质,所述计算机程序被处理器执行时,使得所述处理器执行权利要求1-9任意一项所述的音频控制方法。
PCT/CN2022/104501 2021-08-23 2022-07-08 音频控制方法及装置、终端和存储介质 WO2023024725A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110969111.3 2021-08-23
CN202110969111.3A CN113472938B (zh) 2021-08-23 2021-08-23 音频控制方法及装置、终端和存储介质

Publications (1)

Publication Number Publication Date
WO2023024725A1 true WO2023024725A1 (zh) 2023-03-02

Family

ID=77867018

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/104501 WO2023024725A1 (zh) 2021-08-23 2022-07-08 音频控制方法及装置、终端和存储介质

Country Status (2)

Country Link
CN (1) CN113472938B (zh)
WO (1) WO2023024725A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116129926A (zh) * 2023-04-19 2023-05-16 北京北信源软件股份有限公司 智能设备自然语言交互信息处理方法

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113472938B (zh) * 2021-08-23 2023-06-30 Oppo广东移动通信有限公司 音频控制方法及装置、终端和存储介质
CN114501233A (zh) * 2022-01-30 2022-05-13 联想(北京)有限公司 一种信号处理方法、装置及电子设备
CN114466285B (zh) * 2022-02-16 2023-09-29 深圳万兴软件有限公司 音频信号的响度调节方法、装置、设备及存储介质
CN114449413B (zh) * 2022-02-16 2023-12-22 深圳万兴软件有限公司 音频信号的响度控制方法、装置、设备及存储介质
CN115086837A (zh) * 2022-06-15 2022-09-20 北斗星通智联科技有限责任公司 一种音效增强方法及装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102378085A (zh) * 2010-08-21 2012-03-14 比亚迪股份有限公司 基于等响曲线的响度补偿方法、补偿装置及音频处理系统
CN102915741A (zh) * 2012-10-29 2013-02-06 上海大学 基于等响曲线的根据音量调整自动还原声音信号音色的方法
CN111970628A (zh) * 2020-08-31 2020-11-20 广州视源电子科技股份有限公司 音频信号的增强方法、装置、存储介质和处理器
WO2021038514A1 (en) * 2019-08-29 2021-03-04 Cosi Paolo Luigi Audio data processing method and system
CN113472938A (zh) * 2021-08-23 2021-10-01 Oppo广东移动通信有限公司 音频控制方法及装置、终端和存储介质
CN113992781A (zh) * 2021-10-28 2022-01-28 Oppo广东移动通信有限公司 音量控制方法及装置、电子设备及计算机可读存储介质

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105657608B (zh) * 2015-12-31 2018-09-04 深圳Tcl数字技术有限公司 音频信号频率响应补偿方法及装置
CN109040885B (zh) * 2017-06-09 2019-08-20 中国科学院声学研究所 一种基于音量的个性化响度补偿方法及耳机均衡系统
CN110446102A (zh) * 2018-05-04 2019-11-12 青岛海信电器股份有限公司 音频信号的响度调整方法及装置
CN109473115B (zh) * 2018-11-22 2022-04-12 四川长虹电器股份有限公司 数字音频信号音量等响度调节方法
CN109658942B (zh) * 2018-12-27 2021-04-30 腾讯音乐娱乐科技(深圳)有限公司 一种音频数据处理方法、装置以及相关设备
CN111966322A (zh) * 2020-08-31 2020-11-20 广州视源电子科技股份有限公司 音频信号处理方法、装置、设备及存储介质

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102378085A (zh) * 2010-08-21 2012-03-14 比亚迪股份有限公司 基于等响曲线的响度补偿方法、补偿装置及音频处理系统
CN102915741A (zh) * 2012-10-29 2013-02-06 上海大学 基于等响曲线的根据音量调整自动还原声音信号音色的方法
WO2021038514A1 (en) * 2019-08-29 2021-03-04 Cosi Paolo Luigi Audio data processing method and system
CN111970628A (zh) * 2020-08-31 2020-11-20 广州视源电子科技股份有限公司 音频信号的增强方法、装置、存储介质和处理器
CN113472938A (zh) * 2021-08-23 2021-10-01 Oppo广东移动通信有限公司 音频控制方法及装置、终端和存储介质
CN113992781A (zh) * 2021-10-28 2022-01-28 Oppo广东移动通信有限公司 音量控制方法及装置、电子设备及计算机可读存储介质

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116129926A (zh) * 2023-04-19 2023-05-16 北京北信源软件股份有限公司 智能设备自然语言交互信息处理方法
CN116129926B (zh) * 2023-04-19 2023-06-09 北京北信源软件股份有限公司 智能设备自然语言交互信息处理方法

Also Published As

Publication number Publication date
CN113472938A (zh) 2021-10-01
CN113472938B (zh) 2023-06-30

Similar Documents

Publication Publication Date Title
WO2023024725A1 (zh) 音频控制方法及装置、终端和存储介质
US8515087B2 (en) Apparatus for processing an audio signal and method thereof
CN109686347B (zh) 音效处理方法、音效处理装置、电子设备和可读介质
CN106713794B (zh) 一种调节音频均衡的方法及提供均衡调节的音频系统
CN112954115B (zh) 一种音量调节方法、装置、电子设备及存储介质
US10380989B1 (en) Methods and apparatus for processing stereophonic audio content
WO2021238458A1 (zh) 扬声设备音质的优化方法
WO2024032133A1 (zh) 听力检测方法、装置、电子设备及存储介质
WO2019033943A1 (zh) 音量调节方法、装置、移动终端及存储介质
WO2019033940A1 (zh) 音量调节方法、装置、终端设备及存储介质
CN111508510A (zh) 音频处理方法、装置、存储介质及电子设备
WO2019033941A1 (zh) 音量调节方法、装置、终端设备及存储介质
US11190875B2 (en) Adjustment of dynamic range and tone mapping in audio system
CN102576560B (zh) 电子音频设备
US10666217B2 (en) Dynamically extending loudspeaker capabilities
US11528549B2 (en) Method for improving electrical endurance of batteries of wireless headphones and the wireless headphones
WO2023010691A1 (zh) 一种耳机虚拟空间声回放方法、装置、存储介质及耳机
US20210384879A1 (en) Acoustic signal processing device, acoustic signal processing method, and non-transitory computer-readable recording medium therefor
CN115185479A (zh) 音量调节方法、装置、设备和存储介质
CN113613122A (zh) 音量调节方法、装置、耳机及存储介质
WO2024061436A1 (en) Adaptive audio enhancement system
WO2020034853A1 (zh) 一种语音信号处理方法及装置、存储介质
KR20210053236A (ko) 주파수 스펙트럼 보정을 위한 오디오 신호 처리 방법 및 장치
CN114679658A (zh) 一种音频装置的动态低音补偿系统及方法
CN112911460A (zh) 音箱及其音效控制方法、音效控制系统

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22860071

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE