CN116980804B

CN116980804B - Volume adjustment method, device, equipment and readable storage medium

Info

Publication number: CN116980804B
Application number: CN202311239752.9A
Authority: CN
Inventors: 梁俊斌
Original assignee: Tencent Technology Shenzhen Co Ltd
Current assignee: Tencent Technology Shenzhen Co Ltd
Priority date: 2023-09-25
Filing date: 2023-09-25
Publication date: 2024-01-26
Anticipated expiration: 2043-09-25
Also published as: CN116980804A

Abstract

The application provides a volume adjustment method, a volume adjustment device, volume adjustment equipment and a readable storage medium, wherein the volume adjustment method comprises the following steps: when the set volume value reaches the first upper limit volume value, if the volume increasing instruction is detected, determining a target volume value according to the volume increasing instruction; determining audio class information according to the original audio signal, and determining adjustment reference data according to the audio class information; determining a frequency domain signal from the original audio signal; gain calculation is carried out according to the auditory perception weighting data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data; gain processing is carried out on the frequency domain signal by utilizing the gain data to obtain a gain frequency domain signal; and determining a gain audio signal according to the gain frequency domain signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value. The method provided by the application can enable the perceived volume corresponding to the audio signal after volume adjustment to be matched with the target volume value, and has a good volume adjustment effect.

Description

Volume adjustment method, device, equipment and readable storage medium

Technical Field

The present disclosure relates to the field of computer technologies, and in particular, to a method, an apparatus, a device, and a readable storage medium for adjusting volume.

Background

The volume of audio, also called loudness, refers to the subjective perception of the intensity of the sound heard by the human ear. The volume of the audio is related to the amplitude of the sound. For some audio playback devices (e.g., cell phones, personal computers, speakers, etc.), the volume of the audio they play may be adjusted by setting a volume value.

Currently, a digital adjustment or an analog adjustment method is generally used for volume amplification processing of audio. However, the digital adjusting method may make the adjusted audio have a sense of broken sound in hearing, and the adjusting effect is poor; the analog adjusting method is limited by the amplifying capability of hardware equipment, the volume adjustable range is smaller, and the adjusting effect is also poorer.

Disclosure of Invention

The embodiment of the application provides a volume adjustment method, a volume adjustment device, volume adjustment equipment and a readable storage medium, which can enable perceived volume corresponding to a volume-adjusted audio signal to be matched with a target volume value, and have a good volume adjustment effect.

In one aspect, an embodiment of the present application provides a method for adjusting volume, where the method includes:

when the set volume value reaches a first upper limit volume value, if a volume increasing instruction is detected, determining a target volume value according to the volume increasing instruction, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value;

Performing category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information;

converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal;

gain calculation is carried out according to the auditory perception weighting data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data;

and performing gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal, and converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value.

In one aspect, an embodiment of the present application provides a volume adjustment device, including:

the determining unit is used for determining a target volume value according to the volume increasing instruction when the volume increasing instruction is detected when the set volume value reaches a first upper limit volume value, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value;

The processing unit is used for carrying out category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information;

the processing unit is further used for converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal;

the processing unit is further used for performing gain calculation according to the auditory perception weighted data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data;

the conversion unit is used for performing gain processing on the frequency domain signal by utilizing the gain data to obtain a gain frequency domain signal, converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, and the perceived volume corresponding to the gain audio signal is matched with the target volume value.

In one aspect, embodiments of the present application provide a computer device, including: the device comprises a processor, a communication interface and a memory, wherein the processor, the communication interface and the memory are connected with each other, executable program codes are stored in the memory, and the processor is used for calling the executable program codes to realize the volume adjustment method provided by the embodiment of the application.

Accordingly, the embodiment of the application also provides a computer readable storage medium, wherein the computer readable storage medium stores instructions, which when running on a computer, cause the computer to implement the volume adjustment method provided by the embodiment of the application.

Accordingly, embodiments of the present application also provide a computer program product comprising a computer program or computer instructions stored in a computer readable storage medium. The processor of the computer device reads the computer program or the computer instructions from the computer readable storage medium, and the processor executes the computer program or the computer instructions, so that the computer device realizes the volume adjustment method provided by the embodiment of the application.

In the application, when the set volume value reaches the first upper limit volume value, if the volume increasing instruction is detected, a target volume value can be determined according to the volume increasing instruction, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to the second upper limit volume value; performing category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information; converting an original audio signal from a time domain to a frequency domain to obtain a frequency domain signal; gain calculation is carried out according to the auditory perception weighting data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data; gain processing is carried out on the frequency domain signal by utilizing the gain data to obtain a gain frequency domain signal; and converting the gain frequency domain signal from the frequency domain to the time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value. According to the volume adjustment method provided by the embodiment of the application, when the set volume value reaches the first upper limit volume value and the volume increasing instruction is detected, the volume of the original audio signal is adjusted, so that the volume of the audio signal is further increased, and the volume requirement of a user is met; the original audio signals can be analyzed and processed to obtain adjustment reference data, so that the audio signals of different categories can be subjected to targeted volume adjustment; the original audio signal can be converted to obtain a frequency domain signal, so that gain processing is conveniently carried out according to the frequency domain signal, the processed audio signal has better hearing experience, and the adjusted audio signal cannot be represented as sound truncated; gain data can be determined according to auditory perception weighting data, target volume values, adjustment reference data and frequency domain signals, and the volume adjustable range of the original audio signal is larger because the value range of the target volume values is larger; the gain frequency domain signal can be determined according to the gain data, a gain audio signal is obtained, and the perceived volume corresponding to the gain audio signal is matched with the target volume value; the method provided by the embodiment of the application can further amplify the volume of the audio signal when the volume of the audio signal reaches the first upper limit volume value, so that the definition of the audio signal is ensured, the method provided by the application has a good volume adjusting effect, the adjusted audio signal has good hearing experience sense, no sound breaking sense can be generated in hearing, and meanwhile, the method provided by the embodiment of the application can be used for adjusting various different types of audio signals, and the value range of the target volume value is larger, so that the method provided by the embodiment of the application can be applied to various use scenes (such as noise use scenes) and has good universality.

Drawings

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

Fig. 1 is a schematic system architecture diagram of a volume adjustment system according to an embodiment of the present application;

fig. 2 is a flow chart of a volume adjustment method according to an embodiment of the present application;

fig. 3 is a flow chart of another volume adjustment method according to an embodiment of the present application;

fig. 4 is a schematic diagram of a volume adjustment method according to an embodiment of the present application;

FIG. 5 is a schematic diagram of an acoustic equal loudness curve according to an embodiment of the present disclosure;

FIG. 6 is a schematic diagram of auditory perception weighting data provided by an embodiment of the present application;

fig. 7 is a block diagram of a volume adjusting device according to an embodiment of the present application;

fig. 8 is a block diagram of a computer device according to an embodiment of the present application.

Detailed Description

The following description of the embodiments of the present application will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all, of the embodiments of the present application. All other embodiments, which can be made by one of ordinary skill in the art based on the embodiments herein without making any inventive effort, are intended to be within the scope of the present application.

It should be noted that the descriptions of "first," "second," and the like in the embodiments of the present application are for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a technical feature defining "first", "second" may include at least one such feature, either explicitly or implicitly.

For some audio playback devices, the volume of audio that it plays may be adjusted by setting a volume value. Currently, the method for adjusting the volume of an audio signal is mainly a digital adjusting method and an analog adjusting method. However, both methods have poor volume adjustment effect, and the volume adjustable range of the audio is small.

Based on this, the embodiment of the application provides a volume adjustment method, which can determine a target volume value according to a volume increasing instruction when the volume value reaches a first upper limit volume value, if the volume increasing instruction is detected, wherein the target volume value is greater than the first upper limit volume value and less than or equal to a second upper limit volume value, and the second upper limit volume value is greater than the first upper limit volume value; performing category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information; converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal; gain calculation is carried out according to the auditory perception weighting data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data; and performing gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal, converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, and matching the perceived volume corresponding to the gain audio signal with a target volume value. Through the method provided by the embodiment of the application, the adjusted audio signal has good volume adjusting effect and good hearing experience sense, and can not have sound breaking sense in hearing, and meanwhile, the value range of the target volume value in the embodiment of the application is larger, so that the volume of the original audio signal can be further amplified, and the method provided by the embodiment of the application has good universality.

The volume adjustment method provided by the embodiment of the application can be applied to the field of intelligent transportation. The intelligent transportation system (Intelligent Traffic System, ITS), also called intelligent transportation system (Intelligent Transportation System), is a comprehensive transportation system which uses advanced scientific technology (information technology, computer technology, data communication technology, sensor technology, electronic control technology, automatic control theory, operation study, artificial intelligence, etc.) effectively and comprehensively for transportation, service control and vehicle manufacturing, and enhances the connection among vehicles, roads and users, thereby forming a comprehensive transportation system for guaranteeing safety, improving efficiency, improving environment and saving energy. In the driving process, external noise may cause that the intelligent traffic system cannot well complete the connection between the vehicle and the user, and the volume adjustment method provided by the embodiment of the application can be used for adjusting the volume of the output audio of the intelligent traffic system, so that the perceived volume of the output audio can be clearer, and good man-machine interaction of the intelligent traffic system is facilitated.

The volume adjustment method provided by the embodiment of the application can also be applied to the field of artificial intelligence. Artificial intelligence (Artificial Intelligence, AI) is the theory, method, technique and application system that uses a digital computer or a machine controlled by a digital computer to simulate, extend and extend human intelligence, sense the environment, acquire knowledge and use the knowledge to obtain optimal results. The field of artificial intelligence includes speech technology (Speech Technology). Key technologies to speech technology are Automatic Speech Recognition (ASR) and speech synthesis (TTS) technologies and voiceprint recognition technologies. In the speech synthesis technology, the method provided by the embodiment of the application can be adopted to carry out volume adjustment processing on the synthesized audio signal. According to the volume adjustment method provided by the embodiment of the application, the original audio signal can be subjected to gain processing according to the auditory perception weighting data and the target volume value to obtain the gain audio signal, so that the perception volume corresponding to the gain audio signal is matched with the target volume value, the loudness of the audio generated by the voice synthesis technology is higher, and the quality of the audio is higher.

The architecture of the volume adjustment system provided in the embodiments of the present application will be described below with reference to the accompanying drawings.

Referring to fig. 1, the system architecture of a volume adjustment system provided in the embodiment of the present application includes an audio obtaining device 101, a volume adjustment device 102 and a database 103, where the volume adjustment device 102 may interact with the audio obtaining device 101 and the database 103, and the volume adjustment device 102 includes a gain data module 1021 and a gain frequency domain signal module 1022. Wherein:

the audio acquisition device 101 may generate an audio signal, or may receive an audio signal transmitted by another device, and the audio acquisition device 101 may transmit the original audio signal to the volume adjustment device 102. The audio acquisition device 101 may be a device independent of the volume adjustment device 102 or may be a module disposed in the volume adjustment device 102. The audio acquisition device 101 may be, but is not limited to, a handheld device (e.g., a smart phone, a tablet computer), a computing device (e.g., a personal computer (Personal Computer, PC), an in-vehicle terminal, a smart voice interaction device, a wearable device, or other smart appliance, etc. having audio generation and communication functions.

The volume adjustment device 102 may receive the original audio signal transmitted from the audio acquisition device 101 and perform volume adjustment processing on the original audio signal. The volume adjustment device 102 includes a gain data module 1021 and a gain frequency domain signal module 1022, where the gain data module 1021 is configured to generate gain data, and the gain frequency domain signal module 1022 is configured to generate a gain audio signal according to the gain data and the original audio signal. The volume adjustment device 102 may be a terminal device or a server. When the volume adjustment device 102 is a terminal device, the volume adjustment device 102 may be a mobile phone, a computer, an intelligent voice interaction device, an intelligent home appliance, a vehicle-mounted terminal, an aircraft, or the like, but is not limited thereto. When the volume adjusting device 102 is a server, the volume adjusting device 102 may be an independent physical server, a server cluster or a distributed system formed by a plurality of physical servers, or a cloud server providing cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communication, middleware services, domain name services, security services, a content distribution network (Content Delivery Network, CDN), and basic cloud computing services such as big data and an artificial intelligence platform.

The database 103 is used to store relevant data of the volume adjustment device 102, such as: auditory perception weighting data, and the like. The database 103 may be a local database in the volume adjustment device 102, or may be a cloud database (i.e. a database deployed in the cloud) associated with the volume adjustment device 102, specifically may be deployed based on any one of a private cloud, a public cloud, a hybrid cloud, an edge cloud, and the like, so that the functions of the cloud databases that are focused are different. For example, the database deployed in the private cloud is a personal device of the user, and is more focused on serving a small part of the user, while the database deployed in the public cloud is deployed based on a cloud platform provided by a third party, so that data stored in the database can be shared, data of any user can be stored in the database, and data in the database can be used by any user.

The principle of operation of the volume adjustment system shown in fig. 1 will be described in detail as follows:

the audio acquisition device 101 transmits the original audio signal to the volume adjustment device 102; when the set volume value reaches the first upper limit volume value, if the volume adjustment device 102 detects a volume increase instruction, the volume adjustment device 102 determines a target volume value according to the volume increase instruction, wherein the target volume value is greater than the first upper limit volume value and less than or equal to the second upper limit volume value, and the second upper limit volume value is greater than the first upper limit volume value; the volume adjustment device 102 may perform a category analysis process on the received original audio signal to obtain audio category information, and determine adjustment reference data according to the audio category information; the volume adjustment device 102 may convert the original audio signal from a time domain to a frequency domain, to obtain a frequency domain signal corresponding to the original audio signal; the volume adjustment device 102 may obtain auditory sense weighted data from the database 103, and the gain data module 1021 in the volume adjustment device 102 may perform gain calculation according to the obtained auditory sense weighted data, the target volume value, the adjustment reference data, and the frequency domain signal, to obtain gain data; the gain data module 1021 sends the gain data to the gain frequency domain signal module 1022, and the gain frequency domain signal module 1022 can perform gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal; and converting the gain frequency domain signal from the frequency domain to the time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value. The volume adjustment device 102 may send the gain audio signal to the audio acquisition device 101. Through the volume adjustment method provided by the embodiment of the application, the adjusted audio signal has a good volume adjustment effect, has a good hearing experience sense, does not have a sound breaking sense in hearing, and meanwhile, the value range of the target volume value in the embodiment of the application is larger, so that the volume of the original audio signal can be further amplified.

It will be understood that the architecture diagram of the volume adjustment system described in the embodiments of the present application is for more clearly describing the volume adjustment method of the embodiments of the present application, and does not constitute a limitation of the volume adjustment method provided in the embodiments of the present application. For example, the volume adjustment method provided by the embodiment of the present application may be performed by other devices that are different from the volume adjustment device 102 and that are capable of communicating with the audio acquisition device 101 and the database 103, in addition to the volume adjustment device 102. Those of ordinary skill in the art will appreciate that the number of audio acquisition devices 101, volume adjustment devices 102, and databases 103 in fig. 1 are merely illustrative. Any number of devices may be configured as desired for a service implementation. Moreover, with the evolution of the system architecture and the appearance of new service scenarios, the volume adjustment method provided by the embodiment of the application is also applicable to similar technical problems.

It should be noted that, in the present application, the collection and processing of related data (for example, the original audio signal and the like) should be strictly based on the requirements of related laws and regulations during the actual application, so as to obtain the informed consent or independent consent of the personal information body, and develop the subsequent data use and processing behaviors within the authorized range of the laws and regulations and the personal information body.

Referring to fig. 2, fig. 2 is a flow chart of a volume adjustment method according to an embodiment of the present application. The volume adjustment method may be implemented by the volume adjustment device 102 described above, or may be implemented by another device. The flow of the volume adjustment method provided in the embodiment of the application includes, but is not limited to:

and S201, when the set volume value reaches a first upper limit volume value, if a volume increasing instruction is detected, determining a target volume value according to the volume increasing instruction, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value.

In this embodiment of the present application, the set volume value may be a volume value input by a user, and the first upper limit volume value may be an upper limit value of audio volume when an existing volume adjustment method (for example, a digital adjustment method or an analog adjustment method) is adopted to perform volume adjustment; the second upper limit volume value may be an upper limit value of audio volume when the volume adjustment method provided in the embodiment of the present application is used for volume adjustment. When the set volume value reaches the first upper limit volume value, the volume of the audio cannot be further increased by adopting the existing volume adjustment method, if a volume increasing instruction is still detected, a target volume value can be determined according to the volume increasing instruction, and the volume of the audio is further increased by adopting the volume adjustment method provided by the embodiment of the application. For example: the method comprises the steps of adjusting audio volume by adopting an existing method, wherein a first upper limit volume value is 100; when the set volume value reaches the first upper limit volume value, if the volume increasing instruction is detected, the volume of the audio signal can be further increased by adopting the volume adjusting method provided by the embodiment of the application, the second upper limit volume value can be 120, and the target volume value is determined according to the volume increasing instruction, and at this time, the value range of the target volume value can be [100,120]. According to the volume adjustment method provided by the embodiment of the application, the volume of the audio can be further expanded on the basis of the limit volume of the existing audio, so that the audio signal can be clear in a noise environment and can be recognized by human ears.

S202, carrying out category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information.

In this embodiment of the present application, the type analysis processing may be performed on the original audio information to obtain audio type information of the original audio information, for example: the audio category information may be a voice category, a music category, a noise category, etc., and the audio category information may also be a vocal category, a tubular category, a string category, a percussion category, etc. Adjustment reference data may be determined from the audio category information, which may be used to indicate a volume adjustment to the original audio signal. The adjustment reference data corresponding to different categories of original audio signals may be different. According to the method provided by the embodiment of the application, the adjustment reference data can be determined according to the category information of the original audio signals, so that the volume adjustment of the audio signals of different categories can be realized in a targeted manner, and the method provided by the embodiment of the application has better universality.

S203, converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal.

In the embodiment of the present application, the original audio signal may be an analog signal that is continuous in both time and amplitude, for example: the audio signal corresponding to the audio of the video may be an audio signal of a piece of music or an audio signal when a call is made. The time and frequency domains are the fundamental properties of a signal, and are the angles of two different analysis signals. Time domain refers to a relationship that describes a physical signal (e.g., an audio signal) versus time. For example: the time domain waveform of an audio signal may be expressed as the audio signal changing over time. The frequency domain is a coordinate system used in describing the frequency-wise characteristics of a signal. For an audio signal, the law of change of signal strength with time is its time domain characteristic, and the audio signal is synthesized by signals of which single frequencies is its frequency domain characteristic. The method for adjusting the volume of the audio mainly adopts a data adjustment or analog adjustment method to linearly adjust the audio signal in the time domain so as to realize the volume change of the audio. This approach may result in the adjusted audio (the volume of the audio is amplified to some extent) having a truncated sound and an acoustically broken sound, and the volume extension of the audio is limited by the power of the device. In this embodiment of the present application, the original audio signal may be converted from a time domain to a frequency domain, so as to obtain a frequency domain signal corresponding to the original audio signal, which may be equivalent to converting a time domain audio signal whose signal strength varies with time into a single frequency (i.e., a frequency domain signal corresponding to the audio signal) that forms the audio signal. By the method provided by the embodiment of the application, the original audio signal can be converted from the time domain to the frequency domain, and the frequency domain signal corresponding to the original audio signal is obtained, so that the energy adjustment of the audio signal from the frequency domain is realized, and the situation of sound truncated top can not occur while the increase of the audio volume is realized.

And S204, performing gain calculation according to the auditory perception weighted data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data.

In the embodiment of the application, the auditory perception weighting data can be used for describing the sensitivity degree of the human ear to sounds with different frequencies. The frequency domain signals corresponding to the original audio signals can comprise a plurality of different frequencies, and some of the frequencies in the human ears are very sensitive and are not very sensitive to other frequencies, so that the frequencies sensitive to the human ears can be pertinently enhanced, and the frequencies not sensitive to the human ears are properly weakened, so that the volume of the whole finally obtained audio signals is improved for the human ears. The target volume value may be an integer greater than or equal to 0 that is randomly set, and is used to control the degree of volume adjustment. Gain calculation can be performed according to auditory sense weighting data, target volume values, adjustment reference data and frequency domain signals to obtain gain data, and the gain data can be used for adjusting frequencies in the frequency domain signals. By the method provided by the embodiment of the application, the gain data can be accurately determined, and subsequent gain processing is facilitated on the frequency domain signal according to the gain data, so that volume adjustment of the audio signal is realized.

S205, performing gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal, and converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value.

In this embodiment of the present application, the gain data may include a plurality of gain coefficients, the frequency domain signal may include a plurality of different frequencies, and the different frequencies may correspond to different gain coefficients in the gain data. Some gain coefficients may increase the corresponding frequencies and some frequencies may decrease the corresponding frequencies. The gain data can be used for carrying out gain processing on the frequency domain signal, so that the frequency sensitive to human ears in the frequency domain signal is gain, and the frequency insensitive to human ears is attenuated, and the gain frequency domain signal is obtained. After determining the gain frequency domain signal, the gain frequency domain signal may be converted from the frequency domain to the time domain to obtain a gain audio signal. The perceived volume corresponding to the gain audio signal is matched with the target volume value, namely, the volume of the gain audio signal can be matched with the target volume value by adopting the volume adjustment method provided by the application. For example: when the existing volume adjustment method is adopted to adjust the volume of the audio signal, an upper limit volume value is set to be 100, and the volume adjustment method provided by the embodiment of the application is adopted to adjust the volume of the audio signal, so that a gain audio signal with larger perceived volume can be obtained, and at the moment, the upper limit value of the audio volume can be 120. According to the method provided by the embodiment of the application, the frequency sensitive to the human ear can be pertinently enhanced, the frequency insensitive to the human ear is weakened, so that the audio volume perceived by the human ear is increased under the condition that the whole energy of the audio signal is excessively amplified, the requirement of a user for expanding the volume of corresponding equipment is met, the volume of the audio signal in a noise environment can be further increased, and the identifiable degree of the audio signal is ensured.

Based on the above embodiment, the beneficial effects of the present application are: according to the volume adjustment method provided by the embodiment of the application, when the set volume value reaches the first upper limit volume value and the volume increasing command is provided, the volume of the original audio signal can be adjusted, so that the volume is further increased, and the volume requirement of a user is met; the reference data can be determined and adjusted according to the category information of the original audio signals, so that volume adjustment of the original audio signals of different categories is realized, and universality and pertinence are better; the original audio signal can be converted into a bit frequency domain signal, so that the energy adjustment of the signal from the frequency domain is realized, and the occurrence of sound truncated is avoided while the volume is increased; the frequency sensitive to the human ear can be pertinently enhanced, and the frequency insensitive to the human ear is weakened, so that the perceived volume of the audio signal is improved under the condition that the whole energy of the signal is not required to be excessively amplified; the volume of the audio signal can be increased in a noise environment, the intelligibility of the audio signal is ensured, and the resources consumed for increasing the volume are effectively reduced.

Referring to fig. 3, fig. 3 is a flow chart of another volume adjustment method according to an embodiment of the present application. The volume adjustment method may be implemented by the volume adjustment device 102 described above, or may be implemented by another device. The flow of the volume adjustment method provided in the embodiment of the application includes, but is not limited to:

And S301, when the set volume value reaches a first upper limit volume value, if a volume increasing instruction is detected, determining a target volume value according to the volume increasing instruction, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value.

In this embodiment of the present application, the set volume value may be a volume value input by a user, the first upper limit volume value may be an upper limit value of an audio volume when an existing volume adjustment method is adopted to adjust the volume, and the second upper limit volume value may be an upper limit value of the audio volume when the volume adjustment method provided in this embodiment of the present application is adopted to adjust the volume. When the set volume value reaches the first upper limit volume value, the volume of the audio cannot be further increased by adopting the existing volume adjustment method, if a volume increasing instruction is still detected, a target volume value can be determined according to the volume increasing instruction, and the volume of the audio is further increased by adopting the volume adjustment method provided by the embodiment of the application. For example: the method comprises the steps of adjusting audio volume by adopting an existing method, wherein a first upper limit volume value is 100; when the set volume value reaches the first upper limit volume value, if the volume increasing instruction is detected, the volume of the audio signal can be further increased by adopting the volume adjusting method provided by the embodiment of the application, the second upper limit volume value can be 120, the target volume value is determined according to the volume increasing instruction, and the range of the target volume value can be [100,120]. Please refer to fig. 4, which is a schematic diagram of a volume adjustment method according to an embodiment of the present application. The "volume" in fig. 4 indicates that the interface is currently a volume adjustment interface, and when the volume value set by the user is less than 100, the volume of the audio can be adjusted by adopting the existing digital adjustment or analog adjustment method, so that the volume of the audio is matched with the set volume value; when the sound volume value set by the user is greater than or equal to 100 (the sound volume value set in fig. 4 is 110) in a noise environment, the sound volume adjustment method provided by the embodiment of the invention can be used for adjusting the sound volume of the audio signal, so that the perceived sound volume of the adjusted audio signal is matched with the set sound volume value, and further amplification of the sound volume of the audio is realized. According to the volume adjustment method provided by the embodiment of the application, the volume of the audio can be further expanded on the basis of the limit volume of the existing audio, so that the audio signal can be clear in a noise environment and can be recognized by human ears.

S302, carrying out category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information.

In this embodiment of the present application, the type analysis processing may be performed on the original audio information to obtain audio type information of the original audio information, for example: the audio category information may be a voice category, a music category, a noise category, etc., and the audio category information may also be a vocal category, a tubular category, a string category, a percussion category, etc. The adjustment reference data may be determined according to the audio category information, and the adjustment reference data may be used to indicate volume adjustment of the original audio signal, so that a frequency domain signal sensitive to human ears in the frequency domain signal corresponding to the original audio signal is enhanced, and a frequency domain signal insensitive to human ears is weakened. The adjustment reference data corresponding to different categories of original audio signals may be different. For example: if the audio class information of the original audio signal is voice, the original audio signal is indicated to contain human voice, and as the frequency corresponding to the human voice is generally within 100 to 1000 hertz, the adjustment reference data with smaller value can be determined according to the audio class information; if the category information of the original audio signal is a percussion music, it is indicated that the original audio signal contains a percussion music, and since the frequency corresponding to the percussion music (for example, a drum) is generally about 2500 hz, the adjustment reference data with a larger value can be determined according to the audio category information. According to the method provided by the embodiment of the application, the adjustment reference data can be determined according to the category information of the original audio signals, so that the volume adjustment of the audio signals of different categories can be realized in a targeted manner, and the method provided by the embodiment of the application has better universality.

In an embodiment, the implementation manner of performing the category analysis processing on the original audio signal to obtain the audio category information may be: acquiring a reference audio signal corresponding to an original audio signal, wherein the time corresponding to the reference audio signal is earlier than the time corresponding to the original audio signal; and performing feature analysis processing according to the reference audio signal and the original audio signal to obtain audio class information corresponding to the original audio signal, wherein the audio class information is used for indicating a sounding object corresponding to the original audio signal. A reference audio signal corresponding to the original audio signal may be acquired, the reference audio signal corresponding to a time earlier than the original audio signal corresponding to a time, for example: the reference audio signal and the original audio signal belong to the same piece of music, the reference audio signal can be an audio signal corresponding to audio in the music from 0 th second to 10 th second, and the original audio signal can be an audio signal corresponding to audio in the music from 10 th second to 11 th second, namely, the time corresponding to the reference audio signal (from 0 second to 10 seconds) is earlier than the time corresponding to the original audio signal (from 10 seconds to 11 seconds). The characteristic analysis processing can be performed according to the reference audio signal and the original audio signal by using a method such as a neural network model, so as to obtain audio class information corresponding to the original audio signal, where the audio class information can be used to indicate a sound object corresponding to the original audio signal, for example: the audio category information of the original audio signal may be a percussion instrument, i.e. the sound object corresponding to the original audio signal is a percussion instrument. By the method provided by the embodiment of the application, the audio type information of the original audio signal can be accurately determined, and the subsequent adjustment of the reference data according to the audio type information is facilitated, so that the volume adjustment processing of the audio signals of different types is realized.

S303, converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal.

In the embodiment of the present application, the original audio signal may be an analog signal that is continuous in both time and amplitude, for example: the audio signal corresponding to the audio of the video may be an audio signal of a piece of music or an audio signal when a call is made. The time and frequency domains are the fundamental properties of a signal, and are the angles of two different analysis signals. Time domain refers to a relationship that describes a physical signal (e.g., an audio signal) versus time. For example: the time domain waveform of an audio signal may be expressed as the audio signal changing over time. The frequency domain is a coordinate system used in describing the frequency-wise characteristics of a signal. For an audio signal, the law of change of signal strength with time is its time domain characteristic, and the audio signal is synthesized by signals of which single frequencies is its frequency domain characteristic. The method for adjusting the volume of the audio mainly adopts a data adjustment or analog adjustment method to linearly adjust the audio signal on the time domain so as to realize the volume increase of the audio. This approach may result in sound truncated and audibly broken sound in the adjusted audio and the volume expansion of the audio is limited by the power of the device. In this embodiment of the present application, an original audio signal may be converted from a time domain to a frequency domain, so as to obtain a frequency domain signal corresponding to the original audio signal, which may be equivalent to converting a time domain audio signal whose signal strength varies with time into a single frequency (i.e., a frequency domain signal corresponding to the audio signal) that forms the audio signal. According to the method provided by the application, the original audio signal can be converted from the time domain to the frequency domain, so that the frequency domain signal corresponding to the original audio signal is obtained, the energy adjustment of the audio signal on the frequency domain is realized, and the situation of sound truncated top can not occur while the increase of the audio volume is realized.

In an embodiment, the method of converting the original audio signal from the time domain to the frequency domain may be implemented by performing fourier transform on the original audio signal. Fourier transform is a method of analyzing signals, which can analyze the components of the signals, and can use these components to synthesize the signals. When analyzing signals, the method is mainly applied to processing stationary signals, and components of which frequencies are generally contained in a section of signals can be obtained through Fourier transformation, but the occurrence time of each component cannot be known. The method provided by the embodiment of the application can rapidly convert the original audio signal from the time domain to the frequency domain, and effectively improves the volume adjustment efficiency of the audio signal.

S304, determining relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data, wherein the auditory perception weighting data comprises a corresponding relation between signal frequencies and auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency.

In the embodiment of the application, the auditory perception weighting data can be used for describing the sensitivity degree of the human ear to sounds with different frequencies. The frequency domain signals corresponding to the original audio signals can comprise a plurality of different frequencies, and some of the frequencies in the human ears are very sensitive and are not very sensitive to other frequencies, so that the frequencies sensitive to the human ears can be pertinently enhanced, and the frequencies not sensitive to the human ears are properly weakened, so that the whole perception volume of the finally obtained audio signals is improved for the human ears. The auditory perception weighting data includes correspondence of signal frequencies and auditory perception weighting coefficients, for example: when the signal frequency is 4000 hertz, the corresponding auditory perception weighting coefficient can be 2.6; when the signal frequency is 2000 hz, the corresponding auditory perception weighting coefficient may be 1.2. The adjustment reference data is used to indicate the degree of volume adjustment for the original audio signal. Determining relative auditory sense weighting data based on the auditory sense weighting data and the adjustment reference data; the relative auditory perception weighting data comprises a plurality of relative auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency. The relative auditory perception weighting data is used to achieve gain or attenuation of signals of different frequencies. For example: the setting data may be 1, the relative auditory perception weighting coefficient corresponding to the first signal frequency in the relative auditory perception weighting data may be 0.5, the relative auditory perception weighting coefficient corresponding to the second signal frequency in the relative auditory perception weighting data may be 1.1, the relative auditory perception weighting coefficient corresponding to the first signal frequency may attenuate the first signal frequency, and the relative auditory perception weighting coefficient corresponding to the second signal frequency may gain the second signal frequency. By the method provided by the embodiment of the application, the relative auditory perception weighting data can be accurately determined, and the subsequent determination of gain data according to the relative auditory perception weighting data is facilitated, so that the volume increasing processing of the original audio signal is realized.

In one embodiment, auditory perception weighting data may be determined from the acoustic isotone graphs, thereby enabling quantization of auditory perception. The primary basis for auditory perception is the loudness of the audio, which varies with the intensity and frequency of the sound, and the same intensity but different frequencies of the audio have different auditory perceptions. Referring to fig. 5, an acoustic equal loudness curve is shown according to an embodiment of the present application. In fig. 5, the abscissa indicates the signal frequency (in hertz (Hz)), and the signal frequency in fig. 5 has a value ranging from 20 Hz to 2 khz; the ordinate is sound pressure level (in decibels (dB SPL)), and the sound pressure level in fig. 5 ranges from-10 dB to 130 dB. The curve in fig. 5 is an equal-loudness curve, which is a curve describing the relationship between sound pressure level and signal frequency under equal-loudness conditions, i.e. which sound pressure level needs to be reached to obtain a consistent auditory loudness (or auditory perception) for a listener for audio signals of different frequencies. In fig. 5, 6 equal-loudness curves are included, from top to bottom: an equal loudness curve of 100 auditory loudness (labeled 100phon for the equal loudness curve in fig. 5), an equal loudness curve of 80 auditory loudness (labeled 80 for the equal loudness curve in fig. 5), an equal loudness curve of 60 auditory loudness (labeled 60 for the equal loudness curve in fig. 5), an equal loudness curve of 40 auditory loudness (labeled 40 for the equal loudness curve in fig. 5), an equal loudness curve of 20 auditory loudness (labeled 20 for the equal loudness curve in fig. 5), and a threshold curve, i.e., an equal loudness curve of the lowest auditory loudness (labeled threshold for the equal loudness curve in fig. 5). For any one of the curves in fig. 5, it can be found that when the signal frequency is middle-low frequency (below 1 kHz), the lower the signal frequency, the higher the sound pressure level (i.e., energy) required for achieving equal sound, i.e., the greater the sound energy is required to make the human ear have the same auditory sensation; when the signal frequency is medium-high frequency (above 1 kHz), the signal frequencies of different frequency bands have different acoustic perception characteristics. In some cases, the signal frequency of the human voice audio signal is generally concentrated at a medium-low frequency (e.g., the human voice signal frequency is 1500Hz or less), and as can be seen from fig. 5 described above, the human ear perception is less sensitive to signals having a frequency below a low frequency (500 Hz). Compared with the frequency band sensitive to the human ear of medium and high frequencies (for example, 3-4 khz), the signal of the low frequency part needs to be many times higher in physical absolute energy than the signal of the medium and high frequencies to achieve a near perception effect in the sense of hearing. Therefore, the input frequency domain signal can be regulated, the frequency domain signal insensitive to auditory perception is subjected to attenuation treatment, and the frequency domain signal sensitive to auditory perception is subjected to enhancement treatment, so that the human ear has stronger auditory perception on the audio corresponding to the regulated whole frequency domain signal, and the increase of the audio volume is realized. In some cases, psycho-acoustic equal loudness curve data based on the BS3383 standard (i.e., the BS3383 normal equal loudness level contour specification for pure tones under automatic sound field listening conditions, BS3383 Specification for normal equal-loudness level contours for pure tones under free-field listening conditions) may be used to calculate the auditory perception weighting data. Specific calculation methods can be represented by the following formulas (1), (2), (3) and (4):

（1）

（2）

（3）

（4）

Wherein k is an input frequency value, ff, af, bf, cf is related data in an equal loudness curve data table disclosed in the BS3383 standard, and the loudness value loud corresponding to the target frequency point k can be obtained by interpolating the existing equal loudness curve data by using a linear interpolation method by adopting the methods shown in the above formulas (1), (2), (3) and (4). After the loudness value is calculated by the above equation (1), equation (2), equation (3) and equation (4), the auditory perception weighting data may be determined according to the following equation (5):

（5）

in equation (5), cof (k) represents an auditory perception weighting coefficient corresponding to frequency k, and loud represents a loudness value corresponding to frequency k. Auditory perception weighting coefficients corresponding to the respective frequencies can be determined by the method described in equation (5). Referring to fig. 6, a schematic diagram of auditory perception weighting data according to an embodiment of the present application is shown. The auditory sense weighting data includes a correspondence of signal frequencies to auditory sense weighting coefficients. In fig. 6, the abscissa represents the signal frequency (in hertz (Hz)), the range of the signal frequency is 0Hz to 8000 Hz, the ordinate represents the auditory perception weighting coefficient, and the range of the auditory perception weighting coefficient is 0 to 3. The curves in fig. 6 show the values of the auditory perception weighting coefficients corresponding to different signal frequencies. As can be seen from fig. 6, different frequencies correspond to different auditory perception weighting coefficients, and high frequency (greater than 2500Hz and less than 4500 Hz) signals correspond to auditory perception weighting coefficients greater than 2. By the method provided by the embodiment of the application, the auditory perception weighting coefficient can be accurately determined, and the frequency domain signal corresponding to the original audio signal can be subjected to targeted gain adjustment according to the auditory perception weighting coefficient, so that the accurate adjustment of the volume is realized.

In an embodiment, the implementation manner of determining the relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data may be: acquiring a plurality of auditory sense weighting coefficients in auditory sense weighting data, and dividing and calculating any auditory sense weighting coefficient with adjustment reference data according to any auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients to acquire a relative auditory sense weighting coefficient corresponding to any auditory sense weighting coefficient; after determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, determining relative auditory sense weighting data according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient. The adjustment reference data determined according to the original audio signal is used for enabling the relative auditory perception weighting coefficient corresponding to the low-frequency signal insensitive to human ears to be smaller than 1, so that the attenuation of the low-frequency signal is achieved, and the relative auditory perception weighting coefficient corresponding to the high-frequency signal sensitive to human ears is enabled to be larger than 1, so that the gain of the high-frequency signal is achieved. The method can acquire a plurality of auditory sense weighting coefficients in auditory sense weighting data, and can divide and calculate the auditory sense weighting coefficient and adjustment reference data for any auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients to obtain a relative auditory sense weighting coefficient corresponding to the auditory sense weighting coefficient. The method for determining the relative auditory perception weighting coefficient may be as shown in the following equation (6):

（6）

In the formula (6), b0 represents adjustment reference data, cof (freq) represents an auditory perception weighting coefficient corresponding to the frequency freq,representing the relative auditory perception weighting coefficient corresponding to the frequency freq. Determining a plurality of auditory perception weightsAfter the relative auditory sense weighting coefficients corresponding to the auditory sense weighting coefficients in the coefficients, the relative auditory sense weighting data can be determined according to the relative auditory sense weighting coefficients corresponding to the auditory sense weighting coefficients. By the method provided by the embodiment of the application, the auditory perception weighting data and the relative auditory perception weighting data can be accurately determined, so that the frequency domain signals can be conveniently subjected to gain processing according to the relative auditory perception weighting data, the adjustment of the perceived volume of the audio signals is realized, the quality of the audio signals is ensured, and the situation that the adjusted audio signals are subjected to sound truncated is avoided.

S305, obtaining frequency information corresponding to the frequency domain signal, wherein the frequency information comprises a plurality of frequencies, obtaining a relative auditory perception weighting coefficient corresponding to any frequency from the relative auditory perception weighting data according to any frequency, and determining a gain coefficient corresponding to any frequency according to the relative auditory perception weighting coefficient corresponding to any frequency and a target volume value.

In this embodiment of the present application, frequency information corresponding to a frequency domain signal may be obtained, where the frequency information may include a plurality of frequencies. For any one of a plurality of frequencies, a relative auditory perception weighting coefficient corresponding to the frequency may be obtained from the relative auditory perception weighting data, for example: as can be seen from fig. 6, the frequency information includes a frequency of 3000Hz, and the auditory sense weighting coefficient corresponding to the signal frequency is 2.2, and if the adjustment reference data is 2.0, the relative auditory sense weighting data can be calculated, and the relative auditory sense weighting coefficient corresponding to the frequency (3000 Hz) is determined to be 1.1 from the relative auditory sense weighting data. The gain factor for the frequency may be determined based on the relative auditory perception weighting data and the target volume value for the frequency. The method provided by the embodiment of the invention can determine the relative auditory perception weight coefficient corresponding to each frequency in the frequency domain signal, and is convenient for determining the gain coefficient of the frequency according to the relative auditory perception weight coefficient, thereby realizing the gain processing of the frequency domain signal.

In an embodiment, according to the relative auditory perception weighting coefficient and the target volume value corresponding to any frequency, the implementation manner of determining the gain coefficient corresponding to any frequency may be: converting the target volume value to obtain volume control data; and performing power operation by taking the relative auditory perception weighting coefficient corresponding to any frequency as a base and the volume control data as an exponent to obtain a gain coefficient corresponding to any frequency. The target volume value can be an expected volume value input by a user, the target volume value can be converted to obtain volume control data, any frequency corresponding to the relative auditory perception weighting coefficient in the frequency information is used as a base, the volume control data is used as an exponent to carry out power operation, and the gain coefficient corresponding to the frequency is obtained. The implementation manner of determining the gain coefficient corresponding to a certain frequency can be as shown in the following formula (7):

（7）

In the formula (7), q represents a target volume value, func (q) represents volume control data corresponding to the target volume value,the relative auditory sense weighting coefficient corresponding to the frequency k is represented, power (a, b) represents the gain coefficient corresponding to the frequency k by performing power operation with a as a base and b as an exponent. By the method provided by the embodiment of the application, the gain coefficient associated with the target volume value and the frequency can be determined, so that gain processing of the corresponding frequency domain signal can be realized according to the gain coefficient, further, enhancement of the frequency domain signal sensitive to human ears is realized, and the frequency domain signal insensitive to human ears is reduced.

It should be noted that, the volume adjustment method provided in the embodiment of the present application may be used in combination with other volume adjustment methods, or may be used alone. For example: the volume adjustment method provided in the embodiment of the present application may be used in combination with an existing volume adjustment method, at this time, the value range of the set volume value may be [0,120], when the set volume value input by the user is smaller than 100, the volume of the audio signal may be adjusted by using the existing volume adjustment method, and when the set volume value input by the user is greater than 100, the volume of the audio signal may be adjusted by using the volume adjustment method provided in the embodiment of the present application, so that the perceived volume of the audio signal may be greater, and no situation of sound breaking occurs in hearing. Also for example: the volume adjustment method provided by the embodiment of the application can be used independently, at this time, the value range of the set volume value can be [0,100], and when the set volume value is input by a user, the volume of the audio can be adjusted according to the set volume value by adopting the volume adjustment method provided by the embodiment of the application, so that the adjusted audio has better and clearer hearing feeling.

S306, after determining the gain coefficient corresponding to each frequency in the plurality of frequencies, determining gain data according to the gain coefficient corresponding to each frequency.

In this embodiment of the present application, the frequency information corresponding to the frequency domain signal includes a plurality of frequencies, and after determining gain coefficients corresponding to each of the plurality of frequencies, gain data may be determined according to the gain coefficients corresponding to each of the plurality of frequencies. By the method provided by the embodiment of the application, the gain data corresponding to the frequency domain signal can be determined, and subsequent gain processing of the frequency domain signal can be conveniently realized according to the gain data.

S307, performing gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal.

In this embodiment of the present application, the gain data may include a plurality of gain coefficients, the frequency domain signal may include a plurality of different frequencies, and the different frequencies may correspond to different gain coefficients in the gain data. Some gain coefficients may increase the corresponding frequencies and some frequencies may decrease the corresponding frequencies. The gain data can be used for carrying out gain processing on the frequency domain signal, so that the frequency sensitive to human ears in the frequency domain signal is gain, and the frequency insensitive to human ears is attenuated, and the gain frequency domain signal is obtained. By the method provided by the embodiment of the application, the frequency sensitive to the human ear can be pertinently enhanced, and the frequency insensitive to the human ear can be reduced, so that the audio volume perceived by the human ear can be increased under the condition that the whole energy of the audio signal is excessively amplified, and the requirement of a user for expanding the volume of corresponding equipment can be met.

In an embodiment, for any one of a plurality of frequencies, a signal value corresponding to the any one frequency is obtained from the frequency domain signal; weighting the signal value corresponding to any frequency by using a gain coefficient corresponding to any frequency in the gain data to obtain a signal value corresponding to any frequency after gain processing; after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal from the gain-processed signal values corresponding to each of the frequencies. The frequency information corresponding to the frequency domain signal comprises a plurality of frequencies, and for any frequency in the plurality of frequencies, a signal value corresponding to the frequency can be obtained from the frequency domain signal, and the signal value corresponding to the frequency is weighted (for example, multiplied) according to a gain coefficient corresponding to the frequency in the gain data, so as to obtain a signal value after the gain processing corresponding to the frequency; after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal from the gain-processed signal values corresponding to each of the frequencies. In this embodiment of the present application, gain coefficients corresponding to different frequencies are different, and signal values after gain processing corresponding to the frequencies are also different: for the frequency sensitive to the human ear, the gain coefficient corresponding to the frequency is larger than 1, the signal value corresponding to the frequency is weighted according to the gain coefficient corresponding to the frequency, the signal value after gain processing corresponding to the frequency is obtained, and the signal value after gain processing is larger than the original signal value; for the frequency insensitive to human ears, the gain coefficient corresponding to the frequency is smaller than 1, and the signal value corresponding to the frequency is obtained after the weighting processing is carried out on the signal value corresponding to the frequency according to the gain coefficient corresponding to the frequency, wherein the signal value after the gain processing is smaller than the original signal value. By the method provided by the embodiment of the application, the gain frequency domain signal can be accurately determined, the gain processing is performed on the frequency domain signal sensitive to the human ear, the reduction processing is performed on the frequency domain signal insensitive to the human ear, and the overall gain processing of the gain frequency domain signal is realized.

S308, converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value.

In the embodiment of the application, the gain frequency domain signal can be converted from the frequency domain to the time domain to obtain the gain audio signal. The perceived volume corresponding to the gain audio signal is matched with the target volume value, namely, the volume of the gain audio signal can be matched with the target volume value by adopting the volume adjustment method provided by the application. For example: when the existing volume adjustment method is adopted to adjust the volume of the audio signal, an upper limit volume value is set to be 100, and the volume adjustment method provided by the embodiment of the application is adopted to adjust the volume of the audio signal, so that a gain audio signal with larger perceived volume can be obtained, and at the moment, the upper limit value of the audio volume can be 120. The method provided by the embodiment of the application can further improve the perceived volume of the audio signal, so that the volume of the audio signal in a noise environment can be further increased, and the identification degree of the audio signal is ensured.

In an embodiment, the implementation method for converting the gain frequency domain signal from the frequency domain to the time domain may be an inverse fourier transform. The inverse fourier transform is also called inverse fourier transform, and the calculation principle is to take the conjugate complex number from the frequency domain data and then perform fourier transform, so that the frequency domain signal is converted into the time domain.

It should be noted that, the embodiment of the present application mainly illustrates a volume increasing scene in the volume adjustment of the original audio signal, and the method provided by the implementation of the present application is applicable to a volume decreasing scene in the volume adjustment of the original audio signal.

Based on the above embodiment, the beneficial effects of the present application are: according to the volume adjustment method provided by the embodiment of the application, when the set volume value reaches the first upper limit volume value and the volume increasing command is provided, the volume of the original audio signal can be adjusted, so that the volume is further increased, and the volume requirement of a user is met; the reference data can be determined and adjusted according to the category information of the original audio signals, so that volume adjustment of the original audio signals of different categories is realized, and universality and pertinence are better; the original audio signal can be converted into a bit frequency domain signal, so that the energy adjustment of the signal from the frequency domain is realized, and the occurrence of the condition of sound truncated is avoided while the volume adjustment is realized; the relative auditory perception weighting coefficients corresponding to different frequencies can be determined, so that the intensity of auditory perception is quantized, frequency domain signals sensitive to human ears can be enhanced in a targeted manner, frequency domain signals insensitive to human ears are weakened, the perceived volume of an audio signal is improved under the condition that the whole energy of the signals is not required to be amplified too much, and the processing efficiency of volume adjustment is also improved; the method can realize the volume increase of the audio signal in a noise environment, ensure the intelligibility of the audio signal, and effectively reduce the resources consumed by the volume increase.

Referring to fig. 7, fig. 7 is a block diagram illustrating a volume adjusting device according to an embodiment of the present application. The device comprises:

a determining unit 701, configured to determine, when the set volume value reaches a first upper limit volume value, a target volume value according to the volume increase instruction if the volume increase instruction is detected, where the target volume value is greater than the first upper limit volume value and less than or equal to a second upper limit volume value, and the second upper limit volume value is greater than the first upper limit volume value;

the processing unit 702 is configured to perform category analysis processing on an original audio signal, obtain audio category information, and determine adjustment reference data according to the audio category information;

the processing unit 702 is further configured to convert the original audio signal from a time domain to a frequency domain, so as to obtain a frequency domain signal corresponding to the original audio signal;

the processing unit 702 is further configured to perform gain calculation according to auditory perception weighted data, the target volume value, the adjustment reference data, and the frequency domain signal, so as to obtain gain data;

and a conversion unit 703, configured to perform gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal, and convert the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, where a perceived volume corresponding to the gain audio signal is matched with the target volume value.

In one embodiment, the processing unit 702 is specifically configured to, when performing gain calculation according to the auditory sense weighted data, the target volume value, the adjustment reference data, and the frequency domain signal to obtain gain data: determining relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data, wherein the auditory perception weighting data comprises a corresponding relation between signal frequency and auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency; and performing gain calculation according to the relative auditory perception weighted data, the target volume value and the frequency domain signal to obtain gain data.

In one embodiment, the processing unit 702 is specifically configured to, when performing gain calculation according to the relative auditory sense weighted data, the target volume value, and the frequency domain signal to obtain gain data: acquiring frequency information corresponding to the frequency domain signal, wherein the frequency information comprises a plurality of frequencies; for any frequency of the plurality of frequencies, acquiring a relative auditory perception weighting coefficient corresponding to the any frequency from the relative auditory perception weighting data, and determining a gain coefficient corresponding to the any frequency according to the relative auditory perception weighting coefficient corresponding to the any frequency and the target volume value; after determining the gain coefficient corresponding to each frequency in the plurality of frequencies, determining gain data according to the gain coefficient corresponding to each frequency.

In an embodiment, the processing unit 702 is specifically configured to, when performing gain processing on the frequency domain signal using the gain data to obtain a gain frequency domain signal: for any frequency of the plurality of frequencies, acquiring a signal value corresponding to the any frequency from the frequency domain signal; weighting the signal value corresponding to any frequency by using the gain coefficient corresponding to any frequency in the gain data to obtain a signal value after gain processing corresponding to any frequency; after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal according to the gain-processed signal values corresponding to each of the frequencies.

In an embodiment, the processing unit 702 is specifically configured to, when determining the gain coefficient corresponding to the arbitrary frequency according to the relative auditory perception weighting coefficient corresponding to the arbitrary frequency and the target volume value: converting the target volume value to obtain volume control data; and performing power operation by taking the relative auditory perception weighting coefficient corresponding to any frequency as a base number and taking the volume control data as an exponent to obtain a gain coefficient corresponding to any frequency.

In one embodiment, the processing unit 702 is specifically configured to, when determining the relative auditory sense weighting data according to the auditory sense weighting data and the adjustment reference data: acquiring a plurality of auditory perception weighting coefficients in the auditory perception weighting data, and dividing and calculating any auditory perception weighting coefficient with the adjustment reference data according to any auditory perception weighting coefficient in the auditory perception weighting coefficients to obtain a relative auditory perception weighting coefficient corresponding to the any auditory perception weighting coefficient; after determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, determining relative auditory sense weighting data according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient.

In an embodiment, the processing unit 702 is specifically configured to, when performing a class analysis process on an original audio signal to obtain audio class information: acquiring a reference audio signal corresponding to the original audio signal, wherein the time corresponding to the reference audio signal is earlier than the time corresponding to the original audio signal; and performing feature analysis processing according to the reference audio signal and the original audio signal to obtain audio class information corresponding to the original audio signal, wherein the audio class information is used for indicating a sounding object corresponding to the original audio signal.

It may be understood that the functions of each functional unit of the volume adjusting device in the embodiment of the present application may be specifically implemented according to the volume adjusting method in the embodiment of the method, and the specific implementation process may refer to the related description in the embodiment of the volume adjusting method, which is not repeated herein.

Referring to fig. 8, fig. 8 is a block diagram of a computer device according to an embodiment of the present application. The computer device described in the embodiment of the present application includes: a processor 801, a communication interface 802, and a memory 803. The processor 801, the communication interface 802, and the memory 803 may be connected by a bus or other means, which is exemplified in the embodiment of the present application.

Among them, the processor 801 (or CPU (Central Processing Unit, central processing unit)) is a computing core and a control core of a computer device, which can parse various instructions in the computer device and process various data of the computer device, for example: the CPU can be used for analyzing a startup and shutdown instruction sent by a user to the computer equipment and controlling the computer equipment to perform startup and shutdown operation; and the following steps: the CPU may transmit various types of interaction data between internal structures of the computer device, and so on. The communication interface 802 may optionally include a standard wired interface, a wireless interface (e.g., wi-Fi, mobile communication interface, etc.), controlled by the processor 801 for transceiving data. The Memory 803 (Memory) is a Memory device in the computer device for storing programs and data. It will be appreciated that the memory 803 herein may include both built-in memory of the computer device and extended memory supported by the computer device. Memory 803 provides storage space that stores the operating system of the computer device, which may include, but is not limited to: android systems, iOS systems, windows Phone systems, etc., which are not limiting in this application.

In the present embodiment, the processor 801 performs the following operations by executing executable program code in the memory 803:

In one embodiment, the processor 801 is specifically configured to, when performing gain calculation according to the auditory sense weighted data, the target volume value, the adjustment reference data, and the frequency domain signal, obtain gain data: determining relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data, wherein the auditory perception weighting data comprises a corresponding relation between signal frequency and auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency; and performing gain calculation according to the relative auditory perception weighted data, the target volume value and the frequency domain signal to obtain gain data.

In one embodiment, the processor 801 is specifically configured to, when performing gain calculation according to the relative auditory sense weighted data, the target volume value, and the frequency domain signal to obtain gain data: acquiring frequency information corresponding to the frequency domain signal, wherein the frequency information comprises a plurality of frequencies; for any frequency of the plurality of frequencies, acquiring a relative auditory perception weighting coefficient corresponding to the any frequency from the relative auditory perception weighting data, and determining a gain coefficient corresponding to the any frequency according to the relative auditory perception weighting coefficient corresponding to the any frequency and the target volume value; after determining the gain coefficient corresponding to each frequency in the plurality of frequencies, determining gain data according to the gain coefficient corresponding to each frequency.

In an embodiment, the processor 801 is specifically configured to, when performing gain processing on the frequency domain signal using the gain data, obtain a gain frequency domain signal: for any frequency of the plurality of frequencies, acquiring a signal value corresponding to the any frequency from the frequency domain signal; weighting the signal value corresponding to any frequency by using the gain coefficient corresponding to any frequency in the gain data to obtain a signal value after gain processing corresponding to any frequency; after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal according to the gain-processed signal values corresponding to each of the frequencies.

In an embodiment, the processor 801 is specifically configured to, when determining the gain coefficient corresponding to the arbitrary frequency according to the relative auditory perception weighting coefficient corresponding to the arbitrary frequency and the target volume value: converting the target volume value to obtain volume control data; and performing power operation by taking the relative auditory perception weighting coefficient corresponding to any frequency as a base number and taking the volume control data as an exponent to obtain a gain coefficient corresponding to any frequency.

In one embodiment, the processor 801 is specifically configured to, when determining the relative auditory sense weighting data based on the auditory sense weighting data and the adjustment reference data: acquiring a plurality of auditory perception weighting coefficients in the auditory perception weighting data, and dividing and calculating any auditory perception weighting coefficient with the adjustment reference data according to any auditory perception weighting coefficient in the auditory perception weighting coefficients to obtain a relative auditory perception weighting coefficient corresponding to the any auditory perception weighting coefficient; after determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, determining relative auditory sense weighting data according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient.

In one embodiment, the processor 801 is specifically configured to, when performing a class analysis process on an original audio signal to obtain audio class information: acquiring a reference audio signal corresponding to the original audio signal, wherein the time corresponding to the reference audio signal is earlier than the time corresponding to the original audio signal; and performing feature analysis processing according to the reference audio signal and the original audio signal to obtain audio class information corresponding to the original audio signal, wherein the audio class information is used for indicating a sounding object corresponding to the original audio signal.

In a specific implementation, the processor 801, the communication interface 802, and the memory 803 described in the embodiments of the present application may execute an implementation manner of a volume adjustment device described in a volume adjustment method provided in the embodiments of the present application, or may execute an implementation manner described in a volume adjustment device provided in the embodiments of the present application, which is not described herein again.

The embodiments of the present application also provide a computer-readable storage medium having a computer program stored therein, which when run on a computer, causes the computer to perform the volume adjustment method according to the embodiments of the present application. The specific implementation manner may refer to the foregoing description, and will not be repeated here.

Embodiments of the present application also provide a computer program product comprising a computer program or computer instructions stored in a computer-readable storage medium. A processor of a computer device reads the computer program or computer instructions from the computer readable storage medium, and the processor executes the computer program or computer instructions to cause the computer device to perform a volume adjustment method as described in embodiments of the present application. The specific implementation manner may refer to the foregoing description, and will not be repeated here.

It should be noted that, for simplicity of description, the foregoing method embodiments are all expressed as a series of action combinations, but it should be understood by those skilled in the art that the present application is not limited by the described order of action, as some steps may take other order or be performed simultaneously according to the present application. Further, those skilled in the art will also appreciate that the embodiments described in the specification are all preferred embodiments, and that the acts and modules referred to are not necessarily required in the present application.

Those of ordinary skill in the art will appreciate that all or part of the steps in the various methods of the above embodiments may be implemented by a program to instruct related hardware, the program may be stored in a computer readable storage medium, and the storage medium may include: flash disk, read-Only Memory (ROM), random-access Memory (Random Access Memory, RAM), magnetic or optical disk, and the like.

The foregoing disclosure is only illustrative of some of the embodiments of the present application and is not, of course, to be construed as limiting the scope of the appended claims, and therefore, all changes that come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims

1. A method of volume adjustment, the method comprising:

determining relative auditory sense weighting data based on the auditory sense weighting data and the adjustment reference data; the auditory perception weighting data are used for describing the sensitivity degree of human ears to sounds with different frequencies, and comprise the corresponding relation between signal frequencies and auditory perception weighting coefficients;

acquiring a relative auditory perception weighting coefficient corresponding to the frequency of the frequency domain signal from the relative auditory perception weighting data, performing power operation according to the relative auditory perception weighting coefficient and volume control data corresponding to the target volume value to obtain a gain coefficient corresponding to the frequency, and determining gain data according to the gain coefficient corresponding to the frequency;

2. The method of claim 1, wherein a first signal frequency in the relative auditory perception weighting data corresponds to a relative auditory perception weighting coefficient that is less than a set value, and a second signal frequency that corresponds to a relative auditory perception weighting coefficient that is greater than or equal to the set value, the second signal frequency being greater than the first signal frequency.

3. The method according to claim 2, wherein the obtaining the relative auditory sense weighting coefficient corresponding to the frequency of the frequency domain signal from the relative auditory sense weighting data, performing a power operation according to the relative auditory sense weighting coefficient and the volume control data corresponding to the target volume value to obtain the gain coefficient corresponding to the frequency, and determining the gain data according to the gain coefficient corresponding to the frequency, includes:

acquiring frequency information corresponding to the frequency domain signal, wherein the frequency information comprises a plurality of frequencies;

for any frequency of the plurality of frequencies, acquiring a relative auditory perception weighting coefficient corresponding to the any frequency from the relative auditory perception weighting data;

converting the target volume value to obtain volume control data;

taking the relative auditory perception weighting coefficient corresponding to any frequency as a base, and performing power operation on the volume control data as an exponent to obtain a gain coefficient corresponding to any frequency;

after determining the gain coefficient corresponding to each frequency in the plurality of frequencies, determining gain data according to the gain coefficient corresponding to each frequency.

4. A method according to claim 3, wherein said gain processing said frequency domain signal using said gain data to obtain a gain frequency domain signal comprises:

for any frequency of the plurality of frequencies, acquiring a signal value corresponding to the any frequency from the frequency domain signal;

weighting the signal value corresponding to any frequency by using the gain coefficient corresponding to any frequency in the gain data to obtain a signal value after gain processing corresponding to any frequency;

after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal according to the gain-processed signal values corresponding to each of the frequencies.

5. The method according to claim 3 or 4, wherein said determining relative auditory sense weighting data from auditory sense weighting data and said adjustment reference data comprises:

acquiring a plurality of auditory perception weighting coefficients in the auditory perception weighting data, and dividing and calculating any auditory perception weighting coefficient with the adjustment reference data according to any auditory perception weighting coefficient in the auditory perception weighting coefficients to obtain a relative auditory perception weighting coefficient corresponding to the any auditory perception weighting coefficient;

After determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, determining relative auditory sense weighting data according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient.

6. The method according to any one of claims 1-4, wherein the performing a class analysis process on the original audio signal to obtain audio class information includes:

acquiring a reference audio signal corresponding to the original audio signal, wherein the time corresponding to the reference audio signal is earlier than the time corresponding to the original audio signal;

and performing feature analysis processing according to the reference audio signal and the original audio signal to obtain audio class information corresponding to the original audio signal, wherein the audio class information is used for indicating a sounding object corresponding to the original audio signal.

7. A volume adjustment device, the device comprising:

the processing unit is further configured to determine relative auditory sense weighting data according to the auditory sense weighting data and the adjustment reference data, obtain a relative auditory sense weighting coefficient corresponding to the frequency of the frequency domain signal from the relative auditory sense weighting data, perform power operation according to the relative auditory sense weighting coefficient and volume control data corresponding to the target volume value, obtain a gain coefficient corresponding to the frequency, and determine gain data according to the gain coefficient corresponding to the frequency; the auditory perception weighting data are used for describing the sensitivity degree of human ears to sounds with different frequencies, and comprise the corresponding relation between signal frequencies and auditory perception weighting coefficients;

8. A computer device, comprising: the device comprises a processor, a communication interface and a memory, wherein the processor, the communication interface and the memory are mutually connected, the memory stores executable program codes, and the processor is used for calling the executable program codes to realize the volume adjustment method according to any one of claims 1-6.

9. A computer readable storage medium, characterized in that the computer readable storage medium has stored therein computer instructions, which when run on a computer, cause the computer to implement the volume adjustment method according to any one of claims 1-6.