CN111796790B - Sound effect adjusting method and device, readable storage medium and terminal equipment - Google Patents

Sound effect adjusting method and device, readable storage medium and terminal equipment Download PDF

Info

Publication number
CN111796790B
CN111796790B CN201910281281.5A CN201910281281A CN111796790B CN 111796790 B CN111796790 B CN 111796790B CN 201910281281 A CN201910281281 A CN 201910281281A CN 111796790 B CN111796790 B CN 111796790B
Authority
CN
China
Prior art keywords
noise
sound effect
signal
training
terminal equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910281281.5A
Other languages
Chinese (zh)
Other versions
CN111796790A (en
Inventor
杨廉萍
吴海全
唐大勇
张恩勤
曹磊
师瑞文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Grandsun Electronics Co Ltd
Original Assignee
Shenzhen Grandsun Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Grandsun Electronics Co Ltd filed Critical Shenzhen Grandsun Electronics Co Ltd
Priority to CN201910281281.5A priority Critical patent/CN111796790B/en
Publication of CN111796790A publication Critical patent/CN111796790A/en
Application granted granted Critical
Publication of CN111796790B publication Critical patent/CN111796790B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/162Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Abstract

The present application relates to the field of sound effect adjustment technologies, and in particular, to a sound effect adjustment method, a sound effect adjustment device, a computer readable storage medium, and a terminal device. The sound effect adjusting method provided by the application comprises the following steps: acquiring a noise signal of an environment where terminal equipment is located, and determining the noise power of the noise signal; acquiring the frequency spectrum characteristic of the noise signal, and determining the noise type of the noise signal according to the frequency spectrum characteristic; and determining a target sound effect matched with the environment according to the noise type and the noise power, and adjusting the current sound effect of the terminal equipment according to the target sound effect so as to automatically adjust the playing sound effect of the terminal equipment according to the noise power and the frequency spectrum characteristic of external environment noise, thereby improving the convenience and the intelligence of the terminal equipment and greatly improving the use experience of users.

Description

Sound effect adjusting method and device, readable storage medium and terminal equipment
Technical Field
The present application relates to the field of sound effect adjustment technologies, and in particular, to a sound effect adjustment method, a sound effect adjustment device, a computer readable storage medium, and a terminal device.
Background
At present, terminal devices such as intelligent sound boxes are becoming important branches in the field of artificial intelligence, and also occupy an important place in the consumer electronics market. Terminal equipment such as present intelligent audio amplifier often provides multiple audio effects and supplies user selection or adjustment etc. under the circumstances of broadcast sound, but in terminal equipment such as current intelligent audio amplifier, selection or adjustment etc. often need the user to carry out by hand, has reduced terminal equipment's such as intelligent audio amplifier convenience and intelligence, has greatly influenced user's use experience.
Disclosure of Invention
The embodiment of the application provides a sound effect adjusting method, a sound effect adjusting device, a computer readable storage medium and terminal equipment, which can realize automatic adjustment of sound effects in the terminal equipment, and improve convenience and intelligence of the terminal equipment so as to improve use experience of users.
In a first aspect of an embodiment of the present application, there is provided an audio adjustment method, including:
acquiring a noise signal of an environment where terminal equipment is located, and determining the noise power of the noise signal;
acquiring the frequency spectrum characteristic of the noise signal, and determining the noise type of the noise signal according to the frequency spectrum characteristic;
and determining a target sound effect matched with the environment according to the noise type and the noise power, and adjusting the current sound effect of the terminal equipment according to the target sound effect.
Further, the acquiring the noise signal of the environment where the terminal device is located includes:
acquiring a first voice signal of an environment where the terminal equipment is located;
determining a second voice signal which is positioned in a preset frequency spectrum range in the first voice signal;
deleting the second voice signal from the first voice signal, and determining the first voice signal from which the second voice signal is deleted as the noise signal.
Preferably, the determining the noise power of the noise signal includes:
carrying out framing treatment on the noise signals, and obtaining vibration amplitude corresponding to the noise signals in each frame;
and summing and accumulating the vibration amplitudes, and determining the noise power of the noise signal according to the result of the summation and accumulation.
Optionally, the acquiring the spectral characteristic of the noise signal includes:
preprocessing the noise signal, and acquiring a noise spectrum of the preprocessed noise signal through fast Fourier change;
converting the noise spectrum into a corresponding Mel spectrum by using a Mel filter;
and filtering and discrete cosine transforming the Mel frequency spectrum to obtain the frequency spectrum characteristic of the noise signal.
Further, the determining a target sound effect matched with the environment according to the noise type and the noise power comprises:
and inputting the noise type and the noise power into the trained sound effect matching model to obtain a matching result output by the sound effect matching model, and determining the matching result as a target sound effect matched with the environment.
Preferably, the training process of the sound effect matching model includes:
acquiring training noise types and training noise power for training, and marking standard matching results of the training noise types and the corresponding training noise power;
respectively inputting each training noise type and corresponding training noise power into an initial sound effect matching model to obtain a training matching result output by the initial sound effect matching model;
determining a training error of the current training according to the training matching result and the standard matching result;
if the training error meets a preset condition, determining that the training of the sound effect matching model is completed;
and if the training errors do not meet the preset conditions, adjusting model parameters of the sound effect matching model, determining the sound effect matching model with the adjusted model parameters as an initial sound effect matching model, and returning to execute the step of respectively inputting each training noise type and corresponding training noise power into the initial sound effect matching model to obtain a training matching result output by the initial sound effect matching model and the subsequent step.
In a second aspect of an embodiment of the present application, there is provided an audio effect adjustment apparatus, including:
the noise power determining module is used for acquiring a noise signal of the environment where the terminal equipment is located and determining the noise power of the noise signal;
the noise type determining module is used for acquiring the frequency spectrum characteristic of the noise signal and determining the noise type of the noise signal according to the frequency spectrum characteristic;
and the sound effect adjusting module is used for determining a target sound effect matched with the environment according to the noise type and the noise power and adjusting the current sound effect of the terminal equipment according to the target sound effect.
In a third aspect of the embodiments of the present application, there is provided a terminal device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, the processor implementing the steps of the sound effect adjustment method according to the first aspect as described above when executing the computer program.
In a fourth aspect of the embodiments of the present application, there is provided a computer-readable storage medium storing a computer program which, when executed by a processor, implements the steps of the sound effect adjustment method as described in the foregoing first aspect.
From the above technical solutions, the embodiment of the present application has the following advantages:
in the embodiment of the application, the noise signal of the environment where the terminal equipment is located can be firstly obtained, and the noise power of the noise signal can be determined; secondly, the frequency spectrum characteristic of the noise signal can be obtained, and the noise type of the noise signal is determined according to the frequency spectrum characteristic; finally, a target sound effect matched with the environment can be determined according to the noise type and the noise power, and the current sound effect of the terminal equipment can be adjusted according to the target sound effect. In the embodiment of the application, the playing sound effect of the terminal equipment can be automatically adjusted according to the noise power and the frequency spectrum characteristic of the external environment noise, and the convenience and the intelligence of the terminal equipment are improved, so that the use experience of a user is greatly improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments or the description of the prior art will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flowchart of an embodiment of a method for adjusting sound effects according to an embodiment of the present application;
fig. 2 is a schematic flow chart of acquiring a noise signal in an application scenario according to an embodiment of the present application;
fig. 3 is a schematic flow chart of acquiring spectral characteristics under an application scenario in an audio adjustment method according to an embodiment of the present application;
FIG. 4 is a schematic flow chart of training an audio matching model in an application scenario according to an embodiment of the present application;
FIG. 5 is a block diagram of an embodiment of an audio adjustment device according to an embodiment of the present application;
fig. 6 is a schematic diagram of a terminal device according to an embodiment of the present application.
Detailed Description
The embodiment of the application provides a sound effect adjusting method, a sound effect adjusting device, a computer readable storage medium and terminal equipment, which are used for realizing automatic adjustment of sound effects in the terminal equipment and improving convenience and intelligence of the terminal equipment so as to improve use experience of a user.
In order to make the objects, features and advantages of the present application more comprehensible, the technical solutions in the embodiments of the present application are described in detail below with reference to the accompanying drawings, and it is apparent that the embodiments described below are only some embodiments of the present application, but not all embodiments of the present application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
Referring to fig. 1, an embodiment of the present application provides an audio adjustment method, which includes:
step S101, acquiring a noise signal of an environment where terminal equipment is located, and determining the noise power of the noise signal;
the execution main body of the embodiment of the application is terminal equipment, for example, the execution main body can be an intelligent sound box, a microphone array is arranged in the terminal equipment such as the intelligent sound box, and the microphone array can collect voice signals in the current environment of the terminal equipment such as the intelligent sound box from various angles and can send the collected voice signals to a processing device in the terminal equipment such as the intelligent sound box, and the processing device can extract noise signals from the voice signals and determine noise power corresponding to the noise signals.
Specifically, as shown in fig. 2, in the embodiment of the present application, the acquiring the noise signal of the environment where the terminal device is located may include:
step S201, a first voice signal of the environment where the terminal equipment is located is obtained;
it can be understood that when terminal devices such as an intelligent sound box play or begin to play music, the microphone array in the terminal devices such as the intelligent sound box can collect a first voice signal in the current environment of the terminal devices such as the intelligent sound box, wherein the collected first voice signal can include a noise signal in an external environment and a second voice signal generated by playing music in the terminal devices such as the intelligent sound box.
Step S202, determining a second voice signal which is positioned in a preset frequency spectrum range in the first voice signal;
step S203, deleting the second voice signal from the first voice signal, and determining the first voice signal from which the second voice signal is deleted as the noise signal.
For the above steps S202 and S203, in the embodiment of the present application, the sound effect adjustment is mainly to adjust the music playing effect of the terminal device such as the intelligent sound box according to the external environmental noise in the current environment of the terminal device such as the intelligent sound box, so when the first voice signal collected by the microphone array includes the second voice signal generated by playing music in the terminal device such as the intelligent sound box, the second voice signal can be filtered, that is, the second voice signal can be deleted from the first voice signal, for example, the spectral component corresponding to the second voice signal can be filtered and deleted by a specific filter, and the first voice signal after the second voice signal is filtered and deleted can be determined as the external noise signal of the current environment of the terminal device such as the intelligent sound box.
Here, the second voice signal is a normal music signal played by a terminal device such as an intelligent sound box, where a spectrum of the normal music signal is located in a preset spectrum range, and a determining process of the preset spectrum range may specifically be: and placing terminal equipment such as an intelligent sound box and the like in an experimental environment without external noise for music playing, collecting voice signals generated by playing, and determining the preset frequency spectrum range by analyzing the voice signals generated by playing.
Further, in an embodiment of the present application, the determining the noise power of the noise signal may include:
step a, carrying out framing treatment on the noise signals, and obtaining vibration amplitude corresponding to the noise signals in each frame;
and b, summing and accumulating the vibration amplitudes, and determining the noise power of the noise signal according to the result of summing and accumulating.
For the above steps a and b, it may be understood that after determining the noise signal corresponding to the external environment, the vibration amplitude corresponding to the noise signal may be further obtained, so that the noise power of the noise signal may be determined according to the vibration amplitude, for example, the noise signal may be first subjected to framing processing, then the vibration amplitude corresponding to the noise signal in each frame may be obtained through fourier transformation, and finally the vibration amplitudes corresponding to each frame may be summed and accumulated, and the result obtained by the summation and accumulation may be determined as the noise power of the noise signal, where the obtained noise power may be used to characterize the volume of noise.
Step S102, acquiring the frequency spectrum characteristic of the noise signal, and determining the noise type of the noise signal according to the frequency spectrum characteristic;
in the embodiment of the application, after the noise signal of the current environment of the terminal equipment such as the intelligent sound box is obtained, the frequency spectrum characteristic of the noise signal can be further obtained, for example, the frequency spectrum characteristic of the noise signal can be obtained through a Mel frequency cepstrum coefficient method, so that the noise type of the noise signal can be determined according to the frequency spectrum characteristic, and the environment type of the current environment of the terminal equipment such as the intelligent sound box can be determined.
Further, as shown in fig. 3, in the embodiment of the present application, the obtaining the spectral characteristic of the noise signal may include:
step S301, preprocessing the noise signal, and acquiring a noise spectrum of the preprocessed noise signal through fast Fourier change;
step S302, converting the noise spectrum into a corresponding Mel spectrum by using a Mel filter;
step S303, filtering and discrete cosine transforming the Mel spectrum to obtain the spectrum characteristic of the noise signal.
As to the above steps S301 to S303, it may be understood that, when obtaining the spectral characteristics of the noise signal, the noise signal may be first subjected to preprocessing such as windowing and framing, then, for each analysis window, the noise spectrum in the analysis window may be obtained through fast fourier transform, after obtaining the noise spectrum in each analysis window, each noise spectrum may be converted into a corresponding Mel spectrum by using a Mel filter bank, and finally, the Mel frequency cepstrum coefficient corresponding to each Mel spectrum may be obtained by filtering and discrete cosine transforming the Mel spectrum, thereby obtaining the spectral characteristics corresponding to the noise signal.
Here, after determining the spectral characteristics of the noise signal, the noise type of the noise signal may be determined according to the spectral characteristics, such as determining whether the noise type of the noise signal is natural white noise or speech mechanical noise, or the like.
It should be noted that the spectrum characteristics may include a frequency coverage range, a degree of change of the spectrum characteristics with time, and the like, where the noise signal may be classified into full-band noise and narrow-band noise according to the frequency coverage range, and may be classified into periodic noise, impulse noise, stationary noise, and mild variable noise according to the degree of change of the spectrum characteristics with time. In the embodiment of the application, the noise type of the noise signal is determined according to the spectrum characteristics, mainly according to the spectrum coverage and the degree of change of the spectrum characteristics along with time, for example, when the noise signal is determined to be full-band noise and stable noise, the noise type of the noise signal can be determined to be natural white noise, and the environment where terminal equipment such as an intelligent sound box and the like are currently located can be determined to be natural white noise environment; and when the noise signal is determined to be the narrow-band noise and the periodic noise, the noise type of the noise signal can be determined to be the voice mechanical noise, and the environment where the terminal equipment such as the intelligent sound box and the like is currently located can be determined to be the voice mechanical noise environment.
And step 103, determining a target sound effect matched with the environment according to the noise type and the noise power, and adjusting the current sound effect of the terminal equipment according to the target sound effect.
In the embodiment of the application, after the noise type of the noise signal is determined and the noise power of the noise signal is obtained, the target sound effect matched with the environment can be determined according to the noise type and the noise power, so that the current sound effect of terminal equipment such as an intelligent sound box can be adjusted according to the target sound effect, and the playing effect of the terminal equipment such as the intelligent sound box can be improved.
It should be noted that, the sound effects may include a volume, a heavy bass effect, and a music equalizing effect that is most suitable for receiving by the human ear in different environments, where the volume in the target sound effect is related to the noise power, and the heavy bass effect and the music equalizing effect are related to the noise type, for example, the heavy bass effect required by the natural white noise environment is stronger, and the sound wave band of the intermediate frequency part is most suitable for receiving by the human ear in the speech mechanical noise environment.
Specifically, the volume in the target sound effect can be determined according to the noise power and the current playing sound power of the terminal equipment such as the intelligent sound box, when the noise power exceeds or is close to the current playing sound power of the terminal equipment such as the intelligent sound box, if the noise power reaches 70% or more of the sound power, the noise power can be considered to be close to the sound power, namely, the current playing volume of the terminal equipment such as the intelligent sound box needs to be adjusted, and at the moment, the volume in the determined target sound effect can be larger than the current playing volume of the terminal equipment such as the intelligent sound box.
When the noise type is determined to be natural white noise, the determined heavy bass effect of the target sound effect is stronger, the heavy bass effect in the current playing of the terminal equipment such as the intelligent sound box can be enhanced according to the target sound effect, meanwhile, the volume in the target sound effect can be determined according to the noise power and the current playing sound power of the terminal equipment such as the intelligent sound box, so that the current playing volume of the terminal equipment such as the intelligent sound box can be adjusted according to the determined volume, and if the noise power exceeds or is close to the sound power, the current playing volume can be increased, and the sound power of the current playing volume exceeds the noise power; when the noise type is determined to be voice mechanical noise, the determined voice wave band of the middle frequency part of the target sound effect is higher, at the moment, the voice wave band of the middle frequency part in the current playing music of the terminal equipment such as the intelligent sound box can be automatically lifted according to the target sound effect, the voice wave bands of the high frequency part and the low frequency part can be properly reduced, and meanwhile, the volume of the target sound effect can be determined according to the noise power and the current playing voice power of the terminal equipment such as the intelligent sound box, so that the current playing volume of the terminal equipment such as the intelligent sound box can be adjusted according to the determined volume.
It can be understood that in an application scenario, a corresponding relation between a target sound effect and a noise type and a corresponding noise power may be preset, and the set corresponding relation is stored in a storage device of a terminal device such as an intelligent sound box, so when the noise type and the noise power are determined, the corresponding target sound effect may be found according to the corresponding relation, and then the current sound effect of the terminal device such as the intelligent sound box is adjusted according to the found target sound effect. In another application scenario, the corresponding target sound effect can be matched by constructing a sound effect matching model, wherein the input parameters of the sound effect matching model are noise type and noise power, and the output result is specific sound effect type.
Specifically, in the embodiment of the present application, the determining, according to the noise type and the noise power, the target sound effect matched with the environment may include:
and inputting the noise type and the noise power into the trained sound effect matching model to obtain a matching result output by the sound effect matching model, and determining the matching result as a target sound effect matched with the environment.
Further, as shown in fig. 4, in an embodiment of the present application, the training process of the sound effect matching model may include:
step S401, obtaining training noise types and training noise power for training, and marking standard matching results of the training noise types and the corresponding training noise power;
step S402, inputting the training noise type and the corresponding training noise power into an initial sound effect matching model to obtain a training matching result output by the initial sound effect matching model;
step S403, determining a training error of the current training according to the training matching result and the standard matching result;
step S404, judging whether the training error meets a preset condition;
step 405, if the training error meets the preset condition, determining that the training of the sound effect matching model is completed;
step S406, if the training error does not meet the preset condition, adjusting the model parameters of the sound effect matching model, determining the sound effect matching model after the model parameters are adjusted as an initial sound effect matching model, and returning to execute the step of inputting each training noise type and the corresponding training noise power into the initial sound effect matching model to obtain a training matching result output by the initial sound effect matching model and the subsequent step.
For the above step S401, it may be understood that before the sound effect matching model is trained, the training noise type and the training noise power for training need to be obtained in advance, and the standard matching result of each training noise type and the corresponding training noise power may be marked according to multiple test results, for example, music playing may be performed in the noise environment corresponding to each training noise type and each training noise power, and sound effect adjustment may be performed according to feedback of the relevant person on the music playing effect, so as to determine the sound effect corresponding to the best playing effect fed back by the relevant person as the standard matching result corresponding to the training noise type and the corresponding training noise power.
For the step S402, after the standard matching results of each training noise type and the corresponding training noise power are marked, each training noise type and the corresponding training noise power may be input into the initial sound effect matching model, so as to obtain the training matching result of each training noise type and the corresponding training noise power output by the initial sound effect matching model.
For the above steps S403 and S404, after each training matching result is obtained, a training error of the current training may be calculated according to the training matching result and the standard matching result, and whether the training error meets a preset condition may be determined, for example, whether the training error is less than 5%. The preset condition may be determined when a specific sound effect matching model is trained, for example, the preset condition may be set such that the training error is smaller than a specific threshold.
For the above step S405 and step S406, it may be understood that when the training error meets the preset condition, for example, when the training error is less than 5%, it may be determined that the training of the sound effect matching model is completed. When the training error does not meet the preset condition, for example, when the training error is 8%, the model parameters of the sound effect matching model can be adjusted, the sound effect matching model with the adjusted model parameters can be used as an initial sound effect matching model, then training is performed again, and training errors of subsequent training are minimized until the training error meets the preset condition by repeatedly adjusting the model parameters of the sound effect matching model and performing training for a plurality of times.
In this scene, after constructing with training audio matching model, accessible audio matching model confirms with terminal equipment such as intelligent audio amplifier present environment assorted target audio for can accurately, carry out the regulation of audio fast, improve the accuracy and the efficiency of audio regulation, realize terminal equipment such as intelligent audio amplifier's convenience and intelligence, thereby improve user's use experience.
In the embodiment of the application, the noise signal of the environment where the terminal equipment is located can be firstly obtained, and the noise power of the noise signal can be determined; secondly, the frequency spectrum characteristic of the noise signal can be obtained, and the noise type of the noise signal is determined according to the frequency spectrum characteristic; finally, a target sound effect matched with the environment can be determined according to the noise type and the noise power, and the current sound effect of the terminal equipment can be adjusted according to the target sound effect. In the embodiment of the application, the playing sound effect of the terminal equipment can be automatically adjusted according to the noise power and the frequency spectrum characteristic of the external environment noise, and the convenience and the intelligence of the terminal equipment are improved, so that the use experience of a user is greatly improved.
It should be understood that the sequence number of each step in the foregoing embodiment does not mean that the execution sequence of each process should be determined by the function and the internal logic, and should not limit the implementation process of the embodiment of the present application.
The foregoing has mainly described a sound effect adjusting method, and a sound effect adjusting apparatus will be described in detail.
Fig. 5 is a block diagram showing an embodiment of an audio adjusting apparatus according to an embodiment of the present application. As shown in fig. 5, the sound effect adjusting device includes:
a noise power determining module 501, configured to obtain a noise signal of an environment where a terminal device is located, and determine a noise power of the noise signal;
a noise type determining module 502, configured to obtain a spectral characteristic of the noise signal, and determine a noise type of the noise signal according to the spectral characteristic;
and the sound effect adjusting module 503 is configured to determine a target sound effect matched with the environment according to the noise type and the noise power, and adjust the current sound effect of the terminal device according to the target sound effect.
Further, the noise power determining module 501 may include:
the first voice signal acquisition unit is used for acquiring a first voice signal of the environment where the terminal equipment is located;
a second voice signal determining unit, configured to determine a second voice signal located in a preset spectrum range in the first voice signal;
and a noise signal determining unit configured to delete the second speech signal from the first speech signal and determine the first speech signal from which the second speech signal is deleted as the noise signal.
Preferably, the noise power determining module 501 may further include:
the vibration amplitude acquisition unit is used for carrying out framing processing on the noise signals and acquiring vibration amplitudes corresponding to the noise signals in each frame;
and the noise power determining unit is used for carrying out summation accumulation on the vibration amplitudes and determining the noise power of the noise signal according to the summation accumulation result.
Optionally, the noise type determining module 502 may include:
the noise spectrum acquisition unit is used for preprocessing the noise signals and acquiring the noise spectrum of the preprocessed noise signals through fast Fourier change;
a noise spectrum conversion unit for converting the noise spectrum into a corresponding Mel spectrum by using a Mel filter;
and the spectrum characteristic acquisition unit is used for filtering the Mel spectrum and performing discrete cosine transform to obtain the spectrum characteristic of the noise signal.
Further, the sound effect adjusting module 503 may include:
the target sound effect determining unit is used for inputting the noise type and the noise power into the trained sound effect matching model to obtain a matching result output by the sound effect matching model, and determining the matching result as a target sound effect matched with the environment.
Preferably, the sound effect adjusting module 503 may further include:
the training noise acquisition unit is used for acquiring training noise types and training noise power for training and marking standard matching results of the training noise types and the corresponding training noise power;
the training unit is used for respectively inputting the training noise type and the corresponding training noise power into an initial sound effect matching model to obtain a training matching result output by the initial sound effect matching model;
the training error determining unit is used for determining the training error of the current training according to the training matching result and the standard matching result;
the training completion determining unit is used for determining that the training of the sound effect matching model is completed if the training error meets a preset condition;
and the model parameter adjusting unit is used for adjusting the model parameters of the sound effect matching model if the training errors do not meet the preset conditions, determining the sound effect matching model after the model parameters are adjusted as an initial sound effect matching model, and returning to execute the step of respectively inputting the training noise types and the corresponding training noise power into the initial sound effect matching model to obtain a training matching result output by the initial sound effect matching model and the subsequent step.
Fig. 6 is a schematic diagram of a terminal device according to an embodiment of the present application. As shown in fig. 6, the terminal device 6 of this embodiment includes: a processor 60, a memory 61 and a computer program 62, such as a sound effect modifier program, stored in the memory 61 and executable on the processor 60. The steps of the various embodiments of the sound effect adjustment method described above, such as steps S101 to S103 shown in fig. 1, are implemented when the processor 60 executes the computer program 62. Alternatively, the processor 60, when executing the computer program 62, performs the functions of the modules/units of the apparatus embodiments described above, such as the functions of modules 501-503 shown in fig. 5.
Illustratively, the computer program 62 may be partitioned into one or more modules/units that are stored in the memory 61 and executed by the processor 60 to complete the present application. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions for describing the execution of the computer program 62 in the terminal device 6. For example, the computer program 62 may be divided into a noise power determination module, a noise type determination module, and an audio effect adjustment module, each of which functions specifically as follows:
the noise power determining module is used for acquiring a noise signal of the environment where the terminal equipment is located and determining the noise power of the noise signal;
the noise type determining module is used for acquiring the frequency spectrum characteristic of the noise signal and determining the noise type of the noise signal according to the frequency spectrum characteristic;
and the sound effect adjusting module is used for determining a target sound effect matched with the environment according to the noise type and the noise power and adjusting the current sound effect of the terminal equipment according to the target sound effect.
The terminal device 6 may be a desktop computer, a notebook computer, a palm computer, an intelligent speaker, a robot, or the like, which can output sound. The terminal device may include, but is not limited to, a processor 60, a memory 61. It will be appreciated by those skilled in the art that fig. 6 is merely an example of the terminal device 6 and does not constitute a limitation of the terminal device 6, and may include more or less components than illustrated, or may combine certain components, or different components, e.g., the terminal device may further include an input-output device, a network access device, a bus, etc.
The processor 60 may be a central processing unit (Central Processing Unit, CPU), but may also be other general purpose processors, digital signal processors (Digital Signal Processor, DSPs), application specific integrated circuits (Application Specific Integrated Circuit, ASICs), off-the-shelf programmable gate arrays (Field-Programmable Gate Array, FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The memory 61 may be an internal storage unit of the terminal device 6, such as a hard disk or a memory of the terminal device 6. The memory 61 may be an external storage device of the terminal device 6, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card) or the like, which are provided on the terminal device 6. Further, the memory 61 may also include both an internal storage unit and an external storage device of the terminal device 6. The memory 61 is used for storing the computer program and other programs and data required by the terminal device. The memory 61 may also be used for temporarily storing data that has been output or is to be output.
It will be clear to those skilled in the art that, for convenience and brevity of description, specific working procedures of the above-described systems, apparatuses and units may refer to corresponding procedures in the foregoing method embodiments, which are not repeated herein.
In the foregoing embodiments, the descriptions of the embodiments are emphasized, and in part, not described or illustrated in any particular embodiment, reference is made to the related descriptions of other embodiments.
Those of ordinary skill in the art will appreciate that the modules, units, and/or method steps of the various embodiments described in connection with the embodiments disclosed herein can be implemented as electronic hardware, or as a combination of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
In the several embodiments provided in the present application, it should be understood that the disclosed systems, devices, and methods may be implemented in other manners. For example, the apparatus embodiments described above are merely illustrative, e.g., the division of the units is merely a logical function division, and there may be additional divisions when actually implemented, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, which may be in electrical, mechanical or other form.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.
The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the present application may implement all or part of the flow of the method of the above embodiment, or may be implemented by a computer program to instruct related hardware, where the computer program may be stored in a computer readable storage medium, and when the computer program is executed by a processor, the computer program may implement the steps of each of the method embodiments described above. Wherein the computer program comprises computer program code which may be in source code form, object code form, executable file or some intermediate form etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), an electrical carrier signal, a telecommunications signal, a software distribution medium, and so forth. It should be noted that the computer readable medium contains content that can be appropriately scaled according to the requirements of jurisdictions in which such content is subject to legislation and patent practice, such as in certain jurisdictions in which such content is subject to legislation and patent practice, the computer readable medium does not include electrical carrier signals and telecommunication signals.
The above embodiments are only for illustrating the technical solution of the present application, and not for limiting the same; although the application has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present application.

Claims (8)

1. A method of sound effect adjustment, comprising:
acquiring a noise signal of an environment where terminal equipment is located, and determining the noise power of the noise signal; the obtaining the noise signal of the environment where the terminal equipment is located includes:
acquiring a first voice signal of an environment where the terminal equipment is located; the acquired first voice signals comprise noise signals in an external environment and second voice signals generated by playing music in the terminal equipment;
determining a second voice signal which is positioned in a preset frequency spectrum range in the first voice signal; the second voice signal is a normal music signal played by the terminal equipment, and the frequency spectrum of the normal music signal is located in a preset frequency spectrum range; the determining process of the preset frequency spectrum range comprises the following steps: the terminal equipment is placed in an experimental environment without external noise for music playing, voice signals generated by playing are collected, and the preset frequency spectrum range is determined by analyzing the voice signals generated by playing;
deleting the second voice signal from the first voice signal, and determining the first voice signal from which the second voice signal is deleted as the noise signal;
acquiring the frequency spectrum characteristic of the noise signal, and determining the noise type of the noise signal according to the frequency spectrum characteristic;
determining a target sound effect matched with the environment according to the noise type and the noise power, and adjusting the current sound effect of the terminal equipment according to the target sound effect; the sound effects comprise sound volume, heavy bass effect and music equalization effect which is most suitable for receiving by human ears under different environments, wherein the sound volume in the target sound effect is related to noise power, and the heavy bass effect and the music equalization effect are related to noise types; when the noise type is determined to be natural white noise, enhancing the heavy bass effect in the current playing of the terminal equipment; when the noise type is determined to be voice mechanical noise, the voice wave band of the middle frequency part in the current playing music of the terminal equipment is lifted, and the voice wave bands of the high frequency part and the low frequency part are properly reduced.
2. The sound effect adjustment method of claim 1, wherein said determining the noise power of the noise signal comprises:
carrying out framing treatment on the noise signals, and obtaining vibration amplitude corresponding to the noise signals in each frame;
and summing and accumulating the vibration amplitudes, and determining the noise power of the noise signal according to the result of the summation and accumulation.
3. The sound effect adjusting method according to claim 1, wherein the acquiring the spectral characteristics of the noise signal includes:
preprocessing the noise signal, and acquiring a noise spectrum of the preprocessed noise signal through fast Fourier change;
converting the noise spectrum into a corresponding Mel spectrum by using a Mel filter;
and filtering and discrete cosine transforming the Mel frequency spectrum to obtain the frequency spectrum characteristic of the noise signal.
4. A sound effect adjustment method according to any one of claims 1 to 3, characterized in that said determining a target sound effect matching the environment from the noise type and the noise power comprises:
and inputting the noise type and the noise power into the trained sound effect matching model to obtain a matching result output by the sound effect matching model, and determining the matching result as a target sound effect matched with the environment.
5. The sound effect adjustment method of claim 4, wherein the training process of the sound effect matching model comprises:
acquiring training noise types and training noise power for training, and marking standard matching results of the training noise types and the corresponding training noise power;
respectively inputting each training noise type and corresponding training noise power into an initial sound effect matching model to obtain a training matching result output by the initial sound effect matching model;
determining a training error of the current training according to the training matching result and the standard matching result;
if the training error meets a preset condition, determining that the training of the sound effect matching model is completed;
and if the training errors do not meet the preset conditions, adjusting model parameters of the sound effect matching model, determining the sound effect matching model with the adjusted model parameters as an initial sound effect matching model, and returning to execute the step of respectively inputting each training noise type and corresponding training noise power into the initial sound effect matching model to obtain a training matching result output by the initial sound effect matching model and the subsequent step.
6. An audio conditioning apparatus, comprising:
the noise power determining module is used for acquiring a noise signal of the environment where the terminal equipment is located and determining the noise power of the noise signal; wherein, the noise power determination module includes:
the first voice signal acquisition unit is used for acquiring a first voice signal of the environment where the terminal equipment is located; the acquired first voice signals comprise noise signals in an external environment and second voice signals generated by playing music in the terminal equipment;
a second voice signal determining unit, configured to determine a second voice signal located in a preset spectrum range in the first voice signal; the second voice signal is a normal music signal played by the terminal equipment, and the frequency spectrum of the normal music signal is located in a preset frequency spectrum range; the determining process of the preset frequency spectrum range comprises the following steps: the terminal equipment is placed in an experimental environment without external noise for music playing, voice signals generated by playing are collected, and the preset frequency spectrum range is determined by analyzing the voice signals generated by playing;
a noise signal determination unit configured to delete the second voice signal from the first voice signal and determine the first voice signal from which the second voice signal is deleted as the noise signal;
the noise type determining module is used for acquiring the frequency spectrum characteristic of the noise signal and determining the noise type of the noise signal according to the frequency spectrum characteristic;
the sound effect adjusting module is used for determining a target sound effect matched with the environment according to the noise type and the noise power and adjusting the current sound effect of the terminal equipment according to the target sound effect; the sound effects comprise sound volume, heavy bass effect and music equalization effect which is most suitable for receiving by human ears under different environments, wherein the sound volume in the target sound effect is related to noise power, and the heavy bass effect and the music equalization effect are related to noise types; when the noise type is determined to be natural white noise, enhancing the heavy bass effect in the current playing of the terminal equipment; when the noise type is determined to be voice mechanical noise, the voice wave band of the middle frequency part in the current playing music of the terminal equipment is lifted, and the voice wave bands of the high frequency part and the low frequency part are properly reduced.
7. Terminal device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the processor implements the steps of the sound effect adjustment method according to any one of claims 1 to 5 when the computer program is executed.
8. A computer-readable storage medium storing a computer program, characterized in that the computer program, when executed by a processor, implements the steps of the sound effect adjustment method according to any one of claims 1 to 5.
CN201910281281.5A 2019-04-09 2019-04-09 Sound effect adjusting method and device, readable storage medium and terminal equipment Active CN111796790B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910281281.5A CN111796790B (en) 2019-04-09 2019-04-09 Sound effect adjusting method and device, readable storage medium and terminal equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910281281.5A CN111796790B (en) 2019-04-09 2019-04-09 Sound effect adjusting method and device, readable storage medium and terminal equipment

Publications (2)

Publication Number Publication Date
CN111796790A CN111796790A (en) 2020-10-20
CN111796790B true CN111796790B (en) 2023-09-08

Family

ID=72805158

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910281281.5A Active CN111796790B (en) 2019-04-09 2019-04-09 Sound effect adjusting method and device, readable storage medium and terminal equipment

Country Status (1)

Country Link
CN (1) CN111796790B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112506341B (en) * 2020-12-01 2022-05-03 瑞声新能源发展(常州)有限公司科教城分公司 Vibration effect generation method and device, terminal equipment and storage medium
CN113793623B (en) * 2021-08-17 2023-08-18 咪咕音乐有限公司 Sound effect setting method, device, equipment and computer readable storage medium
CN114666706A (en) * 2021-11-30 2022-06-24 北京达佳互联信息技术有限公司 Sound effect enhancement method, device and system
CN114449339B (en) * 2022-02-16 2024-04-12 深圳万兴软件有限公司 Background sound effect conversion method and device, computer equipment and storage medium

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104064196A (en) * 2014-06-20 2014-09-24 哈尔滨工业大学深圳研究生院 Method for improving speech recognition accuracy on basis of voice leading end noise elimination
CN104104800A (en) * 2014-07-25 2014-10-15 Tcl通讯(宁波)有限公司 Method for adjusting volume output of mobile terminal according to environment noises and mobile terminal
CN105448289A (en) * 2015-11-16 2016-03-30 努比亚技术有限公司 Speech synthesis method, speech synthesis device, speech deletion method, speech deletion device and speech deletion and synthesis method
CN105827853A (en) * 2016-04-29 2016-08-03 维沃移动通信有限公司 Method for controlling terminal loudspeaker volume output and mobile terminal
CN107371102A (en) * 2017-08-18 2017-11-21 广东欧珀移动通信有限公司 Control method, device and the storage medium and mobile terminal of audio broadcast sound volume
CN107766030A (en) * 2017-11-13 2018-03-06 百度在线网络技术(北京)有限公司 Volume adjusting method, device, equipment and computer-readable medium
CN107786709A (en) * 2017-11-09 2018-03-09 广东欧珀移动通信有限公司 Call noise-reduction method, device, terminal device and computer-readable recording medium
CN107910013A (en) * 2017-11-10 2018-04-13 广东欧珀移动通信有限公司 The output processing method and device of a kind of voice signal
CN108012035A (en) * 2017-12-26 2018-05-08 奇酷互联网络科技(深圳)有限公司 The tinkle of bells adjusting method, device, readable storage medium storing program for executing and intelligent terminal
CN108766454A (en) * 2018-06-28 2018-11-06 浙江飞歌电子科技有限公司 A kind of voice noise suppressing method and device
CN109327752A (en) * 2018-11-01 2019-02-12 深圳市智驾实业有限公司 Bluetooth sound pick-up, mobile unit and automobile
CN109559755A (en) * 2018-12-25 2019-04-02 沈阳品尚科技有限公司 A kind of sound enhancement method based on DNN noise classification

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI252049B (en) * 2004-07-23 2006-03-21 Inventec Corp Sound control system and method

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104064196A (en) * 2014-06-20 2014-09-24 哈尔滨工业大学深圳研究生院 Method for improving speech recognition accuracy on basis of voice leading end noise elimination
CN104104800A (en) * 2014-07-25 2014-10-15 Tcl通讯(宁波)有限公司 Method for adjusting volume output of mobile terminal according to environment noises and mobile terminal
CN105448289A (en) * 2015-11-16 2016-03-30 努比亚技术有限公司 Speech synthesis method, speech synthesis device, speech deletion method, speech deletion device and speech deletion and synthesis method
CN105827853A (en) * 2016-04-29 2016-08-03 维沃移动通信有限公司 Method for controlling terminal loudspeaker volume output and mobile terminal
CN107371102A (en) * 2017-08-18 2017-11-21 广东欧珀移动通信有限公司 Control method, device and the storage medium and mobile terminal of audio broadcast sound volume
CN107786709A (en) * 2017-11-09 2018-03-09 广东欧珀移动通信有限公司 Call noise-reduction method, device, terminal device and computer-readable recording medium
CN107910013A (en) * 2017-11-10 2018-04-13 广东欧珀移动通信有限公司 The output processing method and device of a kind of voice signal
CN107766030A (en) * 2017-11-13 2018-03-06 百度在线网络技术(北京)有限公司 Volume adjusting method, device, equipment and computer-readable medium
CN108012035A (en) * 2017-12-26 2018-05-08 奇酷互联网络科技(深圳)有限公司 The tinkle of bells adjusting method, device, readable storage medium storing program for executing and intelligent terminal
CN108766454A (en) * 2018-06-28 2018-11-06 浙江飞歌电子科技有限公司 A kind of voice noise suppressing method and device
CN109327752A (en) * 2018-11-01 2019-02-12 深圳市智驾实业有限公司 Bluetooth sound pick-up, mobile unit and automobile
CN109559755A (en) * 2018-12-25 2019-04-02 沈阳品尚科技有限公司 A kind of sound enhancement method based on DNN noise classification

Also Published As

Publication number Publication date
CN111796790A (en) 2020-10-20

Similar Documents

Publication Publication Date Title
CN111796790B (en) Sound effect adjusting method and device, readable storage medium and terminal equipment
CN110265052B (en) Signal-to-noise ratio determining method and device for radio equipment, storage medium and electronic device
CN109643554A (en) Adaptive voice Enhancement Method and electronic equipment
CN110706693B (en) Method and device for determining voice endpoint, storage medium and electronic device
CN110111769B (en) Electronic cochlea control method and device, readable storage medium and electronic cochlea
CN112565981B (en) Howling suppression method, howling suppression device, hearing aid, and storage medium
CN112820315A (en) Audio signal processing method, audio signal processing device, computer equipment and storage medium
CN110428835B (en) Voice equipment adjusting method and device, storage medium and voice equipment
CN108234793B (en) Communication method, communication device, electronic equipment and storage medium
CN109905808B (en) Method and apparatus for adjusting intelligent voice device
CN112562624B (en) Active noise reduction filter design method, noise reduction method, system and electronic equipment
CN110956973A (en) Echo cancellation method and device and intelligent terminal
CN107509155A (en) A kind of bearing calibration of array microphone, device, equipment and storage medium
CN116405823B (en) Intelligent audio denoising enhancement method for bone conduction earphone
JP2017062460A (en) System modelling properties of musical instrument
JP2017090888A (en) Method for modeling characteristic of instrument
CN114187922A (en) Audio detection method and device and terminal equipment
CN107481732B (en) Noise reduction method and device in spoken language evaluation and terminal equipment
US11297430B2 (en) Terminal device, apparatus and method for simulating the characteristic of an audio processor, and computer-readable storage medium
CN117133307A (en) Low-power consumption mono voice noise reduction method, computer device and computer readable storage medium
CN1355916A (en) Signal noise reduction by time-domain spectral substraction
CN111968620A (en) Algorithm testing method and device, electronic equipment and storage medium
CN110992969B (en) Filter bank configuration method and device of electronic cochlea
CN112233693B (en) Sound quality evaluation method, device and equipment
CN113657026A (en) Simulation design method, device, equipment and storage medium of filter

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant