WO2023070899A1 - Self-adaptive noise reduction method, system and device, and storage medium - Google Patents

Self-adaptive noise reduction method, system and device, and storage medium Download PDF

Info

Publication number
WO2023070899A1
WO2023070899A1 PCT/CN2021/138987 CN2021138987W WO2023070899A1 WO 2023070899 A1 WO2023070899 A1 WO 2023070899A1 CN 2021138987 W CN2021138987 W CN 2021138987W WO 2023070899 A1 WO2023070899 A1 WO 2023070899A1
Authority
WO
WIPO (PCT)
Prior art keywords
noise reduction
noise
parameters
adaptive
audio data
Prior art date
Application number
PCT/CN2021/138987
Other languages
French (fr)
Chinese (zh)
Inventor
刘际滨
Original Assignee
歌尔科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 歌尔科技有限公司 filed Critical 歌尔科技有限公司
Publication of WO2023070899A1 publication Critical patent/WO2023070899A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1785Methods, e.g. algorithms; Devices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1787General system configurations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1083Reduction of ambient noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K2210/00Details of active noise control [ANC] covered by G10K11/178 but not provided for in any of its subgroups
    • G10K2210/10Applications
    • G10K2210/108Communication systems, e.g. where useful sound is kept and noise is cancelled
    • G10K2210/1081Earphones, e.g. for telephones, ear protectors or headsets
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K2210/00Details of active noise control [ANC] covered by G10K11/178 but not provided for in any of its subgroups
    • G10K2210/30Means
    • G10K2210/301Computational
    • G10K2210/3038Neural networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/01Hearing devices using active noise cancellation

Definitions

  • the present application relates to the field of active noise reduction earphones, in particular to an adaptive noise reduction method, system, device and storage medium.
  • Noise-cancelling headphones are divided into active noise-cancelling headphones and passive noise-cancelling headphones according to the different noise-cancelling principles they adopt. Among them, active noise-cancelling headphones are divided into three types: feed-forward noise reduction, feedback noise reduction, and hybrid noise reduction. Active noise-canceling headphones generally receive ambient noise through one or more microphones, and then generate a signal that is in phase opposite to the noise sound wave through an electronic circuit to eliminate external noise entering the ear canal. However, the noise recognition rate of the prior art is low, which seriously affects the noise reduction effect of the earphone.
  • embodiments of the present application provide an adaptive noise reduction method, system, device, and storage medium to solve the problem of low noise recognition rate.
  • An embodiment of the present application provides an adaptive noise reduction method, the method comprising:
  • the anti-phase sound wave signal is generated by using the noise reduction parameters to perform adaptive noise reduction on the current noise audio data.
  • the step of obtaining the noise reduction parameters corresponding to the current noise scene includes:
  • the noise frequency domain response is debugged through cascaded filters to obtain noise reduction parameters corresponding to different noise scenarios.
  • the noise frequency domain response is debugged in the form of a cascaded filter
  • the noise reduction parameters corresponding to different noise scenarios it also includes:
  • noise reduction parameters to eliminate and verify the scene audio data of different noise scenes, and obtain the noise reduction error value
  • the noise reduction parameter is optimized by using the noise reduction error value until the number of reverse iterations reaches the preset number, then the optimization of the noise reduction parameter is stopped, the noise reduction optimization parameter is obtained, and the noise reduction optimization parameter is used as the noise reduction parameter.
  • noise reduction error value to optimize the noise reduction parameters until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters, obtain the noise reduction optimization parameters, and use the noise reduction optimization parameters as the noise reduction parameters, including:
  • the average value of the noise reduction optimization parameters is used as the noise reduction parameter.
  • noise reduction error value to optimize the noise reduction parameters until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters, obtain the noise reduction optimization parameters, and use the noise reduction optimization parameters as the noise reduction parameters, including:
  • noise reduction optimization parameters generated after the last reverse iteration operation are obtained, and the noise reduction optimization parameters are used as the noise reduction parameters.
  • the scene audio data is converted into a corresponding noise frequency domain response by a preset method, including:
  • the average value of multiple noise frequency domain responses is obtained as the noise frequency domain response corresponding to the scene audio data.
  • the noise reduction parameters are used to perform adaptive noise reduction on the current noise audio data, including:
  • the noise reduction mode of the adaptive noise reduction device is feed-forward noise reduction, then use the feed-forward parameters in the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data;
  • the noise reduction mode of the adaptive noise reduction device is feedback noise reduction, then use the feedback parameters in the noise reduction parameters to generate an inverse sound wave signal to perform adaptive noise reduction on the current noise audio data;
  • the noise reduction mode of the adaptive noise reduction device is hybrid noise reduction
  • the hybrid parameter in the noise reduction parameters is used to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data.
  • an adaptive noise reduction system includes:
  • a data acquisition module configured to acquire current noise audio data
  • the noise reduction parameter acquisition module is used to input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene;
  • the adaptive noise reduction module is configured to use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data based on the noise reduction mode of the adaptive noise reduction device.
  • an adaptive noise reduction device includes: a memory, a processor, and an adaptive noise reduction method program stored in the memory and operable on the processor, the adaptive noise reduction method When the noise reduction method program is executed by the processor, the steps of any one of the above adaptive noise reduction methods are realized.
  • a computer storage medium on which an adaptive noise reduction method program is stored.
  • the adaptive noise reduction method program is executed by a processor, any one of the above adaptive noise reduction methods can be realized. step.
  • One or more technical solutions provided in the embodiments of the present application have at least the following technical effects or advantages: obtain the current noise audio data; input the current noise audio data into the pre-trained audio noise reduction model to obtain the current noise scene corresponding The noise reduction parameters; through the pre-trained audio noise reduction model, improve the noise recognition rate and obtain the noise reduction parameters under the high noise recognition rate for adaptive noise reduction equipment to perform noise reduction;
  • the anti-phase sound wave signal is generated by using the noise reduction parameters to perform adaptive noise reduction on the current noise audio data; among them, the noise reduction mode based on the adaptive noise reduction device is used in the noise reduction parameters
  • the parameters corresponding to the noise reduction mode accurately generate the anti-phase sound wave signal to complete the accurate adaptive noise reduction for the current noise audio data, thereby improving the noise reduction effect of the adaptive noise reduction device.
  • FIG. 1 is a schematic flow chart of a first embodiment of the adaptive noise reduction method of the present application
  • FIG. 2 is a schematic flowchart of a second embodiment of the adaptive noise reduction method of the present application
  • FIG. 3 is a schematic flowchart of specific implementation steps of step S220 in the second embodiment of the adaptive noise reduction method of the present application;
  • FIG. 4 is a schematic flowchart of another specific implementation step diagram of step S220 in the second embodiment of the adaptive noise reduction method of the present application;
  • FIG. 5 is a schematic flowchart of specific implementation steps of step S225' of the adaptive noise reduction method of the present application.
  • FIG. 6 is a schematic flowchart of another specific implementation step of step S225' of the adaptive noise reduction method of the present application.
  • FIG. 7 is a schematic flow chart of specific implementation steps of step S222 of the adaptive noise reduction method of the present application.
  • FIG. 8 is a schematic diagram of the adaptive noise reduction system of the present application.
  • FIG. 9 is a schematic diagram of an adaptive noise reduction device of the present application.
  • the main solution of the embodiment of the present application is: obtain the current noise audio data; input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene; Noise reduction mode, using noise reduction parameters to generate anti-phase sound wave signals to perform adaptive noise reduction on the current noisy audio data; this application solves the problem of low noise recognition rate and improves the noise reduction effect of adaptive noise reduction equipment.
  • Fig. 1 is the first embodiment of the adaptive noise reduction method of the present application, the method includes:
  • Step S110 Acquiring current noise audio data
  • the audio that can be picked up in the current environment where the adaptive noise reduction device is located is preprocessed to obtain current noise audio data, wherein the noise audio data includes at least information such as different spectral features and specific speech duration.
  • Step S120 Input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene;
  • the pre-trained audio noise reduction model can be a noise reduction model based on a deep neural network (DNN), which can be used for uplink call noise reduction.
  • DNN deep neural network
  • the method based on spectral mapping Perform noise cancellation.
  • the deep neural network realizes the processing output of the original sound through a multi-layer learning network (each network has multiple nodes), and realizes the separation of speech under different input conditions.
  • Step S130 Based on the noise reduction mode of the adaptive noise reduction device, use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noisy audio data.
  • the adaptive noise reduction device in this embodiment can be an earphone, and the earphone can be an in-ear type or a semi-in-ear type. This is not limited; in addition, the adaptive noise reduction device may also be applied to virtual reality (Virtual Reality, or VR), augmented reality (Augmented Reality, or AR) or mediated reality (Mediated Reality, or MR).
  • VR Virtual Reality
  • AR Augmented Reality
  • MR Mediated Reality
  • the adaptive noise reduction device can be used to generate sound wave signals with the same signal strength and opposite directions by using the noise reduction parameters to offset the current noisy audio data, so as to achieve the effect of adaptive noise reduction.
  • the current noise audio data is obtained; the current noise audio data is input into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene; through the pre-trained audio noise reduction model, the improved Noise recognition rate and obtain noise reduction parameters under high noise recognition rate for adaptive noise reduction equipment to perform noise reduction;
  • the anti-phase sound wave signal is generated by using the noise reduction parameters to perform adaptive noise reduction on the current noise audio data; among them, the noise reduction mode based on the adaptive noise reduction device is used in the noise reduction parameters
  • the parameters corresponding to the noise reduction mode accurately generate the anti-phase sound wave signal to complete the accurate adaptive noise reduction for the current noise audio data, thereby improving the noise reduction effect of the adaptive noise reduction device.
  • FIG. 2 is the second embodiment of the adaptive noise reduction method of the present application. Before the step of inputting the current noise audio data into the pre-trained audio noise reduction model and obtaining the noise reduction parameters corresponding to the current noise scene, it includes:
  • Step S210 Acquiring current noise audio data
  • Step S220 building an audio noise reduction model
  • an audio noise reduction model it can be a noise reduction model based on a deep neural network (DNN), or a hybrid noise reduction model based on DNN-LSTM, or a combination of multiple neural network models in the prior art.
  • DNN deep neural network
  • the integrated learning noise reduction model is used to improve the noise recognition effect of the audio noise reduction model, which is not limited here.
  • the audio noise reduction model can be deployed on the cloud, or on the adaptive noise reduction device side, that is, the local side, which is not limited here and adjusted according to specific settings.
  • Step S230 Input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene;
  • Step S240 Based on the noise reduction mode of the adaptive noise reduction device, use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noisy audio data.
  • noise reduction parameters are used to perform adaptive noise reduction on the current noise audio data, including:
  • the noise reduction mode of the adaptive noise reduction device is feed-forward noise reduction, then use the feed-forward parameters in the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data;
  • the noise reduction mode of the adaptive noise reduction device is feedback noise reduction, then use the feedback parameters in the noise reduction parameters to generate an inverse sound wave signal to perform adaptive noise reduction on the current noise audio data;
  • the noise reduction mode of the adaptive noise reduction device is hybrid noise reduction
  • the hybrid parameter in the noise reduction parameters is used to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data.
  • the residual noise signal is obtained, and the residual noise signal is fed back to the audio noise reduction model.
  • the second embodiment includes step S220, other steps have been described in the first embodiment, and will not be repeated here.
  • FIG. 3 shows the specific implementation steps of step S220 in the second embodiment of the adaptive noise reduction method of the present application, and constructs an audio noise reduction model, including:
  • Step S221 acquiring scene audio data of different noise scenes
  • a large amount of scene audio data of different noise scenes can be obtained as a training set to ensure the comprehensiveness of the audio noise reduction model training.
  • Step S222 Convert the scene audio data into corresponding noise frequency domain responses through a preset method
  • the scene audio data can be transformed into corresponding noise frequency domain responses through Fourier transform.
  • the Fourier transform may be a continuous Fourier transform or a discrete Fourier transform, which is not limited here.
  • Step S223 Debug the noise reduction parameters through the noise frequency domain response in the form of a cascaded filter to obtain noise reduction parameters corresponding to different noise scenarios.
  • the cascaded filter can be a plurality of filters connected together by cascading, wherein, if more than two identical filters are cascaded, the filtering effect can be enhanced; if more than two different If the filters are cascaded, the filtering frequency domain can be extended.
  • the above-mentioned cascading manner is not limited, and it can be dynamically adjusted as required.
  • the debugging of the corresponding combined noise reduction parameters is performed in the form of cascaded filters, so as to accurately obtain the noise reduction parameters corresponding to different noise scenarios.
  • Fig. 4 is another specific implementation step of step S220 in the second embodiment of the adaptive noise reduction method of the present application.
  • the frequency domain response of the noise is debugged in the form of a cascaded filter to obtain different noise
  • the step of denoising parameters corresponding to the scene it also includes:
  • Step S221' Acquiring scene audio data of different noise scenes
  • Step S222' convert the scene audio data into corresponding noise frequency domain responses through a preset method
  • Step S223' Debug the noise reduction parameters through the noise frequency domain response in the form of a cascaded filter to obtain noise reduction parameters corresponding to different noise scenarios;
  • Step S224' use the noise reduction parameters to eliminate and verify the scene audio data of different noise scenes, and obtain the noise reduction error value;
  • the elimination verification may be to use the noise reduction parameters to enable the adaptive noise reduction device to generate sound wave signals with the same intensity but opposite directions to perform noise elimination verification on the scene audio data of different noise scenes.
  • Step S225' Use the noise reduction error value to optimize the noise reduction parameters until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters, obtain the noise reduction optimization parameters, and use the noise reduction optimization parameters as the noise reduction parameter.
  • the preset times are not limited here, and are dynamically adjusted according to the specific audio noise reduction model and the quality of the training set.
  • the optimization of the noise reduction parameters can be stopped, the current noise reduction optimization parameters can be obtained, and the noise reduction optimization parameters can be used as the noise reduction parameters. Noise parameter.
  • this embodiment specifically includes step S224' and step S225', and other steps have been described in the previous embodiment, and will not be repeated here.
  • an optimization operation on the noise reduction parameters is added to improve the accuracy of the noise reduction parameters, thereby further ensuring the noise reduction effect of the adaptive noise reduction device.
  • Fig. 5 is the specific implementation steps of step S225' of the self-adaptive noise reduction method of the present application.
  • the noise reduction parameters are optimized by using the noise reduction error value until the number of reverse iterations reaches the preset number of times, then the noise reduction parameters are stopped. Perform optimization to obtain noise reduction optimization parameters, and use the noise reduction optimization parameters as noise reduction parameters, including:
  • Step S225'-1 Perform reverse iteration operation on the noise reduction error value until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters;
  • the reverse iteration operation can be implemented based on a reverse iterator; the preset number of times is not limited in this embodiment, and the preset number of times is positively correlated with the optimization effect of the noise reduction parameter, that is, the more the preset times are, the more optimized The higher the effect, but it will cause a waste of computing resources, so the setting of the preset number of times is adjusted according to the noise reduction effect.
  • Step S225'-2 Obtain the noise reduction optimization parameters generated after each reverse iteration operation, and obtain the average value of the noise reduction optimization parameters based on the noise reduction optimization parameters and the number of reverse iterations;
  • noise reduction optimization parameters generated after each reverse iteration operation it is also possible to obtain the noise reduction optimization parameters generated after each reverse iteration operation, then remove the maximum and minimum values of the noise reduction optimization parameters, and obtain the average value of the remaining noise reduction optimization parameters to ensure that the noise reduction parameters rationality and correctness.
  • Step S225'-3 Use the average value of the optimized noise reduction parameters as the noise reduction parameters.
  • the average value of the optimized noise reduction parameters is obtained as the noise reduction parameters to make the noise reduction parameters more reasonable, thereby ensuring the noise recognition rate of the audio noise reduction model.
  • Fig. 6 is another specific implementation step of step S225' of the self-adaptive noise reduction method of the present application.
  • the noise reduction parameter is optimized by using the noise reduction error value until the number of reverse iterations reaches the preset number of times, then the noise reduction is stopped.
  • the noise parameters are optimized, the noise reduction optimization parameters are obtained, and the noise reduction optimization parameters are used as the noise reduction parameters, including:
  • Step S225'-11 Perform reverse iteration operation on the noise reduction error value until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters;
  • Step S225'-12 Obtain the noise reduction optimization parameters generated after the last reverse iteration operation, and use the noise reduction optimization parameters as the noise reduction parameters.
  • the noise reduction optimization parameters generated after the last reverse iteration operation are obtained as the noise reduction parameters to ensure that the noise reduction parameters are optimal values, thereby ensuring the noise recognition rate of the audio noise reduction model, and further improving the automatic Adapt to the noise reduction effect of the noise reduction device.
  • Fig. 7 is the specific implementation steps of step S222 of the adaptive noise reduction method of the present application, which converts the scene audio data into corresponding noise frequency domain responses through a preset method, including:
  • Step S222-1 converting the audio data of each scene into multiple noise frequency domain responses through a preset method
  • Step S222-2 Obtain the average value of multiple noise frequency domain responses as the noise frequency domain response corresponding to the scene audio data.
  • the average value of multiple noise frequency domain responses is obtained as the noise frequency domain response corresponding to the scene audio data, so that the noise frequency domain response is more reasonable and accurate.
  • This application also protects an adaptive noise reduction system, said system 02, including:
  • Data acquisition module 21 for acquiring current noise audio data
  • the noise reduction parameter acquisition module 22 is used to input the current noise audio data into the pre-trained audio noise reduction model to obtain the corresponding noise reduction parameters of the current noise scene;
  • the adaptive noise reduction module 23 is configured to use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noisy audio data based on the noise reduction mode of the adaptive noise reduction device.
  • the system shown in FIG. 8 includes a data acquisition module 21, a noise reduction parameter acquisition module 22, and an adaptive noise reduction module 23.
  • the system can execute the methods of the embodiments shown in FIGS. 1 to 7.
  • the parts not described in detail in this embodiment Reference may be made to relevant descriptions of the embodiments shown in FIGS. 1 to 7 .
  • For the execution process and technical effect of this technical solution refer to the description in the embodiment shown in FIG. 1 to FIG. 7 , and details are not repeated here.
  • the present application also protects an adaptive noise reduction device.
  • the adaptive noise reduction device includes: a memory, a processor, and an adaptive noise reduction method program stored on the memory and operable on the processor. When the program of the adaptive noise reduction method is executed by the processor, the steps of any one of the above adaptive noise reduction methods are realized.
  • An adaptive noise reduction device 10 involved in the present application includes at least one processor 12 and a memory 11 as shown in FIG. 9 .
  • the processor 12 may be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the above method can be completed by an integrated logic circuit of hardware in the processor 12 or instructions in the form of software.
  • the above-mentioned processor 12 may be a general-purpose processor, a digital signal processor (DSP), an application-specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • DSP digital signal processor
  • ASIC application-specific integrated circuit
  • FPGA field programmable gate array
  • a general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like.
  • the software module can be located in a mature storage medium in the field such as random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, register.
  • the storage medium is located in the memory 11, and the processor 12 reads the information in the memory 11, and completes the steps of the above method in combination with its hardware.
  • the memory 11 in the embodiment of the present application may be a volatile memory or a non-volatile memory, or may include both volatile and non-volatile memories.
  • the non-volatile memory can be read-only memory (Read Only Memory, ROM), programmable read-only memory (Programmable ROM, PROM), erasable programmable read-only memory (Erasable PROM, EPROM), electrically erasable In addition to programmable read-only memory (Electrically EPROM, EEPROM) or flash memory.
  • the volatile memory can be Random Access Memory (RAM), which acts as external cache memory.
  • RAM Static Random Access Memory
  • SRAM Static Random Access Memory
  • DRAM Dynamic Random Access Memory
  • Synchronous Dynamic Random Access Memory Synchronous Dynamic Random Access Memory
  • SDRAM double data rate synchronous dynamic random access memory
  • Double DataRate SDRAM, DDRSDRAM enhanced synchronous dynamic random access memory
  • Enhanced SDRAM, ESDRAM synchronous connection dynamic random access memory
  • Synch link DRAM, SLDRAM Direct Memory Bus Random Access Memory
  • Direct Rambus RAM Direct Rambus RAM
  • the memory 11 of the system and method described in the embodiments of the present application is intended to include but not limited to these and any other suitable types of memory.
  • the present application also protects a computer storage medium, on which an adaptive noise reduction method program is stored, and when the adaptive noise reduction method program is executed by a processor, the adaptive noise reduction method described in any one of the above is realized method steps.
  • the embodiments of the present application may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
  • computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
  • These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the instructions
  • the device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.
  • any reference signs placed between parentheses shall not be construed as limiting the claim.
  • the word “comprising” does not exclude the presence of elements or steps not listed in a claim.
  • the word “a” or “an” preceding an element does not exclude the presence of a plurality of such elements.
  • the application can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a unit claim enumerating several means, several of these means can be embodied by one and the same item of hardware.
  • the use of the words first, second, and third, etc. does not indicate any order. These words can be interpreted as names.

Abstract

The present application discloses a self-adaptive noise reduction method, system and device, and a storage medium. The method comprises the following steps: obtaining current noise audio data; inputting the current noise audio data into a pre-trained audio noise reduction model to obtain a noise reduction parameter corresponding to a current noise scene; and on the basis of a noise reduction mode of a self-adaptive noise reduction device, using the noise reduction parameter to generate a phase-inverted sound wave signal to perform self-adaptive noise reduction on the current noise audio data. The present application solves the problem of low noise recognition rate, and improves the noise reduction effect of the self-adaptive noise reduction device.

Description

自适应降噪方法、系统、设备及存储介质Adaptive noise reduction method, system, device and storage medium
本申请要求于2021年10月28日提交中国专利局、申请号202111267424.0、申请名称为“自适应降噪方法、系统、设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to a Chinese patent application filed with the China Patent Office on October 28, 2021, with application number 202111267424.0 and titled "Adaptive Noise Reduction Method, System, Device, and Storage Medium," the entire contents of which are incorporated by reference in this application.
技术领域technical field
本申请涉及主动降噪耳机领域,尤其涉及一种自适应降噪方法、系统、设备及存储介质。The present application relates to the field of active noise reduction earphones, in particular to an adaptive noise reduction method, system, device and storage medium.
背景技术Background technique
降噪耳机根据其采用的降噪原理不同,分为主动降噪耳机和被动降噪耳机,其中主动降噪耳机又分为前馈式降噪、反馈时降噪以及混合式降噪三种。主动降噪耳机一般通过一个或多个麦克风来接收环境噪声,然后通过电子电路产生与噪声声波相位相反的信号,以此来消除外界进入耳道的噪声。但是现有技术的噪声识别率低,严重影响了耳机的降噪效果。Noise-cancelling headphones are divided into active noise-cancelling headphones and passive noise-cancelling headphones according to the different noise-cancelling principles they adopt. Among them, active noise-cancelling headphones are divided into three types: feed-forward noise reduction, feedback noise reduction, and hybrid noise reduction. Active noise-canceling headphones generally receive ambient noise through one or more microphones, and then generate a signal that is in phase opposite to the noise sound wave through an electronic circuit to eliminate external noise entering the ear canal. However, the noise recognition rate of the prior art is low, which seriously affects the noise reduction effect of the earphone.
发明内容Contents of the invention
有鉴于此,本申请实施例提供一种自适应降噪方法、系统、设备及存储介质,解决噪声识别率低的问题。In view of this, embodiments of the present application provide an adaptive noise reduction method, system, device, and storage medium to solve the problem of low noise recognition rate.
本申请实施例提供了一种自适应降噪方法,所述方法包括:An embodiment of the present application provides an adaptive noise reduction method, the method comprising:
获取当前的噪声音频数据;Get the current noise audio data;
将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;Input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene;
基于自适应降噪设备的降噪模式,利用降噪参数生成反相声波信号对当前的噪声音频数据进行自适应降噪。Based on the noise reduction mode of the adaptive noise reduction device, the anti-phase sound wave signal is generated by using the noise reduction parameters to perform adaptive noise reduction on the current noise audio data.
可选地,将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数的步骤之前,包括:Optionally, before the step of inputting the current noise audio data into the pre-trained audio noise reduction model, the step of obtaining the noise reduction parameters corresponding to the current noise scene includes:
构建音频降噪模型,具体包括:Build an audio noise reduction model, specifically including:
获取不同噪声场景的场景音频数据;Obtain scene audio data of different noise scenes;
将场景音频数据通过预设方法转换为对应的噪声频域响应;Convert the scene audio data to the corresponding noise frequency domain response through the preset method;
将噪声频域响应通过级联滤波器的形式进行降噪参数的调试,获得不同噪声场景对应的降噪参数。The noise frequency domain response is debugged through cascaded filters to obtain noise reduction parameters corresponding to different noise scenarios.
可选地,将噪声频域响应通过级联滤波器的形式进行降噪参数的调试,获得不同噪声场景对应的降噪参数的步骤之后,还包括:Optionally, after the noise frequency domain response is debugged in the form of a cascaded filter, after obtaining the noise reduction parameters corresponding to different noise scenarios, it also includes:
利用降噪参数对不同噪声场景的场景音频数据进行消除验证,获得降噪误差值;Use the noise reduction parameters to eliminate and verify the scene audio data of different noise scenes, and obtain the noise reduction error value;
利用降噪误差值对降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化,获得降噪优化参数,并将降噪优化参数作为降噪参数。The noise reduction parameter is optimized by using the noise reduction error value until the number of reverse iterations reaches the preset number, then the optimization of the noise reduction parameter is stopped, the noise reduction optimization parameter is obtained, and the noise reduction optimization parameter is used as the noise reduction parameter.
可选地,利用降噪误差值对降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化,获得降噪优化参数,并将降噪优化参数作为降噪参数,包括:Optionally, use the noise reduction error value to optimize the noise reduction parameters until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters, obtain the noise reduction optimization parameters, and use the noise reduction optimization parameters as the noise reduction parameters, including:
对降噪误差值进行逆向迭代操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化;Carry out reverse iteration operation on the noise reduction error value until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters;
获得每次逆向迭代操作后生成的降噪优化参数,并基于降噪优化参数以及逆向迭代次数获得降噪优化参数的平均值;Obtain the noise reduction optimization parameters generated after each reverse iteration operation, and obtain the average value of the noise reduction optimization parameters based on the noise reduction optimization parameters and the number of reverse iterations;
将降噪优化参数的平均值作为降噪参数。The average value of the noise reduction optimization parameters is used as the noise reduction parameter.
可选地,利用降噪误差值对降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化,获得降噪优化参数,并将降噪优化参数作为降噪参数,还包括:Optionally, use the noise reduction error value to optimize the noise reduction parameters until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters, obtain the noise reduction optimization parameters, and use the noise reduction optimization parameters as the noise reduction parameters, including:
对降噪误差值进行逆向迭代操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化;Carry out reverse iteration operation on the noise reduction error value until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters;
获得最后一次逆向迭代操作后生成的降噪优化参数,并将降噪优化参数作为降噪参数。The noise reduction optimization parameters generated after the last reverse iteration operation are obtained, and the noise reduction optimization parameters are used as the noise reduction parameters.
可选地,将场景音频数据通过预设方法转换为对应的噪声频域响应,包括:Optionally, the scene audio data is converted into a corresponding noise frequency domain response by a preset method, including:
将每个场景音频数据通过预设方法转换为多个噪声频域响应;Convert the audio data of each scene into multiple noise frequency domain responses through a preset method;
获得多个噪声频域响应的平均值作为场景音频数据对应的噪声频域响应。The average value of multiple noise frequency domain responses is obtained as the noise frequency domain response corresponding to the scene audio data.
可选地,基于自适应降噪设备的降噪模式,利用降噪参数对当前的噪声音频数据进行自适应降噪,包括:Optionally, based on the noise reduction mode of the adaptive noise reduction device, the noise reduction parameters are used to perform adaptive noise reduction on the current noise audio data, including:
若自适应降噪设备的降噪模式为前馈式降噪,则利用降噪参数中的前馈式参数生成反相声波信号对当前的噪声音频数据进行自适应降噪;If the noise reduction mode of the adaptive noise reduction device is feed-forward noise reduction, then use the feed-forward parameters in the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data;
若自适应降噪设备的降噪模式为反馈式降噪,则利用降噪参数中的反馈式参数生成反相声波信号对当前的噪声音频数据进行自适应降噪;If the noise reduction mode of the adaptive noise reduction device is feedback noise reduction, then use the feedback parameters in the noise reduction parameters to generate an inverse sound wave signal to perform adaptive noise reduction on the current noise audio data;
若自适应降噪设备的降噪模式为混合式降噪,则利用降噪参数中的混合式参数生成反相声波信号对当前的噪声音频数据进行自适应降噪。If the noise reduction mode of the adaptive noise reduction device is hybrid noise reduction, the hybrid parameter in the noise reduction parameters is used to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data.
为实现上述目的,还提供一种自适应降噪系统,所述系统,包括:In order to achieve the above purpose, an adaptive noise reduction system is also provided, and the system includes:
数据获取模块,用于获取当前的噪声音频数据;A data acquisition module, configured to acquire current noise audio data;
降噪参数获取模块,用于将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;The noise reduction parameter acquisition module is used to input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene;
自适应降噪模块,用于基于自适应降噪设备的降噪模式,利用降噪参数生成反相声波信号对当前的噪声音频数据进行自适应降噪。The adaptive noise reduction module is configured to use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data based on the noise reduction mode of the adaptive noise reduction device.
为实现上述目的,还提供一种自适应降噪设备,所述自适应降噪设备包括:存储器、处理器及存储在存储器上并可在处理器上运行的自适应降噪方法程序,自适应降噪方法程序被处理器执行时实现上述任一项自适应降噪方法的步骤。In order to achieve the above object, an adaptive noise reduction device is also provided, the adaptive noise reduction device includes: a memory, a processor, and an adaptive noise reduction method program stored in the memory and operable on the processor, the adaptive noise reduction method When the noise reduction method program is executed by the processor, the steps of any one of the above adaptive noise reduction methods are realized.
为实现上述目的,还提供一种计算机存储介质,所述计算机存储介质上存储有自适应降噪方法程序,自适应降噪方法程序被处理器执行时实现上述任一项自适应降噪方法的步骤。In order to achieve the above object, a computer storage medium is also provided, on which an adaptive noise reduction method program is stored. When the adaptive noise reduction method program is executed by a processor, any one of the above adaptive noise reduction methods can be realized. step.
本申请实施例中提供的一个或多个技术方案,至少具有如下技术效果或优点:获取当前的噪声音频数据;将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;通过预先训练好的音频降噪模型,提高噪声识别率并获取高噪声识别率下的降噪参数,以供自适应降噪设备进行降噪;One or more technical solutions provided in the embodiments of the present application have at least the following technical effects or advantages: obtain the current noise audio data; input the current noise audio data into the pre-trained audio noise reduction model to obtain the current noise scene corresponding The noise reduction parameters; through the pre-trained audio noise reduction model, improve the noise recognition rate and obtain the noise reduction parameters under the high noise recognition rate for adaptive noise reduction equipment to perform noise reduction;
基于自适应降噪设备的降噪模式,利用降噪参数生成反相声波信号对当前的噪声音频数据进行自适应降噪;其中,基于自适应降噪设备的降噪模式,利用降噪参数中降噪模式对应的参数,准确的生成反相声波信号,以完成对当前的噪声音频数据准确的自适应降噪,从而提高自适应降噪设备的降噪效果。Based on the noise reduction mode of the adaptive noise reduction device, the anti-phase sound wave signal is generated by using the noise reduction parameters to perform adaptive noise reduction on the current noise audio data; among them, the noise reduction mode based on the adaptive noise reduction device is used in the noise reduction parameters The parameters corresponding to the noise reduction mode accurately generate the anti-phase sound wave signal to complete the accurate adaptive noise reduction for the current noise audio data, thereby improving the noise reduction effect of the adaptive noise reduction device.
附图说明Description of drawings
图1为本申请自适应降噪方法的第一实施例的流程示意图;FIG. 1 is a schematic flow chart of a first embodiment of the adaptive noise reduction method of the present application;
图2为本申请自适应降噪方法第二实施例的流程示意图;FIG. 2 is a schematic flowchart of a second embodiment of the adaptive noise reduction method of the present application;
图3为本申请自适应降噪方法第二实施例中步骤S220的具体实施步骤的流程示意图;FIG. 3 is a schematic flowchart of specific implementation steps of step S220 in the second embodiment of the adaptive noise reduction method of the present application;
图4为本申请自适应降噪方法第二实施例中步骤S220的另一具体实施步骤图的流程示意图;FIG. 4 is a schematic flowchart of another specific implementation step diagram of step S220 in the second embodiment of the adaptive noise reduction method of the present application;
图5为本申请自适应降噪方法步骤S225'的具体实施步骤的流程示意图;FIG. 5 is a schematic flowchart of specific implementation steps of step S225' of the adaptive noise reduction method of the present application;
图6为本申请自适应降噪方法步骤S225'的另一具体实施步骤的流程示意图;FIG. 6 is a schematic flowchart of another specific implementation step of step S225' of the adaptive noise reduction method of the present application;
图7为本申请自适应降噪方法步骤S222的具体实施步骤的流程示意图;FIG. 7 is a schematic flow chart of specific implementation steps of step S222 of the adaptive noise reduction method of the present application;
图8为本申请自适应降噪系统的示意图;FIG. 8 is a schematic diagram of the adaptive noise reduction system of the present application;
图9为本申请自适应降噪设备的示意图。FIG. 9 is a schematic diagram of an adaptive noise reduction device of the present application.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.
本申请实施例的主要解决方案是:获取当前的噪声音频数据;将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;基于自适应降噪设备的降噪模式,利用降噪参数生成反相声波信号对当前的噪声音频数据进行自适应降噪;本申请解决了噪声识别率低的问题,提高了自适应降噪设备的降噪效果。The main solution of the embodiment of the present application is: obtain the current noise audio data; input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene; Noise reduction mode, using noise reduction parameters to generate anti-phase sound wave signals to perform adaptive noise reduction on the current noisy audio data; this application solves the problem of low noise recognition rate and improves the noise reduction effect of adaptive noise reduction equipment.
为了更好的理解上述技术方案,下面将结合说明书附图以及具体的实施方式对上述技术方案进行详细的说明。In order to better understand the above-mentioned technical solution, the above-mentioned technical solution will be described in detail below in conjunction with the accompanying drawings and specific implementation methods.
参照图1,图1为本申请自适应降噪方法的第一实施例,所述方法包括:Referring to Fig. 1, Fig. 1 is the first embodiment of the adaptive noise reduction method of the present application, the method includes:
步骤S110:获取当前的噪声音频数据;Step S110: Acquiring current noise audio data;
具体地,对自适应降噪设备所处当前环境中能拾取到的音频进行预处理,获得当前的噪声音频数据,其中,噪声音频数据至少包括不同频谱特征、特定语音时长等信息。Specifically, the audio that can be picked up in the current environment where the adaptive noise reduction device is located is preprocessed to obtain current noise audio data, wherein the noise audio data includes at least information such as different spectral features and specific speech duration.
步骤S120:将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;Step S120: Input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene;
具体地,预先训练好的音频降噪模型可以是基于深度神经网络(DNN)的降噪模型,可以用于上行通话降噪,通过设计适合语音增强领域的专用神经网络模型,基于谱映射的方法进行噪声消除。深度神经网络通过多层的学习网络(每个网络中拥有多个节点)实现对原始声音的处理输出,实现不同输入条件下语音分离。Specifically, the pre-trained audio noise reduction model can be a noise reduction model based on a deep neural network (DNN), which can be used for uplink call noise reduction. By designing a special neural network model suitable for the field of speech enhancement, the method based on spectral mapping Perform noise cancellation. The deep neural network realizes the processing output of the original sound through a multi-layer learning network (each network has multiple nodes), and realizes the separation of speech under different input conditions.
步骤S130:基于自适应降噪设备的降噪模式,利用降噪参数生成反相声波信号对当前的噪声音频数据进行自适应降噪。Step S130: Based on the noise reduction mode of the adaptive noise reduction device, use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noisy audio data.
需要另外说明的是,在本实施例中的自适应降噪设备可以是耳机,其中耳机可以是入耳式,也可以是半入耳式,耳机形态不限于耳塞,也可以是头戴式耳机,在此并不限定;另外自适应降噪设备也可以是应用到虚拟现实(Virtual Reality,或VR),増强现实(Augmented Reality,或AR)或者介导现实(Mediated Realtiy,或MR)。It should be noted that the adaptive noise reduction device in this embodiment can be an earphone, and the earphone can be an in-ear type or a semi-in-ear type. This is not limited; in addition, the adaptive noise reduction device may also be applied to virtual reality (Virtual Reality, or VR), augmented reality (Augmented Reality, or AR) or mediated reality (Mediated Reality, or MR).
具体地,可以利用自适应降噪设备利用降噪参数生成信号强度相同且方向相反的声波信号以抵消当前的噪声音频数据,以起到自适应降噪的效果。Specifically, the adaptive noise reduction device can be used to generate sound wave signals with the same signal strength and opposite directions by using the noise reduction parameters to offset the current noisy audio data, so as to achieve the effect of adaptive noise reduction.
在本实施例中,获取当前的噪声音频数据;将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;通过预先训练好的音频降噪模型,提高噪声识别率并获取高噪声识别率下的降噪参数,以供自适应降噪设备进行降噪;In this embodiment, the current noise audio data is obtained; the current noise audio data is input into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene; through the pre-trained audio noise reduction model, the improved Noise recognition rate and obtain noise reduction parameters under high noise recognition rate for adaptive noise reduction equipment to perform noise reduction;
基于自适应降噪设备的降噪模式,利用降噪参数生成反相声波信号对当前的噪声音频数据进行自适应降噪;其中,基于自适应降噪设备的降噪模式,利用降噪参数中降噪模式对应的参数,准确的生成反相声波信号,以完成对当前的噪声音频数据准确的自适应降噪,从而提高自适应降噪设备的降噪效果。Based on the noise reduction mode of the adaptive noise reduction device, the anti-phase sound wave signal is generated by using the noise reduction parameters to perform adaptive noise reduction on the current noise audio data; among them, the noise reduction mode based on the adaptive noise reduction device is used in the noise reduction parameters The parameters corresponding to the noise reduction mode accurately generate the anti-phase sound wave signal to complete the accurate adaptive noise reduction for the current noise audio data, thereby improving the noise reduction effect of the adaptive noise reduction device.
参照图2,图2为本申请自适应降噪方法第二实施例,将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数的步骤之前,包括:Referring to FIG. 2, FIG. 2 is the second embodiment of the adaptive noise reduction method of the present application. Before the step of inputting the current noise audio data into the pre-trained audio noise reduction model and obtaining the noise reduction parameters corresponding to the current noise scene, it includes:
步骤S210:获取当前的噪声音频数据;Step S210: Acquiring current noise audio data;
步骤S220:构建音频降噪模型;Step S220: building an audio noise reduction model;
具体地,通过构建音频降噪模型,可以是基于深度神经网络(DNN)的降噪模型,也可 以是基于DNN-LSTM的混合降噪模型,也可以是现有技术中多个神经网络模型的集成学习降噪模型,以提高音频降噪模型的噪声识别效果,在此并不限定。Specifically, by constructing an audio noise reduction model, it can be a noise reduction model based on a deep neural network (DNN), or a hybrid noise reduction model based on DNN-LSTM, or a combination of multiple neural network models in the prior art. The integrated learning noise reduction model is used to improve the noise recognition effect of the audio noise reduction model, which is not limited here.
需要另外说明的是,在音频降噪模型映射学习的过程中,算法优化前期,学习率较大会加速学习效果,但在算法优化后期学习率较大会造成较大波动,出现围绕最优值徘徊而无法收敛的情况,因此,在本实施例中,音频降噪模型的训练过程中,随着训练的进行,学习率逐渐衰减。It should be noted that in the process of audio noise reduction model mapping learning, in the early stage of algorithm optimization, a large learning rate will accelerate the learning effect, but in the later stage of algorithm optimization, a large learning rate will cause large fluctuations, and there will be a problem of wandering around the optimal value. Therefore, in this embodiment, during the training process of the audio noise reduction model, the learning rate gradually decays as the training progresses.
具体地,音频降噪模型可以部署在云端,也可以部署在自适应降噪设备端,也就是本地端,在此并不作限定,根据具体设定进行调整。Specifically, the audio noise reduction model can be deployed on the cloud, or on the adaptive noise reduction device side, that is, the local side, which is not limited here and adjusted according to specific settings.
步骤S230:将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;Step S230: Input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene;
步骤S240:基于自适应降噪设备的降噪模式,利用降噪参数生成反相声波信号对当前的噪声音频数据进行自适应降噪。Step S240: Based on the noise reduction mode of the adaptive noise reduction device, use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noisy audio data.
在其中一个实施例中,基于自适应降噪设备的降噪模式,利用降噪参数对当前的噪声音频数据进行自适应降噪,包括:In one of the embodiments, based on the noise reduction mode of the adaptive noise reduction device, noise reduction parameters are used to perform adaptive noise reduction on the current noise audio data, including:
若自适应降噪设备的降噪模式为前馈式降噪,则利用降噪参数中的前馈式参数生成反相声波信号对当前的噪声音频数据进行自适应降噪;If the noise reduction mode of the adaptive noise reduction device is feed-forward noise reduction, then use the feed-forward parameters in the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data;
若自适应降噪设备的降噪模式为反馈式降噪,则利用降噪参数中的反馈式参数生成反相声波信号对当前的噪声音频数据进行自适应降噪;If the noise reduction mode of the adaptive noise reduction device is feedback noise reduction, then use the feedback parameters in the noise reduction parameters to generate an inverse sound wave signal to perform adaptive noise reduction on the current noise audio data;
若自适应降噪设备的降噪模式为混合式降噪,则利用降噪参数中的混合式参数生成反相声波信号对当前的噪声音频数据进行自适应降噪。If the noise reduction mode of the adaptive noise reduction device is hybrid noise reduction, the hybrid parameter in the noise reduction parameters is used to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data.
其中,在反馈式降噪模式以及混合式降噪模式中,在生成反相声波信息后,获取残余噪声信号,并将残余噪声信号反馈至音频降噪模型中。Wherein, in the feedback noise reduction mode and the hybrid noise reduction mode, after the inverse sound wave information is generated, the residual noise signal is obtained, and the residual noise signal is fed back to the audio noise reduction model.
第二实施例与第一实施例相比,包括步骤S220,其他步骤在第一实施例中已经进行了阐述,在此不再赘述。Compared with the first embodiment, the second embodiment includes step S220, other steps have been described in the first embodiment, and will not be repeated here.
在本实施例中,通过构建音频降噪模型以调高噪声识别的准确率,从而达到更好的降噪效果。In this embodiment, a better noise reduction effect is achieved by constructing an audio noise reduction model to increase the accuracy of noise recognition.
参照图3,图3为本申请自适应降噪方法第二实施例中步骤S220的具体实施步骤,构建音频降噪模型,包括:Referring to FIG. 3, FIG. 3 shows the specific implementation steps of step S220 in the second embodiment of the adaptive noise reduction method of the present application, and constructs an audio noise reduction model, including:
步骤S221:获取不同噪声场景的场景音频数据;Step S221: acquiring scene audio data of different noise scenes;
具体地,可以获取海量的不同噪声场景的场景音频数据作为训练集,以保证音频降噪模型训练的全面性。Specifically, a large amount of scene audio data of different noise scenes can be obtained as a training set to ensure the comprehensiveness of the audio noise reduction model training.
步骤S222:将场景音频数据通过预设方法转换为对应的噪声频域响应;Step S222: Convert the scene audio data into corresponding noise frequency domain responses through a preset method;
具体地,将场景音频数据可以通过傅里叶变换转换为对应的噪声频域响应。其中,傅里叶变换可以是连续傅里叶变换和离散傅里叶变换,在此并不作限定。Specifically, the scene audio data can be transformed into corresponding noise frequency domain responses through Fourier transform. Wherein, the Fourier transform may be a continuous Fourier transform or a discrete Fourier transform, which is not limited here.
步骤S223:将噪声频域响应通过级联滤波器的形式进行降噪参数的调试,获得不同噪声场景对应的降噪参数。Step S223: Debug the noise reduction parameters through the noise frequency domain response in the form of a cascaded filter to obtain noise reduction parameters corresponding to different noise scenarios.
具体地,级联滤波器可以是将多个滤波器通过级联的方式连接在一起,其中,如果是两个以上完全相同滤波器级联,则可以增强其滤波效果;如果是两个以上不同的滤波器级联,则可以扩展其滤波频域,在本实施例中,并不限定上述的级联方式,可以根据需要动态调整。Specifically, the cascaded filter can be a plurality of filters connected together by cascading, wherein, if more than two identical filters are cascaded, the filtering effect can be enhanced; if more than two different If the filters are cascaded, the filtering frequency domain can be extended. In this embodiment, the above-mentioned cascading manner is not limited, and it can be dynamically adjusted as required.
在本实施例中,通过级联滤波器的形式进行对应的组合降噪参数的调试,以准确获得不同噪声场景对应的降噪参数。In this embodiment, the debugging of the corresponding combined noise reduction parameters is performed in the form of cascaded filters, so as to accurately obtain the noise reduction parameters corresponding to different noise scenarios.
参照图4,图4为本申请自适应降噪方法第二实施例中步骤S220的另一具体实施步骤,将噪声频域响应通过级联滤波器的形式进行降噪参数的调试,获得不同噪声场景对应的降噪参数的步骤之后,还包括:Referring to Fig. 4, Fig. 4 is another specific implementation step of step S220 in the second embodiment of the adaptive noise reduction method of the present application. The frequency domain response of the noise is debugged in the form of a cascaded filter to obtain different noise After the step of denoising parameters corresponding to the scene, it also includes:
步骤S221':获取不同噪声场景的场景音频数据;Step S221': Acquiring scene audio data of different noise scenes;
步骤S222':将场景音频数据通过预设方法转换为对应的噪声频域响应;Step S222': convert the scene audio data into corresponding noise frequency domain responses through a preset method;
步骤S223':将噪声频域响应通过级联滤波器的形式进行降噪参数的调试,获得不同噪声场景对应的降噪参数;Step S223': Debug the noise reduction parameters through the noise frequency domain response in the form of a cascaded filter to obtain noise reduction parameters corresponding to different noise scenarios;
步骤S224':利用降噪参数对不同噪声场景的场景音频数据进行消除验证,获得降噪误差值;Step S224': use the noise reduction parameters to eliminate and verify the scene audio data of different noise scenes, and obtain the noise reduction error value;
具体地,消除验证可以是利用降噪参数使自适应降噪设备产生强度相同,方向相反的声波信号对不同噪声场景的场景音频数据进行噪声消除的验证。Specifically, the elimination verification may be to use the noise reduction parameters to enable the adaptive noise reduction device to generate sound wave signals with the same intensity but opposite directions to perform noise elimination verification on the scene audio data of different noise scenes.
步骤S225':利用降噪误差值对降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化,获得降噪优化参数,并将降噪优化参数作为降噪参数。Step S225': Use the noise reduction error value to optimize the noise reduction parameters until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters, obtain the noise reduction optimization parameters, and use the noise reduction optimization parameters as the noise reduction parameter.
具体地,预设次数在此并不限定,根据具体的音频降噪模型以及训练集的质量进行动态调整。Specifically, the preset times are not limited here, and are dynamically adjusted according to the specific audio noise reduction model and the quality of the training set.
具体地,在另一实施例中,也可以是当降噪误差值在预设误差范围内时,停止对降噪 参数进行优化,获得当前的降噪优化参数,并将降噪优化参数作为降噪参数。Specifically, in another embodiment, when the noise reduction error value is within the preset error range, the optimization of the noise reduction parameters can be stopped, the current noise reduction optimization parameters can be obtained, and the noise reduction optimization parameters can be used as the noise reduction parameters. Noise parameter.
与上一实施例相比,本实施例具体包含了步骤S224',以及步骤S225',其他步骤在上一实施例中已经进行了阐述,在此不再赘述。Compared with the previous embodiment, this embodiment specifically includes step S224' and step S225', and other steps have been described in the previous embodiment, and will not be repeated here.
在本实施例中,增加了对降噪参数的优化操作,提高降噪参数的精确度,从而进一步保证自适应降噪设备的降噪效果。In this embodiment, an optimization operation on the noise reduction parameters is added to improve the accuracy of the noise reduction parameters, thereby further ensuring the noise reduction effect of the adaptive noise reduction device.
参照图5,图5为本申请自适应降噪方法步骤S225'的具体实施步骤,利用降噪误差值对降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化,获得降噪优化参数,并将降噪优化参数作为降噪参数,包括:Referring to Fig. 5, Fig. 5 is the specific implementation steps of step S225' of the self-adaptive noise reduction method of the present application. The noise reduction parameters are optimized by using the noise reduction error value until the number of reverse iterations reaches the preset number of times, then the noise reduction parameters are stopped. Perform optimization to obtain noise reduction optimization parameters, and use the noise reduction optimization parameters as noise reduction parameters, including:
步骤S225'-1:对降噪误差值进行逆向迭代操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化;Step S225'-1: Perform reverse iteration operation on the noise reduction error value until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters;
具体地,逆向迭代操作可以是基于逆向迭代器实现;其中预设次数在本实施例中并不限定,预设次数设置与降噪参数的优化效果成正相关,即预设次数越多,则优化效果越高,但会造成计算资源的浪费,所以预设次数的设置根据降噪效果来进行调整。Specifically, the reverse iteration operation can be implemented based on a reverse iterator; the preset number of times is not limited in this embodiment, and the preset number of times is positively correlated with the optimization effect of the noise reduction parameter, that is, the more the preset times are, the more optimized The higher the effect, but it will cause a waste of computing resources, so the setting of the preset number of times is adjusted according to the noise reduction effect.
步骤S225'-2:获得每次逆向迭代操作后生成的降噪优化参数,并基于降噪优化参数以及逆向迭代次数获得降噪优化参数的平均值;Step S225'-2: Obtain the noise reduction optimization parameters generated after each reverse iteration operation, and obtain the average value of the noise reduction optimization parameters based on the noise reduction optimization parameters and the number of reverse iterations;
具体地,也可以是获得每次逆向迭代操作后生成的降噪优化参数,然后去掉降噪优化参数中的最大值以及最小值,求得剩余降噪优化参数的平均值,以保证降噪参数的合理性以及正确性。Specifically, it is also possible to obtain the noise reduction optimization parameters generated after each reverse iteration operation, then remove the maximum and minimum values of the noise reduction optimization parameters, and obtain the average value of the remaining noise reduction optimization parameters to ensure that the noise reduction parameters rationality and correctness.
步骤S225'-3:将降噪优化参数的平均值作为降噪参数。Step S225'-3: Use the average value of the optimized noise reduction parameters as the noise reduction parameters.
在本实施例中通过获取降噪优化参数的平均值作为降噪参数,使降噪参数更加的合理,从而保证音频降噪模型的噪声识别率。In this embodiment, the average value of the optimized noise reduction parameters is obtained as the noise reduction parameters to make the noise reduction parameters more reasonable, thereby ensuring the noise recognition rate of the audio noise reduction model.
参照图6,图6为本申请自适应降噪方法步骤S225'的另一具体实施步骤,利用降噪误差值对降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化,获得降噪优化参数,并将降噪优化参数作为降噪参数,还包括:Referring to Fig. 6, Fig. 6 is another specific implementation step of step S225' of the self-adaptive noise reduction method of the present application. The noise reduction parameter is optimized by using the noise reduction error value until the number of reverse iterations reaches the preset number of times, then the noise reduction is stopped. The noise parameters are optimized, the noise reduction optimization parameters are obtained, and the noise reduction optimization parameters are used as the noise reduction parameters, including:
步骤S225'-11:对降噪误差值进行逆向迭代操作,直至逆向迭代次数达到预设次数,则停止对降噪参数进行优化;Step S225'-11: Perform reverse iteration operation on the noise reduction error value until the number of reverse iterations reaches the preset number, then stop optimizing the noise reduction parameters;
步骤S225'-12:获得最后一次逆向迭代操作后生成的降噪优化参数,并将降噪优化参数作为降噪参数。Step S225'-12: Obtain the noise reduction optimization parameters generated after the last reverse iteration operation, and use the noise reduction optimization parameters as the noise reduction parameters.
具体地,在本实施例中通过获取最后一次逆向迭代操作后生成的降噪优化参数作 为降噪参数,保证降噪参数为最优值,从而保证音频降噪模型的噪声识别率,进一步提高自适应降噪设备的降噪效果。Specifically, in this embodiment, the noise reduction optimization parameters generated after the last reverse iteration operation are obtained as the noise reduction parameters to ensure that the noise reduction parameters are optimal values, thereby ensuring the noise recognition rate of the audio noise reduction model, and further improving the automatic Adapt to the noise reduction effect of the noise reduction device.
参照图7,图7为本申请自适应降噪方法步骤S222的具体实施步骤,将场景音频数据通过预设方法转换为对应的噪声频域响应,包括:Referring to Fig. 7, Fig. 7 is the specific implementation steps of step S222 of the adaptive noise reduction method of the present application, which converts the scene audio data into corresponding noise frequency domain responses through a preset method, including:
步骤S222-1:将每个场景音频数据通过预设方法转换为多个噪声频域响应;Step S222-1: converting the audio data of each scene into multiple noise frequency domain responses through a preset method;
步骤S222-2:获得多个噪声频域响应的平均值作为场景音频数据对应的噪声频域响应。Step S222-2: Obtain the average value of multiple noise frequency domain responses as the noise frequency domain response corresponding to the scene audio data.
具体地,在本实施例中通过获取多个噪声频域响应的平均值作为场景音频数据对应的噪声频域响应,使噪声频域响应更加合理以及准确。Specifically, in this embodiment, the average value of multiple noise frequency domain responses is obtained as the noise frequency domain response corresponding to the scene audio data, so that the noise frequency domain response is more reasonable and accurate.
本申请还保护一种自适应降噪系统,所述系统02,包括:This application also protects an adaptive noise reduction system, said system 02, including:
数据获取模块21,用于获取当前的噪声音频数据; Data acquisition module 21, for acquiring current noise audio data;
降噪参数获取模块22,用于将当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;The noise reduction parameter acquisition module 22 is used to input the current noise audio data into the pre-trained audio noise reduction model to obtain the corresponding noise reduction parameters of the current noise scene;
自适应降噪模块23,用于基于自适应降噪设备的降噪模式,利用降噪参数生成反相声波信号对当前的噪声音频数据进行自适应降噪。The adaptive noise reduction module 23 is configured to use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noisy audio data based on the noise reduction mode of the adaptive noise reduction device.
图8所示系统包括数据获取模块21、降噪参数获取模块22、自适应降噪模块23,该系统可以执行图1至图7所示实施例的方法,本实施例未详细描述的部分,可参考对图1至图7所示实施例的相关说明。该技术方案的执行过程和技术效果参见图1至图7所示实施例中的描述,在此不再赘述。The system shown in FIG. 8 includes a data acquisition module 21, a noise reduction parameter acquisition module 22, and an adaptive noise reduction module 23. The system can execute the methods of the embodiments shown in FIGS. 1 to 7. The parts not described in detail in this embodiment, Reference may be made to relevant descriptions of the embodiments shown in FIGS. 1 to 7 . For the execution process and technical effect of this technical solution, refer to the description in the embodiment shown in FIG. 1 to FIG. 7 , and details are not repeated here.
本申请还保护一种自适应降噪设备,所述自适应降噪设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的自适应降噪方法程序,所述自适应降噪方法程序被所述处理器执行时实现上述任一项所述的自适应降噪方法的步骤。The present application also protects an adaptive noise reduction device. The adaptive noise reduction device includes: a memory, a processor, and an adaptive noise reduction method program stored on the memory and operable on the processor. When the program of the adaptive noise reduction method is executed by the processor, the steps of any one of the above adaptive noise reduction methods are realized.
本申请涉及的一种自适应降噪设备10包括如图9所示:至少一个处理器12、存储器11。An adaptive noise reduction device 10 involved in the present application includes at least one processor 12 and a memory 11 as shown in FIG. 9 .
处理器12可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器12中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器12可以是通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介 质中。该存储介质位于存储器11,处理器12读取存储器11中的信息,结合其硬件完成上述方法的步骤。The processor 12 may be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the above method can be completed by an integrated logic circuit of hardware in the processor 12 or instructions in the form of software. The above-mentioned processor 12 may be a general-purpose processor, a digital signal processor (DSP), an application-specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. Various methods, steps, and logic block diagrams disclosed in the embodiments of the present application may be implemented or executed. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like. The software module can be located in a mature storage medium in the field such as random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, register. The storage medium is located in the memory 11, and the processor 12 reads the information in the memory 11, and completes the steps of the above method in combination with its hardware.
可以理解,本申请实施例中的存储器11可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。其中,非易失性存储器可以是只读存储器(Read Only Memory,ROM)、可编程只读存储器(Programmable ROM,PROM)、可擦除可编程只读存储器(Erasable PROM,EPROM)、电可擦除可编程只读存储器(Electrically EPROM,EEPROM)或闪存。易失性存储器可以是随机存取存储器(Random Access Memory,RAM),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用,例如静态随机存取存储器(Static RAM,SRAM)、动态随机存取存储器(Dynamic RAM,DRAM)、同步动态随机存取存储器(Synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(Double DataRate SDRAM,DDRSDRAM)、增强型同步动态随机存取存储器(Enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(Synch link DRAM,SLDRAM)和直接内存总线随机存取存储器(Direct Rambus RAM,DRRAM)。本申请实施例描述的系统和方法的存储器11旨在包括但不限于这些和任意其它适合类型的存储器。It can be understood that the memory 11 in the embodiment of the present application may be a volatile memory or a non-volatile memory, or may include both volatile and non-volatile memories. Among them, the non-volatile memory can be read-only memory (Read Only Memory, ROM), programmable read-only memory (Programmable ROM, PROM), erasable programmable read-only memory (Erasable PROM, EPROM), electrically erasable In addition to programmable read-only memory (Electrically EPROM, EEPROM) or flash memory. The volatile memory can be Random Access Memory (RAM), which acts as external cache memory. By way of illustration and not limitation, many forms of RAM are available, such as Static Random Access Memory (Static RAM, SRAM), Dynamic Random Access Memory (Dynamic RAM, DRAM), Synchronous Dynamic Random Access Memory (Synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (Double DataRate SDRAM, DDRSDRAM), enhanced synchronous dynamic random access memory (Enhanced SDRAM, ESDRAM), synchronous connection dynamic random access memory (Synch link DRAM, SLDRAM) And Direct Memory Bus Random Access Memory (Direct Rambus RAM, DRRAM). The memory 11 of the system and method described in the embodiments of the present application is intended to include but not limited to these and any other suitable types of memory.
本申请还保护一种计算机存储介质,所述计算机存储介质上存储有自适应降噪方法程序,所述自适应降噪方法程序被处理器执行时实现上述任一项所述的自适应降噪方法的步骤。The present application also protects a computer storage medium, on which an adaptive noise reduction method program is stored, and when the adaptive noise reduction method program is executed by a processor, the adaptive noise reduction method described in any one of the above is realized method steps.
本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present application may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowcharts and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present application. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the instructions The device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby The instructions provide steps for implementing the functions specified in the flow chart or blocks of the flowchart and/or the block or blocks of the block diagrams.
应当注意的是,在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的部件或步骤。位于部件之前的单词“一”或“一个”不排除存在多个这样的部件。本申请可以借助于包括有若干不同部件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It should be noted that, in the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The application can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a unit claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The use of the words first, second, and third, etc. does not indicate any order. These words can be interpreted as names.
尽管已描述了本申请的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本申请范围的所有变更和修改。While preferred embodiments of the present application have been described, additional changes and modifications to these embodiments can be made by those skilled in the art once the basic inventive concept is appreciated. Therefore, the appended claims are intended to be construed to cover the preferred embodiment and all changes and modifications which fall within the scope of the application.
显然,本领域的技术人员可以对本申请进行各种改动和变型而不脱离本申请的精神和范围。这样,倘若本申请的这些修改和变型属于本申请权利要求及其等同技术的范围之内,则本申请也意图包含这些改动和变型在内。Obviously, those skilled in the art can make various changes and modifications to the application without departing from the spirit and scope of the application. In this way, if these modifications and variations of the present application fall within the scope of the claims of the present application and their equivalent technologies, the present application is also intended to include these modifications and variations.

Claims (10)

  1. 一种自适应降噪方法,其特征在于,所述方法包括:An adaptive noise reduction method, characterized in that the method comprises:
    获取当前的噪声音频数据;Get the current noise audio data;
    将所述当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;Inputting the current noise audio data into a pre-trained audio noise reduction model to obtain noise reduction parameters corresponding to the current noise scene;
    基于自适应降噪设备的降噪模式,利用所述降噪参数生成反相声波信号对所述当前的噪声音频数据进行自适应降噪。Based on the noise reduction mode of the adaptive noise reduction device, the noise reduction parameters are used to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data.
  2. 如权利要求1所述的自适应降噪方法,其特征在于,所述将所述当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数的步骤之前,包括:The adaptive noise reduction method according to claim 1, wherein before the step of inputting the current noise audio data into a pre-trained audio noise reduction model and obtaining the noise reduction parameters corresponding to the current noise scene, include:
    构建所述音频降噪模型,具体包括:Construct the audio noise reduction model, specifically including:
    获取不同噪声场景的场景音频数据;Obtain scene audio data of different noise scenes;
    将所述场景音频数据通过预设方法转换为对应的噪声频域响应;Converting the scene audio data into a corresponding noise frequency domain response by a preset method;
    将所述噪声频域响应通过级联滤波器的形式进行降噪参数的调试,获得不同噪声场景对应的降噪参数。The noise frequency domain response is adjusted through cascaded filters to obtain noise reduction parameters corresponding to different noise scenarios.
  3. 如权利要求2所述的自适应降噪方法,其特征在于,所述将所述噪声频域响应通过级联滤波器的形式进行降噪参数的调试,获得不同噪声场景对应的降噪参数的步骤之后,还包括:The adaptive noise reduction method according to claim 2, wherein the noise frequency domain response is adjusted in the form of a cascaded filter to obtain noise reduction parameters corresponding to different noise scenes After the steps, also include:
    利用所述降噪参数对所述不同噪声场景的场景音频数据进行消除验证,获得降噪误差值;Using the noise reduction parameters to eliminate and verify the scene audio data of the different noise scenes to obtain a noise reduction error value;
    利用所述降噪误差值对所述降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对所述降噪参数进行优化,获得降噪优化参数,并将所述降噪优化参数作为所述降噪参数。Use the noise reduction error value to optimize the noise reduction parameters until the number of reverse iterations reaches the preset number of times, then stop optimizing the noise reduction parameters, obtain noise reduction optimization parameters, and optimize the noise reduction parameter as the denoising parameter.
  4. 如权利要求3所述的自适应降噪方法,其特征在于,所述利用所述降噪误差值对所述降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对所述降噪参数进行优化,获得降噪优化参数,并将所述降噪优化参数作为所述降噪参数,包括:The adaptive noise reduction method according to claim 3, wherein the noise reduction parameter is optimized by using the noise reduction error value, until the number of reverse iterations reaches a preset number of times, then stop performing an operation on the The noise reduction parameters are optimized to obtain the noise reduction optimization parameters, and the noise reduction optimization parameters are used as the noise reduction parameters, including:
    对所述降噪误差值进行逆向迭代操作,直至逆向迭代次数达到预设次数,则停止对所述降噪参数进行优化;Perform reverse iteration operations on the noise reduction error value until the number of reverse iterations reaches a preset number of times, then stop optimizing the noise reduction parameters;
    获得每次逆向迭代操作后生成的降噪优化参数,并基于所述降噪优化参数以及所述逆向迭代次数获得所述降噪优化参数的平均值;Obtaining the noise reduction optimization parameters generated after each reverse iteration operation, and obtaining the average value of the noise reduction optimization parameters based on the noise reduction optimization parameters and the number of reverse iterations;
    将所述降噪优化参数的平均值作为所述降噪参数。The average value of the noise reduction optimization parameters is used as the noise reduction parameter.
  5. 如权利要求3所述的自适应降噪方法,其特征在于,所述利用所述降噪误差值对所述降噪参数进行优化操作,直至逆向迭代次数达到预设次数,则停止对所述降噪参数进行优化,获得降噪优化参数,并将所述降噪优化参数作为所述降噪参数,还包括:The adaptive noise reduction method according to claim 3, wherein the noise reduction parameter is optimized by using the noise reduction error value, until the number of reverse iterations reaches a preset number of times, then stop performing an operation on the Optimizing the noise reduction parameters to obtain the noise reduction optimization parameters, and using the noise reduction optimization parameters as the noise reduction parameters, also includes:
    对所述降噪误差值进行逆向迭代操作,直至逆向迭代次数达到预设次数,则停止对所述降噪参数进行优化;Perform reverse iteration operations on the noise reduction error value until the number of reverse iterations reaches a preset number of times, then stop optimizing the noise reduction parameters;
    获得最后一次逆向迭代操作后生成的降噪优化参数,并将所述降噪优化参数作为所述降噪参数。The noise reduction optimization parameters generated after the last reverse iteration operation are obtained, and the noise reduction optimization parameters are used as the noise reduction parameters.
  6. 如权利要求2所述的自适应降噪方法,其特征在于,所述将所述场景音频数据通过预设方法转换为对应的噪声频域响应,包括:The adaptive noise reduction method according to claim 2, wherein said converting said scene audio data into a corresponding noise frequency domain response by a preset method comprises:
    将每个所述场景音频数据通过预设方法转换为多个噪声频域响应;converting the audio data of each scene into a plurality of noise frequency domain responses through a preset method;
    获得所述多个噪声频域响应的平均值作为所述场景音频数据对应的噪声频域响应。Obtain an average value of the plurality of noise frequency domain responses as the noise frequency domain response corresponding to the scene audio data.
  7. 如权利要求1所述的自适应降噪方法,其特征在于,所述基于自适应降噪设备的降噪模式,利用所述降噪参数对所述当前的噪声音频数据进行自适应降噪,包括:The adaptive noise reduction method according to claim 1, wherein the noise reduction mode based on the adaptive noise reduction device uses the noise reduction parameters to perform adaptive noise reduction on the current noise audio data, include:
    若所述自适应降噪设备的降噪模式为前馈式降噪,则利用所述降噪参数中的前馈式参数生成反相声波信号对所述当前的噪声音频数据进行自适应降噪;If the noise reduction mode of the adaptive noise reduction device is feed-forward noise reduction, use the feed-forward parameters in the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data ;
    若所述自适应降噪设备的降噪模式为反馈式降噪,则利用所述降噪参数中的反馈式参数生成反相声波信号对所述当前的噪声音频数据进行自适应降噪;If the noise reduction mode of the adaptive noise reduction device is feedback noise reduction, using the feedback parameters in the noise reduction parameters to generate an inverse sound wave signal to perform adaptive noise reduction on the current noise audio data;
    若所述自适应降噪设备的降噪模式为混合式降噪,则利用所述降噪参数中的混合式参数生成反相声波信号对所述当前的噪声音频数据进行自适应降噪。If the noise reduction mode of the adaptive noise reduction device is a hybrid noise reduction, use the hybrid parameters in the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data.
  8. 一种自适应降噪系统,其特征在于,所述系统,包括:An adaptive noise reduction system, characterized in that the system includes:
    数据获取模块,用于获取当前的噪声音频数据;A data acquisition module, configured to acquire current noise audio data;
    降噪参数获取模块,用于将所述当前的噪声音频数据输入预先训练好的音频降噪模型,获得当前噪声场景对应的降噪参数;The noise reduction parameter acquisition module is used to input the current noise audio data into the pre-trained audio noise reduction model to obtain the noise reduction parameters corresponding to the current noise scene;
    自适应降噪模块,用于基于自适应降噪设备的降噪模式,利用所述降噪参数生成反相声波信号对所述当前的噪声音频数据进行自适应降噪。The adaptive noise reduction module is configured to use the noise reduction parameters to generate an anti-phase sound wave signal to perform adaptive noise reduction on the current noise audio data based on the noise reduction mode of the adaptive noise reduction device.
  9. 一种自适应降噪设备,其特征在于,所述自适应降噪设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的自适应降噪方法程序,所述自适应降噪方法程序被所述处理器执行时实现如权利要求1至7中任一项所述的自适应降噪方法的步骤。An adaptive noise reduction device, characterized in that the adaptive noise reduction device includes: a memory, a processor, and an adaptive noise reduction method program stored in the memory and operable on the processor, the When the adaptive noise reduction method program is executed by the processor, the steps of the adaptive noise reduction method according to any one of claims 1 to 7 are realized.
  10. 一种计算机存储介质,其特征在于,所述计算机存储介质上存储有自适应降噪方法程序,所述自适应降噪方法程序被处理器执行时实现权利要求1至7中任一项所述的自适应降噪方法的步骤。A computer storage medium, characterized in that an adaptive noise reduction method program is stored on the computer storage medium, and when the adaptive noise reduction method program is executed by a processor, any one of claims 1 to 7 is implemented. The steps of the adaptive noise reduction method.
PCT/CN2021/138987 2021-10-28 2021-12-17 Self-adaptive noise reduction method, system and device, and storage medium WO2023070899A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111267424.0 2021-10-28
CN202111267424.0A CN114120951A (en) 2021-10-28 2021-10-28 Adaptive noise reduction method, system, device and storage medium

Publications (1)

Publication Number Publication Date
WO2023070899A1 true WO2023070899A1 (en) 2023-05-04

Family

ID=80377464

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/138987 WO2023070899A1 (en) 2021-10-28 2021-12-17 Self-adaptive noise reduction method, system and device, and storage medium

Country Status (2)

Country Link
CN (1) CN114120951A (en)
WO (1) WO2023070899A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150081287A1 (en) * 2013-09-13 2015-03-19 Advanced Simulation Technology, inc. ("ASTi") Adaptive noise reduction for high noise environments
CN111800688A (en) * 2020-03-24 2020-10-20 深圳市豪恩声学股份有限公司 Active noise reduction method and device, electronic equipment and storage medium
CN112423176A (en) * 2019-08-20 2021-02-26 Oppo广东移动通信有限公司 Earphone noise reduction method and device, storage medium and noise reduction earphone
CN113132847A (en) * 2021-04-13 2021-07-16 北京安声科技有限公司 Noise reduction parameter determination method and device for active noise reduction earphone and active noise reduction method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150081287A1 (en) * 2013-09-13 2015-03-19 Advanced Simulation Technology, inc. ("ASTi") Adaptive noise reduction for high noise environments
CN112423176A (en) * 2019-08-20 2021-02-26 Oppo广东移动通信有限公司 Earphone noise reduction method and device, storage medium and noise reduction earphone
CN111800688A (en) * 2020-03-24 2020-10-20 深圳市豪恩声学股份有限公司 Active noise reduction method and device, electronic equipment and storage medium
CN113132847A (en) * 2021-04-13 2021-07-16 北京安声科技有限公司 Noise reduction parameter determination method and device for active noise reduction earphone and active noise reduction method

Also Published As

Publication number Publication date
CN114120951A (en) 2022-03-01

Similar Documents

Publication Publication Date Title
US9633654B2 (en) Low power voice detection
JP5718251B2 (en) System and method for reconstruction of decomposed audio signals
CN111800723B (en) Active noise reduction earphone testing method and device, terminal equipment and storage medium
CN111031448B (en) Echo cancellation method, echo cancellation device, electronic equipment and storage medium
US8031883B2 (en) Crosstalk cancellation system with sound quality preservation and parameter determining method thereof
CN113436643A (en) Method, device, equipment and storage medium for training and applying speech enhancement model
US11538487B2 (en) Voice signal enhancing method and device
CN111508519A (en) Method and device for enhancing voice of audio signal
US11393443B2 (en) Apparatuses and methods for creating noise environment noisy data and eliminating noise
WO2023287773A1 (en) Speech enhancement
CN103745729B (en) A kind of audio frequency denoising method and system
CN112309426A (en) Voice processing model training method and device and voice processing method and device
WO2022134351A1 (en) Noise reduction method and system for monophonic speech, and device and readable storage medium
JP6966750B2 (en) Methods, devices and electronic devices for blind signal separation
WO2023070899A1 (en) Self-adaptive noise reduction method, system and device, and storage medium
WO2023124984A1 (en) Method and device for generating speech enhancement model, and speech enhancement method and device
WO2023287782A1 (en) Data augmentation for speech enhancement
CN111968620A (en) Algorithm testing method and device, electronic equipment and storage medium
US11404046B2 (en) Audio processing device for speech recognition
CN110992966B (en) Human voice separation method and system
WO2021175267A1 (en) Method for implementing active noise cancellation, apparatus, and electronic device
WO2022213741A1 (en) Image processing method and apparatus, storage medium, and electronic device
KR102218151B1 (en) Target voice signal output apparatus for improving voice recognition and method thereof
CN114220453B (en) Multi-channel non-negative matrix decomposition method and system based on frequency domain convolution transfer function
CN115376501B (en) Voice enhancement method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21962242

Country of ref document: EP

Kind code of ref document: A1