CN113689900A - Method and device for reducing noise of audio file, server and storage medium - Google Patents
Method and device for reducing noise of audio file, server and storage medium Download PDFInfo
- Publication number
- CN113689900A CN113689900A CN202110961787.8A CN202110961787A CN113689900A CN 113689900 A CN113689900 A CN 113689900A CN 202110961787 A CN202110961787 A CN 202110961787A CN 113689900 A CN113689900 A CN 113689900A
- Authority
- CN
- China
- Prior art keywords
- scene
- audio file
- noise reduction
- audio
- recording rule
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 41
- 230000009467 reduction Effects 0.000 claims abstract description 148
- 238000012545 processing Methods 0.000 claims abstract description 25
- 238000004422 calculation algorithm Methods 0.000 claims description 20
- 230000000694 effects Effects 0.000 abstract description 10
- 230000001960 triggered effect Effects 0.000 abstract description 7
- 230000005540 biological transmission Effects 0.000 description 22
- 230000005236 sound signal Effects 0.000 description 18
- 238000010586 diagram Methods 0.000 description 10
- 238000004891 communication Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 6
- 230000006870 function Effects 0.000 description 5
- 238000004590 computer program Methods 0.000 description 4
- 238000013528 artificial neural network Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000000737 periodic effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 210000004704 glottis Anatomy 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11C—STATIC STORES
- G11C7/00—Arrangements for writing information into, or reading information out from, a digital store
- G11C7/16—Storage of analogue signals in digital stores using an arrangement comprising analogue/digital [A/D] converters, digital memories and digital/analogue [D/A] converters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Abstract
The application relates to the technical field of audio processing, and discloses a method for reducing noise of an audio file, which comprises the following steps: receiving a first audio file; determining a scene corresponding to a first audio file; determining a recording rule corresponding to a scene; determining a noise reduction parameter corresponding to the recording rule; sending the noise reduction parameters to the audio acquisition equipment, and triggering the audio acquisition equipment to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file. Therefore, the scene corresponding to the first audio file is determined, the recording rule corresponding to the scene is determined, the noise reduction parameter under the recording rule is determined, the noise reduction parameter corresponds to the scene, the audio acquisition equipment is triggered to reduce the noise of the second audio file under the same scene by using the noise reduction parameter corresponding to the scene, the pertinence of the noise reduction parameter is improved, and the noise reduction effect is improved. The application also discloses a device for reducing noise of the audio file, a server and a storage medium.
Description
Technical Field
The present application relates to the field of audio processing technologies, and for example, to a method and an apparatus for noise reduction of an audio file, a server, and a storage medium.
Background
At present, many companies require employees to record in work by using intelligent voice cards, so that the working state of the employees can be known through audio files or the preference of users can be counted. Under the condition of recording by using the intelligent voice work card, the voice of a plurality of people and various environmental noises often exist in the recording environment, and in order to reduce calculation, noise reduction processing needs to be carried out on an audio file to eliminate unnecessary voice.
In the process of implementing the embodiments of the present disclosure, it is found that at least the following problems exist in the related art: the audio acquisition equipment in the prior art adopts fixed noise reduction parameters to reduce noise of an audio file, and influence of a scene on the noise reduction parameters is not considered, so that the noise reduction effect is poor.
Disclosure of Invention
The following presents a simplified summary in order to provide a basic understanding of some aspects of the disclosed embodiments. This summary is not an extensive overview nor is intended to identify key/critical elements or to delineate the scope of such embodiments but rather as a prelude to the more detailed description that is presented later.
The embodiment of the disclosure provides a method and a device for noise reduction of an audio file, a server and a storage medium, so as to improve the noise reduction effect.
In some embodiments, the method comprises: receiving a first audio file; determining a scene corresponding to a first audio file; determining a recording rule corresponding to a scene; determining a noise reduction parameter corresponding to the recording rule; sending the noise reduction parameters to the audio acquisition equipment, and triggering the audio acquisition equipment to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
In some embodiments, the apparatus comprises: a receiving module configured to receive a first audio file; the first determining module is configured to determine a scene corresponding to the first audio file; the second determining module is configured to determine a recording rule corresponding to the scene; a third determining module configured to determine a noise reduction parameter under the recording rule; the sending module is configured to send the noise reduction parameters to the audio acquisition equipment and trigger the audio acquisition equipment to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
In some embodiments, the server comprises: a processor and a memory storing program instructions, the processor being configured, upon execution of the program instructions, to perform the method for noise reduction of an audio file as described above.
In some embodiments, the storage medium stores program instructions that, when executed, perform the above-described method for noise reduction of an audio file.
The method and the device for reducing the noise of the audio file, the server and the storage medium provided by the embodiment of the disclosure can realize the following technical effects: the method comprises the steps of determining a scene corresponding to a first audio file, determining a recording rule corresponding to the scene, determining a noise reduction parameter under the recording rule, enabling the noise reduction parameter to correspond to the scene, and triggering an audio acquisition device to perform noise reduction processing on a second audio file under the same scene by using the noise reduction parameter corresponding to the scene. Therefore, the audio file passing through the scene is subjected to noise reduction processing through the noise reduction parameters corresponding to the scene, the pertinence of the noise reduction parameters can be improved, and the noise reduction effect is improved.
The foregoing general description and the following description are exemplary and explanatory only and are not restrictive of the application.
Drawings
One or more embodiments are illustrated by way of example in the accompanying drawings, which correspond to the accompanying drawings and not in limitation thereof, in which elements having the same reference numeral designations are shown as like elements and not in limitation thereof, and wherein:
FIG. 1 is a schematic diagram of a method for noise reduction of an audio file according to an embodiment of the present disclosure;
FIG. 2 is a timing diagram of a method for noise reduction of an audio file provided by an embodiment of the present disclosure;
FIG. 3 is a schematic diagram of an apparatus for noise reduction of audio files according to an embodiment of the present disclosure;
fig. 4 is a schematic diagram of a server of an embodiment of the present disclosure.
Detailed Description
So that the manner in which the features and elements of the disclosed embodiments can be understood in detail, a more particular description of the disclosed embodiments, briefly summarized above, may be had by reference to the embodiments, some of which are illustrated in the appended drawings. In the following description of the technology, for purposes of explanation, numerous details are set forth in order to provide a thorough understanding of the disclosed embodiments. However, one or more embodiments may be practiced without these details. In other instances, well-known structures and devices may be shown in simplified form in order to simplify the drawing.
The terms "first," "second," and the like in the description and in the claims, and the above-described drawings of embodiments of the present disclosure, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the present disclosure described herein may be made. Furthermore, the terms "comprising" and "having," as well as any variations thereof, are intended to cover non-exclusive inclusions.
The term "plurality" means two or more unless otherwise specified.
In the embodiment of the present disclosure, the character "/" indicates that the preceding and following objects are in an or relationship. For example, A/B represents: a or B.
The term "and/or" is an associative relationship that describes objects, meaning that three relationships may exist. For example, a and/or B, represents: a or B, or A and B.
The term "correspond" may refer to an association or binding relationship, and a corresponds to B refers to an association or binding relationship between a and B.
With reference to fig. 1, an embodiment of the present disclosure provides a method for noise reduction of an audio file, including:
in step S101, the server receives a first audio file.
Step S102, the server determines a scene corresponding to the first audio file.
Step S103, the server determines a recording rule corresponding to the scene of the first audio file.
And step S104, the server determines the noise reduction parameters corresponding to the recording rules.
Step S105, the server sends the noise reduction parameters to the audio acquisition equipment, and the audio acquisition equipment is triggered to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
By adopting the method for reducing the noise of the audio file provided by the embodiment of the disclosure, the recording rule corresponding to the scene can be determined by determining the scene corresponding to the first audio file, the noise reduction parameter under the recording rule is determined, so that the noise reduction parameter can correspond to the scene, and the audio acquisition device is triggered to carry out noise reduction processing on the second audio file under the same scene by using the noise reduction parameter corresponding to the scene. Therefore, the audio file passing through the scene is subjected to noise reduction processing through the noise reduction parameters corresponding to the scene, the pertinence of the noise reduction parameters can be improved, and the noise reduction effect is improved.
Optionally, the server receives the first audio file via an audio transmission device. Optionally, the audio transmission device is configured to receive a first audio file sent by the audio acquisition device, and send the first audio file to the server. Optionally, the audio transmission device is further configured to store and process the first audio file. Optionally, the audio capturing device is configured to capture a first audio file and send the first audio file to the audio transmitting device.
Optionally, the audio acquisition device is an intelligent voice card; the intelligent voice work board is provided with a microphone array, and audio files are collected through the microphone array; the intelligent voice work card stores a noise reduction algorithm to reduce noise of the audio file. In some embodiments, the audio capture mode of the audio capture device is controlled primarily by the opening/closing of mechanical keys.
In some embodiments, a plurality of audio capture devices are connected to one audio transmission device, i.e., the plurality of audio capture devices communicate with the server through the same audio transmission device. Or, one audio acquisition device corresponds to one audio transmission device, that is, different audio acquisition devices communicate with the server through different audio transmission devices.
Optionally, the audio transmission device is an edge end device. Optionally, the edge device includes an acquisition station or an intelligent terminal. Optionally, the smart terminal includes a terminal device such as a smart phone, a tablet or a computer, which can communicate with the server. Optionally, the server is a cloud server.
In some embodiments, the edge device can perform low-power consumption intelligent operation and establish communication connection with the cloud server and the audio acquisition device; the edge terminal equipment receives a first audio file sent by the audio acquisition equipment and uploads the first audio file to the cloud server.
In some embodiments, the audio transmission device is a collection station, the audio collection device establishes a connection with the collection station through a USB port, and the audio collection device sends the first audio file to the audio transmission device using the established connection.
In some embodiments, the audio transmission device is an intelligent terminal, the audio acquisition device establishes a connection with the acquisition station through a bluetooth or USB port, and the audio acquisition device sends the first audio file to the audio transmission device by using the established connection.
Optionally, sending the noise reduction parameters to the audio acquisition device includes: and sending the noise reduction parameters to audio acquisition equipment through audio transmission equipment. Like this, audio acquisition equipment can obtain through server and audio transmission equipment and fall the parameter of making an uproar, and it needs the manual work to return the mill to obtain the parameter of making an uproar to go to the collection place and utilize audio acquisition equipment to record after the audio file to compare among the prior art, and this application passes through server and audio transmission equipment and obtains the parameter of making an uproar, does not need the manual work to come and go between collection place and mill, has reduced human cost and time cost, has improved the efficiency of sending the parameter of making an uproar.
Optionally, determining a scene corresponding to the first audio file includes: and determining a scene corresponding to the first audio file according to the keywords in the first audio file. Therefore, the scene corresponding to the first audio file can be determined more accurately through the keywords.
Optionally, determining a scene corresponding to the first audio file according to the keyword in the first audio file includes: acquiring a keyword in a first audio file; performing table look-up operation on the keywords in the first audio file by using a preset keyword database to obtain a scene corresponding to the keywords in the first audio file; the keyword database stores the corresponding relation between scenes and keywords; and determining the scene corresponding to the keyword in the first audio file as the scene corresponding to the first audio file.
In some embodiments, the keywords in the first audio file include: sales promotion, discount, price and commodity, and finding out that the scene corresponding to the keyword is a supermarket in the keyword database; the keywords in the first audio file include: jewelry, diamonds and silver, and finding out that the scene corresponding to the keyword is a jewelry shop in a keyword database; the keywords in the first audio file include: rice, meat and vegetables, and finding out the scene corresponding to the keyword from the keyword database as a restaurant.
Optionally, determining a scene corresponding to the first audio file includes: converting the first audio file into an audio signal model corresponding to the first audio file; performing table look-up operation on the audio signal model by using a preset audio signal model database to obtain a scene corresponding to the audio signal model; the audio signal model database stores the corresponding relation between the scene and the audio signal model; and determining the scene corresponding to the audio signal model as the scene corresponding to the first audio file.
Optionally, an audio signal model is used to characterize the noisiness of the scene. In some embodiments, if the number of the dialog persons in the first audio file is 10, the audio signal model corresponding to the first audio file is a noisy audio signal model, and a scene corresponding to the noisy audio signal model is found in the audio signal model database as a noisy scene; if the number of the dialog characters in the first audio file is 2, the audio signal model corresponding to the first audio file is a quiet audio signal model, and a scene corresponding to the quiet audio signal model is found out in the audio signal model database to be a quiet scene.
Optionally, determining a recording rule corresponding to a scene of the first audio file includes: performing table look-up operation on a scene corresponding to the first audio file by using a preset scene data table to obtain a recording rule corresponding to the scene; the scene data table stores the corresponding relationship between the scene and the recording rule.
Optionally, the recording rule comprises determining a sound signal characteristic parameter. Optionally, the sound signal characteristic parameter comprises a sound to be emphasized and/or a sound to be suppressed.
In some embodiments, the scene is a supermarket, and the recording rule corresponding to the scene is matched in the scene data table to suppress the loudspeaker sound; the scene is a jewelry shop, and the recording rule corresponding to the scene is matched in the scene data table to suppress the voice of a person not wearing the audio acquisition equipment and enhance the voice of the person wearing the audio acquisition equipment; the scene is a restaurant, and the recording rule corresponding to the scene is matched in the scene data table to suppress the sound of the person not wearing the audio acquisition equipment and enhance the sound of the person wearing the audio acquisition equipment.
Optionally, determining a recording rule corresponding to a scene of the first audio file includes: determining noise in a scene of a first audio file; and determining a recording rule corresponding to the scene of the first audio file according to the noise in the scene of the first audio file. Therefore, the noise in the scene can be determined according to the requirements of the user, the recording rule corresponding to the scene is customized, the noise reduction parameters obtained according to the recording rule can better meet the requirements of the wearer, and the use experience of the wearer is improved.
In some embodiments, the scene is a quiet scene, and if the noise in the quiet scene is determined to be male sound, the recording rule corresponding to the quiet scene is to suppress the male sound and enhance the female sound; the scene is a noisy scene, the noise in the noisy scene is determined to be the sound, the background sound and the system sound inside the audio acquisition equipment of a person who does not wear the audio acquisition equipment according to the requirement of a wearer, and then the recording rule corresponding to the noisy scene is to suppress the sound, the background sound and the system sound inside the audio acquisition equipment of the person who does not wear the audio acquisition equipment.
Optionally, determining a noise reduction parameter corresponding to the recording rule includes: and calculating by utilizing the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule. Therefore, the noise reduction parameters corresponding to the recording rules are obtained by calculating according to the preset noise reduction parameter algorithm by utilizing the recording rules and the first audio files, so that the recording rules are matched with the noise reduction parameters, the noise reduction parameters correspond to scenes due to the fact that the recording rules correspond to the scenes, then the noise reduction parameters are sent to the audio acquisition equipment, the audio acquisition equipment is triggered to reduce the noise of the second audio files in the same scene by utilizing the noise reduction parameters corresponding to the scenes, the purpose of reducing the noise specifically according to the scenes corresponding to the audio files is achieved, and the noise reduction effect is improved. Meanwhile, the audio acquisition equipment utilizes the noise reduction parameters to perform noise reduction processing on the acquired second audio file, so that unnecessary audio, namely audio data which the wearer does not want to be recorded, is suppressed, and the privacy of the wearer is protected.
Optionally, calculating by using the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule, including: and inputting the recording rule and the first audio file into a preset CNN (Neural Network) algorithm for calculation to obtain a noise reduction parameter corresponding to the recording rule.
Optionally, calculating by using the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule, including: and inputting the recording rule and the first audio file into a preset RNN (recurrent neural network) algorithm for calculation to obtain a noise reduction parameter corresponding to the recording rule.
In some embodiments, and as shown in conjunction with FIG. 2, FIG. 2 is a timing diagram of a method for noise reduction of an audio file. The method comprises the following steps:
in step S201, the audio capture device obtains a first audio file.
Step S202, the audio acquisition device sends a first audio file to the audio transmission device.
Step S203, the audio transmission device receives the first audio file and sends the first audio file to the server.
Step S204, the server receives the first audio file and determines a scene corresponding to the first audio file.
In step S205, the server determines a recording rule corresponding to the scene.
In step S206, the server determines a noise reduction parameter corresponding to the recording rule.
Step S207, the server sends the noise reduction parameter to the audio transmission device.
And step S208, the audio transmission equipment receives the noise reduction parameters and sends the noise reduction parameters to the audio acquisition equipment.
Step S209, the audio capture device receives the noise reduction parameter and obtains a second audio file.
And step S210, the audio acquisition equipment performs noise reduction processing on the second audio file by using the noise reduction parameters.
In some embodiments, the audio capture device improves the quality of sound collection by adding an arrangement of microphone arrays and/or by adding sound sources by increasing the number of microphones before acquiring the first audio file.
Optionally, the audio acquiring device performs noise reduction processing on the second audio file by using the noise reduction parameter, including: and the audio acquisition equipment sets the noise reduction parameters in a preset noise reduction algorithm model to obtain a complete noise reduction algorithm, and the noise reduction algorithm is utilized to carry out noise reduction processing on the second audio file.
In some embodiments, in a supermarket scene, a salesperson wears an audio acquisition device, exhales air through a lung, then the air passes through a glottis and generates a periodic signal according to the opening and closing of the glottis, and the periodic signal modulates the periodic signal through the change of the shape of a sound channel to obtain a voice signal, so as to make a sound. The voice emitted by the salesperson is collected by the audio collecting device to generate a second audio file. The audio acquisition equipment utilizes the noise reduction parameters corresponding to the supermarket scene sent by the server to set the noise reduction parameters in an internal noise reduction algorithm model to obtain a complete noise reduction algorithm, and utilizes the noise reduction algorithm to perform noise reduction processing on the acquired second audio file, so that the noise reduction effect is improved.
The audio acquisition equipment uploads a first audio file to the server through the audio transmission equipment, the server determines a scene corresponding to the first audio file, the server determines a recording rule corresponding to the scene, and determines noise reduction parameters under the recording rule, so that the noise reduction parameters can correspond to the scene; the server sends the noise reduction parameters to the audio acquisition equipment, the audio acquisition equipment utilizes the received noise reduction parameters corresponding to the scene to reduce noise of the second audio file in the same scene, therefore, even if the scene is changed, the audio acquisition equipment can also reacquire the first audio file in a new scene and upload the first audio file to the server, the server acquires the new noise reduction parameters corresponding to the scene according to the scene of the new first audio file and sends the new noise reduction parameters to the audio acquisition equipment, the audio acquisition equipment utilizes the new noise reduction parameters to reduce noise of the second audio file in the new scene, the audio acquisition equipment does not need to be replaced again, and the utilization rate of the audio acquisition equipment is improved.
Optionally, the noise reduction parameter algorithm stored by the server and the noise reduction algorithm obtained by the audio acquisition device are the same type of algorithm.
Referring to fig. 3, an apparatus for denoising an audio file according to an embodiment of the present disclosure includes a receiving module 1, a first determining module 2, a second determining module 3, a third determining module 4, and a transmitting module 5. The receiving module 1 is configured to receive a first audio file; the first determining module 2 is configured to determine a scene corresponding to the first audio file; the second determining module 3 is configured to determine a recording rule corresponding to the scene; the third determining module 4 is configured to determine a noise reduction parameter corresponding to the recording rule; the sending module 5 is configured to send the noise reduction parameters to the audio acquisition device, and trigger the audio acquisition device to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
By adopting the device for reducing the noise of the audio file, which is provided by the embodiment of the disclosure, the recording rule corresponding to the scene can be determined by determining the scene corresponding to the first audio file, the noise reduction parameter under the recording rule is determined, so that the noise reduction parameter can correspond to the scene, and the audio acquisition equipment is triggered to carry out noise reduction processing on the second audio file under the same scene by using the noise reduction parameter corresponding to the scene. Therefore, the audio file passing through the scene is subjected to noise reduction processing through the noise reduction parameters corresponding to the scene, the pertinence of the noise reduction parameters can be improved, and the noise reduction effect is improved.
Optionally, the second determining module is configured to determine the recording rule corresponding to the scene by: matching a recording rule corresponding to a scene in a preset scene data table; the scene data table stores the corresponding relationship between the scene and the recording rule.
Optionally, the third determining module is configured to determine the noise reduction parameter corresponding to the recording rule by: and calculating by utilizing the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule.
Optionally, the first determining module determines the scene corresponding to the first audio file by: and determining a scene corresponding to the first audio file according to the keywords in the first audio file.
The device for denoising the audio file, provided by the embodiment of the disclosure, is applied to the technical field of intelligent equipment, and is characterized in that a first audio file uploaded by audio transmission equipment is received, a scene corresponding to the first audio file is determined, a recording rule corresponding to the scene is then determined, and denoising parameters under the recording rule are determined, so that the denoising parameters can correspond to the scene, then the denoising parameters are sent to audio acquisition equipment, the audio acquisition equipment is triggered to denoise a second audio file under the same scene by utilizing the denoising parameters corresponding to the scene, the denoising parameters correspond to the scene, and the denoising efficiency is improved. Meanwhile, the first audio file needs to go to a collection place manually for collection, and then the first audio file is uploaded to the device for reducing the noise of the audio file through the audio transmission equipment, so that man-machine cooperation is realized through interaction between a user and different equipment.
As shown in fig. 4, an embodiment of the present disclosure provides a server including a processor (processor)100 and a memory (memory) 101. Optionally, the server may also include a Communication Interface (Communication Interface)102 and a bus 103. The processor 100, the communication interface 102, and the memory 101 may communicate with each other via a bus 103. The communication interface 102 may be used for information transfer. The processor 100 may call logic instructions in the memory 101 to perform the method for audio file noise reduction of the above embodiment.
In addition, the logic instructions in the memory 101 may be implemented in the form of software functional units and stored in a computer readable storage medium when the logic instructions are sold or used as independent products.
By adopting the server provided by the embodiment of the disclosure, the recording rule corresponding to the scene can be determined by determining the scene corresponding to the first audio file, the noise reduction parameter under the recording rule is determined, so that the noise reduction parameter can correspond to the scene, and the audio acquisition device is triggered to perform noise reduction processing on the second audio file under the same scene by using the noise reduction parameter corresponding to the scene. Therefore, the audio file passing through the scene is subjected to noise reduction processing through the noise reduction parameters corresponding to the scene, the pertinence of the noise reduction parameters can be improved, and the noise reduction effect is improved.
The memory 101, which is a computer-readable storage medium, may be used for storing software programs, computer-executable programs, such as program instructions/modules corresponding to the methods in the embodiments of the present disclosure. The processor 100 executes functional applications and data processing, i.e., implements the method for noise reduction of audio files in the above-described embodiments, by executing program instructions/modules stored in the memory 101.
The memory 101 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal device, and the like. In addition, the memory 101 may include a high-speed random access memory, and may also include a nonvolatile memory.
Embodiments of the present disclosure provide a storage medium having stored thereon computer-executable instructions configured to perform the above-described method for noise reduction of an audio file.
Embodiments of the present disclosure provide a computer program product comprising a computer program stored on a computer readable storage medium, the computer program comprising program instructions which, when executed by a computer, cause the computer to perform the above-described method for noise reduction of an audio file.
The computer-readable storage medium described above may be a transitory computer-readable storage medium or a non-transitory computer-readable storage medium.
The technical solution of the embodiments of the present disclosure may be embodied in the form of a software product, where the computer software product is stored in a storage medium and includes one or more instructions to enable a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method of the embodiments of the present disclosure. And the aforementioned storage medium may be a non-transitory storage medium comprising: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes, and may also be a transient storage medium.
The above description and drawings sufficiently illustrate embodiments of the disclosure to enable those skilled in the art to practice them. Other embodiments may incorporate structural, logical, electrical, process, and other changes. The examples merely typify possible variations. Individual components and functions are optional unless explicitly required, and the sequence of operations may vary. Portions and features of some embodiments may be included in or substituted for those of others. Furthermore, the words used in the specification are words of description only and are not intended to limit the claims. As used in the description of the embodiments and the claims, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. Similarly, the term "and/or" as used in this application is meant to encompass any and all possible combinations of one or more of the associated listed. Furthermore, the terms "comprises" and/or "comprising," when used in this application, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. Without further limitation, an element defined by the phrase "comprising an …" does not exclude the presence of other like elements in a process, method or apparatus that comprises the element. In this document, each embodiment may be described with emphasis on differences from other embodiments, and the same and similar parts between the respective embodiments may be referred to each other. For methods, products, etc. of the embodiment disclosures, reference may be made to the description of the method section for relevance if it corresponds to the method section of the embodiment disclosure.
Those of skill in the art would appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software may depend upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the disclosed embodiments. It can be clearly understood by the skilled person that, for convenience and brevity of description, the specific working processes of the system, the apparatus and the unit described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the embodiments disclosed herein, the disclosed methods, products (including but not limited to devices, apparatuses, etc.) may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units may be merely a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form. The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to implement the present embodiment. In addition, functional units in the embodiments of the present disclosure may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. In the description corresponding to the flowcharts and block diagrams in the figures, operations or steps corresponding to different blocks may also occur in different orders than disclosed in the description, and sometimes there is no specific order between the different operations or steps. For example, two sequential operations or steps may in fact be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved. Each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
Claims (10)
1. A method for noise reduction of an audio file, comprising:
receiving a first audio file;
determining a scene corresponding to the first audio file;
determining a recording rule corresponding to the scene;
determining a noise reduction parameter corresponding to the recording rule;
sending the noise reduction parameters to audio acquisition equipment, and triggering the audio acquisition equipment to perform noise reduction processing on a second audio file by using the noise reduction parameters; and the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
2. The method of claim 1, wherein determining the recording rule corresponding to the scene comprises:
performing table look-up operation on the scene by using a preset scene data table to obtain a recording rule corresponding to the scene; the scene data table stores the corresponding relationship between the scene and the recording rule.
3. The method of claim 1, wherein determining the noise reduction parameter corresponding to the recording rule comprises:
and calculating by utilizing the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule.
4. The method of claim 1, wherein determining the scene corresponding to the first audio file comprises:
and determining a scene corresponding to the first audio file according to the keywords in the first audio file.
5. An apparatus for noise reduction of an audio file, comprising:
a receiving module configured to receive a first audio file;
a first determining module configured to determine a scene corresponding to the first audio file;
the second determining module is configured to determine a recording rule corresponding to the scene;
a third determining module configured to determine a noise reduction parameter corresponding to the recording rule;
the sending module is configured to send the noise reduction parameters to audio acquisition equipment and trigger the audio acquisition equipment to perform noise reduction processing on a second audio file by using the noise reduction parameters; and the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
6. The apparatus of claim 5, wherein the second determining module is configured to determine the recording rule corresponding to the scene by:
performing table look-up operation on the scene by using a preset scene data table to obtain a recording rule corresponding to the scene; the scene data table stores the corresponding relationship between the scene and the recording rule.
7. The apparatus of claim 5, wherein the third determining module is configured to determine the noise reduction parameter corresponding to the recording rule by:
and calculating by utilizing the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule.
8. The apparatus of claim 5, wherein the first determining module determines the scene corresponding to the first audio file by:
and determining a scene corresponding to the first audio file according to the keywords in the first audio file.
9. A server comprising a processor and a memory storing program instructions, characterized in that the processor is configured to execute the method for noise reduction of an audio file according to any of claims 1 to 4 when executing the program instructions.
10. A storage medium storing program instructions which, when executed, perform a method for noise reduction of an audio file as claimed in any one of claims 1 to 4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110961787.8A CN113689900A (en) | 2021-08-20 | 2021-08-20 | Method and device for reducing noise of audio file, server and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110961787.8A CN113689900A (en) | 2021-08-20 | 2021-08-20 | Method and device for reducing noise of audio file, server and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113689900A true CN113689900A (en) | 2021-11-23 |
Family
ID=78581051
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110961787.8A Pending CN113689900A (en) | 2021-08-20 | 2021-08-20 | Method and device for reducing noise of audio file, server and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113689900A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109712628A (en) * | 2019-03-15 | 2019-05-03 | 哈尔滨理工大学 | A kind of voice de-noising method and audio recognition method based on RNN |
CN109817236A (en) * | 2019-02-01 | 2019-05-28 | 安克创新科技股份有限公司 | Audio defeat method, apparatus, electronic equipment and storage medium based on scene |
CN112216300A (en) * | 2020-09-25 | 2021-01-12 | 三一专用汽车有限责任公司 | Noise reduction method and device for sound in driving cab of mixer truck and mixer truck |
-
2021
- 2021-08-20 CN CN202110961787.8A patent/CN113689900A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109817236A (en) * | 2019-02-01 | 2019-05-28 | 安克创新科技股份有限公司 | Audio defeat method, apparatus, electronic equipment and storage medium based on scene |
CN109712628A (en) * | 2019-03-15 | 2019-05-03 | 哈尔滨理工大学 | A kind of voice de-noising method and audio recognition method based on RNN |
CN112216300A (en) * | 2020-09-25 | 2021-01-12 | 三一专用汽车有限责任公司 | Noise reduction method and device for sound in driving cab of mixer truck and mixer truck |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102379954B1 (en) | Image processing apparatus and method | |
US20210012777A1 (en) | Context acquiring method and device based on voice interaction | |
CN110853646B (en) | Conference speaking role distinguishing method, device, equipment and readable storage medium | |
JP6469252B2 (en) | Account addition method, terminal, server, and computer storage medium | |
CN105096937A (en) | Voice data processing method and terminal | |
CN104753766A (en) | Expression sending method and device | |
CN105100366A (en) | Method, device and system for confirming harassment telephone number | |
CN104408402A (en) | Face identification method and apparatus | |
CN104021350A (en) | Privacy-information hiding method and device | |
CN111091835B (en) | Model training method, voiceprint recognition method, system, device and medium | |
KR20100062207A (en) | Method and apparatus for providing animation effect on video telephony call | |
CN104021398A (en) | Wearable intelligent device and method for assisting identity recognition | |
CN110321863A (en) | Age recognition methods and device, storage medium | |
CN110400566B (en) | Identification method and electronic equipment | |
CN110992502A (en) | Face recognition attendance checking method and system and readable storage medium | |
JP2022168015A (en) | Responding robot, responding method and program | |
CN110223696A (en) | A kind of acquisition method of voice signal, device and terminal device | |
EP3510523A1 (en) | Person centric trait specific photo match ranking engine | |
WO2018107731A1 (en) | Data processing method and device, and robot | |
CN108364346B (en) | Method, apparatus and computer readable storage medium for constructing three-dimensional face model | |
CN113689900A (en) | Method and device for reducing noise of audio file, server and storage medium | |
CN112750518A (en) | Conversation (inquiry) method based on psychological behavior characteristic recognition | |
CN111210810A (en) | Model training method and device | |
CN104112130A (en) | Optical character recognition method and device | |
CN113282783A (en) | Method and device for information recommendation, electronic equipment and readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |