CN113689900A - Method and device for reducing noise of audio file, server and storage medium - Google Patents

Method and device for reducing noise of audio file, server and storage medium Download PDF

Info

Publication number
CN113689900A
CN113689900A CN202110961787.8A CN202110961787A CN113689900A CN 113689900 A CN113689900 A CN 113689900A CN 202110961787 A CN202110961787 A CN 202110961787A CN 113689900 A CN113689900 A CN 113689900A
Authority
CN
China
Prior art keywords
scene
audio file
noise reduction
audio
recording rule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110961787.8A
Other languages
Chinese (zh)
Inventor
李健
陈爽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Minglue Artificial Intelligence Group Co Ltd
Original Assignee
Shanghai Minglue Artificial Intelligence Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Minglue Artificial Intelligence Group Co Ltd filed Critical Shanghai Minglue Artificial Intelligence Group Co Ltd
Priority to CN202110961787.8A priority Critical patent/CN113689900A/en
Publication of CN113689900A publication Critical patent/CN113689900A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C7/00Arrangements for writing information into, or reading information out from, a digital store
    • G11C7/16Storage of analogue signals in digital stores using an arrangement comprising analogue/digital [A/D] converters, digital memories and digital/analogue [D/A] converters 
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)

Abstract

The application relates to the technical field of audio processing, and discloses a method for reducing noise of an audio file, which comprises the following steps: receiving a first audio file; determining a scene corresponding to a first audio file; determining a recording rule corresponding to a scene; determining a noise reduction parameter corresponding to the recording rule; sending the noise reduction parameters to the audio acquisition equipment, and triggering the audio acquisition equipment to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file. Therefore, the scene corresponding to the first audio file is determined, the recording rule corresponding to the scene is determined, the noise reduction parameter under the recording rule is determined, the noise reduction parameter corresponds to the scene, the audio acquisition equipment is triggered to reduce the noise of the second audio file under the same scene by using the noise reduction parameter corresponding to the scene, the pertinence of the noise reduction parameter is improved, and the noise reduction effect is improved. The application also discloses a device for reducing noise of the audio file, a server and a storage medium.

Description

Method and device for reducing noise of audio file, server and storage medium
Technical Field
The present application relates to the field of audio processing technologies, and for example, to a method and an apparatus for noise reduction of an audio file, a server, and a storage medium.
Background
At present, many companies require employees to record in work by using intelligent voice cards, so that the working state of the employees can be known through audio files or the preference of users can be counted. Under the condition of recording by using the intelligent voice work card, the voice of a plurality of people and various environmental noises often exist in the recording environment, and in order to reduce calculation, noise reduction processing needs to be carried out on an audio file to eliminate unnecessary voice.
In the process of implementing the embodiments of the present disclosure, it is found that at least the following problems exist in the related art: the audio acquisition equipment in the prior art adopts fixed noise reduction parameters to reduce noise of an audio file, and influence of a scene on the noise reduction parameters is not considered, so that the noise reduction effect is poor.
Disclosure of Invention
The following presents a simplified summary in order to provide a basic understanding of some aspects of the disclosed embodiments. This summary is not an extensive overview nor is intended to identify key/critical elements or to delineate the scope of such embodiments but rather as a prelude to the more detailed description that is presented later.
The embodiment of the disclosure provides a method and a device for noise reduction of an audio file, a server and a storage medium, so as to improve the noise reduction effect.
In some embodiments, the method comprises: receiving a first audio file; determining a scene corresponding to a first audio file; determining a recording rule corresponding to a scene; determining a noise reduction parameter corresponding to the recording rule; sending the noise reduction parameters to the audio acquisition equipment, and triggering the audio acquisition equipment to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
In some embodiments, the apparatus comprises: a receiving module configured to receive a first audio file; the first determining module is configured to determine a scene corresponding to the first audio file; the second determining module is configured to determine a recording rule corresponding to the scene; a third determining module configured to determine a noise reduction parameter under the recording rule; the sending module is configured to send the noise reduction parameters to the audio acquisition equipment and trigger the audio acquisition equipment to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
In some embodiments, the server comprises: a processor and a memory storing program instructions, the processor being configured, upon execution of the program instructions, to perform the method for noise reduction of an audio file as described above.
In some embodiments, the storage medium stores program instructions that, when executed, perform the above-described method for noise reduction of an audio file.
The method and the device for reducing the noise of the audio file, the server and the storage medium provided by the embodiment of the disclosure can realize the following technical effects: the method comprises the steps of determining a scene corresponding to a first audio file, determining a recording rule corresponding to the scene, determining a noise reduction parameter under the recording rule, enabling the noise reduction parameter to correspond to the scene, and triggering an audio acquisition device to perform noise reduction processing on a second audio file under the same scene by using the noise reduction parameter corresponding to the scene. Therefore, the audio file passing through the scene is subjected to noise reduction processing through the noise reduction parameters corresponding to the scene, the pertinence of the noise reduction parameters can be improved, and the noise reduction effect is improved.
The foregoing general description and the following description are exemplary and explanatory only and are not restrictive of the application.
Drawings
One or more embodiments are illustrated by way of example in the accompanying drawings, which correspond to the accompanying drawings and not in limitation thereof, in which elements having the same reference numeral designations are shown as like elements and not in limitation thereof, and wherein:
FIG. 1 is a schematic diagram of a method for noise reduction of an audio file according to an embodiment of the present disclosure;
FIG. 2 is a timing diagram of a method for noise reduction of an audio file provided by an embodiment of the present disclosure;
FIG. 3 is a schematic diagram of an apparatus for noise reduction of audio files according to an embodiment of the present disclosure;
fig. 4 is a schematic diagram of a server of an embodiment of the present disclosure.
Detailed Description
So that the manner in which the features and elements of the disclosed embodiments can be understood in detail, a more particular description of the disclosed embodiments, briefly summarized above, may be had by reference to the embodiments, some of which are illustrated in the appended drawings. In the following description of the technology, for purposes of explanation, numerous details are set forth in order to provide a thorough understanding of the disclosed embodiments. However, one or more embodiments may be practiced without these details. In other instances, well-known structures and devices may be shown in simplified form in order to simplify the drawing.
The terms "first," "second," and the like in the description and in the claims, and the above-described drawings of embodiments of the present disclosure, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the present disclosure described herein may be made. Furthermore, the terms "comprising" and "having," as well as any variations thereof, are intended to cover non-exclusive inclusions.
The term "plurality" means two or more unless otherwise specified.
In the embodiment of the present disclosure, the character "/" indicates that the preceding and following objects are in an or relationship. For example, A/B represents: a or B.
The term "and/or" is an associative relationship that describes objects, meaning that three relationships may exist. For example, a and/or B, represents: a or B, or A and B.
The term "correspond" may refer to an association or binding relationship, and a corresponds to B refers to an association or binding relationship between a and B.
With reference to fig. 1, an embodiment of the present disclosure provides a method for noise reduction of an audio file, including:
in step S101, the server receives a first audio file.
Step S102, the server determines a scene corresponding to the first audio file.
Step S103, the server determines a recording rule corresponding to the scene of the first audio file.
And step S104, the server determines the noise reduction parameters corresponding to the recording rules.
Step S105, the server sends the noise reduction parameters to the audio acquisition equipment, and the audio acquisition equipment is triggered to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
By adopting the method for reducing the noise of the audio file provided by the embodiment of the disclosure, the recording rule corresponding to the scene can be determined by determining the scene corresponding to the first audio file, the noise reduction parameter under the recording rule is determined, so that the noise reduction parameter can correspond to the scene, and the audio acquisition device is triggered to carry out noise reduction processing on the second audio file under the same scene by using the noise reduction parameter corresponding to the scene. Therefore, the audio file passing through the scene is subjected to noise reduction processing through the noise reduction parameters corresponding to the scene, the pertinence of the noise reduction parameters can be improved, and the noise reduction effect is improved.
Optionally, the server receives the first audio file via an audio transmission device. Optionally, the audio transmission device is configured to receive a first audio file sent by the audio acquisition device, and send the first audio file to the server. Optionally, the audio transmission device is further configured to store and process the first audio file. Optionally, the audio capturing device is configured to capture a first audio file and send the first audio file to the audio transmitting device.
Optionally, the audio acquisition device is an intelligent voice card; the intelligent voice work board is provided with a microphone array, and audio files are collected through the microphone array; the intelligent voice work card stores a noise reduction algorithm to reduce noise of the audio file. In some embodiments, the audio capture mode of the audio capture device is controlled primarily by the opening/closing of mechanical keys.
In some embodiments, a plurality of audio capture devices are connected to one audio transmission device, i.e., the plurality of audio capture devices communicate with the server through the same audio transmission device. Or, one audio acquisition device corresponds to one audio transmission device, that is, different audio acquisition devices communicate with the server through different audio transmission devices.
Optionally, the audio transmission device is an edge end device. Optionally, the edge device includes an acquisition station or an intelligent terminal. Optionally, the smart terminal includes a terminal device such as a smart phone, a tablet or a computer, which can communicate with the server. Optionally, the server is a cloud server.
In some embodiments, the edge device can perform low-power consumption intelligent operation and establish communication connection with the cloud server and the audio acquisition device; the edge terminal equipment receives a first audio file sent by the audio acquisition equipment and uploads the first audio file to the cloud server.
In some embodiments, the audio transmission device is a collection station, the audio collection device establishes a connection with the collection station through a USB port, and the audio collection device sends the first audio file to the audio transmission device using the established connection.
In some embodiments, the audio transmission device is an intelligent terminal, the audio acquisition device establishes a connection with the acquisition station through a bluetooth or USB port, and the audio acquisition device sends the first audio file to the audio transmission device by using the established connection.
Optionally, sending the noise reduction parameters to the audio acquisition device includes: and sending the noise reduction parameters to audio acquisition equipment through audio transmission equipment. Like this, audio acquisition equipment can obtain through server and audio transmission equipment and fall the parameter of making an uproar, and it needs the manual work to return the mill to obtain the parameter of making an uproar to go to the collection place and utilize audio acquisition equipment to record after the audio file to compare among the prior art, and this application passes through server and audio transmission equipment and obtains the parameter of making an uproar, does not need the manual work to come and go between collection place and mill, has reduced human cost and time cost, has improved the efficiency of sending the parameter of making an uproar.
Optionally, determining a scene corresponding to the first audio file includes: and determining a scene corresponding to the first audio file according to the keywords in the first audio file. Therefore, the scene corresponding to the first audio file can be determined more accurately through the keywords.
Optionally, determining a scene corresponding to the first audio file according to the keyword in the first audio file includes: acquiring a keyword in a first audio file; performing table look-up operation on the keywords in the first audio file by using a preset keyword database to obtain a scene corresponding to the keywords in the first audio file; the keyword database stores the corresponding relation between scenes and keywords; and determining the scene corresponding to the keyword in the first audio file as the scene corresponding to the first audio file.
In some embodiments, the keywords in the first audio file include: sales promotion, discount, price and commodity, and finding out that the scene corresponding to the keyword is a supermarket in the keyword database; the keywords in the first audio file include: jewelry, diamonds and silver, and finding out that the scene corresponding to the keyword is a jewelry shop in a keyword database; the keywords in the first audio file include: rice, meat and vegetables, and finding out the scene corresponding to the keyword from the keyword database as a restaurant.
Optionally, determining a scene corresponding to the first audio file includes: converting the first audio file into an audio signal model corresponding to the first audio file; performing table look-up operation on the audio signal model by using a preset audio signal model database to obtain a scene corresponding to the audio signal model; the audio signal model database stores the corresponding relation between the scene and the audio signal model; and determining the scene corresponding to the audio signal model as the scene corresponding to the first audio file.
Optionally, an audio signal model is used to characterize the noisiness of the scene. In some embodiments, if the number of the dialog persons in the first audio file is 10, the audio signal model corresponding to the first audio file is a noisy audio signal model, and a scene corresponding to the noisy audio signal model is found in the audio signal model database as a noisy scene; if the number of the dialog characters in the first audio file is 2, the audio signal model corresponding to the first audio file is a quiet audio signal model, and a scene corresponding to the quiet audio signal model is found out in the audio signal model database to be a quiet scene.
Optionally, determining a recording rule corresponding to a scene of the first audio file includes: performing table look-up operation on a scene corresponding to the first audio file by using a preset scene data table to obtain a recording rule corresponding to the scene; the scene data table stores the corresponding relationship between the scene and the recording rule.
Optionally, the recording rule comprises determining a sound signal characteristic parameter. Optionally, the sound signal characteristic parameter comprises a sound to be emphasized and/or a sound to be suppressed.
In some embodiments, the scene is a supermarket, and the recording rule corresponding to the scene is matched in the scene data table to suppress the loudspeaker sound; the scene is a jewelry shop, and the recording rule corresponding to the scene is matched in the scene data table to suppress the voice of a person not wearing the audio acquisition equipment and enhance the voice of the person wearing the audio acquisition equipment; the scene is a restaurant, and the recording rule corresponding to the scene is matched in the scene data table to suppress the sound of the person not wearing the audio acquisition equipment and enhance the sound of the person wearing the audio acquisition equipment.
Optionally, determining a recording rule corresponding to a scene of the first audio file includes: determining noise in a scene of a first audio file; and determining a recording rule corresponding to the scene of the first audio file according to the noise in the scene of the first audio file. Therefore, the noise in the scene can be determined according to the requirements of the user, the recording rule corresponding to the scene is customized, the noise reduction parameters obtained according to the recording rule can better meet the requirements of the wearer, and the use experience of the wearer is improved.
In some embodiments, the scene is a quiet scene, and if the noise in the quiet scene is determined to be male sound, the recording rule corresponding to the quiet scene is to suppress the male sound and enhance the female sound; the scene is a noisy scene, the noise in the noisy scene is determined to be the sound, the background sound and the system sound inside the audio acquisition equipment of a person who does not wear the audio acquisition equipment according to the requirement of a wearer, and then the recording rule corresponding to the noisy scene is to suppress the sound, the background sound and the system sound inside the audio acquisition equipment of the person who does not wear the audio acquisition equipment.
Optionally, determining a noise reduction parameter corresponding to the recording rule includes: and calculating by utilizing the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule. Therefore, the noise reduction parameters corresponding to the recording rules are obtained by calculating according to the preset noise reduction parameter algorithm by utilizing the recording rules and the first audio files, so that the recording rules are matched with the noise reduction parameters, the noise reduction parameters correspond to scenes due to the fact that the recording rules correspond to the scenes, then the noise reduction parameters are sent to the audio acquisition equipment, the audio acquisition equipment is triggered to reduce the noise of the second audio files in the same scene by utilizing the noise reduction parameters corresponding to the scenes, the purpose of reducing the noise specifically according to the scenes corresponding to the audio files is achieved, and the noise reduction effect is improved. Meanwhile, the audio acquisition equipment utilizes the noise reduction parameters to perform noise reduction processing on the acquired second audio file, so that unnecessary audio, namely audio data which the wearer does not want to be recorded, is suppressed, and the privacy of the wearer is protected.
Optionally, calculating by using the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule, including: and inputting the recording rule and the first audio file into a preset CNN (Neural Network) algorithm for calculation to obtain a noise reduction parameter corresponding to the recording rule.
Optionally, calculating by using the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule, including: and inputting the recording rule and the first audio file into a preset RNN (recurrent neural network) algorithm for calculation to obtain a noise reduction parameter corresponding to the recording rule.
In some embodiments, and as shown in conjunction with FIG. 2, FIG. 2 is a timing diagram of a method for noise reduction of an audio file. The method comprises the following steps:
in step S201, the audio capture device obtains a first audio file.
Step S202, the audio acquisition device sends a first audio file to the audio transmission device.
Step S203, the audio transmission device receives the first audio file and sends the first audio file to the server.
Step S204, the server receives the first audio file and determines a scene corresponding to the first audio file.
In step S205, the server determines a recording rule corresponding to the scene.
In step S206, the server determines a noise reduction parameter corresponding to the recording rule.
Step S207, the server sends the noise reduction parameter to the audio transmission device.
And step S208, the audio transmission equipment receives the noise reduction parameters and sends the noise reduction parameters to the audio acquisition equipment.
Step S209, the audio capture device receives the noise reduction parameter and obtains a second audio file.
And step S210, the audio acquisition equipment performs noise reduction processing on the second audio file by using the noise reduction parameters.
In some embodiments, the audio capture device improves the quality of sound collection by adding an arrangement of microphone arrays and/or by adding sound sources by increasing the number of microphones before acquiring the first audio file.
Optionally, the audio acquiring device performs noise reduction processing on the second audio file by using the noise reduction parameter, including: and the audio acquisition equipment sets the noise reduction parameters in a preset noise reduction algorithm model to obtain a complete noise reduction algorithm, and the noise reduction algorithm is utilized to carry out noise reduction processing on the second audio file.
In some embodiments, in a supermarket scene, a salesperson wears an audio acquisition device, exhales air through a lung, then the air passes through a glottis and generates a periodic signal according to the opening and closing of the glottis, and the periodic signal modulates the periodic signal through the change of the shape of a sound channel to obtain a voice signal, so as to make a sound. The voice emitted by the salesperson is collected by the audio collecting device to generate a second audio file. The audio acquisition equipment utilizes the noise reduction parameters corresponding to the supermarket scene sent by the server to set the noise reduction parameters in an internal noise reduction algorithm model to obtain a complete noise reduction algorithm, and utilizes the noise reduction algorithm to perform noise reduction processing on the acquired second audio file, so that the noise reduction effect is improved.
The audio acquisition equipment uploads a first audio file to the server through the audio transmission equipment, the server determines a scene corresponding to the first audio file, the server determines a recording rule corresponding to the scene, and determines noise reduction parameters under the recording rule, so that the noise reduction parameters can correspond to the scene; the server sends the noise reduction parameters to the audio acquisition equipment, the audio acquisition equipment utilizes the received noise reduction parameters corresponding to the scene to reduce noise of the second audio file in the same scene, therefore, even if the scene is changed, the audio acquisition equipment can also reacquire the first audio file in a new scene and upload the first audio file to the server, the server acquires the new noise reduction parameters corresponding to the scene according to the scene of the new first audio file and sends the new noise reduction parameters to the audio acquisition equipment, the audio acquisition equipment utilizes the new noise reduction parameters to reduce noise of the second audio file in the new scene, the audio acquisition equipment does not need to be replaced again, and the utilization rate of the audio acquisition equipment is improved.
Optionally, the noise reduction parameter algorithm stored by the server and the noise reduction algorithm obtained by the audio acquisition device are the same type of algorithm.
Referring to fig. 3, an apparatus for denoising an audio file according to an embodiment of the present disclosure includes a receiving module 1, a first determining module 2, a second determining module 3, a third determining module 4, and a transmitting module 5. The receiving module 1 is configured to receive a first audio file; the first determining module 2 is configured to determine a scene corresponding to the first audio file; the second determining module 3 is configured to determine a recording rule corresponding to the scene; the third determining module 4 is configured to determine a noise reduction parameter corresponding to the recording rule; the sending module 5 is configured to send the noise reduction parameters to the audio acquisition device, and trigger the audio acquisition device to perform noise reduction processing on the second audio file by using the noise reduction parameters; the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
By adopting the device for reducing the noise of the audio file, which is provided by the embodiment of the disclosure, the recording rule corresponding to the scene can be determined by determining the scene corresponding to the first audio file, the noise reduction parameter under the recording rule is determined, so that the noise reduction parameter can correspond to the scene, and the audio acquisition equipment is triggered to carry out noise reduction processing on the second audio file under the same scene by using the noise reduction parameter corresponding to the scene. Therefore, the audio file passing through the scene is subjected to noise reduction processing through the noise reduction parameters corresponding to the scene, the pertinence of the noise reduction parameters can be improved, and the noise reduction effect is improved.
Optionally, the second determining module is configured to determine the recording rule corresponding to the scene by: matching a recording rule corresponding to a scene in a preset scene data table; the scene data table stores the corresponding relationship between the scene and the recording rule.
Optionally, the third determining module is configured to determine the noise reduction parameter corresponding to the recording rule by: and calculating by utilizing the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule.
Optionally, the first determining module determines the scene corresponding to the first audio file by: and determining a scene corresponding to the first audio file according to the keywords in the first audio file.
The device for denoising the audio file, provided by the embodiment of the disclosure, is applied to the technical field of intelligent equipment, and is characterized in that a first audio file uploaded by audio transmission equipment is received, a scene corresponding to the first audio file is determined, a recording rule corresponding to the scene is then determined, and denoising parameters under the recording rule are determined, so that the denoising parameters can correspond to the scene, then the denoising parameters are sent to audio acquisition equipment, the audio acquisition equipment is triggered to denoise a second audio file under the same scene by utilizing the denoising parameters corresponding to the scene, the denoising parameters correspond to the scene, and the denoising efficiency is improved. Meanwhile, the first audio file needs to go to a collection place manually for collection, and then the first audio file is uploaded to the device for reducing the noise of the audio file through the audio transmission equipment, so that man-machine cooperation is realized through interaction between a user and different equipment.
As shown in fig. 4, an embodiment of the present disclosure provides a server including a processor (processor)100 and a memory (memory) 101. Optionally, the server may also include a Communication Interface (Communication Interface)102 and a bus 103. The processor 100, the communication interface 102, and the memory 101 may communicate with each other via a bus 103. The communication interface 102 may be used for information transfer. The processor 100 may call logic instructions in the memory 101 to perform the method for audio file noise reduction of the above embodiment.
In addition, the logic instructions in the memory 101 may be implemented in the form of software functional units and stored in a computer readable storage medium when the logic instructions are sold or used as independent products.
By adopting the server provided by the embodiment of the disclosure, the recording rule corresponding to the scene can be determined by determining the scene corresponding to the first audio file, the noise reduction parameter under the recording rule is determined, so that the noise reduction parameter can correspond to the scene, and the audio acquisition device is triggered to perform noise reduction processing on the second audio file under the same scene by using the noise reduction parameter corresponding to the scene. Therefore, the audio file passing through the scene is subjected to noise reduction processing through the noise reduction parameters corresponding to the scene, the pertinence of the noise reduction parameters can be improved, and the noise reduction effect is improved.
The memory 101, which is a computer-readable storage medium, may be used for storing software programs, computer-executable programs, such as program instructions/modules corresponding to the methods in the embodiments of the present disclosure. The processor 100 executes functional applications and data processing, i.e., implements the method for noise reduction of audio files in the above-described embodiments, by executing program instructions/modules stored in the memory 101.
The memory 101 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal device, and the like. In addition, the memory 101 may include a high-speed random access memory, and may also include a nonvolatile memory.
Embodiments of the present disclosure provide a storage medium having stored thereon computer-executable instructions configured to perform the above-described method for noise reduction of an audio file.
Embodiments of the present disclosure provide a computer program product comprising a computer program stored on a computer readable storage medium, the computer program comprising program instructions which, when executed by a computer, cause the computer to perform the above-described method for noise reduction of an audio file.
The computer-readable storage medium described above may be a transitory computer-readable storage medium or a non-transitory computer-readable storage medium.
The technical solution of the embodiments of the present disclosure may be embodied in the form of a software product, where the computer software product is stored in a storage medium and includes one or more instructions to enable a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method of the embodiments of the present disclosure. And the aforementioned storage medium may be a non-transitory storage medium comprising: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes, and may also be a transient storage medium.
The above description and drawings sufficiently illustrate embodiments of the disclosure to enable those skilled in the art to practice them. Other embodiments may incorporate structural, logical, electrical, process, and other changes. The examples merely typify possible variations. Individual components and functions are optional unless explicitly required, and the sequence of operations may vary. Portions and features of some embodiments may be included in or substituted for those of others. Furthermore, the words used in the specification are words of description only and are not intended to limit the claims. As used in the description of the embodiments and the claims, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. Similarly, the term "and/or" as used in this application is meant to encompass any and all possible combinations of one or more of the associated listed. Furthermore, the terms "comprises" and/or "comprising," when used in this application, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. Without further limitation, an element defined by the phrase "comprising an …" does not exclude the presence of other like elements in a process, method or apparatus that comprises the element. In this document, each embodiment may be described with emphasis on differences from other embodiments, and the same and similar parts between the respective embodiments may be referred to each other. For methods, products, etc. of the embodiment disclosures, reference may be made to the description of the method section for relevance if it corresponds to the method section of the embodiment disclosure.
Those of skill in the art would appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software may depend upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the disclosed embodiments. It can be clearly understood by the skilled person that, for convenience and brevity of description, the specific working processes of the system, the apparatus and the unit described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the embodiments disclosed herein, the disclosed methods, products (including but not limited to devices, apparatuses, etc.) may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units may be merely a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form. The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to implement the present embodiment. In addition, functional units in the embodiments of the present disclosure may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. In the description corresponding to the flowcharts and block diagrams in the figures, operations or steps corresponding to different blocks may also occur in different orders than disclosed in the description, and sometimes there is no specific order between the different operations or steps. For example, two sequential operations or steps may in fact be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved. Each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

Claims (10)

1. A method for noise reduction of an audio file, comprising:
receiving a first audio file;
determining a scene corresponding to the first audio file;
determining a recording rule corresponding to the scene;
determining a noise reduction parameter corresponding to the recording rule;
sending the noise reduction parameters to audio acquisition equipment, and triggering the audio acquisition equipment to perform noise reduction processing on a second audio file by using the noise reduction parameters; and the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
2. The method of claim 1, wherein determining the recording rule corresponding to the scene comprises:
performing table look-up operation on the scene by using a preset scene data table to obtain a recording rule corresponding to the scene; the scene data table stores the corresponding relationship between the scene and the recording rule.
3. The method of claim 1, wherein determining the noise reduction parameter corresponding to the recording rule comprises:
and calculating by utilizing the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule.
4. The method of claim 1, wherein determining the scene corresponding to the first audio file comprises:
and determining a scene corresponding to the first audio file according to the keywords in the first audio file.
5. An apparatus for noise reduction of an audio file, comprising:
a receiving module configured to receive a first audio file;
a first determining module configured to determine a scene corresponding to the first audio file;
the second determining module is configured to determine a recording rule corresponding to the scene;
a third determining module configured to determine a noise reduction parameter corresponding to the recording rule;
the sending module is configured to send the noise reduction parameters to audio acquisition equipment and trigger the audio acquisition equipment to perform noise reduction processing on a second audio file by using the noise reduction parameters; and the scene corresponding to the first audio file is the same as the scene corresponding to the second audio file.
6. The apparatus of claim 5, wherein the second determining module is configured to determine the recording rule corresponding to the scene by:
performing table look-up operation on the scene by using a preset scene data table to obtain a recording rule corresponding to the scene; the scene data table stores the corresponding relationship between the scene and the recording rule.
7. The apparatus of claim 5, wherein the third determining module is configured to determine the noise reduction parameter corresponding to the recording rule by:
and calculating by utilizing the recording rule and the first audio file according to a preset noise reduction parameter algorithm to obtain a noise reduction parameter corresponding to the recording rule.
8. The apparatus of claim 5, wherein the first determining module determines the scene corresponding to the first audio file by:
and determining a scene corresponding to the first audio file according to the keywords in the first audio file.
9. A server comprising a processor and a memory storing program instructions, characterized in that the processor is configured to execute the method for noise reduction of an audio file according to any of claims 1 to 4 when executing the program instructions.
10. A storage medium storing program instructions which, when executed, perform a method for noise reduction of an audio file as claimed in any one of claims 1 to 4.
CN202110961787.8A 2021-08-20 2021-08-20 Method and device for reducing noise of audio file, server and storage medium Pending CN113689900A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110961787.8A CN113689900A (en) 2021-08-20 2021-08-20 Method and device for reducing noise of audio file, server and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110961787.8A CN113689900A (en) 2021-08-20 2021-08-20 Method and device for reducing noise of audio file, server and storage medium

Publications (1)

Publication Number Publication Date
CN113689900A true CN113689900A (en) 2021-11-23

Family

ID=78581051

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110961787.8A Pending CN113689900A (en) 2021-08-20 2021-08-20 Method and device for reducing noise of audio file, server and storage medium

Country Status (1)

Country Link
CN (1) CN113689900A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109712628A (en) * 2019-03-15 2019-05-03 哈尔滨理工大学 A kind of voice de-noising method and audio recognition method based on RNN
CN109817236A (en) * 2019-02-01 2019-05-28 安克创新科技股份有限公司 Audio defeat method, apparatus, electronic equipment and storage medium based on scene
CN112216300A (en) * 2020-09-25 2021-01-12 三一专用汽车有限责任公司 Noise reduction method and device for sound in driving cab of mixer truck and mixer truck

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109817236A (en) * 2019-02-01 2019-05-28 安克创新科技股份有限公司 Audio defeat method, apparatus, electronic equipment and storage medium based on scene
CN109712628A (en) * 2019-03-15 2019-05-03 哈尔滨理工大学 A kind of voice de-noising method and audio recognition method based on RNN
CN112216300A (en) * 2020-09-25 2021-01-12 三一专用汽车有限责任公司 Noise reduction method and device for sound in driving cab of mixer truck and mixer truck

Similar Documents

Publication Publication Date Title
KR102379954B1 (en) Image processing apparatus and method
US20210012777A1 (en) Context acquiring method and device based on voice interaction
CN110853646B (en) Conference speaking role distinguishing method, device, equipment and readable storage medium
JP6469252B2 (en) Account addition method, terminal, server, and computer storage medium
CN105096937A (en) Voice data processing method and terminal
CN104753766A (en) Expression sending method and device
CN105100366A (en) Method, device and system for confirming harassment telephone number
CN104408402A (en) Face identification method and apparatus
CN104021350A (en) Privacy-information hiding method and device
CN111091835B (en) Model training method, voiceprint recognition method, system, device and medium
KR20100062207A (en) Method and apparatus for providing animation effect on video telephony call
CN104021398A (en) Wearable intelligent device and method for assisting identity recognition
CN110321863A (en) Age recognition methods and device, storage medium
CN110400566B (en) Identification method and electronic equipment
CN110992502A (en) Face recognition attendance checking method and system and readable storage medium
JP2022168015A (en) Responding robot, responding method and program
CN110223696A (en) A kind of acquisition method of voice signal, device and terminal device
EP3510523A1 (en) Person centric trait specific photo match ranking engine
WO2018107731A1 (en) Data processing method and device, and robot
CN108364346B (en) Method, apparatus and computer readable storage medium for constructing three-dimensional face model
CN113689900A (en) Method and device for reducing noise of audio file, server and storage medium
CN112750518A (en) Conversation (inquiry) method based on psychological behavior characteristic recognition
CN111210810A (en) Model training method and device
CN104112130A (en) Optical character recognition method and device
CN113282783A (en) Method and device for information recommendation, electronic equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination