CN108198565B

CN108198565B - Mixing processing method, mixing processing device, computer equipment and storage medium

Info

Publication number: CN108198565B
Application number: CN201711458475.5A
Authority: CN
Inventors: 王艳芬; 黄维; 何斌
Original assignee: Shenzhen Tendzone Intelligent Technology Co ltd
Current assignee: Shenzhen Tendzone Intelligent Technology Co ltd
Priority date: 2017-12-28
Filing date: 2017-12-28
Publication date: 2020-11-17
Anticipated expiration: 2037-12-28
Also published as: CN108198565A

Abstract

The invention relates to a sound mixing processing method, a sound mixing processing device, a computer device and a storage medium, wherein the method comprises the following steps: acquiring a plurality of microphone identifications, and screening the plurality of microphone identifications according to a preset condition to obtain a plurality of microphone identifications for estimating environmental noise; acquiring a first audio signal and a weight corresponding to a plurality of microphone identifiers for estimating ambient noise; estimating the environmental noise of the current environment where the plurality of microphones are located by using the first audio signal and the weight; and when the ratio of the second audio signals of the plurality of microphones to the estimated ambient noise is larger than a first threshold value, outputting the second audio signals of the plurality of microphones. By adopting the method, the influence of the environmental noise on the sound mixing can be reduced.

Description

Mixing processing method, mixing processing device, computer equipment and storage medium

Technical Field

The present application relates to the field of computer technologies, and in particular, to a method and an apparatus for processing audio mixing, a computer device, and a storage medium.

Background

In daily life, there are cases where it is necessary to use a plurality of microphones simultaneously in a large conference room. In current conference system, mostly adopt the mode of exporting after carrying out the audio mixing with many microphones, however, after many microphones audio mixing export to same sound system, ambient noise can increase, and a large amount of ambient noise leads to the signal-to-noise ratio after the audio mixing low (the ratio of signal and noise), and is great to the sound source interference, can not be fine the requirement of satisfying the meeting. How to reduce the influence of the ambient noise on the audio mixing becomes a technical problem to be solved at present.

Disclosure of Invention

In view of the above, it is desirable to provide a mixing processing method, an apparatus, a computer device and a readable storage medium capable of reducing the influence of ambient noise on mixing.

A mixing processing method, comprising:

acquiring a plurality of microphone identifications, and screening the plurality of microphone identifications according to a preset condition to obtain a plurality of microphone identifications for estimating environmental noise;

acquiring first audio signals and weights corresponding to the plurality of microphone identifiers for estimating the ambient noise;

estimating the environmental noise of the current environment where the plurality of microphones are located by using the first audio signal and the weight;

and when the ratio of the second audio signals of the plurality of microphones to the estimated ambient noise is larger than a first threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, before the step of estimating the ambient noise of the current environment in which the plurality of microphones are located by using the processed first audio signal and the weight, the method further includes:

acquiring first audio signals corresponding to the multiple microphone identifiers for estimating the environmental noise, and smoothing amplitudes corresponding to the multiple first audio signals;

the step of estimating the ambient noise of the current environment in which the plurality of microphones are located using the first audio signal and the weights comprises:

and estimating the environmental noise of the current environment where the plurality of microphones are positioned by using the amplitude values and the corresponding weights after the plurality of first audio signals are subjected to smoothing processing.

In one embodiment, the method further comprises:

acquiring a correction coefficient corresponding to the current environment, and correcting the estimated environmental noise by using the correction coefficient;

and when the ratio of the second audio signals of the plurality of microphones to the corrected environmental noise is larger than a second threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, before the step of acquiring the plurality of microphone identifications, the method further includes:

selecting a plurality of microphone identifications and a plurality of environment identifications;

and acquiring a grouping instruction, and mapping the plurality of microphone identifications to the environments corresponding to the plurality of environment identifications respectively by using the grouping instruction.

In one embodiment, the method further comprises:

acquiring a plurality of device identifications, and generating a corresponding device group according to the device identifications, wherein the device group has an environment identification; the device group comprises a master device and a slave device;

respectively acquiring microphone identifications with the environment identifications in a plurality of devices according to the device identifications;

and when the microphones corresponding to the slave equipment generate second audio signals and the ratio of the amplitude of the second audio signals to the estimated environmental noise is larger than a first threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, the method further comprises:

acquiring parameter adjustment instructions corresponding to one or more microphone identifiers;

adjusting parameters corresponding to the microphone identification according to the parameter adjusting instruction;

mapping the adjusted microphone identification to other environments.

An apparatus for processing a mixed sound, the apparatus comprising:

the screening module is used for acquiring a plurality of microphone identifications, screening the plurality of microphone identifications according to a preset condition and obtaining a plurality of microphone identifications used for estimating the environmental noise;

the acquisition module is used for acquiring first audio signals and weights corresponding to the plurality of microphone identifiers for estimating the environmental noise;

the calculation module is used for estimating the environmental noise of the current environment where the microphones are positioned by using the first audio signal and the weight;

and the output module is used for outputting the second audio signals of the plurality of microphones when the ratio of the second audio signals of the plurality of microphones to the estimated environmental noise is larger than a first threshold value.

In one embodiment, the apparatus further comprises:

the smoothing module is used for acquiring first audio signals corresponding to a plurality of microphone identifiers for estimating environmental noise and smoothing amplitudes corresponding to the first audio signals;

the calculation module is further configured to estimate an ambient noise of a current environment in which the plurality of microphones are located by using the smoothed amplitudes of the plurality of first audio signals and the corresponding weights.

A computer device comprising a memory and a computer program stored on the memory and executable on the processor, the processor implementing the steps of the mixing processing method provided in the above embodiments of the present invention when executing the program.

A computer-readable storage medium on which a computer program is stored, which when executed by a processor, implements the steps of the mixing processing method provided in the above-described embodiments of the present invention.

According to the sound mixing processing method, the sound mixing processing device, the computer equipment and the readable storage medium, the terminal screens the obtained multiple microphone identifications by using the preset conditions, and obtains the first audio signals and the weights corresponding to the multiple microphone identifications used for estimating the environmental noise, which are obtained after the screening. The terminal can process each first audio signal and estimate the environmental noise of the current environment by using the processed first audio signals and the corresponding weights. And when the amplitude of the first audio signals of the plurality of microphones in the current environment is larger than the estimated environmental noise, mixing the first audio signals of the plurality of microphones for outputting. Since the microphones are screened, the ambient noise with the ambient noise closer to the current actual noise is estimated. And estimating by using the first audio signal and the weight so as to obtain the environmental noise of the current environment where the plurality of microphones are positioned. The accuracy of the environmental noise estimation is effectively improved. Therefore, the influence of the environmental noise on the mixed sound is reduced, and the problem that the environmental noise has a large influence on the mixed sound is solved.

Drawings

Fig. 1 is a schematic application environment diagram of a mixing processing method according to an embodiment;

FIG. 2 is a flowchart illustrating a mixing method according to an embodiment;

FIG. 3 is a schematic diagram showing a configuration of a mixing processing apparatus according to an embodiment;

FIG. 4 is a block diagram of a computer device in one embodiment.

Detailed Description

The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

It will be understood that, as used herein, the terms "first," "second," and the like may be used herein to describe various elements, but these elements are not limited by these terms. These terms are only used to distinguish one element from another. For example, a first client may be referred to as a second client, and similarly, a second client may be referred to as a first client, without departing from the scope of the present invention. Both the first client and the second client are clients, but they are not the same client.

Fig. 1 is a schematic application environment diagram of a mixing processing method and apparatus according to an embodiment. As shown in fig. 1, the application environment includes a terminal 102 and a microphone 104, and the terminal 102 is connected to the microphone 104. The terminal 102 may be a desktop computer, a personal laptop computer, a tablet computer, a smart phone, a personal digital assistant, or the like. The number of microphones 104 may be multiple, and the terminal 102 may manage multiple microphones 104 in the device, control the microphones, and set parameters. The application scene of the sound mixing processing can be a multifunctional conference hall or a multifunctional banquet hall and the like.

Fig. 2 is a flow diagram of mixing processing according to an embodiment, which is performed at a terminal controlling a microphone. It should be understood that, although the steps in the flowchart of fig. 2 are shown in sequence as indicated by the arrows, the steps are not necessarily performed in sequence as indicated by the arrows. Moreover, at least a portion of the steps in fig. 2 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, in different orders, and may be performed alternately or at least partially with respect to other steps or sub-steps of other steps. Taking the application of the method to the terminal as an example for explanation, the method specifically comprises the following steps:

step S202, a plurality of microphone identifications are obtained, and the plurality of microphone identifications are screened according to preset conditions to obtain a plurality of microphone identifications used for estimating the environmental noise.

There may be multiple microphones in the same environment, and the multiple microphones may need to be mixed to output, and before output, the ambient noise in the current environment needs to be estimated. It will be appreciated that the microphone may also be other audio input devices. The preset condition may mean that the power to the microphone has been turned on but not involved in mixing. The method comprises the steps that a terminal acquires a plurality of microphone identifications in the same environment, the microphone identifications correspond to microphones one by one, the microphones corresponding to the acquired microphone identifications are screened, and the microphones which are not involved in sound mixing, namely the microphones corresponding to the microphone identifications without effective audio signals are selected when the power supply of the microphones is turned on. Since the first audio signals picked up by these microphones are closest to the ambient noise of the current environment, the first audio signals picked up by these microphones can be used to estimate the ambient noise.

In step S204, a plurality of microphone identifications for estimating the ambient noise are obtained corresponding to the first audio signal and the weight.

In step S206, the first audio signal and the weight are used to estimate the environmental noise of the current environment where the plurality of microphones are located.

The terminal respectively acquires a first audio signal of a microphone corresponding to each microphone identifier obtained according to preset conditions and the weight of the microphone corresponding to each microphone identifier. And the terminal performs primary processing on the acquired first audio signal to obtain an amplitude corresponding to the first audio signal. Specifically, RMS (Root Mean Square) processing may be performed on each of the first audio signals. Calculating the corresponding amplitude of the audio signal of the kth frame (a plurality of sampling points can be set) can adopt the following formula (1):

where y (k) is the calculated RMS amplitude and x (i) is the acquired first audio signal. And estimating the environmental noise of the current environment where the plurality of microphones are positioned by using the processed first audio signal and the weight. For example, a total number of the plurality of microphone identifiers used for estimating the ambient noise is 4, and a total number of the microphone identifiers used for estimating the ambient noise is 4, and after the amplitude values corresponding to the microphones corresponding to the 4 microphone identifiers in the k-th frame of the first audio signal are simultaneously calculated by the terminal, the ambient noise of the current environment is estimated by using a weighted summation method.

Further, in order to make the estimation of the ambient noise of the current environment more accurate, the terminal may further perform smoothing processing on y (k) calculated in formula (1) after calculating the amplitude corresponding to the first audio signal of the kth frame. Specifically, the terminal may perform smoothing processing using equation (2) mentioned below. Thereby reducing the possibility of data mutation of the current frame. Specifically, the weighting method may also be an average weighting method to estimate the ambient noise of the current environment.

In step S208, when the ratio of the second audio signals of the plurality of microphones to the estimated ambient noise is greater than the first threshold, the second audio signals of the plurality of microphones are output.

The terminal can accurately reflect the actual situation of the environmental noise of the current environment by utilizing the first audio signal and the weight to estimate the environmental noise of the current environment by the plurality of microphones. And when the ratio of the amplitude of the second audio signal of the microphone to the estimated environmental noise is larger than a first threshold value, outputting the second audio signal of the microphone corresponding to the microphone identifier. And when the microphones exist, the ratio of the amplitude of the second audio signals to the estimated environmental noise is larger than a first threshold value, and then the audio models of the microphones are subjected to sound mixing output.

Conventionally, the method of estimating the ambient noise of the current environment is usually a noise estimation using a single microphone or a conventional noise estimation using each microphone. Noise estimation by using a single microphone may not be accurate enough, the risk is high, and the possibility of misoperation exists. The traditional noise estimation is adopted for each microphone, for example, a minimum tracking algorithm and a time recursive average algorithm are adopted, and the consumption of the internal memory is high and the feasibility is low when the environmental noise estimation is carried out in such a mode.

In this embodiment, the terminal screens the obtained multiple microphone identifiers by using a preset condition, and obtains first audio signals and weights corresponding to the multiple microphone identifiers used for estimating the environmental noise, which are obtained after the selection. The terminal can process each first audio signal and estimate the environmental noise of the current environment by using the processed first audio signals and the corresponding weights. And when the amplitude of the first audio signals of the plurality of microphones in the current environment is larger than the estimated environmental noise, mixing the first audio signals of the plurality of microphones for outputting. Since the microphones are screened, the ambient noise with the ambient noise closer to the current actual noise is estimated. And estimating by using the first audio signal and the weight so as to obtain the environmental noise of the current environment where the plurality of microphones are positioned. The accuracy of the environmental noise estimation is effectively improved. Therefore, the influence of the environmental noise on the mixed sound is reduced, and the problem that the environmental noise has a large influence on the mixed sound is solved.

In one embodiment, before the step of estimating the ambient noise of the current environment in which the plurality of microphones are located by using the processed first audio signal and the weight, the method further includes: acquiring first audio signals corresponding to a plurality of microphone identifiers for estimating environmental noise, and smoothing amplitudes corresponding to the plurality of first audio signals; the step of estimating the environmental noise of the current environment in which the plurality of microphones are located using the audio signals and the weights comprises: and estimating the environmental noise of the current environment where the plurality of microphones are positioned by using the amplitude values and the corresponding weights after the plurality of first audio signals are subjected to smoothing processing.

When the screened microphones are used for noise estimation of the current environment, the power supplies of the screened microphones are already turned on but do not participate in sound mixing, and the microphones may also collect unnecessary noise which affects the estimation of the environmental noise, for example, a certain microphone suddenly collects sharp cough sound in the environment. Specifically, the following formula can be adopted:

y(k)＝α*y(k)+(1-α)*y(k-1) (2)

wherein y (k) corresponds to the amplitude corresponding to the first audio signal of the k-th frame, y (k-1) corresponds to the amplitude corresponding to the first audio signal of the k-1-th frame, and α represents a smoothing factor. The smaller α, the smaller the magnitude of y (k) is in relation to the magnitude of the current frame, the larger α, and the larger the magnitude of y (k) is in relation to the magnitude of the current frame. For example, when k is 1, y (k-1) may be preliminarily set to 0.

In this embodiment, the terminal performs smoothing processing on the amplitudes corresponding to the plurality of first audio signals, and estimates the environmental noise of the current environment where the plurality of microphones are located by using the amplitudes and the corresponding weights after the smoothing processing of the plurality of first audio signals. And calculating the amplitude corresponding to the first audio signal of the current frame again by combining the result of the previous frame, so that when other noises influencing the estimation of the environmental noise exist, the first audio signal changes suddenly and influences the estimation of the environmental noise, the possibility of sudden change can be weakened, and the estimated environmental noise is more accurate. The influence of the environmental noise on the sound mixing is reduced, and the sound mixing effect can better meet the requirement on the audio quality of the microphone.

The correction coefficient is a value which is set in advance and used for correcting the environmental noise, the terminal obtains the correction coefficient corresponding to the current environment, and the environmental noise obtained through estimation is corrected by using the correction coefficient. The smaller the correction coefficient value, the closer the estimated value of the environmental noise to the current environment is to the real environmental noise. The setting of the correction factor is relatively flexible and can in most cases be set empirically, for example the value of the correction factor can be set to 6. And calculating by using the correction coefficient to obtain the corresponding corrected environmental noise. Specifically, the correction coefficient and the ambient noise may be summed to obtain the corrected ambient noise. And when the ratio of the second audio signals of the plurality of microphones to the corrected environmental noise is larger than a second threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, the step of obtaining the plurality of microphone identifications further comprises, before the step of obtaining the plurality of microphone identifications: selecting a plurality of microphone identifications and a plurality of environment identifications; and acquiring a grouping instruction, and mapping the plurality of microphone identifications to the environments corresponding to the plurality of environment identifications by using the grouping instruction.

The multiple microphones managed by the same device can be grouped, each group is equivalent to one environment, each environment has a corresponding environment identifier, namely, a conference can be carried out in each environment, and different multiple environments can be managed through the microphones of the terminal management device. Specifically, the number of environments may be set according to the requirements of the conference, and may be set through an interface at the front end. The microphone parameters, environmental parameters, etc. in the environment can be set and managed. The terminal can obtain a grouping instruction, and the grouping instruction comprises a corresponding microphone identifier and an environment identifier. The terminal maps the microphone identification to the environment corresponding to the environment identification by using the grouping instruction. The mapping process is a process of grouping the microphones. The environment identification can be environment A, environment B, environment C etc., can each other noninterference between the different environment, relatively independent, and every environment estimates the ambient noise value in own environment alone, carries out the audio mixing in own environment separately, also makes the ambient noise who estimates like this more accurate, has reduced the influence of ambient noise to the audio mixing, has solved the great problem of ambient noise to the audio mixing influence.

And the terminal maps the microphones to the environment corresponding to the environment identification according to the grouping instruction, so that the plurality of microphones mapped to the same group can perform sound mixing. For example, the terminal may group microphones corresponding to different microphone identifiers according to the grouping instruction, and map the grouped microphones to corresponding environments for use. For example, 32 microphones are managed by the same device, 4 conferences that need to be performed simultaneously exist, the user can manage the microphones through an interface of the terminal, and the terminal allocates the microphones to corresponding environments respectively according to the obtained grouping instruction, for example, 8 microphones are mapped into the environment a, 12 microphones are mapped into the environment B, 6 microphones are mapped into the environment C, and 4 microphones are mapped into the environment D. The number of the microphones and the corresponding environment setting can be set and adjusted on the front-end interface by a user, and the environment identification can be changed and mapped again after the microphone grouping mapping.

In the embodiment, the terminal groups the microphones by acquiring the grouping instruction sent by the user, maps the microphone identifiers to the environments corresponding to the environment identifiers according to the grouping instruction, different environments can independently carry out illegal conferences, the microphones in different environments can independently carry out sound mixing, the flexibility of environment setting is increased, meanwhile, the grouping setting of the microphones is relatively flexible, the environments to which the microphones belong can be set according to actual needs, and more application scenes used by the microphones are increased by the method, so that the requirements of different scenes in life can be met better.

In one embodiment, the mixing processing method further includes: acquiring a plurality of equipment identifications, and generating a corresponding equipment group according to the equipment identifications, wherein the equipment group has an environment identification; the device group comprises a master device and a slave device; respectively acquiring microphone identifications with environment identifications in a plurality of devices according to the device identifications; and when the second audio signal is generated from the microphone corresponding to the equipment and the ratio of the amplitude of the second audio signal to the estimated environmental noise is larger than a first threshold value, outputting the second audio signals of the plurality of microphones.

In general, during use, microphones of multiple devices are combined in one environment for use, such as a large conference site. For example, the device 1 manages 20 microphones, the device 2 manages 20 microphones, and a large conference site may need to use the 40 microphones managed by the two devices together for mixing. The method comprises the steps that a terminal acquires a plurality of equipment identifiers needing sound mixing in one environment, a corresponding equipment group is generated according to the equipment identifiers, microphones in the same equipment group are mapped to the same environment by the terminal, the equipment group comprises a main device and a slave device, and audio signals of the slave device can be transmitted to the main device through a channel to be output. The plurality of devices corresponding to the same environment identifier usually have only one device as a master device and the rest devices as slave devices. And the terminal respectively acquires the microphone identifications with the environment identifications in the multiple devices according to the device identifications and connects the multiple devices by using a connection protocol. The microphones with corresponding microphone identifications are the microphones needing to be mixed in the corresponding environment identifications. The terminal maps all the microphone identifications to an environment according to the environment identifications, then carries out estimation on the environment noise of the current environment to obtain a corresponding self-adaptive threshold value, the method for estimating the environment noise can adopt the method in the embodiment, the environment noise can be calculated in real time, the first audio signals of the microphones corresponding to the plurality of microphone identifications for estimating the environment noise are obtained in real time, and the environment noise of the current environment is estimated in real time. And when the microphone corresponding to the slave equipment generates a second audio signal and the ratio of the amplitude of the second audio signal to the estimated environmental noise is greater than a first threshold value, sending the second audio signal of the microphone of the slave equipment to the master equipment, and carrying out sound mixing output on the slave equipment and the microphone managed by the master equipment.

In the conventional technology, generally, microphones of a plurality of devices are used in one environment by uniformly managing all the microphones of different devices through Dante (a modern high-performance digital media transmission system operating on a standard IP network). However, because Dante is expensive, each microphone corresponds to a channel for transmission, and needs more channels for transmission, which is relatively expensive, and thus, the method is not suitable for the needs of large conferences.

In one embodiment, the second audio signals of the multiple microphones transmitted to the device by the slave device through the channel may also be sorted according to the priorities of the microphones corresponding to the microphone identifications, the specific priorities may be converted into corresponding weights, the slave device sorts the microphones by combining the amplitudes and weights of the second audio signals of different microphones of the slave device, and selects the second audio signals of the microphones corresponding to the microphones with the top-ranked microphone identifications to be transmitted to the master device for mixing and outputting. The method can reduce the use of channels and the accuracy of the selected microphone is higher.

In this embodiment, after the terminal acquires the multiple device identifiers, the terminal generates a corresponding device group with the environment identifier according to the device identifiers, the second audio signal of the slave device may be sent to the master device, and the microphone identifiers with the environment identifiers in the multiple devices are respectively acquired according to the device identifiers. The ambient noise of the current environment is estimated for the plurality of microphones using the first audio signal and the weights. And when the ratio of the audio signal in the second slave device to the estimated environmental noise is larger than a first threshold value, sending the second audio signals of the plurality of microphones to the master device through a transmission channel for mixed sound output. And only sending the second audio signal of the microphone with the valid audio signal to the master device by screening the second audio signal of the microphone corresponding to the microphone identification of the slave device. Under the condition of connecting a plurality of devices, the method reduces the required transmission channels and greatly reduces the cost.

In one embodiment, the terminal may further obtain a plurality of device identifiers, and generate a corresponding device group according to the device identifiers, where the device group includes a master device and a slave device; respectively acquiring microphone identifications of a plurality of devices according to the device identifications; and outputting the second audio signals of the plurality of microphones when the second audio signals are generated from the microphones corresponding to the equipment and the ratio of the amplitude of the second audio signals to the estimated environmental noise is larger than a second threshold value. The second threshold is

In one embodiment, different devices may also group different device microphones to form multiple different environments when performing online processing. For example, device 1 manages 20 microphones, device 2 manages 20 microphones, device 1 serves as a master device, and device 2 serves as a slave device. It is possible to manage that 10 microphones are mapped to environment a and 10 microphones are mapped to environment B by device 1, and 10 microphones are mapped to environment a and 10 microphones are mapped to environment B by device 2, respectively. The different meeting's of adaptation that like this can be more nimble requirement, in the environmental noise of independent calculation current environment in the environment of difference, can not each other not influence between the environment of difference, and then the environmental noise who obtains is more accurate, has reduced the influence of environmental noise to the audio mixing, has solved the environmental noise and has influenced great problem to the audio mixing.

In one embodiment, the mixing processing method further includes: acquiring parameter adjustment instructions corresponding to one or more microphone identifiers; adjusting parameters corresponding to the microphone identification according to the parameter adjusting instruction; mapping the adjusted microphone identification to other environments.

Parameters need to be set and adjusted in the sound mixing process, the parameters include parameters in three aspects, and the parameters of a single microphone, parameters of an environment, parameters of equipment after environment division and the like can be set and managed. The user can set and adjust the parameters through the interface of the terminal. Wherein the microphone parameters include: silence, volume adjustment, priority, etc. The microphone with the higher priority level is set, for example, the second audio signal of the microphone with the host authority can be directly mixed and output, and the priority level can better meet the requirement in practical application. The unified setting of some parameters can be carried out on the microphones belonging to the same environment by setting the environment parameters. The environmental parameters include the size of the output volume, the limit of the number of microphones, etc. Device parameters after the split context include output mute, reset, etc. The device parameters may control microphones belonging to a plurality of different environments managed by the unified device, for example, when the reset function is started, the microphone parameters in different environments of the whole device are reset. Of course, common parameters or independent parameters can be set between different environments, and the setting can be performed according to the needs of users.

In this embodiment, the terminal adjusts the parameters corresponding to the one or more microphone identifiers according to the parameter adjustment instruction corresponding to the one or more microphone identifiers, and maps the adjusted microphone identifiers to other environments. The terminal can adjust the microphone corresponding to the microphone identification by using the adjusted parameter value according to the adjusted parameter type, so that the terminal can realize free mapping of the microphone between different environments through the adjustment instruction, and the adjusted microphone can meet the requirements of users better.

In one embodiment, as shown in fig. 3, there is provided a mixing processing apparatus including: a screening module 302, an acquisition module 304, a calculation module 306, and an output module 308, wherein:

the system comprises an acquisition module, a processing module and a processing module, wherein the acquisition module is used for acquiring first audio signals and weights corresponding to a plurality of microphone identifiers for estimating ambient noise;

the computing module is used for estimating the environmental noise of the current environment where the microphones are located by utilizing the first audio signals and the weights;

and the output module is used for outputting the second audio signals of the microphones when the ratio of the second audio signals of the microphones to the estimated environmental noise is larger than a first threshold value.

In one embodiment, the mixing processing apparatus further includes: the smoothing module is used for acquiring first audio signals corresponding to a plurality of microphone identifiers for estimating environmental noise and smoothing amplitudes corresponding to the first audio signals; the calculation module is further configured to estimate an ambient noise of a current environment in which the plurality of microphones are located by using the smoothed amplitudes of the plurality of first audio signals and the corresponding weights.

In one embodiment, the obtaining module is further configured to obtain a correction coefficient corresponding to the current environment, and correct the estimated environmental noise by using the correction coefficient; the output module is further used for outputting the second audio signals of the plurality of microphones when the ratio of the second audio signals of the plurality of microphones to the corrected ambient noise is larger than a second threshold.

In one embodiment, the obtaining module is further configured to select a plurality of microphone identifications and a plurality of environment identifications; and the mapping module is used for acquiring the grouping instruction and mapping the plurality of microphone identifications to the environments corresponding to the plurality of environment identifications by using the grouping instruction.

In one embodiment, the obtaining module is further configured to obtain a plurality of device identifiers, and generate a corresponding device group according to the device identifiers, where the device group includes a master device and a slave device; respectively acquiring microphone identifications of a plurality of devices according to the device identifications; and the output module is used for outputting the second audio signals of the plurality of microphones when the second audio signals are generated from the microphones corresponding to the equipment and the ratio of the amplitude of the second audio signals to the estimated environmental noise is larger than a first threshold value.

In one embodiment, the obtaining module is further configured to obtain a parameter adjustment instruction corresponding to one or more microphone identifiers; the adjusting module is used for adjusting the parameters corresponding to the microphone identification according to the parameter adjusting instruction; and the mapping module is used for mapping the adjusted microphone identification to other environments.

In one embodiment, a computer device is provided, the computer device comprising a memory and a computer program stored on the memory and executable on a processor, the processor performing the steps of: acquiring a plurality of microphone identifications, and screening the plurality of microphone identifications according to a preset condition to obtain a plurality of microphone identifications for estimating environmental noise; acquiring a first audio signal and a weight corresponding to a plurality of microphone identifiers for estimating ambient noise; estimating the environmental noise of the current environment where the plurality of microphones are located by using the first audio signal and the weight; and when the ratio of the second audio signals of the plurality of microphones to the estimated ambient noise is larger than a first threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, before the step of estimating the ambient noise of the current environment in which the plurality of microphones are located by using the processed first audio signal and the weight, the processor further performs the following steps: acquiring first audio signals corresponding to a plurality of microphone identifiers for estimating environmental noise, and smoothing amplitudes corresponding to the plurality of first audio signals; the step of estimating the ambient noise of the current environment in which the plurality of microphones are located using the first audio signal and the weight includes: and estimating the environmental noise of the current environment where the plurality of microphones are positioned by using the amplitude values and the corresponding weights after the plurality of first audio signals are subjected to smoothing processing.

In one embodiment, the processor further performs the steps of: acquiring a correction coefficient corresponding to the current environment, and correcting the estimated environmental noise by using the correction coefficient; and when the ratio of the second audio signals of the plurality of microphones to the corrected environmental noise is larger than a second threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, the processor further performs the steps of: selecting a plurality of microphone identifications and a plurality of environment identifications; and acquiring a grouping instruction, and mapping the plurality of microphone identifications to the environments corresponding to the plurality of environment identifications by using the grouping instruction.

In one embodiment, the processor further performs the steps of: acquiring a plurality of equipment identifications, and generating a corresponding equipment group according to the equipment identifications, wherein the equipment group has an environment identification; the device group comprises a master device and a slave device; respectively acquiring microphone identifications with environment identifications in a plurality of devices according to the device identifications; and when the second audio signal is generated from the microphone corresponding to the equipment and the ratio of the amplitude of the second audio signal to the estimated environmental noise is larger than a first threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, the processor further performs the steps of: acquiring parameter adjustment instructions corresponding to one or more microphone identifiers; adjusting parameters corresponding to the microphone identification according to the parameter adjusting instruction; mapping the adjusted microphone identification to other environments.

In one embodiment, a computer-readable storage medium is proposed, on which a computer program is stored which, when being executed by a processor, carries out the steps of: acquiring a plurality of microphone identifications, and screening the plurality of microphone identifications according to a preset condition to obtain a plurality of microphone identifications for estimating environmental noise; acquiring a first audio signal and a weight corresponding to a plurality of microphone identifiers for estimating ambient noise; estimating the environmental noise of the current environment where the plurality of microphones are located by using the first audio signal and the weight; and when the ratio of the second audio signals of the plurality of microphones to the estimated ambient noise is larger than a first threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, before the step of estimating the ambient noise of the current environment in which the plurality of microphones are located using the processed first audio signal and the weight, the program when executed by the processor implements the steps of: acquiring first audio signals corresponding to a plurality of microphone identifiers for estimating environmental noise, and smoothing amplitudes corresponding to the plurality of first audio signals; the step of estimating the ambient noise of the current environment in which the plurality of microphones are located using the first audio signal and the weight includes: and estimating the environmental noise of the current environment where the plurality of microphones are positioned by using the amplitude values and the corresponding weights after the plurality of first audio signals are subjected to smoothing processing.

In one embodiment, the program when executed by a processor implements the steps of: selecting a plurality of microphone identifications and a plurality of environment identifications; and acquiring a grouping instruction, and mapping the plurality of microphone identifications to the environments corresponding to the plurality of environment identifications by using the grouping instruction.

In one embodiment, the step of obtaining the plurality of microphone identifications is preceded by the program when executed by the processor implementing the steps of: selecting a plurality of microphone identifications and a plurality of environment identifications; and acquiring a grouping instruction, and mapping the plurality of microphone identifications to the environments corresponding to the plurality of environment identifications by using the grouping instruction.

In one embodiment, the program when executed by a processor implements the steps of: acquiring a plurality of equipment identifications, and generating a corresponding equipment group according to the equipment identifications, wherein the equipment group has an environment identification; the device group comprises a master device and a slave device; respectively acquiring microphone identifications with environment identifications in a plurality of devices according to the device identifications; and when the second audio signal is generated from the microphone corresponding to the equipment and the ratio of the amplitude of the second audio signal to the estimated environmental noise is larger than a first threshold value, outputting the second audio signals of the plurality of microphones.

In one embodiment, the program when executed by a processor implements the steps of: acquiring parameter adjustment instructions corresponding to one or more microphone identifiers; adjusting parameters corresponding to the microphone identification according to the parameter adjusting instruction; mapping the adjusted microphone identification to other environments.

In one embodiment, a computer device is provided, as shown in fig. 4, comprising a processor, a non-volatile storage medium, an internal memory, a network interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is used to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The nonvolatile storage medium of the computer device stores an operating system and a computer program, and the internal memory of the computer device provides an environment for the operating system and the computer program in the nonvolatile storage medium to run. The computer program is executed by a processor to implement a mixing processing method. The network interface of the terminal is used for communicating with an external network interface. The display screen of the terminal can be a touch screen, and the input device can be a touch layer covered on the display screen, and can also be a key, a track ball, a touch pad, an external keyboard, a touch pad or a mouse, and the like, which are set on the shell of the terminal. The computer equipment can be a computer, a mobile phone, a tablet computer and the like, and not only comprises a terminal, but also comprises a server. The structure shown in fig. 4 is a block diagram of only a part of the structure related to the scheme of the present invention, and does not constitute a limitation of the terminal to which the scheme of the present invention is applied, and a specific terminal may include more or less components than those shown in the figure, or combine some components, or have a different arrangement of components.

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a non-volatile computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the computer program is executed. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), or the like. The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The above examples only show some embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims

1. A mixing processing method, comprising:

acquiring a plurality of microphone identifications, and selecting identifications corresponding to microphones without effective audio signals from the plurality of microphone identifications to obtain a plurality of microphone identifications for estimating environmental noise;

2. The method of claim 1, further comprising, before the step of estimating the ambient noise of the current environment in which the plurality of microphones are located using the processed first audio signal and the weights:

3. The method of claim 1, further comprising:

4. The method of claim 1, wherein the step of obtaining the plurality of microphone identifications is preceded by:

5. The method of claim 1, further comprising:

acquiring a plurality of equipment identifications, and generating a corresponding equipment group according to the equipment identifications, wherein the equipment group comprises a master device and a slave device;

respectively acquiring microphone identifications of a plurality of devices according to the device identifications;

6. The method of claim 1, further comprising:

mapping the adjusted microphone identification to other environments.

7. An apparatus for processing a mixed sound, the apparatus comprising:

the screening module is used for acquiring a plurality of microphone identifications, selecting an identification corresponding to a microphone without an effective audio signal from the plurality of microphone identifications to obtain a plurality of microphone identifications for estimating the environmental noise;

8. The apparatus of claim 7, further comprising:

the smoothing module is used for acquiring first audio signals corresponding to the multiple microphone identifiers for estimating the environmental noise and smoothing the amplitudes corresponding to the multiple first audio signals;

9. A computer device comprising a memory and a computer program stored on the memory and executable on a processor, characterized in that the processor implements the mixing processing method according to any one of claims 1 to 6 when executing the computer program.

10. A computer-readable storage medium on which a computer program is stored, the program, when executed by a processor, implementing the mixing processing method according to any one of claims 1 to 6.