WO2023070792A1 - Procédé et dispositif d'équilibrage de volume pour sonnette de porte conversationnelle, et support de stockage lisible - Google Patents

Procédé et dispositif d'équilibrage de volume pour sonnette de porte conversationnelle, et support de stockage lisible Download PDF

Info

Publication number
WO2023070792A1
WO2023070792A1 PCT/CN2021/132597 CN2021132597W WO2023070792A1 WO 2023070792 A1 WO2023070792 A1 WO 2023070792A1 CN 2021132597 W CN2021132597 W CN 2021132597W WO 2023070792 A1 WO2023070792 A1 WO 2023070792A1
Authority
WO
WIPO (PCT)
Prior art keywords
volume
volume value
audio signal
value
average
Prior art date
Application number
PCT/CN2021/132597
Other languages
English (en)
Chinese (zh)
Inventor
杨华泽
赵江涛
张文松
Original Assignee
歌尔科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 歌尔科技有限公司 filed Critical 歌尔科技有限公司
Publication of WO2023070792A1 publication Critical patent/WO2023070792A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • G10L21/034Automatic adjustment
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers
    • H03G3/20Automatic control
    • H03G3/30Automatic control in amplifiers having semiconductor devices
    • H03G3/3005Automatic control in amplifiers having semiconductor devices in amplifiers suitable for low-frequencies, e.g. audio amplifiers

Definitions

  • the present application relates to the technical field of audio processing, in particular to a volume equalization method, device and readable storage medium of a talking doorbell.
  • Talking doorbell is a doorbell product that can talk. Users make voice calls inside and outside the house through the microphone and speaker fixed on the doorbell. Usually, the volume value of the audio signal is increased uniformly with a fixed gain to ensure the transmission of the sound signal. However, due to It is impossible to limit the distance between the sound source and the microphone, the volume of the sound source, the processing of the sound source by the input side of the sound source, etc., so that we cannot guarantee the volume value of the input audio.
  • the volume of the input audio cannot be estimated, if the fixed gain is large , when the volume of the input audio itself is high, the volume may be too high, which will not only affect the normal hearing effect and cause noise, distortion, etc., but may also cause hardware loss and shorten the service life of the hardware device, but if the fixed gain is small , for the case where the volume of the input audio itself is low, the receiver may not be able to hear the specific information of the voice clearly, and cannot accurately convey the voice information, that is, the volume after the gain adjustment is unbalanced.
  • the main purpose of this application is to provide a volume equalization method, device and readable storage medium of a talking doorbell, aiming to solve the technical problem of unbalanced volume after gain adjustment in the prior art.
  • the present application provides a volume equalization method of a talking doorbell
  • the volume equalization method of the talking doorbell includes:
  • the initial volume value is adjusted according to the volume gain to obtain a first balanced volume value corresponding to the audio signal.
  • the present application also provides a volume equalization device for a talking doorbell, which includes:
  • An audio signal acquisition module configured to acquire an audio signal and an initial volume value of the audio signal
  • a sample audio signal acquisition module configured to acquire a sample audio signal and an average volume value of the sample audio signal from the audio signal
  • a volume interval determination module configured to determine the preset volume interval to which the average volume value belongs
  • a volume gain determination module configured to determine the volume gain corresponding to the average volume value according to the preset volume interval
  • a volume equalization module configured to adjust the initial volume value according to the volume gain to obtain a first equalized volume value corresponding to the audio signal.
  • the present application also provides a volume equalization device for a talking doorbell, which includes a processor and a volume equalization device for a talking doorbell that is stored in the processor and can run on the processor. program, wherein, when the volume equalization program of the talking doorbell is executed by the processor, the steps of the above-mentioned volume equalization method for the talking doorbell are realized.
  • the present application also provides a readable storage medium, on which a program for realizing the volume equalization method of the talking doorbell is stored, and the program for realizing the volume equalizing method of the talking doorbell is executed by a processor to realize the following: The steps of the volume equalization method of the above-mentioned talking doorbell.
  • the present application provides a volume equalization method, device and readable storage medium of a call-type doorbell.
  • the initial volume value of the sample audio signal and the average volume value of the sample audio signal are obtained from the audio signal to realize the determination of a representative sample audio signal, by determining the preset volume to which the average volume value belongs interval, the volume gain corresponding to the average volume value is determined according to the preset volume interval, and different volume gains are determined for sample audio signals belonging to different volume intervals, and the initial volume value is adjusted according to the volume gain , the first equalized volume value corresponding to the audio signal is obtained, and the volume gain determined by the average volume value of the sample audio signal is used to balance and adjust the initial volume value of the entire audio signal, and then the volume gain of the sample audio signal is realized.
  • the volume value determines the volume gain required for the volume value of the overall audio signal, overcomes the technical problem that the volume value of the output audio may be unbalanced due to the difference in the source of the fixed gain, and realizes the automatic volume of audio signals with different volume values balanced.
  • Fig. 1 is the schematic flow chart of an embodiment of the volume equalization method of the application's talking doorbell
  • Fig. 2 is the schematic flow chart of another embodiment of the volume equalization method of the talking doorbell of the present application
  • FIG. 3 is a schematic diagram of the device structure of the hardware operating environment involved in the volume equalization of the talking doorbell in the embodiment of the present application.
  • the volume equalization method of the talking doorbell is mainly used in the short conversation talking doorbell that uses the speaker to play and the microphone to receive.
  • the distance between the person and the microphone and the volume of the sound may be different, but the volume of the voice in the same conversation is usually relatively stable. Therefore, the volume value of the representative target audio signal represents the volume value of the entire audio, and according to the The volume value is used as the reference to adjust the gain of the entire audio volume value.
  • the implementation process is simple and fast, does not rely on hardware, does not require new accessories, devices, etc., and can effectively balance the volume values of different sizes before outputting.
  • the embodiment of the present application provides a volume equalization method for a talking doorbell.
  • the volume equalizing method for a talking doorbell includes:
  • Step S10 acquiring an audio signal and an initial volume value of the audio signal
  • sound is a wave generated by the vibration of an object, and this wave can be transformed into a time-continuous voltage wave, and the change law of this voltage wave and its corresponding original wave is Consistent, so the sound signal can be simulated by this voltage wave.
  • This simulated sound signal is the audio signal.
  • the audio signal is the information carrier of the voltage wave wavelength and intensity change.
  • the sound signal is simulated by the recording device It is converted into an electrical signal, that is, an audio signal, and then the audio signal is transmitted to the speaker, and then the speaker vibrates to generate sound, the audio signal can be converted into sound, and then the sound is transferred from one audio device to another.
  • the volume equalization method of the talking doorbell in this embodiment can be used to convert the voice signal received by the own microphone of the talking doorbell into an audio signal and then process it, and can also process other audio signals received by the own speaker of the talking doorbell.
  • the audio signal transmitted by the audio equipment is processed; the volume is also called sound intensity or loudness, which refers to the subjective feeling of the human ear on the strength of the sound heard.
  • the objective evaluation scale is the amplitude of the sound. The larger the amplitude, the stronger the intensity. Higher, the louder the sound, the greater the volume; the volume value is a value used to quantify the volume, which can be represented by sound pressure level, sound intensity level, loudness value or amplitude value, etc., this embodiment does not be restricted.
  • the audio signal is obtained, and the amplitude of each audio sample of the audio signal is obtained, and the amplitude is used as the initial volume value of the audio signal, or the amplitude is calculated and converted into a sound pressure level, a sound intensity level or a loudness level , the sound pressure level, sound intensity level or loudness level is used as the initial volume value of the audio signal.
  • the initial audio signal generated by the recording device can be processed.
  • the initial volume value is the volume value of the audio signal obtained from the signal transmission process, which can be the original volume value received by the recording device, or It is the volume value of the recording device after processing such as gain, filtering, modification or superposition, or the volume value after encoding and compression processing, or the volume value of the audio output device after processing such as gain, filtering, modification and/or superposition value, etc., which are not limited in this embodiment.
  • Step S20 acquiring a sample audio signal and an average volume value of the sample audio signal from the audio signal
  • the sample audio signal is obtained from the audio signal according to the preset number and time period of the sample audio signal, and the volume value of each sample data is obtained according to the sample data of the sample audio signal, and then Calculate the average volume value of all target audio signals, wherein the number and time period of the sample audio signal can be determined by experience, common sense and/or test results, etc., the number is at least one, and the length of the time period of each sample audio signal It can be the same or different. The earlier the start time of the time period, the earlier the receiver receives the volume gain, the shorter the delay during the call, the faster and timely the information transmission, and the specific time to obtain the sample audio signal.
  • the method can be to acquire a preset number of sample audio signals from a preset time period, or start to acquire a preset time length of sample audio signals from a certain moment, or start to acquire samples of the same or different preset time lengths from multiple times Audio signal (for example: acquire any three audio signals with a length of 1ms from 0ms to 5ms; 1s to 1s+1ms, 1s+1ms to 1s+4ms, 1s+6ms to 1s+8ms and 1s+9ms to 1s+ 11ms four-segment audio signal; start to acquire 8ms audio signal from the 2s, etc.).
  • Audio signal for example: acquire any three audio signals with a length of 1ms from 0ms to 5ms; 1s to 1s+1ms, 1s+1ms to 1s+4ms, 1s+6ms to 1s+8ms and 1s+9ms to 1s+ 11ms four-segment audio signal; start to
  • the overall volume value of the audio signal can be obtained from the samples effectively and quickly, so that the volume gain can be obtained quickly and accurately, and the speed and accuracy of volume equalization can be improved.
  • it can reduce the amount of computation and save costs, and is more suitable for the hardware conditions of a relatively small device such as a talking doorbell; compared to receiving the entire After a segment of audio signal, the volume interval is divided and the volume gain is adjusted according to the average volume value of the entire segment of the audio signal, which can determine the volume gain more quickly, thereby reducing the delay phenomenon, and is more suitable for communication products such as communication doorbells.
  • Requirements for immediacy of transmission compared to adjusting the volume values of all sizes to the same volume value, it can maintain the original volume difference of the audio, retain the dynamic characteristics of the sound, and is more suitable for talking doorbells
  • the owner can identify the visitor by voice.
  • the step of obtaining the sample audio signal and the average volume value of the sample audio signal from the audio signal it further includes:
  • Step A10 judging whether the average volume value belongs to the preset noise volume value range
  • the volume value range greater than the lower limit value or smaller than the upper limit value is used as the noise volume value range, If the average volume value is within any volume value range of the two volume value ranges, it belongs to the noise volume value range, wherein, it should be noted that the noise interferes with the expression of voice information in normal calls. Sounds include ambient sounds with low volume, interference sounds from irregular signals, and sounds that are too loud and endanger human health. Therefore, a lower upper limit can be set. If the volume is lower than this upper limit The value may cause inaudible specific voice information. At the same time, set a higher lower limit value. If the volume value is higher than this lower limit value, it may cause noise, distortion, etc.
  • the noise volume value range includes values below this upper limit.
  • the volume value range of the value and the volume value range higher than this lower limit (such as: less than 20 decibels or greater than or equal to 80 decibels)
  • the setting of the noise volume value range upper limit and lower limit value can be based on big data statistics, actual Tests, hardware loss, or user-defined data can be set.
  • Step A20 if the average volume value does not belong to the preset noise volume value range, perform the step of: determining the preset volume interval to which the average volume value belongs;
  • the average volume value does not belong to the preset noise volume value range, it means that the sample audio signal is not a noise signal and can be used to represent the audio signal, and then perform the next step: A preset volume interval to which the average volume value belongs is determined.
  • Step A30 if the average volume value belongs to the preset noise volume value range, then use the sample audio signal as a noise audio signal, and return to the step of: obtaining a sample audio signal from the audio signal and the sample audio signal the average volume value of .
  • the average volume value belongs to the preset noise volume value range, it means that the sample audio signal is a noise signal, and cannot be used to represent the audio signal, so it is necessary to return to the execution step: A sample audio signal and an average volume value of the sample audio signal are obtained from the audio signal.
  • the step of using the sample audio signal as a noise audio signal it further includes:
  • Step A31 obtaining the noise initial volume value of the noise audio signal
  • the initial noise volume value is the volume value obtained from the noise audio samples of the noise audio signal. It is easy to understand that a section of noise audio signal includes one or more noise audio samples, each Noise audio samples may have the same or different volume values.
  • Step A32 generating a first volume gain of the noise initial volume value
  • the adjusted volume value or volume value range (such as: the lower limit of the human hearing threshold, the range that does not affect other sound information accepted by the human ear, etc.)
  • the volume value or range of volume values can be set according to big data statistics, actual tests, hardware loss, or user-defined data. It is easy to understand that if the initial volume value of the noise is different, the first volume gain generated will be different. .
  • Step A33 adjusting the initial noise volume value according to the first volume gain to obtain a second balanced volume value corresponding to the noise audio signal.
  • each noise initial volume value is adjusted to a preset adjusted volume value or volume value range (such as: human hearing threshold The lower limit value of the noise audio signal, the range that does not affect the acceptance of other sound information by the human ear, etc.), to obtain the second equalized volume value corresponding to the noise audio signal.
  • a preset adjusted volume value or volume value range such as: human hearing threshold The lower limit value of the noise audio signal, the range that does not affect the acceptance of other sound information by the human ear, etc.
  • the audio signal to be processed is a noise signal, it has no practical meaning and will interfere with the expression of voice information in a normal conversation, so the initial volume values of noises of different sizes can be adjusted to the same volume value or volume Value range, in order to eliminate or reduce its interference to the expression of voice information in normal conversations.
  • the first sample audio signal of a preset time length is acquired, and the average volume value is calculated, when the average volume value belongs to the preset noise volume value range , reducing the ratio of the acquired initial noise volume value of the first sample audio signal to the lower limit of normal human hearing to the lower limit of normal human hearing, and Starting from a moment, the second sample audio signal with a preset time length is continuously intercepted, and the average volume value is calculated, and the noise volume value range is determined until the Nth sample audio signal whose average volume value is not within the preset noise volume value range is obtained , using the Nth sample audio signal as the final sample audio signal.
  • the range of the noise volume value by determining the range of the noise volume value, it is possible to eliminate the interference that the acquired sample audio signal is a noise signal, increase the representativeness of the sample audio signal, and reduce the influence of the noise signal on volume gain adjustment.
  • Step S30 determining the preset volume interval to which the average volume value belongs
  • the preset volume range is based on the division of volume values based on the human hearing range, comfortable hearing range, volume level, hardware loss and volume relationship, and/or distortion phenomena.
  • the obtained volume value range is used to formulate different targeted volume adjustment methods for different preset volume ranges, so that the volume can be balanced, for example: low volume range less than or equal to 40 decibels, medium volume greater than 40 decibels and less than 75 decibels interval, and a high volume interval greater than or equal to 75 decibels; an appropriate volume interval of greater than or equal to 40 decibels and less than or equal to 75 decibels, and an effective volume interval of less than 40 decibels or greater than 75 decibels, etc.
  • volume intervals and volume value ranges corresponding to each volume interval are preset, and the preset volume interval to which the average volume value belongs is determined according to the volume value range in which the average volume value is located.
  • the step of determining the preset volume interval to which the average volume value belongs includes:
  • Step B10 judging whether the average volume value is within a preset appropriate volume value range
  • the appropriate volume value range is a volume value range that can accurately convey the information expressed by the sound and is within the comfortable hearing range, which can be calculated according to big data statistics, actual Test, hardware loss, or user-defined data, etc., to pre-set the appropriate volume value range
  • the appropriate volume value range is a continuous volume value range, wherein the minimum volume value is the lower limit of the appropriate volume value range value, and the maximum volume value is the upper limit value of the appropriate volume value range.
  • a personalized suitable volume value range is generated according to the customized data
  • the personalized suitable volume value range is used as a preset suitable volume value range. Due to differences in the hearing of different people, the appropriate volume range for different users may be different, so you can obtain custom data set by the user according to your own situation, for example: by playing standard audio with different volume levels for the user to choose, and then obtain The volume of the standard audio selected by the user is taken as the custom data; after a section of standard audio is played according to the preset volume value, the user is allowed to adjust the volume, and then the volume finally adjusted by the user is used as the custom data; The upper limit value and the lower limit value (such as: decibel value, the percentage of the maximum volume value, etc.) of the personalized suitable volume value range input according to one's own needs, and the described personalized suitable volume value range is used as the self-defined data.
  • the upper limit value and the lower limit value such as: decibel value, the percentage of the maximum volume value, etc.
  • Step B20 if the average volume value is within the preset appropriate volume value range, then the average volume value belongs to the appropriate volume range;
  • Step B30 if the average volume value is not within the preset appropriate volume value range, then the average volume value belongs to the effective volume interval.
  • the average volume value is at If the appropriate volume value range is preset, it is judged that the average volume value belongs to the appropriate volume range; if the average volume value is less than or equal to the lower limit value of the appropriate volume value range, or greater than or equal to the lower limit value of the appropriate volume value range upper limit value, if the average volume value is not within the preset appropriate volume value range, then it is determined that the average volume value belongs to an effective volume interval, wherein the effective volume interval includes an upper limit greater than or equal to the appropriate volume value range.
  • the volume value range of the limit value and the volume value range less than or equal to the lower limit value of the appropriate volume value range are two discontinuous volume value ranges. It should be noted that the average volume value is equal to the appropriate volume value In the case of the upper limit or lower limit of the value range, it may be preset whether the average volume value belongs to or does not belong to the appropriate volume value range.
  • Step S40 determine the volume gain corresponding to the average volume value
  • the gain generally refers to the degree of increase in current, voltage or power of components, circuits, equipment or systems
  • the volume gain refers to the degree of increase in volume, including positive gain , negative gain and/or constant gain, etc.
  • the volume gain can be a numerical value, function or volume adjustment method, etc.
  • the volume gain corresponding to the average volume value is determined according to the preset volume interval to which the average volume value belongs and the mapping relationship between the preset volume interval and the volume gain.
  • Step S50 adjusting the initial volume value according to the volume gain to obtain a first balanced volume value corresponding to the audio signal.
  • the initial volume value is adjusted according to the volume gain to obtain the first balanced volume value after gain adjustment, and finally the uniform volume gain obtained by the average volume value of the sample audio signal is realized.
  • Adjusting all the initial volume values of the entire audio signal, compared to adjusting all initial volume values to the same volume value, can also maintain the original volume difference of the audio, retain the dynamic characteristics of the sound, and make the information transmission more acurrate.
  • the step of adjusting the initial volume value according to the volume gain to obtain a first balanced volume value corresponding to the audio signal includes:
  • Step C10 adjusting the initial volume value according to the volume gain to obtain an intermediate volume value
  • the initial volume value is adjusted according to the volume gain to obtain an intermediate volume value after gain adjustment.
  • Step C20 detecting whether the intermediate volume value belongs to the preset noise volume value range
  • the volume value range greater than the lower limit value or smaller than the upper limit value is used as the noise volume value range, If the average volume value is within any volume value range of the two volume value ranges, it belongs to the noise volume value range, wherein, it should be noted that the noise interferes with the expression of voice information in normal calls. Sounds include ambient sounds with low volume, interference sounds from irregular signals, and sounds that are too loud and endanger human health. Therefore, a lower upper limit can be set. If the volume is lower than this upper limit The value may cause inaudible specific voice information. At the same time, set a higher lower limit value.
  • the noise volume value range includes values below this upper limit.
  • the volume value range of the value and the volume value range higher than this lower limit (such as: less than 20 decibels or greater than or equal to 80 decibels, etc.), the setting of the noise volume value range upper limit and lower limit value can be based on big data statistics, Actual test, hardware loss, or user-defined data can be set.
  • Step C30 if the intermediate volume value belongs to the preset noise volume value range, generating a fourth volume gain for the intermediate volume value
  • the intermediate volume value belongs to the preset noise volume value range
  • the audio signal corresponding to the intermediate volume value is a noise signal
  • the adjusted volume value or volume value range is preset. (such as: the lower limit of the human hearing threshold, the range that does not affect the acceptance of other sound information by the human ear and/or the preset appropriate volume value range, etc.)
  • the ratio, difference or functional relationship of the value range (such as: upper limit value, lower limit value or intermediate value, etc.) is used as the fourth volume gain, wherein, the adjusted volume value or volume value range is based on big data statistics, The actual test, hardware loss, user-defined data or appropriate volume value range are pre-set.
  • the validity of the noise signal can be further judged. If the noise signal is invalid signal, the adjusted volume value or volume value range is pre-set according to big data statistics, actual tests, hardware loss, or user-defined data (such as: the lower limit of the human hearing threshold and/or does not affect other sounds information is accepted by the human ear, etc.), if the noise is an effective signal, the ratio, difference or functional relationship between the intermediate volume value and the preset appropriate volume value or the preset appropriate volume value range is used as the fourth Volume gain, wherein the validity judgment method includes voice recognition and preset effective volume value range, etc.
  • Step C40 adjusting the intermediate volume value according to the fourth volume gain to obtain a third balanced volume value
  • the intermediate volume value is adjusted to a preset adjusted volume value or volume value range (such as: the lower limit value of the human hearing threshold, which does not affect other The range that the sound information is accepted by the human ear and/or the preset appropriate volume value range, etc.) to obtain the third balanced volume value.
  • a preset adjusted volume value or volume value range such as: the lower limit value of the human hearing threshold, which does not affect other The range that the sound information is accepted by the human ear and/or the preset appropriate volume value range, etc.
  • Step D50 if the intermediate volume value does not belong to the preset noise volume value range, then use the intermediate volume value as the first equalized volume value.
  • the intermediate volume value does not belong to the preset noise volume value range, it means that the audio signal corresponding to the intermediate volume value is not a noise signal, and the intermediate volume value is taken as the first An equalization volume value.
  • the uniform volume gain obtained by the average volume value of the sample audio signal is used to adjust all the volume of the entire audio signal.
  • the initial volume value does not require volume gain calculation and special processing for the initial volume value of each frame, and the volume gain adjustment can be performed simply and quickly.
  • the adjusted volume value may not belong to the Special circumstances of the preset noise volume value range, for example, a sudden increase in sound due to shock during a call, a sudden honking of a nearby vehicle during a call, or an unrelated communication between the person on the call and other people around him during the call, etc. , so after the adjustment is completed, perform noise volume detection and noise volume adjustment again to ensure that the first equalized volume value after volume equalization is in a more appropriate volume range, improving the accuracy of voice information transmission and receiving voice information people's hearing comfort.
  • the present application provides a volume equalization method, device and readable storage medium of a call-type doorbell.
  • the initial volume value of the sample audio signal and the average volume value of the sample audio signal are obtained from the audio signal to realize the determination of a representative sample audio signal, by determining the preset volume to which the average volume value belongs interval, the volume gain corresponding to the average volume value is determined according to the preset volume interval, and different volume gains are determined for sample audio signals belonging to different volume intervals, and the initial volume value is adjusted according to the volume gain , the first equalized volume value corresponding to the audio signal is obtained, and the volume gain determined by the average volume value of the sample audio signal is used to balance and adjust the initial volume value of the entire audio signal, and then the volume gain of the sample audio signal is realized.
  • the volume value determines the volume gain required for the volume value of the overall audio signal, overcomes the technical problem that the volume value of the output audio may be unbalanced due to the difference in the source of the fixed gain, and realizes the automatic volume of audio signals with different volume values balanced.
  • the preset volume interval includes an effective volume interval and an appropriate volume interval
  • the appropriate volume range is a range of volume values that can more accurately convey the information expressed by the sound and is within the comfortable hearing range, which can be calculated according to large data statistics and actual tests. , hardware loss, or user-defined data to pre-set the appropriate volume value range, and the effective volume range is a volume value range outside the volume value range of the appropriate volume range.
  • the step of determining the volume gain corresponding to the average volume value according to the preset volume interval includes:
  • Step S41 if the preset volume range is the effective volume range, then determine a second volume gain corresponding to the average volume value according to the average volume value and the characteristic volume value belonging to the appropriate volume range;
  • the characteristic volume value is within the volume value range of the appropriate volume interval, and is used to represent certain characteristic numerical values of the appropriate volume interval, for example: the upper limit value , lower limit, median or third value, etc.
  • the preset volume interval includes an effective volume interval and an appropriate volume interval, and if the preset volume interval is the effective volume interval, according to the average volume value and the characteristic volume value of the appropriate volume interval Difference, ratio, sum value or functional relationship, etc., to determine the second volume gain corresponding to the average volume value, for example: the difference between the average volume value and the upper limit value of the appropriate volume range, the average volume value The ratio of the median value of the appropriate volume range or the half of the sum of the average volume value and the lower limit of the appropriate volume value, etc.
  • the step of determining the second volume gain corresponding to the average volume value according to the average volume value and the characteristic volume values belonging to the appropriate volume range includes:
  • Step S411 obtaining the median value of the appropriate volume range as the characteristic volume value
  • the median value of the appropriate volume range is a value obtained by dividing the sum of the upper limit value of the appropriate volume range plus the lower limit value of the appropriate volume range by 2.
  • the median value of the appropriate volume interval is calculated according to the upper limit value and the lower limit value of the appropriate volume interval, and the median value is used as the characteristic volume value, wherein, since the appropriate volume interval is previously Therefore, the median value of the appropriate volume interval can also be predetermined according to the preset volume value range of the appropriate volume interval.
  • Step S412 Determine a second volume gain corresponding to the average volume value according to the ratio of the average volume value to the median value.
  • the ratio of the average volume value to the median value is used as the multiple relationship between the volume after gain adjustment and the volume before gain adjustment, and the specific ratio obtained by converting the ratio or the ratio
  • the volume value is used as the second volume gain corresponding to the average volume value.
  • the median value of the appropriate volume range is used as the standard, which can ensure the equalized volume to the greatest extent.
  • the first balanced volume value is in an appropriate volume range, which improves the accuracy of voice information transmission and the auditory comfort of the person receiving the voice information.
  • Step S42 if the preset volume range is an appropriate volume range, then determine a third volume gain corresponding to the average volume value.
  • the preset volume interval is an appropriate volume interval, it means that the average volume value itself is an appropriate volume value, and there is no need to adjust the volume gain, then the average volume value is determined.
  • value corresponding to the third volume gain, the third volume gain is a constant gain, which is used to keep the volume value before and after the gain adjustment unchanged.
  • volume intervals through the setting of volume intervals, different gain adjustments are realized for volume values belonging to different volume intervals. It is not necessary to perform unnecessary gain adjustments for volume values that are appropriate in size.
  • the value is adaptively adjusted according to the difference between its volume value and the appropriate volume range, thereby solving the technical problem of unbalanced volume value of the output audio caused by the volume value of the fixed gain due to the difference in the volume value of the sound source, and realizing the control of all audio signals in the entire audio segment. Automatic equalization of the initial volume value.
  • An embodiment of the present application provides a volume equalization device for a talking doorbell, and the volume equalizing device for a talking doorbell includes:
  • An audio signal acquisition module configured to acquire an audio signal and an initial volume value of the audio signal
  • a sample audio signal acquisition module configured to acquire a sample audio signal and an average volume value of the sample audio signal from the audio signal
  • a volume interval determination module configured to determine the preset volume interval to which the average volume value belongs
  • a volume gain determination module configured to generate a volume gain corresponding to an average volume value according to the preset volume interval
  • a volume equalization module configured to adjust the initial volume value according to the volume gain to obtain a first equalized volume value corresponding to the audio signal.
  • sample audio signal acquisition module is also used for:
  • the average volume value belongs to the preset noise volume value range, then use the sample audio signal as a noise audio signal, and return to the execution step: obtain the sample audio signal and the average volume of the sample audio signal from the audio signal value.
  • sample audio signal acquisition module is also used for:
  • the initial noise volume value is adjusted according to the first volume gain to obtain a second balanced volume value corresponding to the noise audio signal.
  • the volume gain determination module is also used for:
  • the preset volume interval is an effective volume interval, determining a second volume gain corresponding to the average volume value according to the average volume value and the characteristic volume value belonging to the appropriate volume interval;
  • the preset volume range is an appropriate volume range, then determine a third volume gain corresponding to the average volume value.
  • the volume gain determination module is also used for:
  • a second volume gain corresponding to the average volume value is determined according to a ratio of the average volume value to the median value.
  • the volume range determination module is also used for:
  • the average volume value belongs to an appropriate volume range
  • the average volume value belongs to an effective volume interval.
  • volume equalization module is also used for:
  • the intermediate volume value does not belong to the preset noise volume value range, the intermediate volume value is used as the first balanced volume value.
  • An embodiment of the present application provides a volume equalization device for a talking doorbell.
  • the volume equalizing device for a talking doorbell includes a processor and a volume of the talking doorbell that is stored in the processor and can run on the processor.
  • An equalization program wherein, when the volume equalization program of the talking doorbell is executed by the processor, the method for volume equalization of the talking doorbell in the first embodiment is realized.
  • the volume equalization device of the talking doorbell of the present application may include: a processor 1001 such as a CPU, a speaker 1004 , a user interface 1003 , a memory 1005 , a microphone 1006 , and a communication bus 1002 .
  • the communication bus 1002 is used to realize connection and communication between these components.
  • the user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface.
  • the memory 1005 can be a high-speed RAM memory, or a stable memory (non-volatile memory), such as a disk memory.
  • the memory 1005 may also be a storage device independent of the aforementioned processor 1001 .
  • the speaker 1004 is used to play sound signals.
  • the microphone 1006 is used to receive signals from the speaker and voice signals triggered by the user.
  • the terminal may also include a camera, an RF (Radio Frequency, radio frequency) circuit, a sensor, an audio circuit, a WiFi module, and the like.
  • sensors such as light sensors, motion sensors and other sensors.
  • the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display screen according to the brightness of the ambient light, and the proximity sensor may turn off the display screen and/or backlight.
  • the gravitational acceleration sensor can detect the magnitude of acceleration in various directions (generally three axes), and can detect the magnitude and direction of gravity when it is stationary, and can be used for applications that recognize the posture of mobile terminals (such as horizontal and vertical screen switching, Related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer, tap), etc.; of course, the mobile terminal can also be equipped with other sensors such as gyroscope, barometer, hygrometer, thermometer, infrared sensor, etc. No longer.
  • the volume equalization device of the talking doorbell provided by the present application adopts the volume equalization method of the talking doorbell in the first embodiment above, and solves the technical problem of unbalanced audio volume after gain adjustment in the prior art.
  • the beneficial effect of the volume equalization device of the talking doorbell provided by the embodiment of the present application is the same as that of the volume equalizing method of the talking doorbell provided in the first embodiment above, and the volume equalization of the talking doorbell
  • Other technical features in the device are the same as those disclosed by the method in the previous embodiment, and will not be repeated here.
  • terminal structure shown in FIG. 1 does not constitute a limitation on the terminal, and may include more or less components than those shown in the figure, or combine some components, or arrange different components.
  • This embodiment provides a computer-readable storage medium, which has computer-readable program instructions stored thereon, and the computer-readable program instructions are used to implement the volume equalization method for a talking doorbell in the first embodiment above.
  • the computer-readable storage medium provided in the embodiment of the present application may be, for example, a USB flash drive, but is not limited to an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, system, or device, or any combination thereof. More specific examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer diskettes, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • RAM random access memory
  • ROM read-only memory
  • EPROM or flash memory erasable Programmable read-only memory
  • CD-ROM portable compact disk read-only memory
  • magnetic storage device magnetic storage device, or any suitable combination of the above.
  • the computer-readable storage medium may be any tangible medium containing or storing a program, and the program may be used by or in combination with an instruction execution system, system or device.
  • Program code embodied on a computer readable storage medium may be transmitted by any appropriate medium, including but not limited to: wires, optical cables, RF (radio frequency), etc., or any suitable combination of the above.
  • the above-mentioned computer-readable storage medium may be included in the electronic device, or may exist independently without being incorporated into the electronic device.
  • the above-mentioned computer-readable storage medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: acquires the audio signal and the initial volume value of the audio signal; from the audio signal Acquiring a sample audio signal and an average volume value of the sample audio signal; determining a preset volume interval to which the average volume value belongs; determining a volume gain corresponding to the average volume value according to the preset volume interval; according to the The volume gain adjusts the initial volume value to obtain a first balanced volume value corresponding to the audio signal.
  • Computer program code for carrying out the operations of the present disclosure can be written in one or more programming languages, or combinations thereof, including object-oriented programming languages—such as Java, Smalltalk, C++, and conventional Procedural Programming Language - such as "C" or a similar programming language.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer can be connected to the user computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider). Internet connection).
  • LAN local area network
  • WAN wide area network
  • Internet service provider such as AT&T, MCI, Sprint, EarthLink, MSN, GTE, etc.
  • each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
  • modules involved in the embodiments described in the present disclosure may be implemented by software or by hardware. Wherein, the name of the module does not constitute a limitation of the unit itself under certain circumstances.
  • the computer-readable storage medium provided by the present application stores computer-readable program instructions for implementing the volume equalization method of the above-mentioned talking doorbell, which solves the technical problem of unbalanced audio volume after gain adjustment in the prior art.
  • the beneficial effect of the computer-readable storage medium provided by the embodiment of the present application is the same as the beneficial effect of the volume equalization method for the talking doorbell provided by the first embodiment above, and will not be repeated here.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephone Function (AREA)

Abstract

L'invention concerne un procédé, un appareil et un dispositif d'équilibrage de volume pour une sonnette de porte conversationnelle, et un support de stockage lisible. Le procédé comprend les étapes consistant à : acquérir un signal audio et une valeur de volume initial du signal audio (S10) ; acquérir un signal audio d'échantillon et une valeur de volume moyenne du signal audio d'échantillon à partir du signal audio (S20) ; déterminer un intervalle de volume prédéfini auquel la valeur de volume moyen appartient (S30) ; déterminer, en fonction de l'intervalle de volume prédéfini, un gain de volume correspondant à la valeur de volume moyen (S40) ; et à ajuster la valeur de volume initiale en fonction du gain de volume pour obtenir une première valeur de volume équilibrée correspondant au signal audio (S50). Le procédé résout le problème technique de l'état de la technique de déséquilibre de volume après ajustement de gain.
PCT/CN2021/132597 2021-10-26 2021-11-24 Procédé et dispositif d'équilibrage de volume pour sonnette de porte conversationnelle, et support de stockage lisible WO2023070792A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111250855.6A CN113963716A (zh) 2021-10-26 2021-10-26 通话式门铃的音量均衡方法、装置、设备和可读存储介质
CN202111250855.6 2021-10-26

Publications (1)

Publication Number Publication Date
WO2023070792A1 true WO2023070792A1 (fr) 2023-05-04

Family

ID=79467337

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/132597 WO2023070792A1 (fr) 2021-10-26 2021-11-24 Procédé et dispositif d'équilibrage de volume pour sonnette de porte conversationnelle, et support de stockage lisible

Country Status (2)

Country Link
CN (1) CN113963716A (fr)
WO (1) WO2023070792A1 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114550393A (zh) * 2022-02-23 2022-05-27 潍坊歌尔电子有限公司 门铃控制方法、电子设备和可读存储介质
CN114708853A (zh) * 2022-03-18 2022-07-05 歌尔股份有限公司 交互系统
CN114679658A (zh) * 2022-03-23 2022-06-28 深圳市大十科技有限公司 一种音频装置的动态低音补偿系统及方法

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1885711A (zh) * 2005-06-24 2006-12-27 腾讯科技(深圳)有限公司 一种音量控制方法和系统
CN205541185U (zh) * 2016-04-28 2016-08-31 歌尔声学股份有限公司 一种自动调节音量的门铃
CN108986830A (zh) * 2018-08-28 2018-12-11 安徽淘云科技有限公司 一种音频语料筛选方法及装置
CN109217834A (zh) * 2018-10-19 2019-01-15 歌尔科技有限公司 增益调整方法、音频设备及可读存储介质
CN109996143A (zh) * 2019-03-07 2019-07-09 上海蔚来汽车有限公司 音量调节方法、装置、系统及音频播放设备和车辆
US10466959B1 (en) * 2018-03-20 2019-11-05 Amazon Technologies, Inc. Automatic volume leveler
CN111986696A (zh) * 2020-08-27 2020-11-24 湖南融视文化创意有限公司 一种高效处理歌曲音量均衡的方法

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1885711A (zh) * 2005-06-24 2006-12-27 腾讯科技(深圳)有限公司 一种音量控制方法和系统
CN205541185U (zh) * 2016-04-28 2016-08-31 歌尔声学股份有限公司 一种自动调节音量的门铃
US10466959B1 (en) * 2018-03-20 2019-11-05 Amazon Technologies, Inc. Automatic volume leveler
CN108986830A (zh) * 2018-08-28 2018-12-11 安徽淘云科技有限公司 一种音频语料筛选方法及装置
CN109217834A (zh) * 2018-10-19 2019-01-15 歌尔科技有限公司 增益调整方法、音频设备及可读存储介质
CN109996143A (zh) * 2019-03-07 2019-07-09 上海蔚来汽车有限公司 音量调节方法、装置、系统及音频播放设备和车辆
CN111986696A (zh) * 2020-08-27 2020-11-24 湖南融视文化创意有限公司 一种高效处理歌曲音量均衡的方法

Also Published As

Publication number Publication date
CN113963716A (zh) 2022-01-21

Similar Documents

Publication Publication Date Title
WO2023070792A1 (fr) Procédé et dispositif d'équilibrage de volume pour sonnette de porte conversationnelle, et support de stockage lisible
US9305568B2 (en) Active acoustic filter with socially determined location-based filter characteristics
WO2018059030A1 (fr) Procédé de réglage de volume et terminal
WO2017215657A1 (fr) Procédé de traitement d'effet sonore et dispositif terminal
US10461712B1 (en) Automatic volume leveling
CN112954115B (zh) 一种音量调节方法、装置、电子设备及存储介质
WO2019033438A1 (fr) Procédé et dispositif de réglage de signal audio, support de stockage et terminal
WO2014078096A1 (fr) Système de commande de correction physiologique audio
WO2019033987A1 (fr) Procédé et appareil d'invite, support d'informations et terminal
CN107465824B (zh) 音量调节方法、装置、移动终端及存储介质
CN107526570B (zh) 音量调节方法、装置、终端设备及存储介质
AU2020402822B2 (en) User hearing protection method, apparatus, and electronic device
US20160180863A1 (en) Intelligent volume control interface
WO2019033942A1 (fr) Procédé et appareil de réglage de volume, dispositif terminal et support d'informations
US20220391163A1 (en) Automatic volume adjustment method and apparatus, medium, and device
CN111508510A (zh) 音频处理方法、装置、存储介质及电子设备
CN111462764A (zh) 音频编码方法、装置、计算机可读存储介质及设备
CN107483734B (zh) 音量调节方法、装置、终端设备及存储介质
WO2019033941A1 (fr) Appareil et procédé de réglage de volume, dispositif terminal, et support d'informations
KR20190012003A (ko) 청각 인지 속성에 기반하여 디지털 오디오 신호의 이득을 조정하는 전자 장치 및 방법
TW202226230A (zh) 將麥克風信號靜音和取消靜音之方法
CN107506168B (zh) 音量调节方法、装置、终端设备及存储介质
CN116193311A (zh) 无线耳机的音质优化方法及相关设备、介质和程序产品
WO2022199354A1 (fr) Procédé et appareil de réglage de volume d'appel, terminal et support de stockage
CN111739496B (zh) 音频处理方法、装置及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21962136

Country of ref document: EP

Kind code of ref document: A1