CN115148203A - Pickup calibration method, pickup calibration device, nonvolatile storage medium and electronic equipment - Google Patents

Pickup calibration method, pickup calibration device, nonvolatile storage medium and electronic equipment Download PDF

Info

Publication number
CN115148203A
CN115148203A CN202210631086.2A CN202210631086A CN115148203A CN 115148203 A CN115148203 A CN 115148203A CN 202210631086 A CN202210631086 A CN 202210631086A CN 115148203 A CN115148203 A CN 115148203A
Authority
CN
China
Prior art keywords
audio
energy value
microphone
equipment
calibration coefficient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210631086.2A
Other languages
Chinese (zh)
Inventor
王祖悦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao Haier Technology Co Ltd
Haier Smart Home Co Ltd
Original Assignee
Qingdao Haier Technology Co Ltd
Haier Smart Home Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao Haier Technology Co Ltd, Haier Smart Home Co Ltd filed Critical Qingdao Haier Technology Co Ltd
Priority to CN202210631086.2A priority Critical patent/CN115148203A/en
Publication of CN115148203A publication Critical patent/CN115148203A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/60Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/2816Controlling appliance services of a home automation network by calling their functionalities
    • H04L12/282Controlling appliance services of a home automation network by calling their functionalities based on user interaction within the home
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Automation & Control Theory (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The application discloses a pickup calibration method, a pickup calibration device, a nonvolatile storage medium and electronic equipment, which relate to the field of smart home, wherein the pickup calibration method comprises the following steps: acquiring a first audio and a second audio, wherein the first audio is an audio obtained by a first microphone of first equipment collecting wake-up voice sent by standard equipment, and the second audio is an audio obtained by a second microphone of second equipment collecting wake-up voice sent by standard equipment; determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and respectively carrying out sound pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.

Description

Pickup calibration method, pickup calibration device, nonvolatile storage medium and electronic equipment
Technical Field
The application relates to the field of smart home, in particular to a pickup calibration method and device, a nonvolatile storage medium and electronic equipment.
Background
In the intelligent house scene, can have a lot of smart machine in the user family, present most smart machine all support and carry out voice interaction with the user to realize the user to smart control of smart machine, reduce user's work load. However, when a user sends a voice command to an intelligent device that the user wants to control, other intelligent devices also receive the voice command sent by the user, so that the plurality of intelligent devices need to share information to determine which intelligent device the voice command sent by the user currently matches, and therefore, inconvenience is brought to life of the user due to the fact that the wrong device responds to the user command. However, the related art has the following problems: the method that the user calls the equipment name and adds the user instruction is adopted, and the user needs to memorize the accurate names of various equipment, so that the voice control behavior of the user is very complicated, and the user experience is not good; or the energy of the voice command received by different intelligent devices is used as the judgment amount, but the method is limited by the fact that the sound pickup capabilities of different intelligent devices are different, and accurate judgment is difficult to carry out.
In view of the above problems, no effective solution has been proposed.
Disclosure of Invention
The embodiment of the invention provides a pickup calibration method, a pickup calibration device, a computer-readable storage medium and electronic equipment, which are used for at least solving the technical problem that a convenient equipment pickup calibration method is lacked in an intelligent household scene.
According to an aspect of an embodiment of the present invention, there is provided a sound pickup calibration method including: acquiring a first audio and a second audio, wherein the first audio is an audio obtained by a first microphone of first equipment collecting a wake-up voice sent by standard equipment, and the second audio is an audio obtained by a second microphone of second equipment collecting the wake-up voice sent by the standard equipment; determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and respectively carrying out sound pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
Optionally, the determining, based on the first energy value and the second energy value, a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device includes: acquiring a third audio and a fourth audio, wherein the third audio is an audio obtained by acquiring the awakening voice sent by the first equipment through a standard microphone of the standard equipment, and the fourth audio is an audio obtained by acquiring the awakening voice sent by the second equipment through the standard microphone of the standard equipment; determining a third energy value corresponding to the third audio and a fourth energy value corresponding to the fourth audio; determining the first calibration coefficient based on the first energy value and the third energy value, and determining the second calibration coefficient based on the second energy value and the fourth energy value.
Optionally, the determining the first calibration coefficient based on the first energy value and the third energy value, and the determining the second calibration coefficient based on the second energy value and the fourth energy value, comprises: determining a ratio of the third energy value and the first energy value as the first calibration coefficient; and determining a ratio of the fourth energy value and the second energy value as the second calibration coefficient.
Optionally, the method further includes: acquiring an online equipment list, wherein the online equipment list comprises a plurality of pieces of equipment; selecting one calibrated device in the online device list as the standard device, and selecting the first device and the second device in the online device list as devices to be calibrated; and sending awakening word data to the standard equipment, wherein the awakening word data is used for the standard equipment to generate the awakening voice.
Optionally, the method further includes: and sending the awakening word data to the first equipment and the second equipment, wherein the first equipment and the second equipment respond to awakening voice sent by the standard equipment based on the awakening word data, and respectively calling the first microphone and the second microphone to collect the awakening voice sent by the standard equipment to obtain the first audio and the second audio.
Optionally, the calibrating the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient respectively includes: sending the first calibration coefficient to the first device, wherein the first device calibrates an energy value of the received audio according to the first calibration coefficient; transmitting the second calibration coefficient to the second device, wherein the second device calibrates an energy value of the received audio according to the second calibration coefficient.
Optionally, the method further includes: acquiring a fifth audio, wherein the fifth audio is an audio obtained by acquiring a wake-up voice sent by the standard equipment by a third microphone of third equipment; determining a fifth energy value corresponding to the fifth audio; determining a third correction coefficient corresponding to the third device based on the fifth energy value and any one of the first energy value and the second energy value; and carrying out pickup calibration on the third microphone according to the third correction coefficient.
According to another aspect of the embodiments of the present invention, there is also provided a pickup calibration apparatus, including: the device comprises an acquisition module, a processing module and a processing module, wherein the acquisition module is used for acquiring a first audio and a second audio, the first audio is an audio obtained by a first microphone of first equipment collecting a wake-up voice sent by standard equipment, and the second audio is an audio obtained by a second microphone of second equipment collecting the wake-up voice sent by the standard equipment; the first determining module is used for determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; a second determining module, configured to determine, based on the first energy value and the second energy value, a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device; and the calibration module is used for respectively carrying out pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
According to still another aspect of the embodiment of the present invention, there is further provided a computer-readable storage medium, which includes a stored program, wherein when the program runs, the apparatus where the computer-readable storage medium is located is controlled to execute any one of the above-mentioned sound pickup calibration methods.
According to still another aspect of the embodiments of the present invention, there is provided an electronic device, including a processor, configured to execute a program, where the program executes to perform any one of the above pickup calibration methods.
In the embodiment of the invention, the purpose of calibrating the sound pickup capability of the first equipment and the second equipment in the intelligent home scene is achieved by acquiring the first audio and the second audio collected by the respective microphones of the first equipment and the second equipment, determining the first energy value and the second energy value based on the first audio and the second audio, and determining the first calibration coefficient and the second calibration coefficient for carrying out sound pickup calibration on the first microphone of the first equipment and the second microphone of the second equipment based on the first energy value and the second energy value, so that the technical effect of providing a convenient and fast equipment sound pickup calibration method in the intelligent home scene is achieved, and the technical problem that the convenient equipment sound pickup calibration method is lacked in the intelligent home scene is solved.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and, together with the description, serve to explain the principles of the application.
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly described below, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.
Fig. 1 is a schematic diagram of a hardware environment of an interaction method of an intelligent device according to an embodiment of the present application;
FIG. 2 is a flowchart illustrating a calibration method according to an embodiment of the present invention;
fig. 3 is a block diagram of a pickup calibration apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the application described herein may be implemented in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
According to one aspect of the embodiment of the application, an interaction method of intelligent household equipment is provided. The interaction method of the intelligent Home equipment is widely applied to full-House intelligent digital control application scenes such as intelligent homes (Smart Home), intelligent homes, intelligent Home equipment ecology, intelligent House (Intelligent House) ecology and the like. Optionally, in this embodiment, the interaction method of the smart home device may be applied to a hardware environment formed by the terminal device 102 and the server 104 as shown in fig. 1. As shown in fig. 1, the server 104 is connected to the terminal device 102 through a network, and may be configured to provide a service (e.g., an application service) for the terminal or a client installed on the terminal, set a database on the server or independent of the server, and provide a data storage service for the server 104, and configure a cloud computing and/or edge computing service on the server or independent of the server, and provide a data operation service for the server 104.
The network may include, but is not limited to, at least one of: wired networks, wireless networks. The wired network may include, but is not limited to, at least one of: wide area networks, metropolitan area networks, local area networks, which may include, but are not limited to, at least one of the following: WIFI (Wireless Fidelity), bluetooth. Terminal equipment 102 can be but not limited to be PC, the cell-phone, the panel computer, intelligent air conditioner, intelligent cigarette machine, intelligent refrigerator, intelligent oven, intelligent kitchen range, intelligent washing machine, intelligent water heater, intelligent washing equipment, intelligent dish washer, intelligent projection equipment, intelligent TV, intelligent clothes hanger, intelligent (window) curtain, intelligence audio-visual, smart jack, intelligent stereo set, intelligent audio amplifier, intelligent new trend equipment, intelligent kitchen guarding equipment, intelligent bathroom equipment, intelligence robot of sweeping the floor, intelligence robot of wiping the window, intelligence robot of mopping the ground, intelligent air purification equipment, intelligent steam ager, intelligent microwave oven, intelligent kitchen is precious, intelligent clarifier, intelligent water dispenser, intelligent lock etc..
The application provides a method embodiment for accurately judging target equipment which is expected to be controlled by a user by carrying out pickup calibration on the equipment. In the related art, when sound pickup calibration is performed on different devices, calibration of sound pickup microphones of the respective devices is required, that is: all microphones are calibrated to the same standard microphone. This is also the current mainstream calibration scheme. For example, offline real-time manual calibration is used, i.e.: fixing the sounding position and the sound receiving position, playing audio by adopting a standard loudspeaker, placing equipment to be calibrated and a standard microphone at the sound receiving position to record the audio, and then comparing the audio recorded by the equipment to be calibrated with the audio recorded by the standard microphone to realize calibration; generally, the related technology can perform audio frequency comparison in a frequency domain, calibrate the amplitude of a key frequency point to the amplitude of a frequency point corresponding to standard equipment, and in addition, can also adopt simple time domain energy comparison. However, in practical applications, with the increase of smart devices, especially the increase of low-cost devices, however, due to the fact that hardware differences of household appliances are too large, microphone consistency is too poor, and with the increase of the number of products in a smart home environment, a user cannot perform offline manual debugging and calibration on each smart device. Therefore, the present embodiment proposes the following sound pickup calibration method to solve the technical problem that it is impossible to accurately determine which device among a plurality of devices the voice command issued by the user is intended to operate.
Fig. 2 is a schematic flow chart of a pickup calibration method according to an embodiment of the present invention, as shown in fig. 2, the method includes the following steps:
step S202, a first audio and a second audio are obtained, wherein the first audio is obtained by a first microphone of the first device collecting the wake-up voice sent by the standard device, and the second audio is obtained by a second microphone of the second device collecting the wake-up voice sent by the standard device.
In this step, the first device, the second device and the standard device may be smart devices in a smart home, and each of the first device and the second device has a function of responding to a voice of a user to perform an action. Optionally, the wake-up voice may be emitted through a speaker of the standard device and collected by respective microphones of the first device and the second device.
Step S204, a first energy value corresponding to the first audio and a second energy value corresponding to the second audio are determined.
Optionally, the first energy value and the second energy value in this step may be total energy of the first audio and the second audio, respectively, or may also be time domain energy in corresponding time domains of the first audio and the second audio. Since the first and second audio frequencies are obtained by the first and second microphones respectively collecting the wake-up voice of the standard device, the first and second audio frequencies should correspond in the time domain. Accordingly, a time domain range may be determined, and a first energy value and a second energy value of the first audio and the second audio, respectively, within the time domain range may be determined.
Step S206, based on the first energy value and the second energy value, a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device are determined.
In this step, based on the energy value of the wake-up voice sent by the standard device and received by each of the first device and the second device, the difference in the sound-collecting capability of the microphones of each of the first device and the second device can be determined, and thus, the method can be used for accurately calibrating sound-collecting of the first microphone and the second microphone.
In step S208, sound pickup calibration is performed on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient, respectively.
The first microphone and the second microphone have different sound pickup capacities due to different product models and different use states, the first calibration coefficient is adopted to process the energy value of the audio collected by the microphones, and the capacity difference between the first microphone and the second microphone can be eliminated, so that multiple corrected microphones can pick up sound on the same sound pickup level, for example, awakening competition. Optionally, if the user utters a voice for waking up the device, the calibrated first device and the calibrated second device both receive the voice uttered by the user through the microphone. Since the first microphone and the second microphone are respectively calibrated by the first calibration coefficient and the second calibration coefficient, the relative position difference and the sound production orientation difference between the user and the first device and the second device can be determined based on the difference between the energy value of the audio collected by the first microphone and the energy value of the audio collected by the second microphone, and the device is helped to accurately judge which device the target device which the target object wants to manipulate is.
In the above steps, the first audio and the second audio collected by the respective microphones of the first device and the second device are obtained, the first energy value and the second energy value are determined based on the first audio and the second audio, and the first calibration coefficient and the second calibration coefficient for carrying out sound pickup calibration on the first microphone of the first device and the second microphone of the second device are determined based on the first energy value and the second energy value, so that the purpose of calibrating the sound pickup capability of the first device and the second device in the smart home scene is achieved, the technical effect of providing a convenient and fast device sound pickup calibration method in the smart home scene is achieved, and the technical problem that the convenient device sound pickup calibration method is lacked in the smart home scene is solved.
As an alternative embodiment, based on the first energy value and the second energy value, the following steps may be adopted when determining the first calibration coefficient corresponding to the first device and the second calibration coefficient corresponding to the second device: acquiring a third audio and a fourth audio, wherein the third audio is an audio obtained by acquiring a wake-up voice sent by first equipment by a standard microphone of standard equipment, and the fourth audio is an audio obtained by acquiring a wake-up voice sent by second equipment by the standard microphone of the standard equipment; determining a third energy value corresponding to a third audio and a fourth energy value corresponding to a fourth audio; a first calibration coefficient is determined based on the first energy value and the third energy value, and a second calibration coefficient is determined based on the second energy value and the fourth energy value.
In this optional embodiment, the first device and the second device may sequentially emit the wake-up voice, and the wake-up voice is received by the standard microphone of the standard device, and then the respective sound pickup capabilities of the microphones of the first device and the second device are determined based on the audio received by the standard microphone. In the method, the first microphone and the second microphone are calibrated without measuring the distance between the first device and the standard device respectively, the distance from the loudspeaker of the standard device to the microphone of the first device is the same as the distance from the loudspeaker of the first device to the microphone of the standard device, therefore, in the two processes, the interference and attenuation of sound in the path can be considered to be the same, and therefore, the first calibration coefficient and the second calibration coefficient for representing the difference between the sound pickup capacity of the first microphone and the sound pickup capacity of the standard microphone can be determined through the first energy value and the third energy value. After the difference of the sound pickup capacities of the first microphone and the second microphone relative to the standard microphone is determined, the sound pickup capacity can be calculated instead, and the difference of the sound pickup capacities between the first microphone and the second microphone can be determined.
As an alternative embodiment, the first calibration coefficient and the second calibration coefficient may be determined by: determining a ratio of the third energy value to the first energy value as a first calibration coefficient; and determining a ratio of the fourth energy value and the second energy value as a second calibration coefficient.
Alternatively, the first calibration coefficient may take the form:
Figure BDA0003679766180000081
wherein, the first and the second end of the pipe are connected with each other,
Figure BDA0003679766180000082
representing a first calibration factor, d, corresponding to the first device i Representing the distance between the first device and the standard device (i.e. the distance between the first microphone and the standard microphone),
Figure BDA0003679766180000083
representing a first energy value determined from first audio captured by a first microphone of a first device,
Figure BDA0003679766180000084
and a third energy value corresponding to a third audio collected by a standard microphone of the standard equipment is represented. In case the first microphone and the standard microphone have the same sound-collecting capability, the first energy value corresponding to the first audio and the third energy value corresponding to the third audio should be equivalent, i.e. the first energy value and the third energy value should be equivalent
Figure BDA0003679766180000085
However, when the first microphone has a weaker sound-pickup capability than the standard microphone, there is a possibility that
Figure BDA0003679766180000086
At this time
Figure BDA0003679766180000087
Less than 1. Therefore, when the first microphone of the first device receives the voice uttered by the user, the voice energy value of the voice received by the first microphone may be calibrated based on the first calibration coefficient as follows: calibrated speech energy value = speech energy value ≥
Figure BDA0003679766180000088
Similarly, the second microphone of the second device may be calibrated for its pick-up capability in the manner described above to obtain a second calibration factor
Figure BDA0003679766180000089
By the optional embodiment, a specific method for calibrating the sound pickup capability of the first microphone of the first device and the second microphone of the second device is provided, and by adopting the method, the sound pickup capability difference between the microphones of the first device and the second device can be correctly calibrated, so that the calibration effect is improved.
As an alternative embodiment, the method may further include: acquiring an online equipment list, wherein the online equipment list comprises a plurality of pieces of equipment; selecting one calibrated device in the online device list as a standard device, and selecting a first device and a second device in the online device list as devices to be calibrated; and sending awakening word data to the standard equipment, wherein the awakening word data is used for generating awakening voice by the standard equipment.
The optional embodiment can be applied to an intelligent gateway in an intelligent home scene, and the intelligent gateway acquires an online device list in a home through the internet or a local area network and acquires calibration information about whether each device in the list is calibrated by sound pickup. The intelligent gateway can then determine one calibrated device in the table as a standard device, and calibrate other devices in the online device list that are not calibrated for pickup, such as the first device and the second device, to achieve distributed pickup calibration for devices in the home without the aid of specialized pickup calibration tools.
As an optional embodiment, the intelligent gateway may send wake-up word data to the first device and the second device, where the first device and the second device respond to the wake-up voice sent by the standard device based on the wake-up word data, and call the first microphone and the second microphone respectively to collect the wake-up voice sent by the standard device to obtain the first audio and the second audio.
As an alternative embodiment, the first microphone and the second microphone may be calibrated according to the first calibration coefficient and the second calibration coefficient, respectively, in the following manner: sending a first calibration coefficient to the first device, wherein the first device calibrates the energy value of the received audio according to the first calibration coefficient; and transmitting the second calibration coefficient to the second device, wherein the second device calibrates the energy value of the received audio according to the second calibration coefficient. It should be noted that after the intelligent gateway calculates and obtains the first calibration coefficient and the second calibration coefficient, the first calibration coefficient and the second calibration coefficient are issued to the first device and the second device, and the first microphone and the second microphone are calibrated with respect to the standard microphone, which is equivalent to calibration performed between the first microphone and the second microphone, so that the fast calibration of the sound pickup capability between the devices in the home scene is realized.
As an optional embodiment, a fifth audio is obtained, where the fifth audio is an audio obtained by a third microphone of the third device collecting a wake-up voice sent by the standard device; determining a fifth energy value corresponding to the fifth audio; determining a third correction coefficient corresponding to a third device based on any one of the first energy value and the second energy value and the fifth energy value; and carrying out pickup calibration on the third microphone according to the third correction coefficient. Optionally, the third device may be another device in the online device list that is not calibrated for sound pickup, and the present embodiment may support calibrating the sound pickup capability of multiple devices in the home to the same level. At this time, when the user wakes up the devices in the family by sending the wake-up word, each device wakes up competition based on the microphones with the same sound pickup capability, and judges which device the user is facing to send the wake-up word, so that the accuracy of waking up the devices by the user is improved, and the device with wrong wake-up of the target object is avoided.
It should be noted that, when the energy value is used as a determination condition for determining a target object, for example, a device that a user wants to operate, the determination may be performed according to the magnitude of the energy value received by the device. Specifically, when the target object is located between the first device and the second device, if the target object is closer to the first device, the energy value corresponding to the first audio received by the first microphone of the first device should be greater than the energy value corresponding to the second audio received by the second microphone of the second device. However, since there may be a difference in the sound-collecting capability of the first microphone and the second microphone, directly comparing the energy values of the first audio and the second audio may cause a problem of misjudgment. By adopting the optional embodiment, the problem can be solved by calibrating the sound pickup capacity of the first microphone and the second microphone, and then the equipment corresponding to the larger energy value is determined to be the equipment to be awakened closer to the target object, so that the technical effect of improving the accuracy of judging the equipment to be awakened is achieved.
The above steps can be actively performed by a user in a home, one intelligent device is selected as a standard device for sound pickup calibration in a home environment according to a certain rule, for example, all the intelligent devices in the home are traversed, the device with the best sound pickup effect is used as the standard device, other intelligent devices in the home are respectively subjected to sound pickup calibration with the standard device, and a calibration process is executed to obtain a calibration coefficient corresponding to each device. By the self-calibration method between the household intelligent devices, the technical problem that the target device for judging the voice direction of the target object is inaccurate due to different sound pickup capabilities of different household intelligent devices can be solved, the calibration process of the sound pickup capabilities of the intelligent devices can be simplified, a method for calibrating the sound pickup capability of the household intelligent devices is provided for household users, and the efficiency of the sound pickup capability calibration process is greatly improved.
According to an embodiment of the present invention, there is also provided a sound pickup calibration apparatus for implementing the sound pickup calibration method, and fig. 3 is a block diagram of a structure of the sound pickup calibration apparatus according to the embodiment of the present invention, and as shown in fig. 3, the sound pickup calibration apparatus 30 includes: the calibration device 30 includes an acquisition module 32, a first determination module 34, a second determination module 36, and a calibration module 38, which are described below.
The acquiring module 32 is configured to acquire a first audio and a second audio, where the first audio is an audio obtained by a first microphone of a first device collecting a wake-up voice sent by a standard device, and the second audio is an audio obtained by a second microphone of a second device collecting the wake-up voice sent by the standard device;
a first determining module 34, connected to the obtaining module 32, for determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio;
a second determining module 36, connected to the first determining module 34, for determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value;
and a calibration module 38, connected to the second determining module 36, for calibrating the sound pickup of the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient, respectively.
It should be noted here that the acquiring module 32, the first determining module 34, the second determining module 36 and the calibrating module 38 correspond to steps S202 to S208 in the embodiment, and a plurality of modules are the same as the corresponding steps in the implementation example and the application scenario, but are not limited to the disclosure in the embodiment.
An embodiment of the present invention may provide a computer device, and optionally, in this embodiment, the computer device may be located in at least one network device of a plurality of network devices of a computer network. The computer device includes a memory and a processor.
The memory may be configured to store software programs and modules, such as program instructions/modules corresponding to the sound pickup calibration method and apparatus in the embodiments of the present invention, and the processor executes various functional applications and data processing by operating the software programs and modules stored in the memory, so as to implement the sound pickup calibration method. The memory may include high speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory may further include memory located remotely from the processor, and these remote memories may be connected to the computer terminal through a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The processor can call the information and application program stored in the memory through the transmission device to execute the following steps: acquiring a first audio and a second audio, wherein the first audio is an audio obtained by acquiring a wake-up voice sent by standard equipment through a first microphone of first equipment, and the second audio is an audio obtained by acquiring the wake-up voice sent by the standard equipment through a second microphone of second equipment; determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and respectively carrying out sound pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
Those skilled in the art will appreciate that all or part of the steps in the methods of the above embodiments may be implemented by a program instructing hardware associated with the terminal device, where the program may be stored in a computer-readable storage medium, and the storage medium may include: flash disks, read-Only memories (ROMs), random Access Memories (RAMs), magnetic or optical disks, and the like.
Embodiments of the present invention also provide a computer-readable storage medium. Alternatively, in this embodiment, the computer-readable storage medium may be configured to store program codes executed by the pickup calibration method provided in the above-described embodiment.
Optionally, in this embodiment, the computer-readable storage medium may be located in any one of a group of computer terminals in a computer network, or in any one of a group of mobile terminals.
Optionally, in this embodiment, the computer readable storage medium is configured to store program code for performing the following steps: acquiring a first audio and a second audio, wherein the first audio is an audio obtained by a first microphone of first equipment collecting a wake-up voice sent by standard equipment, and the second audio is an audio obtained by a second microphone of second equipment collecting the wake-up voice sent by the standard equipment; determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and respectively carrying out sound pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
The above-mentioned serial numbers of the embodiments of the present invention are only for description, and do not represent the advantages and disadvantages of the embodiments.
In the above embodiments of the present invention, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
In the embodiments provided in the present application, it should be understood that the disclosed technology can be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, a division of a unit may be a division of a logic function, and an actual implementation may have another division, for example, a plurality of units or components may be combined or may be integrated into another system, or some features may be omitted, or may not be executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on a plurality of units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a non-volatile memory storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.
The foregoing is only a preferred embodiment of the present application and it should be noted that, as will be apparent to those skilled in the art, numerous modifications and adaptations can be made without departing from the principles of the present application and such modifications and adaptations are intended to be considered within the scope of the present application.

Claims (10)

1. A method of calibrating a sound pickup, comprising:
acquiring a first audio and a second audio, wherein the first audio is an audio obtained by a first microphone of first equipment collecting a wake-up voice sent by standard equipment, and the second audio is an audio obtained by a second microphone of second equipment collecting the wake-up voice sent by the standard equipment;
determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio;
determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value;
the first microphone and the second microphone are calibrated according to the first calibration coefficient and the second calibration coefficient respectively.
2. The method of claim 1, wherein determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value comprises:
acquiring a third audio and a fourth audio, wherein the third audio is an audio obtained by acquiring the awakening voice sent by the first equipment through a standard microphone of the standard equipment, and the fourth audio is an audio obtained by acquiring the awakening voice sent by the second equipment through the standard microphone of the standard equipment;
determining a third energy value corresponding to the third audio and a fourth energy value corresponding to the fourth audio;
determining the first calibration coefficient based on the first energy value and the third energy value, and determining the second calibration coefficient based on the second energy value and the fourth energy value.
3. The method of claim 2, wherein determining the first calibration coefficient based on the first energy value and the third energy value and determining the second calibration coefficient based on the second energy value and the fourth energy value comprises:
determining a ratio of the third energy value and the first energy value as the first calibration coefficient;
and determining a ratio of the fourth energy value and the second energy value as the second calibration coefficient.
4. The method of claim 1, further comprising:
acquiring an online equipment list, wherein the online equipment list comprises a plurality of pieces of equipment;
selecting one calibrated device in the online device list as the standard device, and selecting the first device and the second device in the online device list as devices to be calibrated;
and sending awakening word data to the standard equipment, wherein the awakening word data is used for the standard equipment to generate the awakening voice.
5. The method of claim 4, further comprising: and sending the awakening word data to the first equipment and the second equipment, wherein the first equipment and the second equipment respond to awakening voice sent by the standard equipment based on the awakening word data, and respectively calling the first microphone and the second microphone to collect the awakening voice sent by the standard equipment to obtain the first audio and the second audio.
6. The method of claim 4, wherein the calibrating the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient respectively comprises:
sending the first calibration coefficient to the first device, wherein the first device calibrates the energy value of the received audio according to the first calibration coefficient;
transmitting the second calibration coefficient to the second device, wherein the second device calibrates the energy value of the received audio according to the second calibration coefficient.
7. The method of claim 1, further comprising:
acquiring a fifth audio, wherein the fifth audio is an audio obtained by acquiring a wake-up voice sent by the standard equipment by a third microphone of third equipment;
determining a fifth energy value corresponding to the fifth audio;
determining a third correction coefficient corresponding to the third device based on the fifth energy value and any one of the first energy value and the second energy value;
and carrying out pickup calibration on the third microphone according to the third correction coefficient.
8. A pickup calibration device, comprising:
the device comprises an acquisition module, a processing module and a processing module, wherein the acquisition module is used for acquiring a first audio and a second audio, the first audio is an audio obtained by a first microphone of first equipment collecting a wake-up voice sent by standard equipment, and the second audio is an audio obtained by a second microphone of second equipment collecting the wake-up voice sent by the standard equipment;
the first determining module is used for determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio;
a second determining module, configured to determine, based on the first energy value and the second energy value, a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device;
and the calibration module is used for respectively carrying out pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
9. A computer-readable storage medium, comprising a stored program, wherein the program when executed performs the pickup calibration method of any one of claims 1 to 7.
10. An electronic device comprising a memory and a processor, wherein the memory stores a computer program, and the processor is configured to execute the pickup calibration method according to any one of claims 1 to 7 by the computer program.
CN202210631086.2A 2022-06-06 2022-06-06 Pickup calibration method, pickup calibration device, nonvolatile storage medium and electronic equipment Pending CN115148203A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210631086.2A CN115148203A (en) 2022-06-06 2022-06-06 Pickup calibration method, pickup calibration device, nonvolatile storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210631086.2A CN115148203A (en) 2022-06-06 2022-06-06 Pickup calibration method, pickup calibration device, nonvolatile storage medium and electronic equipment

Publications (1)

Publication Number Publication Date
CN115148203A true CN115148203A (en) 2022-10-04

Family

ID=83406156

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210631086.2A Pending CN115148203A (en) 2022-06-06 2022-06-06 Pickup calibration method, pickup calibration device, nonvolatile storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN115148203A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108122563A (en) * 2017-12-19 2018-06-05 北京声智科技有限公司 Improve voice wake-up rate and the method for correcting DOA
CN109275084A (en) * 2018-09-12 2019-01-25 北京小米智能科技有限公司 Test method, device, system, equipment and the storage medium of microphone array
CN109451415A (en) * 2018-12-17 2019-03-08 深圳Tcl新技术有限公司 Microphone array auto-collation, device, equipment and storage medium
WO2019098178A1 (en) * 2017-11-14 2019-05-23 日本電信電話株式会社 Voice communication device, voice communication method, and program
WO2019122412A1 (en) * 2017-12-22 2019-06-27 Resmed Sensor Technologies Limited Apparatus, system, and method for health and medical sensing
CN110223684A (en) * 2019-05-16 2019-09-10 华为技术有限公司 A kind of voice awakening method and equipment
JP6908142B1 (en) * 2020-01-27 2021-07-21 沖電気工業株式会社 Sound collecting device, sound collecting program, and sound collecting method
CN113314098A (en) * 2020-02-27 2021-08-27 青岛海尔科技有限公司 Device calibration method and apparatus, storage medium, and electronic apparatus
CN113808614A (en) * 2021-07-30 2021-12-17 北京声智科技有限公司 Sound energy value calibration and device wake-up method, device and storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019098178A1 (en) * 2017-11-14 2019-05-23 日本電信電話株式会社 Voice communication device, voice communication method, and program
CN108122563A (en) * 2017-12-19 2018-06-05 北京声智科技有限公司 Improve voice wake-up rate and the method for correcting DOA
WO2019122412A1 (en) * 2017-12-22 2019-06-27 Resmed Sensor Technologies Limited Apparatus, system, and method for health and medical sensing
CN109275084A (en) * 2018-09-12 2019-01-25 北京小米智能科技有限公司 Test method, device, system, equipment and the storage medium of microphone array
CN109451415A (en) * 2018-12-17 2019-03-08 深圳Tcl新技术有限公司 Microphone array auto-collation, device, equipment and storage medium
CN110223684A (en) * 2019-05-16 2019-09-10 华为技术有限公司 A kind of voice awakening method and equipment
JP6908142B1 (en) * 2020-01-27 2021-07-21 沖電気工業株式会社 Sound collecting device, sound collecting program, and sound collecting method
CN113314098A (en) * 2020-02-27 2021-08-27 青岛海尔科技有限公司 Device calibration method and apparatus, storage medium, and electronic apparatus
CN113808614A (en) * 2021-07-30 2021-12-17 北京声智科技有限公司 Sound energy value calibration and device wake-up method, device and storage medium

Similar Documents

Publication Publication Date Title
CN110347366B (en) Volume adjusting method, terminal device, storage medium and electronic device
CN108919657B (en) Automatic configuration method and device of panel and panel
CN114697150A (en) Command issuing method and device, storage medium and electronic device
CN104900237B (en) A kind of methods, devices and systems for audio-frequency information progress noise reduction process
CN115148203A (en) Pickup calibration method, pickup calibration device, nonvolatile storage medium and electronic equipment
WO2023231552A1 (en) Distributed voice wake-up method and apparatus, storage medium, and electronic apparatus
CN113314098B (en) Device calibration method and apparatus, storage medium, and electronic apparatus
CN114676043A (en) Testing method and device of intelligent voice module, storage medium and electronic device
CN115734139A (en) Audio calibration method, receiving device, playing device and storage medium
CN111131353A (en) Collaboration method and intelligent device group
CN118118282A (en) Audio playing method and device, storage medium and electronic device
CN116206618A (en) Equipment awakening method, storage medium and electronic device
CN116072124A (en) User identity recognition method, storage medium and electronic device
CN115312049A (en) Command response method, storage medium and electronic device
CN117877474A (en) Response method and device for controlling voice, storage medium and electronic device
CN117008865A (en) Play setting method and device, storage medium and electronic device
CN115001885A (en) Device control method and apparatus, storage medium, and electronic apparatus
CN116302877A (en) Message queue performance test method and device
CN115312048A (en) Equipment awakening method and device, storage medium and electronic device
CN116155637A (en) Equipment control method, device, storage medium and electronic device
CN117524231A (en) Voice person identification method, voice interaction method and device
CN115171699A (en) Wake-up parameter adjusting method and device, storage medium and electronic device
CN114999494A (en) Voice interaction function testing method and device, storage medium and electronic device
CN115691552A (en) Method and device for determining response time, storage medium and electronic device
CN116110382A (en) Voice response method, device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination