CN110475181B - Equipment configuration method, device, equipment and storage medium - Google Patents

Equipment configuration method, device, equipment and storage medium Download PDF

Info

Publication number
CN110475181B
CN110475181B CN201910758095.6A CN201910758095A CN110475181B CN 110475181 B CN110475181 B CN 110475181B CN 201910758095 A CN201910758095 A CN 201910758095A CN 110475181 B CN110475181 B CN 110475181B
Authority
CN
China
Prior art keywords
candidate
voice
response rate
equalizer parameter
equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910758095.6A
Other languages
Chinese (zh)
Other versions
CN110475181A (en
Inventor
张在东
郑林
车婷婷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201910758095.6A priority Critical patent/CN110475181B/en
Publication of CN110475181A publication Critical patent/CN110475181A/en
Application granted granted Critical
Publication of CN110475181B publication Critical patent/CN110475181B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/02Circuits for transducers, loudspeakers or microphones for preventing acoustic reaction, i.e. acoustic oscillatory feedback

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The embodiment of the invention discloses a device configuration method, a device, equipment and a storage medium. The method comprises the following steps: determining a target equalizer parameter range according to a target use distance of the equipment to be configured and a mapping relation between the candidate use distance and the candidate equalizer parameter range; wherein the candidate equalizer parameter range is determined according to the voice response rate at the candidate use distance; and determining the target equalizer parameter value of the equipment to be configured within the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured. According to the embodiment of the invention, by evaluating the voice response rate of the intelligent equipment, the audio playing effect of the equipment is improved on the premise of meeting the voice response rate, the sound effect of the intelligent equipment is ensured, and the voice response rate and the audio playing effect under the condition of internal noise reach the optimal balance point.

Description

Equipment configuration method, device, equipment and storage medium
Technical Field
The embodiment of the invention relates to the technical field of intelligent equipment, in particular to an equipment configuration method, an equipment configuration device, equipment and a storage medium.
Background
Along with the rapid development of artificial intelligence, intelligent devices such as intelligent sound boxes generally have audio playing and voice recognition functions. For example, during the process of playing music by the intelligent device, the user can control the intelligent device through voice.
However, when the user performs voice control on the smart device during the process of playing the audio by the smart device, the audio played by the smart device may be input into the smart device along with the user voice, which affects the voice recognition function. At present, in the process of playing audio by a smart device, if the voice recognition capability is to be improved, an Acoustic Echo Cancellation (AEC) technology may be used to eliminate noise generated by the smart device itself, and an AEC processing effect is related to a Total Harmonic Distortion curve (THD) of the audio playing source of the smart device itself.
Therefore, in the prior art, the relationship between the audio playing function and the voice recognition function is difficult to balance, and the audio playing effect or the voice recognition efficiency of the intelligent device is reduced.
Disclosure of Invention
The embodiment of the invention provides a device configuration method, a device, equipment and a storage medium, which can balance an audio playing function and a voice recognition function.
In a first aspect, an embodiment of the present invention provides an apparatus configuration method, including:
determining a target equalizer parameter range according to a target use distance of the equipment to be configured and a mapping relation between the candidate use distance and the candidate equalizer parameter range; wherein the candidate equalizer parameter range is determined according to the voice response rate at the candidate use distance;
and determining the target equalizer parameter value of the equipment to be configured within the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured.
In a second aspect, an embodiment of the present invention provides an apparatus for configuring a device, including:
the parameter range determining module is used for determining the parameter range of the target equalizer according to the target use distance of the equipment to be configured and the mapping relation between the candidate use distance and the candidate equalizer parameter range; wherein the candidate equalizer parameter range is determined according to the voice response rate at the candidate use distance;
and the parameter configuration module is used for determining a target equalizer parameter value of the equipment to be configured in the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured.
In a third aspect, an embodiment of the present invention provides an apparatus, including:
one or more processors;
a memory for storing one or more programs;
when the one or more programs are executed by the one or more processors, the one or more processors are caused to implement the device configuration method according to any embodiment of the present invention.
In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, on which a computer program is stored, and the computer program, when executed by a processor, implements the device configuration method according to any embodiment of the present invention.
The method and the device determine the target equalizer parameter range of the equipment to be configured according to the target use distance of the equipment to be configured and the mapping relation between the candidate use distance and the candidate equalizer parameter range by taking the voice response rate of the equipment as a basis and determining the candidate equalizer parameter range associated with each candidate use distance in advance, thereby determining the target equalizer parameter range of the equipment to be configured according to the target use distance of the equipment to be configured and the mapping relation between the candidate use distance and the candidate equalizer parameter range, and determining the target equalizer parameter value of the equipment to be configured in the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured. According to the embodiment of the invention, by evaluating the voice response rate of the intelligent equipment, the audio playing effect of the equipment is improved on the premise of meeting the voice response rate, the sound effect of the intelligent equipment is ensured, and the voice response rate and the audio playing effect under the condition of internal noise reach the optimal balance point.
Drawings
Fig. 1 is a flowchart of an apparatus configuration method according to an embodiment of the present invention;
fig. 2 is a flowchart of a device configuration method according to a second embodiment of the present invention;
FIG. 3 is a diagram illustrating a parameter testing system according to a second embodiment of the present invention;
fig. 4 is a schematic structural diagram of an apparatus configuration device according to a third embodiment of the present invention;
fig. 5 is a schematic structural diagram of an apparatus according to a fourth embodiment of the present invention.
Detailed Description
The embodiments of the present invention will be described in further detail with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the embodiments of the invention and that no limitation of the invention is intended. It should be further noted that, for convenience of description, only some structures, not all structures, relating to the embodiments of the present invention are shown in the drawings.
It should be further noted that, for the convenience of description, only some but not all of the relevant portions of the present application are shown in the drawings. Before discussing exemplary embodiments in more detail, it should be noted that some exemplary embodiments are described as processes or methods depicted as flowcharts. Although a flowchart may describe the operations (or steps) as a sequential process, many of the operations can be performed in parallel, concurrently or simultaneously. In addition, the order of the operations may be re-arranged. The process may be terminated when its operations are completed, but may have additional steps not included in the figure. The processes may correspond to methods, functions, procedures, subroutines, and the like.
Example one
Fig. 1 is a flowchart of an apparatus configuration method according to an embodiment of the present invention, where the present embodiment is applicable to a case of performing equalizer parameter configuration on an intelligent apparatus, and the method may be executed by an apparatus configuration device, where the apparatus may be implemented in a software and/or hardware manner, and is preferably configured in an intelligent apparatus having audio playing and voice recognition functions, such as an intelligent sound box. The method specifically comprises the following steps:
s110, determining a target equalizer parameter range according to a target use distance of the equipment to be configured and a mapping relation between the candidate use distance and the candidate equalizer parameter range; wherein, the candidate equalizer parameter range is determined according to the voice response rate under the candidate use distance.
In the specific embodiment of the present invention, the device to be configured refers to an intelligent device having both audio playing and voice recognition functions, such as an intelligent sound box and a smart phone. The use distance of the device to be configured is a distance between a user and the device to be configured when the user uses the device to be configured, for example, the user sits on one side of a writing desk with a width of 1m, and fixes and sets the smart sound box on the other side of the writing desk, so that the use distance of the smart sound box is about 1m currently. The target use distance refers to a use distance of the device to be configured in different use scenes, for example, 1m in the above-mentioned scenes. And the candidate use distance refers to an available distance of the device to be configured, for example, at least one of 0-5 m.
In this embodiment, the equalizer is a module for adjusting the amplification amount of each frequency component electrical signal in the intelligent device, and the equalizer realizes the effects of compensating the curves of the speaker and the sound field, compensating and modifying various sound sources, and the like by adjusting the electrical signals with different frequencies. In this embodiment, devices with the same model as various devices to be configured may be adopted as devices to be tested in advance, and by testing the values of the equalizer parameter ranges of the devices to be tested at different candidate use distances, the establishment of the mapping relationship between the candidate use distances and the candidate equalizer parameter ranges is realized, so that when the devices to be configured perform the configuration of the equalizer parameters, the target equalizer parameter ranges are determined directly according to the target use distances of the devices to be configured and the mapping relationship between the candidate use distances and the candidate equalizer parameter ranges.
Specifically, under each candidate use distance, the present embodiment determines the candidate equalizer parameter range according to the voice response rate of the device under test to the voice control instruction. The voice response rate is the ratio of the number of successful responses of the equipment to the total number of voice commands under the control of a certain number of voice commands. For example, the number of successful awakenings of the device under test on 100 voice awakening words is the awakening rate. The equalizer parameter range refers to the range of equalizer parameters that can be adjusted under the condition that the tested device meets a certain voice response rate, and the equalizer parameter range is different under different use distances. Before configuring the equalizer parameters of the device, the embodiment may use the voice response rate as a precondition, and construct a mapping relationship between the use distance and the equalizer parameter range by testing the equalizer parameter range of the device under test at different use distances.
Exemplarily, in a test scene, a dummy and the like can be selected in advance as a voice simulation sound source, an intelligent device with the same model as that of the device to be configured is selected as the device to be tested for testing, and the equalizer curve is burned into the device to be tested. In the available distance range of the equipment, a plurality of distances can be selected in advance to serve as candidate using distances respectively, and the distance between the equipment to be tested and the voice simulation sound source is fixed to serve as the candidate using distance.
In the testing process, the tested device is controlled to play white noise with the maximum volume, the voice simulation sound source is controlled to play voice commands with certain frequency and times, and the sound pressure level of the voice simulation sound source is adjusted to be standard sound pressure, such as 94 dB. And determining the response result of the tested device to each voice instruction, and calculating the voice response rate of the tested device at the current candidate using distance. The reverberation time of the device under test at the current candidate use distance may be recorded, and if the reverberation time exceeds the standard reverberation time range, for example, 0.3 to 0.6s, the test of the equalizer parameter at the candidate use distance is ended. And finally, determining the candidate equalizer parameter range of the tested device under the candidate use distance according to the voice response rate of the tested device.
For example, if the voice response rate of the device under test is less than the first response rate threshold, that is, the voice response rate condition is not satisfied, the gain of the equalizer parameter in the device under test at the low frequency component is reduced, for example, the gain of the frequency component of 50-300Hz is reduced, so as to reduce the THD of the loudspeaker at the low frequency band, and improve the echo cancellation processing effect of AEC, until the obtained voice response rate is equal to the first response rate threshold, that is, the voice response rate condition is satisfied, the current equalizer parameter is used as the lower limit value in the candidate equalizer parameter range. Similarly, if the voice response rate of the tested device is greater than the second response rate threshold, that is, the voice response rate condition is met, the gain of the equalizer parameter in the tested device at the low-frequency component is increased, so as to improve the THD of the loudspeaker at the low-frequency band, and reduce the echo cancellation processing effect of the AEC, until the obtained voice response is equal to the second response rate threshold, that is, when the voice response rate condition is about to be met and the voice response rate condition is not met, the current equalizer parameter is used as the upper limit value in the candidate equalizer parameter range. The first response rate threshold and the second response rate threshold may be the same, for example, both are 98%, in addition, the first response rate threshold and the second response rate threshold may also be different, and the first response rate threshold may be greater than the second response rate threshold. The lower and upper limits, in turn, constitute candidate equalizer parameter ranges and are associated with current candidate use distances.
In this embodiment, on the basis of the mapping relationship between the candidate use distance and the candidate equalizer parameter range, the target use distance of the device to be configured is compared with the candidate use distance, and the candidate equalizer parameter range associated with the candidate use distance closest to the target use distance is used as the target equalizer parameter range of the device to be configured. And then through choosing the parameter range of the goal equalizer from the mapping relation, can be under the prerequisite in order to meet the speech response rate of the intellectual device, namely guarantee the intellectual device can succeed the prerequisite of responding to user's speech control order, lock the adjustable range of the equalizer parameter in the parameter range of the goal equalizer, has reduced the adjustable range of the equalizer parameter in the intellectual device by a wide margin, raise parameter configuration efficiency and accuracy.
And S120, determining a target equalizer parameter value of the equipment to be configured in the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured.
In an embodiment of the present invention, the audio playing feature is used to describe an audio playing effect of the device to be configured. The audio playing characteristics may include subjective indicators of sound quality such as volume, pitch, and timbre, and objective indicators of sound quality such as Frequency Response (FR) and harmonic distortion (THD) curves. The frequency response curve is used for describing the difference of the processing capacity of the equipment for signals with different frequencies electronically, and is a curve of the variation of gain along with frequency; the harmonic distortion curve refers to the harmonic cost of the output signal more than the input signal, and the harmonic distortion is caused by the fact that the system is not completely linear. The sum of all the additional harmonic levels is called total harmonic distortion, the total harmonic distortion is related to frequency, and further, a total harmonic distortion curve refers to a curve of the total harmonic distortion changing along with the frequency. The present implementation preferably uses objective features such as FR and THD to measure the audio playing characteristics of the device to be configured.
In this embodiment, the device to be configured is adjusted within the range of the target equalizer parameter, so as to obtain the audio playing characteristics of the device to be configured under each equalizer parameter, and according to the audio playing characteristics under each equalizer parameter, the equalizer parameter value with the best audio playing effect is selected as the target equalizer parameter value of the device to be configured.
Exemplarily, a frequency response curve and a harmonic distortion curve of a device to be configured when the device to be configured plays audio are obtained within a parameter range of a target equalizer; and determining the target equalizer parameter value of the equipment to be configured from the target equalizer parameter range according to the frequency response curve and the harmonic distortion curve. For example, the frequency response range of the frequency response curve at least covers the sound range which can be heard by human ears from 20Hz to 20KHz, and the flatter the frequency response curve is, the better the frequency response curve is; harmonic distortion is difficult to inhibit, the damage to tone quality is large, particularly odd harmonics are not musical, the damage to tone quality is great, and therefore the smaller the THD value is, the better the THD value is.
According to the technical scheme of the embodiment, the candidate equalizer parameter range associated with each candidate use distance is predetermined according to the voice response rate of the equipment, so that when the equalizer parameter configuration is performed on the equipment to be configured, the target equalizer parameter range of the equipment to be configured is determined according to the target use distance of the equipment to be configured and the mapping relation between the candidate use distance and the candidate equalizer parameter range, and the target equalizer parameter value of the equipment to be configured is determined in the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured. According to the embodiment of the invention, by evaluating the voice response rate of the intelligent equipment, the audio playing effect of the equipment is improved on the premise of meeting the voice response rate, the sound effect of the intelligent equipment is ensured, and the voice response rate and the audio playing effect under the condition of internal noise reach the optimal balance point.
Example two
In this embodiment, on the basis of the first embodiment, a preferred implementation of the device configuration method is provided, and the candidate equalizer parameter range corresponding to the candidate use distance can be determined according to the voice response rate of the device in the candidate use distance. Fig. 2 is a flowchart of an apparatus configuration method according to a second embodiment of the present invention, and as shown in fig. 2, the method specifically includes the following steps:
and S210, if the reverberation time of the tested device under any candidate use distance exceeds the standard reverberation time range, ending the test of the equalizer parameter under the candidate use distance.
In the specific embodiment of the present invention, devices with the same model as various devices to be configured may be adopted as devices to be tested in advance, and the establishment of the mapping relationship between the candidate use distance and the candidate equalizer parameter range is realized by testing the value of the equalizer parameter range of the devices to be tested at different candidate use distances.
In this embodiment, the reverberation time is the time required for the sound pressure level to be reduced by 60dB through the attenuation process in the room after the sound source stops sounding. The reverberation time is too short, the sound is dry, and the sound is dry, tasteless and not close to nature; the reverberation time is too long, so that the sound is unclear; when the reverberation time is proper, the sound is mellow and smooth. Furthermore, when the mapping relationship between the candidate use distance and the candidate equalizer parameter range is established, the embodiment may select a reverberation room or other reverberation field condition for testing, so as to use the reverberation time as one of the conditions for adjusting the sound quality of the device. The reverberation time is an important evaluation index for the quantitative estimation of acoustic energy in acoustic design, and standard reverberation times applicable to different reverberation conditions are different. For example, the reverberation time suitable for movie presentations is typically no more than 0.8 seconds, and the reverberation time suitable for concert halls is typically 1.5 seconds.
In the available distance range of the equipment, a plurality of distances can be selected in advance to serve as candidate use distances respectively, and the distance between the tested equipment and the voice simulation sound source is fixed to serve as the candidate use distance. At each candidate use distance, the reverberation time of the reverberation chamber is recorded. And comparing the reverberation time under each candidate use distance with the standard reverberation time under the current reverberation condition, if the reverberation time under any candidate use distance exceeds the standard reverberation time range, rejecting the candidate use distance, and finishing the test of the equalizer parameters under the candidate use distance.
S220, in the process that the tested device plays the audio, determining the voice response rate of the tested device according to the response result of the tested device to the voice played by the voice simulation sound source and the voice playing times of the voice simulation sound source.
In the specific embodiment of the invention, the tested device is controlled to play white noise with the maximum volume, the voice simulation sound source is controlled to play voice commands with a certain frequency and times, the sound pressure level of the voice simulation sound source is adjusted to be standard sound pressure, such as 94dB, and the response result of the tested device to each voice command is determined. For example, assuming that a voice simulation sound source is controlled to play a voice control command 1000 times at a certain frequency, and the tested device is determined to successfully respond 980 times through detection of a response result, the voice response rate of the tested device under the current candidate use distance is calculated to be 98%.
And S230, determining the candidate equalizer parameter range of the tested device under the candidate using distance according to the voice response rate of the tested device.
In the specific embodiment of the invention, the candidate equalizer parameter range is determined according to the voice response rate of the tested device to the voice control instruction. The voice response rate is the ratio of the number of successful responses of the equipment to the total number of voice commands under the control of a certain number of voice commands. Based on the voice response rate condition, when the tested device crosses the critical point which meets and does not meet the voice response rate condition and the critical point which does not meet and meets the voice response rate condition, the upper and lower limit values of the equalizer parameter of the tested device are determined, so that the candidate equalizer parameter range of the tested device under the candidate use distance is obtained.
Optionally, if the voice response rate of the device under test is smaller than the first response rate threshold, the gain of the equalizer parameter in the low-frequency component is reduced until the obtained voice response rate is equal to the first response rate threshold, and the current equalizer parameter is used as a lower limit value in the candidate equalizer parameter range; if the voice response rate of the tested equipment is greater than the second response rate threshold value, the gain of the equalizer parameter in the low-frequency component is increased until the obtained voice response is equal to the second response rate threshold value, and the current equalizer parameter is used as an upper limit value in the candidate equalizer parameter range.
In this embodiment, the response rate threshold refers to a successful response rate condition required by the intelligent device of the tested device of the same type, for example, the response rate threshold of the intelligent sound box may be required to be 98%. The response rate may be a response success rate of the device for any voice control instruction, or may also be a response success rate of the device for a specific type or a certain voice control instruction, for example, an awakening rate of the device for a voice awakening word. In addition, the response rate threshold may be a single threshold, or may be a range of response rate thresholds, that is, the first response rate threshold and the second response rate threshold are included, and the first response rate threshold may be greater than the second response rate threshold.
In this embodiment, if the voice response rate of the device under test is less than the first response rate threshold, that is, the voice response rate condition is not satisfied, the gain of the equalizer parameter in the device under test at the low frequency component is reduced, for example, the gain of the frequency component of 50-300Hz is reduced, so as to reduce the THD of the loudspeaker at the low frequency band, and improve the echo cancellation processing effect of AEC, until the obtained voice response rate is equal to the first response rate threshold, that is, when the voice response rate condition is satisfied, the current equalizer parameter is used as the lower limit value in the candidate equalizer parameter range. Similarly, if the voice response rate of the tested device is greater than the second response rate threshold, that is, the voice response rate condition is met, the gain of the equalizer parameter in the tested device at the low-frequency component is increased, so as to improve the THD of the loudspeaker at the low-frequency band, and reduce the echo cancellation processing effect of the AEC, until the obtained voice response is equal to the second response rate threshold, that is, when the voice response rate condition is about to be met and the voice response rate condition is not met, the current equalizer parameter is used as the upper limit value in the candidate equalizer parameter range.
By way of example, FIG. 3 is an exemplary diagram of a parametric test system. As shown in fig. 3, under the condition of the reverberation field of the reverberation chamber, a human simulator and the like can be selected in advance as a voice simulation sound source, an intelligent device with the same model as that of the device to be configured is selected as a device to be tested for testing, and the distance between the device to be tested and the voice simulation sound source is fixedly set as a candidate use distance L. The sound pressure level of the voice analog sound source is tested by the PC and external test equipment such as a sound card, a power amplifier, an adapter and the like through a standard microphone, the adapter, the sound card and the PC, and the sound pressure level at the voice analog sound source is determined to be 94 dB. The equalizer curve is burned into the tested equipment through the PC, and the voice simulation sound source is controlled through the PC, the sound card and the power amplifier to play the voice command at a certain frequency and times, for example, 1000 times. And in the testing process, determining the response result of the tested device to each voice instruction, and calculating the voice response rate of the tested device under the current candidate using distance. And respectively determining the upper limit value and the lower limit value of the parameter under the current candidate use distance according to the voice response rate of the tested device under the current candidate use distance and the threshold value of the response rate as critical points, so as to obtain the parameter range of the candidate equalizer under the current candidate use distance.
S240, determining a target equalizer parameter range according to the target use distance of the equipment to be configured and the mapping relation between the candidate use distance and the candidate equalizer parameter range.
In the embodiment of the present invention, the target use distance of the device to be configured is compared with the candidate use distance, and the candidate equalizer parameter range associated with the candidate use distance closest to the target use distance may be used as the target equalizer parameter range of the device to be configured according to the comparison result.
And S250, acquiring a frequency response curve and a harmonic distortion curve when the equipment to be configured plays audio within the parameter range of the target equalizer.
In the specific embodiment of the invention, the device to be configured is adjusted within the parameter range of the target equalizer, and the device to be configured is controlled to play audio. FR and THD of the equipment to be configured during audio playing can be acquired through equipment such as an oscilloscope.
And S260, determining a target equalizer parameter value of the equipment to be configured from the target equalizer parameter range according to the frequency response curve and the harmonic distortion curve.
In the specific embodiment of the invention, the frequency response range of the frequency response curve at least covers the sound range which can be heard by human ears from 20Hz to 20KHz, and the flatter and straighter the frequency response curve is, the better; harmonic distortion is difficult to inhibit and has great harm to tone quality, particularly odd harmonics are non-musical and have great harm to tone quality, so that the smaller the value of THD is, the better the value is. Therefore, the target equalizer parameter value of the device to be configured can be determined from the target equalizer parameter range according to the shape of the curve based on FR and THD collected during the adjustment of the equalizer parameter of the device to be configured within the target equalizer parameter range. In addition, the tone quality of the voice signal heard manually can be obtained through artificial subjective listening, the person hearing the audio can input the heard tone quality into the PC, and the PC integrates objective indexes and subjective indexes to determine the target equalizer parameter value of the equipment to be configured from the target equalizer parameter range. Therefore, the best tone quality playing effect is obtained on the premise of ensuring the voice response rate.
According to the technical scheme of the embodiment, the reverberation time can be used as a standard, and candidate use distances which do not meet requirements can be removed. Under the scene of each candidate use distance, in the process of playing the audio by the tested device, determining the voice response rate of the tested device according to the response result of the tested device to the voice played by the voice simulation sound source and the voice playing times of the voice simulation sound source, determining the candidate equalizer parameter range of the tested device under the candidate use distance according to the voice response rate, and establishing the mapping relation between the candidate use distance and the candidate equalizer parameter range. Therefore, when the equalizer parameter configuration is carried out on the equipment to be configured, the target equalizer parameter range of the equipment to be configured is determined according to the target use distance of the equipment to be configured and the mapping relation, and the target equalizer parameter value of the equipment to be configured is determined according to the audio playing characteristic of the equipment to be configured. According to the embodiment of the invention, by evaluating the voice response rate of the intelligent equipment, the audio playing effect of the equipment is improved on the premise of meeting the voice response rate, the sound effect of the intelligent equipment is ensured, and the voice response rate and the audio playing effect under the condition of internal noise reach the optimal balance point.
EXAMPLE III
Fig. 4 is a schematic structural diagram of an apparatus configuration device according to a third embodiment of the present invention, where this embodiment is applicable to a case where an equalizer parameter configuration is performed on an intelligent device, and the apparatus can implement the apparatus configuration method according to any embodiment of the present invention. The device specifically comprises the following steps:
a parameter range determining module 410, configured to determine a target equalizer parameter range according to a target use distance of the device to be configured and a mapping relationship between the candidate use distance and the candidate equalizer parameter range; wherein the candidate equalizer parameter range is determined according to the voice response rate at the candidate use distance;
a parameter configuration module 420, configured to determine, according to the audio playing characteristic of the device to be configured, a target equalizer parameter value of the device to be configured within the target equalizer parameter range.
Optionally, the apparatus further comprises a candidate parameter range determining module 430; the candidate parameter range determination module 430 comprises:
the response rate determining unit 4301 is configured to determine, in a process that the device under test plays an audio, a voice response rate of the device under test according to a response result of the device under test to a voice played by a voice simulation sound source and the number of times of playing the voice of the voice simulation sound source;
a candidate equalizer parameter range determining unit 4302, configured to determine, according to a voice response rate of the device under test, a candidate equalizer parameter range of the device under test at the candidate using distance; wherein the candidate using distance is the distance between the tested device and the voice simulation sound source.
Optionally, the candidate equalizer parameter range determining unit 4302 is specifically configured to:
if the voice response rate of the tested equipment is smaller than a first response rate threshold value, the gain of the equalizer parameter in the low-frequency component is reduced until the obtained voice response rate is equal to the first response rate threshold value, and the current equalizer parameter is used as a lower limit value in the candidate equalizer parameter range;
if the voice response rate of the tested equipment is greater than a second response rate threshold value, the gain of the equalizer parameter in the low-frequency component is increased until the obtained voice response is equal to the second response rate threshold value, and the current equalizer parameter is used as an upper limit value in the candidate equalizer parameter range.
Further, the candidate parameter range determining module 430 further includes a distance filtering unit 4303, specifically configured to:
before determining the voice response rate of the tested device according to the response result of the tested device to the voice played by the voice simulation sound source and the voice playing times of the voice simulation sound source, if the reverberation time of the tested device at any candidate using distance exceeds the standard reverberation time range, ending the test of the equalizer parameters at the candidate using distance.
Optionally, the parameter configuration module 420 is specifically configured to:
acquiring a frequency response curve and a harmonic distortion curve when the device to be configured plays audio within the parameter range of the target equalizer;
and determining a target equalizer parameter value of the equipment to be configured from the target equalizer parameter range according to the frequency response curve and the harmonic distortion curve.
According to the technical scheme of the embodiment, through the mutual cooperation of all the functional modules, the functions of building a test environment, detecting reverberation time, screening candidate distances, playing a voice control instruction, calculating voice response rate, determining a parameter range of an equalizer, determining a target equalizer parameter range of equipment to be configured, detecting tone quality, configuring equalizer parameters and the like are achieved. According to the embodiment of the invention, by evaluating the voice response rate of the intelligent equipment, the audio playing effect of the equipment is improved on the premise of meeting the voice response rate, the sound effect of the intelligent equipment is ensured, and the voice response rate and the audio playing effect under the condition of internal noise reach the optimal balance point.
Example four
Fig. 5 is a schematic structural diagram of an apparatus according to a fourth embodiment of the present invention, and fig. 5 shows a block diagram of an exemplary apparatus suitable for implementing the embodiment of the present invention. The device shown in fig. 5 is only an example and should not bring any limitation to the function and the scope of use of the embodiments of the present invention.
The device 500 shown in fig. 5 is only an example and should not bring any limitations to the functionality and scope of use of the embodiments of the present invention. The device 500 is preferably a smart device having both audio playback and voice recognition capabilities.
As shown in fig. 5, device 500 is in the form of a general purpose computing device. The components of device 500 may include, but are not limited to: one or more processors 510, a system memory 520, and a bus 530 that couples the various system components (including the system memory 520 and the processors 510).
Bus 530 represents one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. By way of example, such architectures include, but are not limited to, Industry Standard Architecture (ISA) bus, micro-channel architecture (MAC) bus, enhanced ISA bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus.
Device 500 typically includes a variety of computer system readable media. Such media can be any available media that is accessible by device 500 and includes both volatile and nonvolatile media, removable and non-removable media.
The system memory 520 may include computer system readable media in the form of volatile memory, such as Random Access Memory (RAM)521 and/or cache memory 522. The device 500 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 523 may be used to read from and write to non-removable, nonvolatile magnetic media (not shown in FIG. 5 and commonly referred to as a "hard disk drive"). Although not shown in FIG. 5, a magnetic disk drive for reading from and writing to a removable, nonvolatile magnetic disk (e.g., a "floppy disk") and an optical disk drive for reading from or writing to a removable, nonvolatile optical disk (e.g., a CD-ROM, DVD-ROM, or other optical media) may be provided. In these cases, each drive may be connected to bus 530 by one or more data media interfaces. System memory 520 may include at least one program product having a set (e.g., at least one) of program modules that are configured to carry out the functions of embodiments of the invention.
A program/utility 525 having a set (at least one) of program modules 524 may be stored, for example, in system memory 520, such program modules 524 including, but not limited to, an operating system, one or more application programs, other program modules, and program data, each of which examples or some combination thereof may include an implementation of a network environment. Program modules 524 generally perform the functions and/or methodologies of embodiments described herein.
Device 500 may also communicate with one or more external devices 540 (e.g., keyboard, pointing device, display 541, etc.), one or more devices that enable a user to interact with device 500, and/or any devices (e.g., network card, modem, etc.) that enable device 500 to communicate with one or more other computing devices. Such communication may occur via input/output (I/O) interfaces 550. Also, device 500 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the internet) via network adapter 560. As shown, the network adapter 560 communicates with the other modules of the device 500 over a bus 530. It should be understood that although not shown in the figures, other hardware and/or software modules may be used in conjunction with the device 500, including but not limited to: microcode, device drivers, redundant processors, external disk drive arrays, RAID systems, tape drives, and data backup storage systems, among others.
The processor 510 executes various functional applications and data processing, for example, implementing a device configuration method provided by an embodiment of the present invention, by executing programs stored in the system memory 520.
EXAMPLE five
An embodiment of the present invention further provides a computer-readable storage medium, on which a computer program (or referred to as computer-executable instructions) is stored, where the computer program is used for executing, by a processor, a device configuration method, where the method includes:
determining a target equalizer parameter range according to a target use distance of the equipment to be configured and a mapping relation between the candidate use distance and the candidate equalizer parameter range; wherein the candidate equalizer parameter range is determined according to the voice response rate at the candidate use distance;
and determining the target equalizer parameter value of the equipment to be configured within the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured.
Computer storage media for embodiments of the invention may employ any combination of one or more computer-readable media. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for embodiments of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or device. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the embodiments of the present invention have been described in more detail through the above embodiments, the embodiments of the present invention are not limited to the above embodiments, and many other equivalent embodiments may be included without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims (8)

1. A device configuration method, comprising:
in the process of playing audio by the tested equipment, determining the voice response rate of the tested equipment according to the response result of the tested equipment to the voice played by the voice simulation sound source and the voice playing times of the voice simulation sound source; determining a candidate equalizer parameter range of the tested device under a candidate use distance according to the voice response rate of the tested device; wherein the candidate using distance is the distance between the tested device and the voice simulation sound source;
determining a target equalizer parameter range according to a target use distance of the equipment to be configured and a mapping relation between the candidate use distance and the candidate equalizer parameter range;
and determining the target equalizer parameter value of the equipment to be configured within the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured.
2. The method of claim 1, wherein the determining the candidate equalizer parameter range of the device under test at the candidate use distance according to the voice response rate of the device under test comprises:
if the voice response rate of the tested equipment is smaller than a first response rate threshold value, the gain of the equalizer parameter in the low-frequency component is reduced until the obtained voice response rate is equal to the first response rate threshold value, and the current equalizer parameter is used as a lower limit value in the candidate equalizer parameter range;
if the voice response rate of the tested equipment is greater than a second response rate threshold value, the gain of the equalizer parameter in the low-frequency component is increased until the obtained voice response is equal to the second response rate threshold value, and the current equalizer parameter is used as an upper limit value in the candidate equalizer parameter range.
3. The method according to claim 1, before determining the voice response rate of the device under test according to the response result of the device under test to the voice played by the voice simulation sound source and the number of times of playing the voice of the voice simulation sound source, further comprising:
and if the reverberation time of the tested device under any candidate use distance exceeds the standard reverberation time range, finishing the test of the equalizer parameters under the candidate use distance.
4. The method of claim 1, wherein determining the target equalizer parameter value of the device to be configured within the target equalizer parameter range according to the audio playing characteristics of the device to be configured comprises:
acquiring a frequency response curve and a harmonic distortion curve when the device to be configured plays audio within the parameter range of the target equalizer;
and determining a target equalizer parameter value of the equipment to be configured from the target equalizer parameter range according to the frequency response curve and the harmonic distortion curve.
5. An apparatus configuration device, comprising:
a candidate parameter range determination module comprising:
the response rate determining unit is used for determining the voice response rate of the tested equipment according to the response result of the tested equipment to the voice played by the voice simulation sound source and the voice playing times of the voice simulation sound source in the process of playing the audio by the tested equipment;
the candidate equalizer parameter range determining unit is used for determining the candidate equalizer parameter range of the tested device under the candidate using distance according to the voice response rate of the tested device; wherein the candidate using distance is the distance between the tested device and the voice simulation sound source;
the parameter range determining module is used for determining the parameter range of the target equalizer according to the target use distance of the equipment to be configured and the mapping relation between the candidate use distance and the candidate equalizer parameter range;
and the parameter configuration module is used for determining a target equalizer parameter value of the equipment to be configured in the target equalizer parameter range according to the audio playing characteristics of the equipment to be configured.
6. The apparatus according to claim 5, wherein the candidate equalizer parameter range determining unit is specifically configured to:
if the voice response rate of the tested equipment is smaller than a first response rate threshold value, the gain of the equalizer parameter in the low-frequency component is reduced until the obtained voice response rate is equal to the first response rate threshold value, and the current equalizer parameter is used as a lower limit value in the candidate equalizer parameter range;
if the voice response rate of the tested equipment is greater than a second response rate threshold value, the gain of the equalizer parameter in the low-frequency component is increased until the obtained voice response is equal to the second response rate threshold value, and the current equalizer parameter is used as an upper limit value in the candidate equalizer parameter range.
7. An electronic device, comprising:
one or more processors;
a memory for storing one or more programs;
the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the device configuration method of any of claims 1-4.
8. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out a method for configuring a device according to any one of claims 1 to 4.
CN201910758095.6A 2019-08-16 2019-08-16 Equipment configuration method, device, equipment and storage medium Active CN110475181B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910758095.6A CN110475181B (en) 2019-08-16 2019-08-16 Equipment configuration method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910758095.6A CN110475181B (en) 2019-08-16 2019-08-16 Equipment configuration method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN110475181A CN110475181A (en) 2019-11-19
CN110475181B true CN110475181B (en) 2021-04-30

Family

ID=68510154

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910758095.6A Active CN110475181B (en) 2019-08-16 2019-08-16 Equipment configuration method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN110475181B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111405111A (en) * 2020-03-11 2020-07-10 惠州Tcl移动通信有限公司 Audio power amplifier control method and device, storage medium and mobile terminal
CN112382305B (en) * 2020-10-30 2023-09-22 北京百度网讯科技有限公司 Method, apparatus, device and storage medium for adjusting audio signal
CN114697804A (en) * 2020-12-28 2022-07-01 深圳Tcl数字技术有限公司 Audio equalization method and device, intelligent terminal and computer readable storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20090083112A (en) * 2008-01-29 2009-08-03 한국과학기술원 Noise cancellation apparatus and method thereof
CN102947878A (en) * 2010-06-01 2013-02-27 高通股份有限公司 Systems, methods, devices, apparatus, and computer program products for audio equalization
CN105513592A (en) * 2014-10-13 2016-04-20 福特全球技术公司 Acoustic impulse response simulation
CN106898348A (en) * 2016-12-29 2017-06-27 北京第九实验室科技有限公司 It is a kind of go out acoustic equipment dereverberation control method and device
CN108600898A (en) * 2018-03-28 2018-09-28 深圳市冠旭电子股份有限公司 A kind of method, wireless sound box and the terminal device of configuration wireless sound box
CN109192193A (en) * 2018-08-14 2019-01-11 四川虹美智能科技有限公司 A kind of speech recognition product test method and test device
WO2019070328A1 (en) * 2017-10-04 2019-04-11 Google Llc Methods and systems for automatically equalizing audio output based on room characteristics
CN109905808A (en) * 2019-03-13 2019-06-18 北京百度网讯科技有限公司 Method and apparatus for adjusting intelligent sound equipment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4247002B2 (en) * 2003-01-22 2009-04-02 富士通株式会社 Speaker distance detection apparatus and method using microphone array, and voice input / output apparatus using the apparatus
GB201518004D0 (en) * 2015-10-12 2015-11-25 Microsoft Technology Licensing Llc Audio signal processing

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20090083112A (en) * 2008-01-29 2009-08-03 한국과학기술원 Noise cancellation apparatus and method thereof
CN102947878A (en) * 2010-06-01 2013-02-27 高通股份有限公司 Systems, methods, devices, apparatus, and computer program products for audio equalization
CN105513592A (en) * 2014-10-13 2016-04-20 福特全球技术公司 Acoustic impulse response simulation
CN106898348A (en) * 2016-12-29 2017-06-27 北京第九实验室科技有限公司 It is a kind of go out acoustic equipment dereverberation control method and device
WO2019070328A1 (en) * 2017-10-04 2019-04-11 Google Llc Methods and systems for automatically equalizing audio output based on room characteristics
CN108600898A (en) * 2018-03-28 2018-09-28 深圳市冠旭电子股份有限公司 A kind of method, wireless sound box and the terminal device of configuration wireless sound box
CN109192193A (en) * 2018-08-14 2019-01-11 四川虹美智能科技有限公司 A kind of speech recognition product test method and test device
CN109905808A (en) * 2019-03-13 2019-06-18 北京百度网讯科技有限公司 Method and apparatus for adjusting intelligent sound equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
智能家居语音控制系统的设计;赵亮;《中国优秀硕士学位论文全文数据库 工程科技II辑》;20180415;全文 *

Also Published As

Publication number Publication date
CN110475181A (en) 2019-11-19

Similar Documents

Publication Publication Date Title
CN110475181B (en) Equipment configuration method, device, equipment and storage medium
RU2467406C2 (en) Method and apparatus for supporting speech perceptibility in multichannel ambient sound with minimum effect on surround sound system
US20210256971A1 (en) Detection of replay attack
CN109831733A (en) Test method, device, equipment and the storage medium of audio broadcast performance
CN111798852B (en) Voice wakeup recognition performance test method, device, system and terminal equipment
US20140337021A1 (en) Systems and methods for noise characteristic dependent speech enhancement
KR20090123921A (en) Systems, methods, and apparatus for signal separation
CN109361995B (en) Volume adjusting method and device for electrical equipment, electrical equipment and medium
CN112185406A (en) Sound processing method, sound processing device, electronic equipment and readable storage medium
CN109905808B (en) Method and apparatus for adjusting intelligent voice device
CN113241085B (en) Echo cancellation method, device, equipment and readable storage medium
CN108461081A (en) Method, apparatus, equipment and the storage medium of voice control
CN112437391B (en) Microphone testing method and system for open environment
KR20050104976A (en) Method and apparatus for measuring the speech quality
CN113553022A (en) Equipment adjusting method and device, mobile terminal and storage medium
WO2018211908A1 (en) Playback system, control device, control method, and program
CN108900959B (en) Method, device, equipment and computer readable medium for testing voice interaction equipment
CN109600697A (en) The outer playback matter of terminal determines method and device
CN109741761B (en) Sound processing method and device
CN111785298A (en) Acoustic performance testing method and device, electronic equipment and computer readable medium
WO2023051622A1 (en) Method for improving far-field speech interaction performance, and far-field speech interaction system
US11579840B2 (en) Method for adjusting sound playback and portable device thereof
CN114302301B (en) Frequency response correction method and related product
CN110390954A (en) Method and device for evaluating quality of voice product
JP4173462B2 (en) Microphone position determination method, microphone position determination device, microphone position determination program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant