WO2021036970A1 - Loudspeaker box control method, loudspeaker box, and loudspeaker box system - Google Patents

Loudspeaker box control method, loudspeaker box, and loudspeaker box system Download PDF

Info

Publication number
WO2021036970A1
WO2021036970A1 PCT/CN2020/110720 CN2020110720W WO2021036970A1 WO 2021036970 A1 WO2021036970 A1 WO 2021036970A1 CN 2020110720 W CN2020110720 W CN 2020110720W WO 2021036970 A1 WO2021036970 A1 WO 2021036970A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
sound box
speaker
distance
box
Prior art date
Application number
PCT/CN2020/110720
Other languages
French (fr)
Chinese (zh)
Inventor
蒋幼宇
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2021036970A1 publication Critical patent/WO2021036970A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation

Definitions

  • This application relates to the technical field of terminals, and in particular to a speaker control method, a speaker, and a speaker system.
  • the main activity area (which can be referred to as the active area) of the people in the room is relatively fixed. For example, if a person spends a long time on the sofa (resting/watching TV/listening to music/reading, etc.), then the area where the sofa is located is the active area. After the speakers are placed in a fixed position in the room, the corresponding position where you can hear better sound effects is also determined. Take two speakers as an example. Suppose that when speaker 1 and speaker 2 are set on both sides of the TV cabinet, there is only a fixed area opposite the TV cabinet (the fixed area is equal to the area 1 where the speaker 1 is located and the area 2 where the speaker 2 is located. Triangular area) can obtain a better stereo sound effect, while in other areas, such as the restaurant area, the stereo sound effect of the speakers will be weakened, which affects the user experience.
  • the purpose of this application is to provide a speaker control method, a speaker, and a speaker system, so that users can obtain better stereo sound effects at different positions in the room.
  • a speaker control method which can be applied to a speaker group, the speaker group includes a first speaker and a second speaker, the first speaker and the second speaker are set at different positions ,
  • the method includes: the first sound box collects a first sound signal, the second sound box collects a second sound signal; the first sound box determines the sound signal according to the first sound signal and the second sound signal.
  • the time delay difference between the distance and the second distance is determined; the first sound box sends a first instruction to the second sound box, and the first instruction is used to instruct the second sound box to emit a sound at the second time, the The second time is determined based on the first time and the time delay difference.
  • the first time is the sounding time of the first speaker; the first speaker emits a third sound signal at the first time, so
  • the second sound box emits a fourth sound signal at the second moment; wherein, the third sound signal and the fourth sound signal are signals of different channels of the same audio file.
  • the speaker group can adjust the sound parameters (for example, the sounding time) of the first speaker and the second speaker according to the user's position, so that the user can obtain a better sound regardless of where the user is in the room. Good stereo sound effect.
  • the first sound box determines a first loudness gain according to the first distance; the first sound box determines a second loudness gain according to the second distance; the first sound box determines a second loudness gain according to the The first loudness gain adjusts the loudness of the first sound box; the first sound box sends a second instruction to the second sound box, and the second instruction is used to instruct the second sound box to adjust based on the second loudness gain The loudness of the second sound box.
  • the speaker group can adjust the sound parameters (for example, the sound loudness) of the first speaker and the second speaker according to the user's position, so that the user can obtain a better sound regardless of where the user is in the room. Good stereo sound effect.
  • the sound parameters for example, the sound loudness
  • the first sound box determining the position of the sound source according to the first sound signal and the second sound signal includes: the first sound box determining the position of the sound source according to the first sound signal The first angle of the sound source in the first coordinate system; the first sound box determines the second angle of the sound source in the first coordinate system according to the second sound signal; the first sound box The position of the sound source is determined according to the first angle, the second angle, and the distance between the first sound box and the second sound box.
  • the speaker group can adjust the sound parameters of the first speaker and the second speaker (for example, sounding time, sound loudness, etc.) according to the user's position, so that no matter where the user is in the room Both can obtain better stereo sound effects.
  • the first speaker in the speaker group can perform the calculation process of the user's position. For example, the first speaker collects the user's first sound signal, the second speaker collects the user's second sound signal, and the second application combines the second sound The signal is sent to the first sound box.
  • the first sound box can determine the first angle of the sound source in the first coordinate system according to the first sound signal; according to the second sound signal, determine that the sound source is in the first coordinate system.
  • the second angle in the coordinate system; and then according to the first angle, the second angle, and the distance between the first sound box and the second sound box, the position of the sound source is determined.
  • the calculation process of the user position performed by the first speaker can also be performed by the second speaker, that is, the first speaker sends the first sound signal to the second speaker, and the second speaker executes the foregoing process.
  • the first speaker performs the process of calculating the first angle
  • the second speaker performs the process of calculating the second angle
  • the second speaker sends the calculated second angle information to the first
  • the first sound box determines the position of the sound source according to the received second angle and the first angle calculated by itself, as well as the distance between the first sound box and the second sound box.
  • the first sound box determines the position of the sound source according to the first sound signal and the second sound signal
  • the first sound box determines the first sound signal and the The second sound signal includes a wake-up word.
  • the user can wake up the speaker group (one or more speakers in the speaker group can be awakened), and when the speaker group detects that the sound signal sent by the user includes a wake-up word, the user’s Position, and then adjust the sound parameters of the first and second speakers (for example, sounding time, loudness, etc.) according to the user's position.
  • the speaker group may not perform the above speaker control Method, and when the speaker group is awakened, it can make the user get a better stereo sound effect no matter where the user is in the room.
  • the first sound box determines the position of the sound source according to the first sound signal and the second sound signal, and the first sound box emits the first sound at the first moment Three sound signals.
  • the active area is that the number of times the user uses the sound box group is greater than The first preset threshold or the area where the frequency of using the speaker group is greater than the second preset threshold.
  • the user can wake up the speaker group.
  • the speaker group detects that the sound signal sent by the user includes a wake-up word, the user's location is determined. If the user is in the active area, the above is executed. If the speaker control method is not in the active area, the above speaker control method is not executed.
  • the speaker group implements the speaker control method, the sound parameters of the first speaker and the second speaker can be adjusted according to the user's position (for example, sounding time, sound loudness, etc.), so that no matter where the user is in the room, they can get better Stereo sound effect.
  • the third sound signal is the left channel signal of the audio file
  • the fourth sound information is the right channel signal of the audio file
  • the fourth sound signal Is the left channel signal of the audio file
  • the third sound information is the right channel signal of the audio file.
  • the first speaker in the speaker group can send out the left channel signal, and the second speaker can send out the right channel signal, or the first speaker can send out the right channel signal, and the second speaker can send out Left channel signal.
  • the first sound box determines the time delay difference based on the first distance and the second distance, including: the first sound box determines the sound propagation speed according to the current temperature; the first sound box According to the first distance and the sound propagation speed, determine a first time length; the first sound box determines a second time length according to the second distance and the sound propagation speed; the first sound box determines a second time length according to the first time length and the sound propagation speed; The second time length determines the time delay difference.
  • the first speaker can determine the time delay difference according to the first distance between the user and the first speaker and the second distance between the user and the second speaker, and then adjust the first speaker and the second speaker according to the time delay difference.
  • the sounding time of the second speaker enables the user to obtain a better stereo sound effect no matter where the user is in the room.
  • the current temperature value is taken into account, so that the calculated sound propagation speed is more accurate, so that the time of the first speaker and the second speaker is more accurate.
  • the delay is more accurate, and the stereo sound effect of the speaker group can be realized more accurately.
  • the first sound box determines the delay difference according to the first duration and the second duration, including: when the first distance is greater than the second distance, the The first time length is greater than the second time length, the first sound box determines that the difference between the first time length and the second time length is the time delay difference; the second time is the delay of the first time The moment after the time delay difference; when the first distance is less than the second distance, the first time length is less than the second time length, and the first sound box determines the second time length and the first time length The difference in time length is the time delay difference; the second time is a time when the first time is advanced by the time delay difference.
  • the first speaker when the user is far from the first speaker and closer to the second speaker, the first speaker can emit sound first, and the second speaker can delay sound, so that no matter where the user is A good stereo sound effect can be obtained from any position in the room.
  • the first sound box adjusts the loudness of the first sound box according to the first loudness gain, including: the first sound box Increase the current loudness of the first speaker according to the first loudness gain; the second instruction is used to instruct the second speaker to decrease the current loudness of the second speaker according to the second loudness gain;
  • adjusting the loudness of the first sound box by the first sound box according to the first loudness gain includes: the first sound box reduces the loudness of the first sound box according to the first loudness gain The current loudness of the first sound box; the second instruction is used to instruct the second sound box to increase the current loudness of the second sound box according to the second loudness gain.
  • the first speaker when the user is far from the first speaker and closer to the second speaker, the first speaker can increase the sound loudness, and the second speaker can reduce the sound loudness, so that No matter where the user is in the room, a better stereo sound effect can be obtained.
  • the embodiments of the present application also provide a sound box, which includes modules/units that execute the first aspect or any one of the possible design methods of the first aspect; these modules/units can be implemented by hardware, It can also be implemented by hardware executing corresponding software.
  • an embodiment of the present application also provides a speaker, which includes: one or more processors, one or more memories, one or more speakers, one or more microphones, and a communication module; among them, one One or more microphones are used to collect sound signals; a communication module is used to communicate with other speakers; one or more speakers are used to emit sound signals; one or more processors are coupled to the one or more memories; Wherein, one or more memories are used to store computer-executable program codes; wherein, the program codes include instructions, when the one or more processors execute the instructions, the speakers will execute the first aspect and Any possible design technical solution in the first aspect.
  • an embodiment of the present application provides a chip, which is coupled with a memory in a sound box to implement the first aspect of the embodiment of the present application and any possible design technical solution of the first aspect; in the embodiment of the present application " "Coupled” means that two components are directly or indirectly connected to each other.
  • a sound box system provided by an embodiment of the present application includes one or more sound boxes, wherein at least one sound box is the sound box described in the second and third aspects above, and the sound box can perform the above All or part of the steps of the first speaker in the first aspect.
  • an embodiment of the present application provides a sound box system
  • the sound box system includes a first sound box and a second sound box, the first sound box and the second sound box are set at different positions; the first sound box The sound box and the second sound box can communicate; the first sound box is the sound box described in the second and third aspects above (the sound box can be any one of the possible designs of the first aspect and the first aspect above) The first speaker in the technical solution).
  • a computer-readable storage medium includes a computer program.
  • the computer program runs on a computer, the computer executes the first aspect and Any possible design technical solution in the first aspect.
  • a program product in the embodiments of the present application when the computer program product runs on a computer, causes the computer to execute the first aspect of the embodiments of the present application and any possible design technology of the first aspect Program.
  • FIG. 1 is a schematic diagram of an application scenario provided by an embodiment of this application.
  • FIG. 2 is a schematic diagram of the structure of a sound box provided by an embodiment of the application.
  • FIG. 3 is a schematic diagram of the structure of a sound box provided by an embodiment of the application.
  • 4A is a schematic diagram of the structure of a sound box provided by an embodiment of the application.
  • FIG. 4B is a schematic diagram of the speaker box provided in an embodiment of the application for determining the user's position
  • FIG. 5 is a schematic flowchart of a method for controlling a speaker provided by an embodiment of the application
  • FIG. 6 is a schematic diagram of establishing a coordinate system between a master speaker and a slave speaker provided by an embodiment of the application;
  • FIG. 7A is a schematic diagram of determining the position of the sound source of the main speaker provided by an embodiment of the application.
  • FIG. 7B is a schematic diagram of determining the position of the sound source of the main speaker provided by an embodiment of the application.
  • FIG. 8 is a schematic flowchart of another method for controlling a speaker provided by an embodiment of the application.
  • FIG. 9 is a schematic diagram of determining the position of the sound source of the main speaker provided by an embodiment of the application.
  • FIG. 10 is a schematic diagram of determining an active area of a speaker provided by an embodiment of the application.
  • references described in this specification to "one embodiment” or “some embodiments”, etc. mean that one or more embodiments of the present application include a specific feature, structure, or characteristic described in conjunction with the embodiment. Therefore, the sentences “in one embodiment”, “in some embodiments”, “in some other embodiments”, “in some other embodiments”, etc. appearing in different places in this specification are not necessarily All refer to the same embodiment, but mean “one or more but not all embodiments” unless it is specifically emphasized otherwise.
  • the terms “including”, “including”, “having” and their variations all mean “including but not limited to”, unless otherwise specifically emphasized.
  • Fig. 1 shows a schematic diagram of an application scenario provided by an embodiment of the present application.
  • Figure 1 it is a schematic diagram of a living room in a family.
  • speakers in the living room for example, speakers on the left and right sides of a TV cabinet, such as speaker 1 and speaker 2).
  • the two speakers adjust the time delay, gain and other parameters of the sound in real time, so that the user hears the sound in different positions in a synchronized and three-dimensional manner.
  • the position of is fixed, and the position where a better stereo sound effect can be obtained is also fixed.
  • Fig. 2 shows a functional block diagram of a speaker provided by an embodiment of the present application.
  • the speaker 100 may include one or more input devices (input devices) 101, one or more output devices (output devices) 102, and one or more processors (processors) 103.
  • the input device 101 can detect various types of input signals (may be abbreviated as input), and the output device 104 can provide various types of output information (may be abbreviated as: output).
  • the processor 103 may receive input signals from one or more input devices 101, generate output information in response to the input signals, and output through one or more output devices 102.
  • one or more input devices 101 can detect various types of inputs and provide signals (for example, input signals) corresponding to the detected inputs, and then one or more input devices 101 can input The signal is provided to one or more processors 103.
  • the one or more input devices 101 may include any components or components capable of detecting input signals.
  • the input device 101 may include audio sensors (such as one or more microphones), distance sensors, optical or visual sensors (such as cameras, visible light sensors or invisible light sensors), proximity light sensors, touch sensors, pressure sensors, and mechanical A device (for example, a crown, a switch, a button or a key, etc.), a temperature sensor, a communication device (for example, a wired or wireless communication device), etc., or the input device 101 may also be some combination of the above-mentioned various components.
  • one or more output devices 102 may provide various types of output.
  • one or more output devices 102 may receive one or more signals (for example, an output signal provided by one or more processors 103), and provide an output corresponding to the signal.
  • the output device 102 may include any suitable components or components for providing output.
  • the output device 102 may include an audio output device (for example, one or more speakers), a visual output device (for example, one or more lights or displays), a tactile output device, and a communication device (for example, a wired or wireless communication device) Etc., or the output device 102 may also be some combination of the above-mentioned various components.
  • one or more processors 103 may be coupled to the input device 101 and the output device 102.
  • the processor 103 can communicate with the input device 101 and the output device 102.
  • one or more processors 103 may receive input signals from the input device 101 (for example, input signals corresponding to the input detected by the input device 101).
  • the one or more processors 103 may parse the received input signal to determine whether to provide one or more corresponding outputs in response to the input signal. If so, one or more processors 103 may send output signals to the output device 102 to provide output.
  • FIG. 3 shows a functional block diagram of a speaker 300 provided by another embodiment of the present application.
  • the sound box 300 may be an example of the sound box 100 described in FIG. 2.
  • the speaker 300 includes a microphone 301, a speaker 302, a processor 303, a memory 304, and a sensor module 306. It is understandable that the components shown in FIG. 3 do not constitute a specific limitation to the speaker 300, and the speaker 300 may also include more or fewer components than those shown in the figure, or combine certain components, or split certain components. Or different component arrangements.
  • the processor 303 may include one or more processing units.
  • the processor 303 may include an application processor (AP), a modem processor, a graphics processing unit (GPU), and an image signal processor. (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) Wait.
  • the different processing units may be independent devices or integrated in one or more processors.
  • the controller may be the nerve center and command center of the speaker 30. The controller can generate operation control signals according to the instruction operation code and timing signals to complete the control of fetching and executing instructions.
  • the processor 303 may also be provided with a memory for storing instructions and data.
  • the memory in the processor 303 is a cache memory.
  • the memory can store instructions or data that have just been used or recycled by the processor 303. If the processor 303 needs to use the instruction or data again, it can be directly called from the memory, which avoids repeated access and reduces the waiting time of the processor 303, thereby improving the efficiency of the system.
  • the processor 303 may run the software code/module of the speaker control method provided in some embodiments of the present application to realize the function of controlling the speaker.
  • the microphone 301 also known as a "microphone” or a “microphone” is used to collect sound signals (for example, collecting sounds made by users) and convert the sound signals into electrical signals.
  • one or more microphones 301 may be provided on the speaker 300, such as a microphone array.
  • the microphone 301 in addition to collecting sound signals, can also realize the function of noise reduction on the sound signals, or can also identify the source of the sound signals, realize the function of directional recording, and so on.
  • the speaker 302 also called a “speaker” is used to convert audio electrical signals into sound signals.
  • the speaker 300 can play sound signals such as music through the speaker 302.
  • the microphone 301 and the speaker 302 are coupled to the processor 303.
  • the microphone 301 receives the sound signal, it sends the sound signal or the audio electrical signal converted from the sound signal to the processor 303.
  • the processor 303 determines whether to respond to the sound signal or audio electrical signal, and if so, outputs a corresponding output signal, such as playing music through the speaker 302.
  • the memory 304 may be used to store computer executable program code, where the executable program code includes instructions.
  • the processor 303 executes various functional applications and data processing of the speaker 300 by running instructions stored in the memory.
  • the memory may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, a flash memory device, a universal flash storage (UFS), etc., which are not limited in the embodiment of the present application.
  • the memory 304 may store information such as "wake-up words".
  • the memory 304 may also store audio information (for example, songs, cross talk, storytelling, etc.).
  • the sensor module 306 may include an air pressure sensor 306A, a temperature sensor 306B, and so on. It should be understood that FIG. 3 only lists several examples of sensors. In practical applications, the speaker 300 may also include more or fewer sensors, or use other sensors with the same or similar functions to replace the above-listed sensors, etc. , The embodiments of this application are not limited.
  • the air pressure sensor 306A is used to measure air pressure.
  • the processor 303 may be coupled with the air pressure sensor 306A, and use the air pressure value measured by the air pressure sensor 306A to assist in the calculation, such as calculating the attenuation coefficient of the sound.
  • the temperature sensor 306B is used to detect temperature.
  • the processor 303 may be coupled with the temperature sensor 306B, and use the temperature value measured by the temperature sensor 306B to assist in the calculation, such as calculating the attenuation coefficient of the sound.
  • the communication module 305 may be a wireless communication module (such as Bluetooth, wireless).
  • the speaker 300 is connected to other devices, such as another speaker, mobile phone, TV, etc., through the communication module 305.
  • the speaker 300 may include a display (or display screen), or may not include a display.
  • the display can be used to display the display interface of the application, such as the currently playing song.
  • the display includes a display panel.
  • the display panel can adopt liquid crystal display (LCD), organic light-emitting diode (OLED), active matrix organic light-emitting diode or active-matrix organic light-emitting diode (active-matrix organic light-emitting diode).
  • LCD liquid crystal display
  • OLED organic light-emitting diode
  • active matrix organic light-emitting diode active-matrix organic light-emitting diode
  • active-matrix organic light-emitting diode active-matrix organic light-emitting diode
  • AMOLED flexible light-emitting diode (FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diode (QLED), etc.
  • a touch sensor may be provided in the display to form a touch screen, which is not limited in the embodiment of the present application.
  • the touch sensor is used to detect touch operations acting on or near it.
  • the touch sensor may transmit the detected touch operation to the processor 303 to determine the type of the touch event.
  • the processor 303 may provide visual output related to the touch operation through the display.
  • FIG. 3 may also include more devices, such as batteries, USB ports, etc., which are not described in detail in the embodiments of the present application.
  • FIG. 4A shows a schematic structural diagram of a sound box provided by an embodiment of the present application.
  • the sound box 400 may be an example of the sound box described in FIG. 2 or FIG. 3.
  • the sound box 400 may include a base 401 and a housing 402.
  • the base 401 can function as a support.
  • the base 401 can support the housing 402 and the components enclosed in the housing 402 (for example, a processor, a microphone, a speaker, etc.).
  • the base 401 may be made of any other supporting material such as metal, plastic, ceramic, etc., or a combination of these materials.
  • one or more speakers 406 may be supported on the base 401.
  • the base 401 may support a fixing member 404, and one or more speakers 406 may be provided on the fixing member 404.
  • the base 401 may support the fixing member 404 through a supporting column 405 or other methods.
  • the fixing member 404 can be any shape, such as a circle, a square, and so on.
  • one or more speakers 406 may be arranged on the fixing member 404 in a certain arrangement. For example, one or more speakers 406 may be evenly distributed on the edge of the fixing member 404, for example, the distance between each speaker is the same.
  • one or more speakers 406 may be coupled with the processor 403.
  • the processor 403 may output audio signals through one or more speakers 406.
  • the housing 402 may be any three-dimensional shape such as a cylinder, a cube, or a cube.
  • the housing 402 may enclose components such as the processor 403, the fixing member 404, and one or more speakers 406.
  • the housing 402 may be a single housing member or composed of more than two housing members.
  • the housing 402 may include an upper housing 402a and a side housing 402b.
  • the one or more shell members may be metal, plastic, ceramic, crystal, or a combination of these materials, or any other shell members suitable to be arranged on the sound box, and so on.
  • the side shell 402b may be a shell with a mesh structure, for example, the mesh may be a round hole, a square hole, a hexagonal hole, or the like.
  • the shell of the mesh structure can play the role of decoration, dustproof, and protection of the internal components of the shell (such as speakers, microphones, etc.), and the shell of the mesh structure can reduce the blocking of the sound output by the speaker.
  • the upper casing 402a may be a mesh structure or a casing that is not a mesh structure.
  • the upper housing 402a can be provided with input devices, such as switches, buttons or keys.
  • the switch is used to turn the speaker on or off.
  • Buttons or keys can be used to adjust functions such as volume.
  • a display screen 409 (such as a touch screen) can be provided on the upper housing 402a, which can be used to receive input, provide visual output, and so on.
  • the name of the song currently playing, the name of the singer, etc. can be displayed on the display screen 409.
  • the display screen may not be provided on the speaker, which is not limited in the embodiment of the present application.
  • the upper housing 402a may be connected to the fixing member 407.
  • One or more microphones 408 can be provided on the fixing member 407.
  • the fixing member 407 can be any shape, such as a circle, a square, and so on.
  • one or more microphones 408 may be arranged on the fixing member 407 in a certain arrangement.
  • one or more microphones 408 may be evenly distributed on the edge of the fixing member 407, for example, the distance between each microphone is the same.
  • the center angle a corresponding to each two adjacent microphones (for example, the angle formed by the straight line connecting the two microphones to the center point of the fixing member 407) may be fixed, such as 30 degrees, 60 degrees, and so on.
  • one or more microphones 408 may be coupled with the processor 403.
  • the processor 403 may obtain an input signal (such as a sound signal sent by a user) through one or more speakers 406.
  • the application scenario of FIG. 1 is taken as an example, and the speaker 1 and/or the speaker 2 in FIG. 1 is the speaker 400 shown in FIG. 4A as an example.
  • one of the speaker 1 and the speaker 2 is called the main speaker, and the other is called the slave speaker.
  • the main sound box and the slave sound box may be used in conjunction.
  • the main speaker is used to play the left channel
  • the slave speaker is used to play the right channel
  • the main speaker is used to play the right channel
  • the slave speaker is used to play the left channel.
  • the cooperation of the main speaker and the slave speaker can achieve the stereo sound effect of the audio.
  • whether a speaker is a master speaker or a slave speaker can be set before the speaker leaves the factory, or it can be user-defined (for example, the speaker accesses the input operation through the touch screen, and the input operation is used to select Whether the speaker is the main speaker or the slave speaker).
  • the structure of the main sound box and the slave sound box may be the same.
  • the main sound box and the slave sound box both have the structure shown in FIG. 4A.
  • the structure of the main speaker and the slave speaker may not be exactly the same.
  • the master speaker may be provided with a display screen, but the slave speaker may not be provided with a display screen.
  • the functions of some parts of the main speaker and the slave speaker may not be exactly the same.
  • the processor in the main speaker can be used to calculate the delay difference (for example, the time difference between the first duration and the second duration.
  • the first duration can be the time required for the sound from the main speaker to the user
  • the second duration can be It is the time required for the sound from the speaker to the user), loudness gain, etc., and the processor in the slave speaker does not have this function.
  • audio files can be stored in the memory in the master speaker and/or the slave speaker, and the master speaker and the slave speaker can play the stored audio files.
  • the main speaker can receive input (for example, receiving input operations through the touch screen, or receiving language input through the microphone), and the input can be used to activate the main speaker and/or the slave speaker, or to control the main speaker and the playback of the slave speaker , Switch songs, etc.
  • one or more microphones in the main speaker collect a sound signal (for example, a sound signal sent by a user), the processor recognizes that the sound signal contains "wake-up word + play song", and the processor determines the memory When the song does not exist in the file, the song can be downloaded from the network side, or prompt information (such as language information) can be output to prompt the user that the song does not exist.
  • a sound signal for example, a sound signal sent by a user
  • the processor recognizes that the sound signal contains "wake-up word + play song”
  • the processor determines the memory
  • prompt information such as language information
  • the master speaker and/or the slave speaker can be connected to other electronic devices (such as mobile phones, televisions), and can be connected in a wired or wireless manner.
  • the connection between the main speaker and the mobile phone for example, Bluetooth connection
  • the mobile phone can send the audio signal to the main speaker, so that the main speaker and the slave speaker can play the audio signal (for example, after the main speaker receives the audio signal, the audio signal can be sent to the slave speaker).
  • the mobile phone is running a music playing application (for example, Kugou Music), and is playing the song "All the Way North”
  • the mobile phone can send the audio signal of the song to the main speaker, so that the main speaker and the slave speaker can play the audio signal.
  • the user can control the mobile phone to perform corresponding operations through the main speaker.
  • the user sends out the sound signal "Xiaobai plays songs and listens to her mother" in the room.
  • the main speaker collects the sound signal and can pause the playback all the way to the north. Instead, it outputs the prompt message "It is for you.” Seek to listen to what my mother says.” For example, the main speaker can find from the local storage whether there is a song listening to mom’s words. If it does not exist, the main speaker can download it from the network side, or the main speaker can send an instruction to the mobile phone.
  • the instruction user instructs the mobile phone to play listening to mom’s words and the phone receives After the instruction is reached, download or play the song online, and send the audio signal of the song to the main speaker, so that the main speaker and the slave speaker play the audio signal of the song (that is, listening to mother's words).
  • both the master speaker and the slave speaker can activate the function of automatically recognizing the "wake word”.
  • the main speaker Take the main speaker as an example. After the main speaker activates the function of automatically recognizing the "wake word", all or part of the components (for example, one or more microphones, processors, etc.) in the main speaker are in an enabled state.
  • the sound signal sent by the user in the room is received by one or more microphones in the main speaker.
  • One or more microphones send the received sound signal to the processor, and when the processor determines that the sound signal contains the "wake-up word", it activates other components (for example, one or more speakers).
  • the "wake-up word” can be set by default when the speaker is shipped from the factory, or it can be user-defined.
  • the "wake-up word” can be "Xiaobai", “Xiaoyin", “Xiaoyi” and so on.
  • both the master speaker and the slave speaker can activate the function of automatically recognizing the "wake-up word + play song".
  • the main speaker activates the function of the automatic device "wake word + play song"
  • all or part of the components (for example, one or more microphones, processors, etc.) in the main speaker are in an enabled state.
  • the sound signal sent by the user in the room is received by one or more microphones in the main speaker.
  • One or more microphones send the received sound signal to the processor, and when the processor determines that the sound signal contains "wake-up word + play song", it activates other components (for example, one or more speakers). For example, the user sends out "Xiaobai play all the way north" in the room.
  • the sound signal collected by the microphone in the main speaker is then sent to the processor.
  • the processor recognizes that the sound signal includes the wake-up word: Xiaobai. It also includes: playing a song, and the processor activates other components (such as one or more speakers). ).
  • the main speaker can receive input operations through an input device (such as a touch screen on the main speaker) or through other devices connected to the main speaker, such as a mobile phone. "Or “Wake up word + play song” function, the main speaker can send an instruction to the slave speaker, this instruction is used to instruct the slave speaker to start the automatic recognition of "wake word” or "wake up word + play song” function.
  • an input device such as a touch screen on the main speaker
  • other devices connected to the main speaker such as a mobile phone.
  • the main speakers and the slave speakers are placed in different positions in the room.
  • the main sound box can detect the distance D between the main sound box and the slave sound box for use.
  • the distance can be the straight line distance between the main sound box and the slave sound box.
  • the slave speaker can detect the distance D from the main speaker itself for use.
  • the slave speaker can detect the distance D from the main speaker and then send it to the master speaker, that is, the master speaker does not need to detect the distance D and so on.
  • the main speaker can detect the distance between the main speaker and the slave speaker through a distance sensor.
  • the distance sensor may be a laser distance sensor, an infrared distance sensor, or the like.
  • the distance sensor on the main sound box emits infrared light of a specific frequency, which is reflected from the sound box, and the main sound box receives the light emitted from the sound box.
  • the main sound box can calculate the distance between the main sound box and the slave sound box according to the first time when the infrared light is emitted and the second time when the reflected light is received.
  • the master speaker can also communicate with the slave speaker to achieve the purpose of measuring the distance between the master speaker and the slave speaker.
  • the main speaker transmits a detection signal to the slave speaker, and after receiving the detection signal, the slave speaker sends a feedback signal to the main speaker, and the main speaker receives the feedback signal.
  • the main speaker can determine the distance between the main speaker and the slave speaker according to the second time when the feedback signal is received and the first time when the detection signal is sent.
  • the main speaker may also receive an input operation through an input device (such as a touch screen on the main speaker), and the input operation is used to input the distance between the main speaker and the slave speaker.
  • the user may be anywhere in the room, and the distance between the main speaker and the slave speaker and the user may be different.
  • the main speaker and the slave speaker activate the function of automatically recognizing "wake word” or "wake word + play song”.
  • the main speaker and the speaker in the slave speaker collect sound signals.
  • the main and slave speakers determine that the sound signal contains "wake-up words” or "wake-up words + play songs”
  • the user's position can be judged, and then the sound parameters of the main and slave speakers can be controlled according to the user's position.
  • the sound parameters may include the time delay difference between the main speaker and the slave speaker, loudness gain, and so on. Therefore, in this embodiment, when the master speaker and the slave speaker recognize that the collected sound signal contains "wake word” or "wake word + play song", the sound parameters of the master speaker and slave speaker are adjusted according to the user's position .
  • the process of determining the position of the user by the main speaker and the slave speaker may include: the main speaker collects the sound signal 1.
  • the sound signal is collected from the speaker 2.
  • the main speaker determines that the sound signal 1 includes the "wake-up word", and the slave speaker determines that the sound signal 2 includes the "wake-up word”.
  • the slave speaker can also send the "wake-up word” included in sound signal 2 or sound signal 2 to the main speaker, and the master speaker determines whether the "wake-up word” in sound signal 1 and sound signal 2 is The same wake word.
  • the main speaker can determine the first direction/azimuth of the user relative to the main speaker according to the sound signal 1.
  • the first direction/azimuth can be expressed as the first angle between the user and the x-axis in the coordinate system constructed by the main speaker .
  • the slave speaker can determine the second direction/azimuth of the user relative to the slave speaker according to the sound signal 2.
  • the second direction/azimuth can be expressed as the second direction of the user between the x-axis and the coordinate system constructed by the slave speaker. angle.
  • the slave speaker can send the second angle to the master speaker, and the master speaker determines the user's position according to the first angle and the second angle, and the distance D between the master speaker and the slave speaker.
  • the construction of the coordinate system of the main speaker and the slave speaker, and the process of determining the position of the user by the master speaker and the slave speaker will be described in detail later.
  • the main speaker can determine the first direction/azimuth of the user relative to the main speaker according to the sound signal 1 in many ways.
  • microphone array positioning technology for example, estimation of the sound source position based on the time difference between the sound signals received by at least two microphones in the microphone array on the main speaker
  • the beam-direction (steered-beamformer) positioning method based on high resolution High-resolution spectral analysis (high-resolution spectral analysis) positioning methods
  • sound source positioning techniques based on sound time-delay estimation (TDE), etc. are not limited in the embodiment of the present application.
  • the process of determining the first direction/azimuth of the user relative to the main speaker by the main speaker according to the sound signal 1 may include; the microphone array 408 in the main speaker collects the sound signal, assuming that the microphone 408-1 and the microphone The strength of the sound signal collected by 408-2 is relatively high.
  • the main speaker can use the first time t1 when the sound signal is collected by the microphone 408-1 and the second time t2 when the sound signal is collected by the microphone 408-2, and the microphone 408-
  • the distance L1 between 1 and the microphone 408-2 (the distance can be stored in the main speaker after leaving the factory), and the sound source, that is, the first position of the user relative to the main speaker is calculated.
  • the main speaker can determine the angle A of the user relative to the microphone 408-1 according to (t1-t2)*c and L1, and the trigonometric function relationship.
  • This angle A can be used as the user relative to the main speaker
  • the main speaker can transform the included angle A into the coordinate system constructed by the main speaker to obtain the included angle B
  • the included angle B can also be used as the user's first orientation relative to the main speaker.
  • the structure of the slave speaker and the main speaker can be the same, so the process of determining the second orientation of the user relative to the slave speaker can be similar to the above process.
  • the main speaker and the slave speaker in the process of the user continuously emitting sound signals, can collect sound signals in real time and continuously (the sound signals may not include "wake-up words” or “wake-up words + play songs”. "), and then determine the user's location, adjust the sound parameters of the main speaker and the slave speaker according to the user's location, until a sound signal containing "wake-up words” or "wake-up words + play songs" is detected, the adjusted sound parameters (For example, the delay difference between the main speaker and the slave speaker, loudness gain, etc.) parameters control the main speaker and the slave speaker to play audio signals.
  • the adjusted sound parameters For example, the delay difference between the main speaker and the slave speaker, loudness gain, etc.
  • the following embodiment introduces possible implementations of controlling the sound parameters of the main speaker and the slave speaker according to the user's position.
  • FIG. 5 shows a schematic flowchart of a speaker control method provided by an embodiment of the present application. As shown in Figure 5, the process can include:
  • the main speaker collects sound signals.
  • both the master speaker and the slave speaker are the speakers 400 shown in FIG. 4A.
  • One or more microphones in the main speaker and the slave speaker can always be enabled. Therefore, the microphones in the main speaker and the slave speaker can collect sound signals (for example, the sound signals sent by the user).
  • the main sound box as an example, one or more microphones 408 in the main sound box collect sound signals sent by the user.
  • the one or more microphones 408 send the collected sound signals to the processor 403.
  • the processor 403 recognizes the sound signal. For example, the processor 403 can determine whether the sound signal contains a "wake-up word". If so, the processor 403 activates the main speaker, for example, other components in the main speaker (such as one or more speakers 406, a display screen 409, etc.) Supply power to enable it.
  • the main speaker calculates the first angle of the sound source relative to the main speaker.
  • 504 Calculate the second angle of the sound source from the sound box relative to the sound box.
  • the main sound box can establish a first coordinate system.
  • a first coordinate system For example, take the edge of the display screen on the main speaker as the coordinate origin, the gravity direction as the z-axis direction, the short side of the display screen is the x direction, and the long side is the y-axis direction to establish the x1-y1-z1 coordinate system.
  • the main speaker can determine the first position of the sound source level user in the coordinate system, for example, the first angle between the user and the x1 axis Or, the functional relationship between the position of the user and the ray formed by the origin of the first coordinate system in the x1-y1-z1 coordinate system.
  • the coordinate system can also be constructed in other ways.
  • the coordinate system is constructed with the center point of the main sound box as the coordinate origin, the direction of gravity as the z-axis direction, and the direction from the center point to a microphone in the main sound box as the x-axis direction. The way.
  • the second coordinate system (such as the x2-y2-z2 coordinate system in the figure) can also be established from the sound box.
  • the way of constructing the coordinate system from the speaker and the main speaker can be the same.
  • the second position of the user in the coordinate system constructed by the speaker can be determined, such as the second angle l between the user and x2.
  • the slave speaker sends the calculation result to the master speaker.
  • the calculation result from the sound box may be the second angle l. Therefore, the secondary speaker can send the second angle l to the main speaker.
  • the main speaker can determine the position of the user according to the first angle and the second angle, and the distance D between the main speaker and the slave speaker.
  • the calculation result of the slave speaker can also be the function of the second ray formed by the second included angle l in the second coordinate system, that is, the x2-y2-z2 coordinate system, that is, the position of the user
  • the functional relationship of the ray formed in the second coordinate system with the origin of the second coordinate system, and the secondary speaker can send the functional relationship of the second ray to the main speaker.
  • the main speaker calculates the position of the sound source.
  • Example 1 if the second angle l is sent from the speaker to the main speaker.
  • the second ray can be determined according to the second angle l in the x1-y1-z1 coordinate system, such as the second ray shown in FIG. 7A.
  • the main speaker can also be based on the first angle Determine the first ray in the x1-y1-z1 coordinate system.
  • the main speaker determines that the intersection Q between the first ray and the second ray is the position of the user.
  • the second ray is identified in the x1-y1-z1 coordinate system, because the second ray is not at x1. -y1-z1 coordinate system, so the second ray can be translated by the distance D along the z1 direction to obtain the third mathematical relationship.
  • the third ray corresponding to the third mathematical relationship is the second ray in the main speaker x1-y1 -The representation in the z1 coordinate system.
  • the intersection point between the first ray and the third ray can be determined (for example, solving the equation formed by the first mathematical relationship corresponding to the first ray and the third mathematical relationship corresponding to the third ray Group), the intersection is the location of the sound source (for example, the user).
  • the main speaker determines the first distance d1 from the sound source to the main speaker, and the second distance d2 from the sound source to the slave speaker.
  • the distance d1 between the intersection point and the main sound box (such as the origin of the first coordinate system) can be determined according to the law of triangles, or it can be determined
  • the distance d2 between the intersection point and the secondary speaker (for example, the distance between the intersection point and the intersection point A of the third ray and the -z1 axis on the first coordinate system).
  • the main speaker calculates the time delay difference according to the first distance d1 and the second distance d2.
  • the main speaker can determine the time delay difference between the first distance d1 and the second distance d2 according to the following formula:
  • c is the sound propagation speed
  • t1 is the time required for the sound signal to travel from the main speaker to the user
  • t2 is the time required for the sound signal to travel from the speaker to the user
  • ⁇ t is the sound signal propagating in the first distance d1 and the second distance d2
  • the delay is poor.
  • the sound propagation speed may be affected by various factors, such as temperature and air pressure. The following embodiments introduce several possible ways of determining c.
  • Method 1 c can be obtained based on the following formula:
  • T is the temperature value detected by the temperature sensor in the main speaker.
  • R is the gas constant of 287J/(kg ⁇ K), and T is the temperature detected by the temperature sensor ( For example, absolute temperature);
  • M is the molar mass of the gas, and the value of M can be fixed, such as 22.4L/mol.
  • T is the temperature value detected by the temperature sensor in the main speaker
  • P is the pressure value detected by the air pressure sensor in the main speaker.
  • the time delay of the sound of the main speaker or the slave speaker can be adjusted based on the time delay difference ⁇ t, so that the time for the sound from the main speaker and the slave speaker to reach the user is consistent. .
  • one or more speakers in the slave speaker can be controlled to delay sound by ⁇ t.
  • the master speaker can send a command to the slave speaker, and the command can carry the sounding time point of the master speaker and the length of time the slave speaker needs to delay.
  • the sounding time of the main speaker and the length of time the slave speaker needs to be delayed can be determined. In this case, delaying the sound from the speaker can make the sound from the main speaker and the slave speaker reach the user at the same time.
  • the master speaker when the master speaker determines that d1 ⁇ d2, the master speaker can issue an instruction to the slave speaker, and the instruction is used to indicate the sounding time point of the slave speaker.
  • the main speaker controls its own one or more speakers to delay a certain period of time based on the sounding time point before sounding. In this case, the sound of the main speaker is delayed so that the sound from the main speaker and the slave speaker can reach the user at the same time.
  • the main speaker controls its own one or more speakers to sound at the sounding time point.
  • the main speaker calculates the loudness gain according to the first distance d1 and the second distance d2.
  • the main speaker can determine the loudness gain by the following formula:
  • e1 and e2 are loudness gains
  • d1 is the distance between the main speaker and the sound source
  • d2 is the distance between the slave speaker and the sound source
  • ⁇ 1 and ⁇ 2 are the sound attenuation coefficients. Because the sound attenuation coefficient will be affected by many factors, such as air pressure, temperature and so on.
  • the following embodiments introduce several possible ways of determining the sound attenuation coefficient.
  • the sound attenuation coefficient can be obtained by the following formula:
  • P0 is the standard atmospheric pressure (1013.25 hPa)
  • P is the pressure value detected by the pressure sensor
  • T0 is 293K
  • T is the temperature detected by the temperature sensor
  • f is the sound frequency of the speaker.
  • the value of f may be preset by the sound box.
  • f may be related to the performance specifications of the speaker in the sound box. The sound frequency of the sound box is set before the sound box leaves the factory.
  • ⁇ cl is the classical part caused by viscosity and thermal conductivity
  • ⁇ rot is the rotational relaxation part caused by the relaxation process of rotating excited molecules
  • the vibration relaxation part caused by the relaxation process of ⁇ vib vibration excited molecules.
  • ⁇ cl , ⁇ rot and ⁇ vib can be obtained by the following formula:
  • P0 is the standard atmospheric pressure (1013.25 hPa)
  • P is the pressure value detected by the pressure sensor
  • T0 is 293K
  • T is the temperature detected by the temperature sensor.
  • f is the sound frequency of the speaker.
  • f r,o represents the vibrational relaxation frequency of oxygen molecules
  • f r,N represents the vibrational relaxation frequency of nitrogen molecules
  • fr,N can be obtained by the following formula:
  • q is the specific humidity
  • e is the pressure detected by the air pressure sensor
  • p0 is 101325N/m2
  • T is the temperature detected by the temperature sensor
  • T0 is 293K.
  • the wet pressure e can be used to indicate the vapor pressure in the air.
  • the sound box includes a sensor, which can be used to detect the water vapor pressure in the air; for another example, the sound box can query the water vapor pressure in the current control on the network side (for example, weather service).
  • the loudness gain of one or more speakers in the master speaker and the slave speaker can be adjusted based on e1 and e2.
  • the main speaker can control the loudness of one or more speakers to increase e1.
  • the master speaker can also send an instruction to the slave speaker, which is used to instruct the loudness of one or more speakers in the slave speaker to increase by e2.
  • the main speaker determines that d1>d2, the loudness of one or more speakers in the main speaker can be controlled to increase e1, and the loudness of the slave speaker may not increase.
  • the main speaker may send an instruction to the slave speaker, which is used to instruct the loudness of one or more speakers in the slave speaker to increase by e2, and the loudness of the main speaker may not increase.
  • both the main speaker and the slave speaker may not increase the loudness, or the loudness of the main speaker increases by e1, and the loudness of the slave speaker increases by e2.
  • the main speaker controls the sound playback of the main speaker and the slave speaker according to the delay difference and loudness gain.
  • 510 includes 510a and 510b, where 510a controls the playback of the main speaker, and 510b controls the playback of the master speaker and the slave speaker.
  • the process of calculating the delay difference and the loudness gain of the main speaker may occur simultaneously or differently, which is not limited in the embodiment of the present application.
  • the above process of calculating the position of the sound source based on the first ray and the second ray may also be performed by the slave speaker.
  • the related information of the first angle (such as the first angle, Or the first mathematical relationship of the first ray formed by the first angle in the first coordinate system) is sent to the slave speaker, and the slave speaker calculates the position of the sound source, the time delay difference, and the loudness gain.
  • the slave speaker may not have the computing capability (for example, it does not have the computing capability of calculating the second orientation of the sound source relative to the slave speaker).
  • FIG. 8 is a schematic flowchart of a speaker control method provided by another embodiment of this application. As shown in Figure 8, the process may include:
  • the main speaker collects sound signals.
  • the description of 801 and 802 can refer to the description of 501 and 502 in the embodiment shown in FIG. 5, which is not repeated here.
  • the main speaker calculates the first angle of the sound source relative to the main speaker, and the second angle of the sound source relative to the slave speaker.
  • the slave speaker does not have the calculation ability (for example, it does not have the calculation ability to calculate the second orientation of the sound source relative to the slave speaker), so the slave speaker cannot calculate the second angle of the sound source relative to the slave speaker.
  • the slave speaker can send the collected sound signal to the main speaker.
  • one or more microphones are included in the sound box, and the sound signals collected by each microphone are different. From the speaker, the sound signal collected by each microphone can be sent to the main speaker. In some examples, if the distribution of microphones in the microphone array in the main speaker and the slave speaker are the same, when the main speaker receives the sound signal collected by each microphone sent from the speaker, it can be mapped to the microphone in the main speaker.
  • each microphone in the main speaker and the slave speaker is set with a number.
  • the sound signal sent from the sound box to the main sound box includes: sound signal 1 corresponding to microphone 1, sound signal 2 corresponding to microphone 2, and sound signal 3 corresponding to microphone 3.
  • the main speaker receives the sound signal (for example, the sound signal group composed of sound signal 1, sound signal 2 and sound signal 3), it can be mapped to the microphone in the main speaker, that is, the microphone 1 in the main speaker collects the sound signal 1.
  • the microphone 2 collects the sound signal 2, and the microphone 3 collects the sound signal 3. Therefore, for the main speaker, two/groups of sound signals are collected, one/group of sound signals is collected and sent by the user, and the other/group of sound signals is received and sent from the speaker.
  • the two/groups of sound signals can be different, for example, different loudness, etc.
  • the main speaker can calculate an angle according to each/group of sound signals to get two angles, such as the first angle And the second angle l.
  • the main speaker calculates the position of the sound source.
  • the main speaker can be based on the first angle
  • the first ray is determined, and the second ray is determined according to the second angle l and the distance D.
  • the main sound box determines the intersection of the first ray and the second ray, and the intersection is the location of the user.
  • the main speaker calculates the first distance between the sound source and the main speaker, and the second distance between the sound source and the slave speaker.
  • the first distance d1 between the sound source and the main sound box and the second distance d2 between the sound source and the main sound box can be determined according to the triangle law.
  • the main speaker calculates the delay difference and/or loudness gain based on the first distance and the second distance.
  • the main speaker controls the sound playback of the main speaker and the slave speaker according to the delay difference and loudness gain.
  • 808 includes 808a and 808b, where 808a controls the playback of the main speaker, and 808b controls the playback of the slave speaker.
  • the main speaker may not have the computing capability (for example, it does not have the computing capability to calculate the first orientation of the sound source relative to the main speaker). Therefore, the execution steps of the slave speaker in Figure 8 are executed by the master speaker. The steps of the speaker are performed by the slave speaker.
  • the main speaker and/or the slave speaker after the main speaker and/or the slave speaker are turned on (every time it is turned on or the first time after purchase), the location where the user often listens to music can be counted.
  • the main speaker and/or the slave speaker receive a voice command, the voice command is used to activate the speaker, and the master speaker and/or the slave speaker determine the user's first position based on the voice command.
  • the master speaker and/or the slave speaker store the first position.
  • the main sound box and/or the slave sound box detects a voice command again, the sound command is used to start the sound box to play a song, and the master sound box and/or the slave sound box determine the user's second position based on the voice command, and store the second position.
  • the main speaker and/or the slave speaker can determine several positions of the user. For example, all the black solid points in Figure 10 are the user's position determined by the main speaker and/or the slave speaker.
  • the main speaker and/or the slave speaker can determine all the coordinate points of the position where the distance between the two coordinate points is less than the preset distance, and then the area constituted by these coordinate points can be determined according to the minimum envelope circle. That is, the area where the user listens to music for a long time can be called the active area, such as the area formed by densely distributed points in FIG.
  • the main sound box and the slave sound box collect the sound signals emitted by the sound source.
  • the main speaker determines that the sound signal includes "wake-up word” or "wake-up word + play song"
  • the location of the sound source is determined.
  • the main speaker can determine whether the sound source position is in the active area. If so, control the sound of the main speaker and the slave speaker based on the process shown in Figure 5 or Figure 7 above. If not, you do not need to use the sound source shown in Figure 5 or Figure 7 above.
  • the process controls the sound of the master speaker and the slave speaker.
  • the "active area" may be a location where users often listen to music.
  • the user emits the sound source "Xiaobai plays all the way north” at the door.
  • the main speaker receives the sound signal, determines that the sound signal includes "wake-up word + play song", and determines the user's location. If the main speaker determines that the position is not in the active area, there is no need to adjust the sound parameters of the main speaker and the slave speaker (such as loudness gain, delay difference, etc.), for example, the main speaker and the slave speaker sound at the same time, or use the last used sound Parameters control the sound, or use the default sound parameters (for example, the factory default settings) to control the sound. In this case, the main speaker judges that the user is not in the active area, so there is no need to adjust the sound parameters (such as loudness gain, delay difference) according to the user's location, which helps to save power consumption.
  • the sound parameters of the main speaker and the slave speaker such as loudness gain, delay difference, etc.
  • the terminal device may include a hardware structure and/or a software module, and implement the above functions in the form of a hardware structure, a software module, or a hardware structure plus a software module. Whether a certain function of the above-mentioned functions is executed by a hardware structure, a software module, or a hardware structure plus a software module depends on the specific application and design constraint conditions of the technical solution.
  • the term “when” or “after” can be interpreted as meaning “if" or “after” or “in response to determining" or “in response to detecting ".
  • the phrase “when determining" or “if detected (statement or event)” can be interpreted as meaning “if determined" or “in response to determining" or “when detected (Condition or event stated)” or “in response to detection of (condition or event stated)”.
  • relationship terms such as first and second are used to distinguish one entity from another entity, and any actual relationship and order between these entities are not limited.
  • the computer may be implemented in whole or in part by software, hardware, firmware, or any combination thereof.
  • software it can be implemented in the form of a computer program product in whole or in part.
  • the computer program product includes one or more computer instructions.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
  • the computer instructions may be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium.
  • the computer instructions may be transmitted from a website, computer, server, or data center.
  • the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server or a data center integrated with one or more available media.
  • the usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Stereophonic System (AREA)

Abstract

A loudspeaker box control method, a loudspeaker box, and a loudspeaker box system. The method pertains to the fields of smart terminals, human-machine interaction, and the like. The method comprises: a first loudspeaker box acquiring a first sound signal, and a second loudspeaker box acquiring a second sound signal; the first loudspeaker box determining a position of a sound source according to the first sound signal and the second sound signal, and determining a first distance of the sound source to the first loudspeaker box and a second distance of the sound source to the second loudspeaker box; the first loudspeaker box determining a delay difference on the basis of the first distance and the second distance; the first loudspeaker box instructing the second loudspeaker box to emit sound at a second time, the second time being determined according to a first time and the delay difference, and the first time being a sound emission time of the first loudspeaker box; and the first loudspeaker box emitting a third sound signal at the first time, and the second loudspeaker box emitting a fourth sound signal at the second time, wherein the third sound signal and the fourth sound signal are signals of different sound channels of the same audio file. This manner provides users with an improved stereo sound effect for different locations in a room.

Description

一种音箱控制方法、音箱以及音箱系统Sound box control method, sound box and sound box system
相关申请的交叉引用Cross-references to related applications
本申请要求在2019年08月23日提交中国专利局、申请号为201910785595.9、申请名称为“一种音箱控制方法、音箱以及音箱系统”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on August 23, 2019, the application number is 201910785595.9, and the application title is "A speaker control method, speaker and speaker system", the entire content of which is incorporated by reference In this application.
技术领域Technical field
本申请涉及终端技术领域,尤其涉及一种音箱控制方法、音箱以及音箱系统。This application relates to the technical field of terminals, and in particular to a speaker control method, a speaker, and a speaker system.
背景技术Background technique
目前,越来越多的家庭内安装音箱,以提升音频播放的效果。通常,房间内人的主要活动区域(可以简称活跃区域)是相对固定的。比如,人在较长的时间在沙发上(休息/看电视/听音乐/阅读等),那么沙发所在的区域就是活跃区域。当音箱在房间内的摆放位置固定后,对应的能听到比较好的声效的位置也确定了。以两个音箱为例,假设音箱1和音箱2设置在电视柜两侧时,只有电视柜对面的固定区域(该固定区域是与音箱1所在的区域1和音箱2所在的区域2构成等边三角形的区域)能够获得比较好的立体声音效果,而在其他区域比如餐厅区域音箱的立体声效会弱化,影响用户体验。At present, more and more households install speakers to enhance the effect of audio playback. Generally, the main activity area (which can be referred to as the active area) of the people in the room is relatively fixed. For example, if a person spends a long time on the sofa (resting/watching TV/listening to music/reading, etc.), then the area where the sofa is located is the active area. After the speakers are placed in a fixed position in the room, the corresponding position where you can hear better sound effects is also determined. Take two speakers as an example. Suppose that when speaker 1 and speaker 2 are set on both sides of the TV cabinet, there is only a fixed area opposite the TV cabinet (the fixed area is equal to the area 1 where the speaker 1 is located and the area 2 where the speaker 2 is located. Triangular area) can obtain a better stereo sound effect, while in other areas, such as the restaurant area, the stereo sound effect of the speakers will be weakened, which affects the user experience.
发明内容Summary of the invention
本申请的目的在于提供了一种音箱控制方法、音箱以及音箱系统,使得用户在房间内的不同位置都能获得较好的立体声音效果。The purpose of this application is to provide a speaker control method, a speaker, and a speaker system, so that users can obtain better stereo sound effects at different positions in the room.
上述目标和其他目标将通过独立权利要求中的特征来达成。进一步的实现方式在从属权利要求、说明书和附图中体现。The above objectives and other objectives will be achieved through the features in the independent claims. Further implementations are embodied in the dependent claims, description and drawings.
第一方面,提供了一种音箱控制方法,该方法可以应用于音箱组,所述音箱组包括第一音箱和第二音箱,所述第一音箱和所述第二音箱被设置在不同的位置,该方法包括:所述第一音箱采集到第一声音信号,所述第二音箱采集到第二声音信号;所述第一音箱根据所述第一声音信号和所述第二声音信号确定声源的位置;所述第一音箱确定所述声源与所述第一音箱的第一距离以及所述声源与所述第二音箱的第二距离;所述第一音箱基于所述第一距离和所述第二距离确定时延差;所述第一音箱向所述第二音箱发送第一指令,所述第一指令用于指示所述第二音箱在第二时刻发出声音,所述第二时刻是根据第一时刻和所述时延差确定的,所述第一时刻是所述第一音箱的发声时间;所述第一音箱在所述第一时刻发出第三声音信号,所述第二音箱在所述第二时刻发出第四声音信号;其中,所述第三声音信号和所述第四声音信号是同一音频文件的不同声道的信号。In a first aspect, a speaker control method is provided, which can be applied to a speaker group, the speaker group includes a first speaker and a second speaker, the first speaker and the second speaker are set at different positions , The method includes: the first sound box collects a first sound signal, the second sound box collects a second sound signal; the first sound box determines the sound signal according to the first sound signal and the second sound signal. The position of the source; the first sound box determines the first distance between the sound source and the first sound box and the second distance between the sound source and the second sound box; the first sound box is based on the first sound box The time delay difference between the distance and the second distance is determined; the first sound box sends a first instruction to the second sound box, and the first instruction is used to instruct the second sound box to emit a sound at the second time, the The second time is determined based on the first time and the time delay difference. The first time is the sounding time of the first speaker; the first speaker emits a third sound signal at the first time, so The second sound box emits a fourth sound signal at the second moment; wherein, the third sound signal and the fourth sound signal are signals of different channels of the same audio file.
应理解,本申请提供的音箱控制方法中,音箱组可以根据用户的位置调整第一音箱和第二音箱的声音参数(例如,发声时间),使得无论用户在房间内的哪个位置都能获得较好的立体声音效果。It should be understood that in the speaker control method provided by the present application, the speaker group can adjust the sound parameters (for example, the sounding time) of the first speaker and the second speaker according to the user's position, so that the user can obtain a better sound regardless of where the user is in the room. Good stereo sound effect.
在一种可能的设计中,所述第一音箱根据所述第一距离确定第一响度增益;所述第一 音箱根据所述第二距离确定第二响度增益;所述第一音箱根据所述第一响度增益调整所述第一音箱的响度;所述第一音箱向所述第二音箱发送第二指令,所述第二指令用于指示所述第二音箱基于所述第二响度增益调整所述第二音箱的响度。In a possible design, the first sound box determines a first loudness gain according to the first distance; the first sound box determines a second loudness gain according to the second distance; the first sound box determines a second loudness gain according to the The first loudness gain adjusts the loudness of the first sound box; the first sound box sends a second instruction to the second sound box, and the second instruction is used to instruct the second sound box to adjust based on the second loudness gain The loudness of the second sound box.
应理解,本申请提供的音箱控制方法中,音箱组可以根据用户的位置调整第一音箱和第二音箱的声音参数(例如,声音响度),使得无论用户在房间内的哪个位置都能获得较好的立体声音效果。It should be understood that in the speaker control method provided by the present application, the speaker group can adjust the sound parameters (for example, the sound loudness) of the first speaker and the second speaker according to the user's position, so that the user can obtain a better sound regardless of where the user is in the room. Good stereo sound effect.
在一种可能的设计中,所述第一音箱根据所述第一声音信号和所述第二声音信号确定声源的位置,包括:所述第一音箱根据所述第一声音信号,确定所述声源在第一坐标系中的第一角度;所述第一音箱根据所述第二声音信号,确定所述声源在所述第一坐标系中的第二角度;所述第一音箱根据所述第一角度、所述第二角度,以及所述第一音箱和所述第二音箱之间的距离,确定所述声源的位置。In a possible design, the first sound box determining the position of the sound source according to the first sound signal and the second sound signal includes: the first sound box determining the position of the sound source according to the first sound signal The first angle of the sound source in the first coordinate system; the first sound box determines the second angle of the sound source in the first coordinate system according to the second sound signal; the first sound box The position of the sound source is determined according to the first angle, the second angle, and the distance between the first sound box and the second sound box.
应理解,本申请提供的音箱控制方法中,音箱组可以根据用户的位置调整第一音箱和第二音箱的声音参数(例如,发声时间、声音响度等),使得无论用户在房间内的哪个位置都能获得较好的立体声音效果。具体而言,音箱组中的第一音箱可以执行用户位置的计算过程,例如,第一音箱采集用户的第一声音信号,第二音箱采集用户的第二声音信号,第二应用将第二声音信号发送给第一音箱,这样的话,第一音箱可以根据第一声音信号,确定声源在第一坐标系中的第一角度;根据所述第二声音信号,确定声源在所述第一坐标系中的第二角度;然后根据所述第一角度、所述第二角度,以及所述第一音箱和所述第二音箱之间的距离,确定所述声源的位置。当然,上述第一音箱执行的用户位置的计算过程,也可以由第二音箱执行,即第一音箱将第一声音信号发送给第二音箱,由第二音箱执行上述过程。当然,另一种可能的实现方式为,第一音箱执行计算第一角度的过程,第二音箱执行计算上述第二角度的过程,然后第二音箱将计算出的第二角度的信息发送给第一音箱,由第一音箱根据接收到的第二角度和自己计算出的第一角度,以及第一音箱和第二音箱之间的距离确定声源的位置。It should be understood that in the speaker control method provided by the present application, the speaker group can adjust the sound parameters of the first speaker and the second speaker (for example, sounding time, sound loudness, etc.) according to the user's position, so that no matter where the user is in the room Both can obtain better stereo sound effects. Specifically, the first speaker in the speaker group can perform the calculation process of the user's position. For example, the first speaker collects the user's first sound signal, the second speaker collects the user's second sound signal, and the second application combines the second sound The signal is sent to the first sound box. In this case, the first sound box can determine the first angle of the sound source in the first coordinate system according to the first sound signal; according to the second sound signal, determine that the sound source is in the first coordinate system. The second angle in the coordinate system; and then according to the first angle, the second angle, and the distance between the first sound box and the second sound box, the position of the sound source is determined. Of course, the calculation process of the user position performed by the first speaker can also be performed by the second speaker, that is, the first speaker sends the first sound signal to the second speaker, and the second speaker executes the foregoing process. Of course, another possible implementation is that the first speaker performs the process of calculating the first angle, the second speaker performs the process of calculating the second angle, and then the second speaker sends the calculated second angle information to the first For a sound box, the first sound box determines the position of the sound source according to the received second angle and the first angle calculated by itself, as well as the distance between the first sound box and the second sound box.
在一种可能的设计中,在所述第一音箱根据所述第一声音信号和所述第二声音信号确定声源的位置之前,所述第一音箱确定所述第一声音信号和所述第二声音信号中包括唤醒词。In a possible design, before the first sound box determines the position of the sound source according to the first sound signal and the second sound signal, the first sound box determines the first sound signal and the The second sound signal includes a wake-up word.
应理解,本申请提供的音箱控制方法中,用户可以唤醒音箱组(可以唤醒音箱组中的一个或多个音箱),当音箱组检测到用户发出的声音信号中包括唤醒词时,确定用户的位置,然后根据用户的位置调整第一音箱和第二音箱的声音参数(例如,发声时间、声音响度等),这样的话,当用户发声但是不是唤醒音箱组时,音箱组可以不执行上述音箱控制方法,而且,当音箱组被唤醒后,可以使得无论用户在房间内的哪个位置都能获得较好的立体声音效果。It should be understood that in the speaker control method provided by the present application, the user can wake up the speaker group (one or more speakers in the speaker group can be awakened), and when the speaker group detects that the sound signal sent by the user includes a wake-up word, the user’s Position, and then adjust the sound parameters of the first and second speakers (for example, sounding time, loudness, etc.) according to the user's position. In this case, when the user makes a sound but does not wake up the speaker group, the speaker group may not perform the above speaker control Method, and when the speaker group is awakened, it can make the user get a better stereo sound effect no matter where the user is in the room.
在一种可能的设计中,在所述第一音箱根据所述第一声音信号和所述第二声音信号确定声源的位置之后,且在所述第一音箱在所述第一时刻发出第三声音信号,所述第二音箱在所述第二时刻发出第四声音信号之前,确定所述声源的位置处于活跃区域内;其中,所述活跃区域是用户使用所述音箱组的次数大于第一预设阈值或者使用所述音箱组的频率大于第二预设阈值的区域。In a possible design, after the first sound box determines the position of the sound source according to the first sound signal and the second sound signal, and the first sound box emits the first sound at the first moment Three sound signals. Before the second sound box emits the fourth sound signal at the second moment, it is determined that the position of the sound source is in the active area; wherein the active area is that the number of times the user uses the sound box group is greater than The first preset threshold or the area where the frequency of using the speaker group is greater than the second preset threshold.
应理解,本申请实施例提供的音箱控制方法中,用户可以唤醒音箱组,当音箱组检测到用户发出的声音信号中包括唤醒词时,确定用户的位置,若用户处于活跃区域,则执行 上述音箱控制方法,若不处于活跃区域,则不执行上述音箱控制方法。当音箱组执行音箱控制方法时,可以根据用户的位置调整第一音箱和第二音箱的声音参数(例如,发声时间、声音响度等),使得无论用户在房间内的哪个位置都能获得较好的立体声音效果。It should be understood that in the speaker control method provided by the embodiments of the present application, the user can wake up the speaker group. When the speaker group detects that the sound signal sent by the user includes a wake-up word, the user's location is determined. If the user is in the active area, the above is executed. If the speaker control method is not in the active area, the above speaker control method is not executed. When the speaker group implements the speaker control method, the sound parameters of the first speaker and the second speaker can be adjusted according to the user's position (for example, sounding time, sound loudness, etc.), so that no matter where the user is in the room, they can get better Stereo sound effect.
在一种可能的设计中,所述第三声音信号是所述音频文件的左声道信号,所述第四声音信息是所述音频文件的右声道信号;或者,所述第四声音信号是所述音频文件的左声道信号,所述第三声音信息是所述音频文件的右声道信号。In a possible design, the third sound signal is the left channel signal of the audio file, and the fourth sound information is the right channel signal of the audio file; or, the fourth sound signal Is the left channel signal of the audio file, and the third sound information is the right channel signal of the audio file.
应理解,为了提升立体声音效果,音箱组中的第一音箱可以发出左声道信号,第二音箱可以发出右声道信号,或者,第一音箱可以发出右声道信号,第二音箱可以发出左声道信号。It should be understood that in order to enhance the stereo sound effect, the first speaker in the speaker group can send out the left channel signal, and the second speaker can send out the right channel signal, or the first speaker can send out the right channel signal, and the second speaker can send out Left channel signal.
在一种可能的设计中,所述第一音箱基于所述第一距离和所述第二距离确定时延差,包括:所述第一音箱根据当前温度确定声音传播速度;所述第一音箱根据所述第一距离和声音传播速度,确定第一时长;所述第一音箱根据所述第二距离和声音传播速度,确定第二时长;所述第一音箱根据所述第一时长和所述第二时长确定所述时延差。In a possible design, the first sound box determines the time delay difference based on the first distance and the second distance, including: the first sound box determines the sound propagation speed according to the current temperature; the first sound box According to the first distance and the sound propagation speed, determine a first time length; the first sound box determines a second time length according to the second distance and the sound propagation speed; the first sound box determines a second time length according to the first time length and the sound propagation speed; The second time length determines the time delay difference.
应理解,由于用户与音箱组的第一音箱和第二音箱的距离不同。本申请提供的音箱控制方法中,第一音箱可以根据用户与第一音箱的第一距离,以及用户与第二音箱的第二距离确定时延差,进而根据该时延差调整第一音箱和第二音箱的发声时间,使得无论用户在房间内的哪个位置都能获得较好的立体声音效果。It should be understood that the distance between the user and the first speaker and the second speaker of the speaker group is different. In the speaker control method provided by the present application, the first speaker can determine the time delay difference according to the first distance between the user and the first speaker and the second distance between the user and the second speaker, and then adjust the first speaker and the second speaker according to the time delay difference. The sounding time of the second speaker enables the user to obtain a better stereo sound effect no matter where the user is in the room.
在一种可能的设计中,所述第一音箱根据当前温度确定声音传播速度,包括:所述第一音箱检测当前温度值;所述第一音箱根据所述温度值和公式c=331.3+0.606T,确定所述声音传播速度;其中,T为温度值,c为所述声音传播速度;或者,所述第一音箱根据所述温度值和公式
Figure PCTCN2020110720-appb-000001
确定所述声音传播速度;其中,γ为定压比热与定容比热之比,R为气体常数,T为所述温度值,M为气体的摩尔质量,c为所述声音传播速度;或者,所述第一音箱根据所述温度值和公式
Figure PCTCN2020110720-appb-000002
确定所述声音传播速度;其中,Pw是空气中水蒸气的分压强,T是所述温度值,P是所述第一音箱检测到的压强值,c为所述声音传播速度。
In a possible design, the first sound box determines the sound propagation speed according to the current temperature, including: the first sound box detects the current temperature value; the first sound box detects the current temperature value according to the temperature value and the formula c=331.3+0.606 T, determine the sound propagation speed; where T is the temperature value, and c is the sound propagation speed; or, the first sound box is based on the temperature value and formula
Figure PCTCN2020110720-appb-000001
Determine the sound propagation speed; where γ is the ratio of the constant pressure specific heat to the constant volume specific heat, R is the gas constant, T is the temperature value, M is the molar mass of the gas, and c is the sound propagation speed; Or, the first sound box is based on the temperature value and formula
Figure PCTCN2020110720-appb-000002
Determine the sound propagation speed; where Pw is the partial pressure of water vapor in the air, T is the temperature value, P is the pressure value detected by the first sound box, and c is the sound propagation speed.
应理解,本申请实施例提供的音箱控制方法中,第一音箱计算声音传播速度时,考虑了当前温度值,使得计算出的声音传播速度较为准确,进而使得第一音箱和第二音箱的时延差更为准确,也就能更加准确的实现音箱组的立体声音效果。It should be understood that in the speaker control method provided by the embodiments of the present application, when the first speaker calculates the sound propagation speed, the current temperature value is taken into account, so that the calculated sound propagation speed is more accurate, so that the time of the first speaker and the second speaker is more accurate. The delay is more accurate, and the stereo sound effect of the speaker group can be realized more accurately.
在一种可能的设计中,所述第一音箱根据所述第一时长和所述第二时长确定所述时延差,包括:当所述第一距离大于所述第二距离时,所述第一时长大于所述第二时长,所述第一音箱确定所述第一时长和所述第二时长的差值为所述时延差;所述第二时刻是所述第一时刻延迟所述时延差后的时刻;当所述第一距离小于所述第二距离时,所述第一时长小于所述第二时长,所述第一音箱确定所述第二时长和所述第一时长的差值为所述时延差;所述第二时刻是所述第一时刻提前所述时延差的时刻。In a possible design, the first sound box determines the delay difference according to the first duration and the second duration, including: when the first distance is greater than the second distance, the The first time length is greater than the second time length, the first sound box determines that the difference between the first time length and the second time length is the time delay difference; the second time is the delay of the first time The moment after the time delay difference; when the first distance is less than the second distance, the first time length is less than the second time length, and the first sound box determines the second time length and the first time length The difference in time length is the time delay difference; the second time is a time when the first time is advanced by the time delay difference.
应理解,本申请实施例提供的音箱控制方法中,当用户距离第一音箱较远,而距离第二音箱较近时,第一音箱可以先发声,第二音箱可以延迟发声,使得无论用户在房间内的哪个位置都能获得较好的立体声音效果。It should be understood that in the speaker control method provided by the embodiments of the present application, when the user is far from the first speaker and closer to the second speaker, the first speaker can emit sound first, and the second speaker can delay sound, so that no matter where the user is A good stereo sound effect can be obtained from any position in the room.
在一种可能的设计中,当所述第一距离大于所述第二距离时,所述第一音箱根据所述第一响度增益调整所述第一音箱的响度,包括:所述第一音箱根据所述第一响度增益增大所述第一音箱的当前响度;所述第二指令用于指示所述第二音箱根据所述第二响度增益降 低所述第二音箱的当前响度;当所述第一距离小于所述第二距离时,所述第一音箱根据所述第一响度增益调整所述第一音箱的响度,包括:所述第一音箱根据所述第一响度增益降低所述第一音箱的当前响度;所述第二指令用于指示所述第二音箱根据所述第二响度增益增大所述第二音箱的当前响度。In a possible design, when the first distance is greater than the second distance, the first sound box adjusts the loudness of the first sound box according to the first loudness gain, including: the first sound box Increase the current loudness of the first speaker according to the first loudness gain; the second instruction is used to instruct the second speaker to decrease the current loudness of the second speaker according to the second loudness gain; When the first distance is smaller than the second distance, adjusting the loudness of the first sound box by the first sound box according to the first loudness gain includes: the first sound box reduces the loudness of the first sound box according to the first loudness gain The current loudness of the first sound box; the second instruction is used to instruct the second sound box to increase the current loudness of the second sound box according to the second loudness gain.
应理解,本申请实施例提供的音箱控制方法中,当用户距离第一音箱较远,而距离第二音箱较近时,第一音箱可以增大声音响度,第二音箱可以降低声音响度,使得无论用户在房间内的哪个位置都能获得较好的立体声音效果。It should be understood that in the speaker control method provided by the embodiments of the present application, when the user is far from the first speaker and closer to the second speaker, the first speaker can increase the sound loudness, and the second speaker can reduce the sound loudness, so that No matter where the user is in the room, a better stereo sound effect can be obtained.
第二方面,本申请实施例还提供了一种音箱,所述音箱包括执行第一方面或者第一方面的任意一种可能的设计的方法的模块/单元;这些模块/单元可以通过硬件实现,也可以通过硬件执行相应的软件实现。In the second aspect, the embodiments of the present application also provide a sound box, which includes modules/units that execute the first aspect or any one of the possible design methods of the first aspect; these modules/units can be implemented by hardware, It can also be implemented by hardware executing corresponding software.
第三方面,本申请实施例还提供了一种音箱,该音箱包括:一个或多个处理器,一个或多个存储器,一个或多个扬声器,一个或多个麦克风,通信模块;其中,一个或多个麦克风,用于采集声音信号;通信模块,用于与其它音箱进行通信;一个或多个扬声器,用于发出声音信号;一个或多个处理器与所述一个或多个存储器耦合;其中,一个或多个存储器用于存储计算机可执行程序代码;其中,所述程序代码包括指令,当所述一个或多个处理器执行所述指令时,使所述音箱执行上述第一方面及其第一方面任一可能设计的技术方案。In a third aspect, an embodiment of the present application also provides a speaker, which includes: one or more processors, one or more memories, one or more speakers, one or more microphones, and a communication module; among them, one One or more microphones are used to collect sound signals; a communication module is used to communicate with other speakers; one or more speakers are used to emit sound signals; one or more processors are coupled to the one or more memories; Wherein, one or more memories are used to store computer-executable program codes; wherein, the program codes include instructions, when the one or more processors execute the instructions, the speakers will execute the first aspect and Any possible design technical solution in the first aspect.
第四方面,本申请实施例提供一种芯片,所述芯片与音箱中的存储器耦合,执行本申请实施例第一方面及其第一方面任一可能设计的技术方案;本申请实施例中“耦合”是指两个部件彼此直接或间接地结合。In a fourth aspect, an embodiment of the present application provides a chip, which is coupled with a memory in a sound box to implement the first aspect of the embodiment of the present application and any possible design technical solution of the first aspect; in the embodiment of the present application " "Coupled" means that two components are directly or indirectly connected to each other.
第五方面,本申请实施例提供的一种音箱系统,所述音箱系统包括一个或多个音箱,其中至少一个音箱是如上述第二方面、第三方面所述的音箱,该音箱可以执行上述第一方面中的第一音箱的全部或部分步骤。In a fifth aspect, a sound box system provided by an embodiment of the present application includes one or more sound boxes, wherein at least one sound box is the sound box described in the second and third aspects above, and the sound box can perform the above All or part of the steps of the first speaker in the first aspect.
第六方面,本申请实施例提供的一种音箱系统,所述音箱系统包括第一音箱和第二音箱,所述第一音箱和所述第二音箱被设置在不同的位置;所述第一音箱和所述第二音箱之间能够通信;所述第一音箱为如上述第二方面、第三方面所述的音箱(该音箱可以是上述第一方面及其第一方面任一可能设计的技术方案中的所述第一音箱)。In a sixth aspect, an embodiment of the present application provides a sound box system, the sound box system includes a first sound box and a second sound box, the first sound box and the second sound box are set at different positions; the first sound box The sound box and the second sound box can communicate; the first sound box is the sound box described in the second and third aspects above (the sound box can be any one of the possible designs of the first aspect and the first aspect above) The first speaker in the technical solution).
第七方面,本申请实施例的一种计算机可读存储介质,所述计算机可读存储介质包括计算机程序,当计算机程序在计算机上运行时,使得所述计算机执行本申请实施例第一方面及其第一方面任一可能设计的技术方案。In a seventh aspect, a computer-readable storage medium according to an embodiment of the present application. The computer-readable storage medium includes a computer program. When the computer program runs on a computer, the computer executes the first aspect and Any possible design technical solution in the first aspect.
第八方面,本申请实施例的中一种程序产品,当所述计算机程序产品在计算机上运行时,使得所述计算机执行本申请实施例第一方面及其第一方面任一可能设计的技术方案。In an eighth aspect, a program product in the embodiments of the present application, when the computer program product runs on a computer, causes the computer to execute the first aspect of the embodiments of the present application and any possible design technology of the first aspect Program.
附图说明Description of the drawings
图1为本申请一实施例提供的一种应用场景的示意图;FIG. 1 is a schematic diagram of an application scenario provided by an embodiment of this application;
图2为本申请一实施例提供的音箱的结构的示意图;FIG. 2 is a schematic diagram of the structure of a sound box provided by an embodiment of the application;
图3为本申请一实施例提供的音箱的结构的示意图;FIG. 3 is a schematic diagram of the structure of a sound box provided by an embodiment of the application;
图4A为本申请一实施例提供的音箱的结构的示意图;4A is a schematic diagram of the structure of a sound box provided by an embodiment of the application;
图4B为本申请一实施例提供的音箱确定用户方位的示意图;FIG. 4B is a schematic diagram of the speaker box provided in an embodiment of the application for determining the user's position; FIG.
图5为本申请一实施例提供的一种音箱控制方法的流程示意图;FIG. 5 is a schematic flowchart of a method for controlling a speaker provided by an embodiment of the application;
图6为本申请一实施例提供的主音箱和从音箱建立坐标系的示意图;FIG. 6 is a schematic diagram of establishing a coordinate system between a master speaker and a slave speaker provided by an embodiment of the application;
图7A为本申请一实施例提供的主音箱确定声源位置的示意图;FIG. 7A is a schematic diagram of determining the position of the sound source of the main speaker provided by an embodiment of the application; FIG.
图7B为本申请一实施例提供的主音箱确定声源位置的示意图;FIG. 7B is a schematic diagram of determining the position of the sound source of the main speaker provided by an embodiment of the application;
图8为本申请一实施例提供的另一种音箱控制方法的流程示意图;FIG. 8 is a schematic flowchart of another method for controlling a speaker provided by an embodiment of the application;
图9为本申请一实施例提供的主音箱确定声源位置的示意图;FIG. 9 is a schematic diagram of determining the position of the sound source of the main speaker provided by an embodiment of the application;
图10为本申请一实施例提供的音箱确定活跃区域的示意图。FIG. 10 is a schematic diagram of determining an active area of a speaker provided by an embodiment of the application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述。The technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application.
以下,对本申请实施例中的部分用语进行解释说明,以便于本领域技术人员理解。Hereinafter, some terms in the embodiments of the present application will be explained to facilitate the understanding of those skilled in the art.
本申请实施例涉及的多个,是指大于或等于两个。需要说明的是,在本申请实施例的描述中,“第一”、“第二”等词汇,仅用于区分描述的目的,而不能理解为指示或暗示相对重要性,也不能理解为指示或暗示顺序。The multiple mentioned in the embodiments of the present application refer to greater than or equal to two. It should be noted that in the description of the embodiments of the present application, words such as "first" and "second" are only used for the purpose of distinguishing description, and cannot be understood as indicating or implying relative importance, nor can it be understood as indicating Or imply the order.
以下实施例中所使用的术语只是为了描述特定实施例的目的,而并非旨在作为对本申请的限制。如在本申请的说明书和所附权利要求书中所使用的那样,单数表达形式“一个”、“一种”、“所述”、“上述”、“该”和“这一”旨在也包括例如“一个或多个”这种表达形式,除非其上下文中明确地有相反指示。还应当理解,在本申请实施例中,“一个或多个”是指一个、两个或两个以上;“和/或”,描述关联对象的关联关系,表示可以存在三种关系;例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B的情况,其中A、B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。The terms used in the following embodiments are only for the purpose of describing specific embodiments, and are not intended to limit the application. As used in the specification and appended claims of this application, the singular expressions "a", "an", "said", "above", "the" and "this" are intended to also This includes expressions such as "one or more" unless the context clearly indicates to the contrary. It should also be understood that in the embodiments of the present application, "one or more" refers to one, two, or more than two; "and/or" describes the association relationship of associated objects, indicating that there may be three relationships; for example, A and/or B can mean the situation where A exists alone, A and B exist at the same time, and B exists alone, where A and B can be singular or plural. The character "/" generally indicates that the associated objects before and after are in an "or" relationship.
在本说明书中描述的参考“一个实施例”或“一些实施例”等意味着在本申请的一个或多个实施例中包括结合该实施例描述的特定特征、结构或特点。由此,在本说明书中的不同之处出现的语句“在一个实施例中”、“在一些实施例中”、“在其他一些实施例中”、“在另外一些实施例中”等不是必然都参考相同的实施例,而是意味着“一个或多个但不是所有的实施例”,除非是以其他方式另外特别强调。术语“包括”、“包含”、“具有”及它们的变形都意味着“包括但不限于”,除非是以其他方式另外特别强调。References described in this specification to "one embodiment" or "some embodiments", etc. mean that one or more embodiments of the present application include a specific feature, structure, or characteristic described in conjunction with the embodiment. Therefore, the sentences "in one embodiment", "in some embodiments", "in some other embodiments", "in some other embodiments", etc. appearing in different places in this specification are not necessarily All refer to the same embodiment, but mean "one or more but not all embodiments" unless it is specifically emphasized otherwise. The terms "including", "including", "having" and their variations all mean "including but not limited to", unless otherwise specifically emphasized.
图1示出了本申请一实施例提供的一种应用场景的示意图。如图1所示,为家庭内的客厅的示意图。如图1所示,客厅内设置音箱(比如,电视柜左右两侧的音箱,比如音箱1和音箱2)。用户从一个位置移动到另一位置的过程中,两个音箱实时的调整发声的时延、增益等参数,使得用户在不同位置处,听到的声音是同步的、立体的,不会因为音箱的位置固定,而导致能够获得比较好的立体声音效果的位置也固定。Fig. 1 shows a schematic diagram of an application scenario provided by an embodiment of the present application. As shown in Figure 1, it is a schematic diagram of a living room in a family. As shown in Figure 1, there are speakers in the living room (for example, speakers on the left and right sides of a TV cabinet, such as speaker 1 and speaker 2). When the user moves from one position to another, the two speakers adjust the time delay, gain and other parameters of the sound in real time, so that the user hears the sound in different positions in a synchronized and three-dimensional manner. The position of is fixed, and the position where a better stereo sound effect can be obtained is also fixed.
图2示出了本申请一实施例提供的音箱的功能框图。在一些实施例中,音箱100可以包括一个或多个输入设备(input device)101,一个或多个输出设备(output device)102和一个或多个处理器(processor)103。其中,输入设备101可以检测各种类型的输入信号(可以简称:输入),输出设备104可以提供各种类型的输出信息(可以简称:输出)。处理器103可以从一个或多个输入设备101处接收输入信号,响应于该输入信号,产生输出信息,通过一个或多个输出设备102输出。Fig. 2 shows a functional block diagram of a speaker provided by an embodiment of the present application. In some embodiments, the speaker 100 may include one or more input devices (input devices) 101, one or more output devices (output devices) 102, and one or more processors (processors) 103. The input device 101 can detect various types of input signals (may be abbreviated as input), and the output device 104 can provide various types of output information (may be abbreviated as: output). The processor 103 may receive input signals from one or more input devices 101, generate output information in response to the input signals, and output through one or more output devices 102.
在一些实施例中,一个或多个输入设备101可以检测各种类型的输入,并提供与检测到的输入相对应的信号(比如,输入信号),然后一个或多个输入设备101可以将输入信 号提供给一个或多个处理器103。在一些示例中,一个或多个输入设备101可以是包括任何能够检测输入信号的部件或组件。比如,输入设备101可以包括音频传感器(比如,一个或多个麦克风),距离传感器、光学或视觉传感器(比如,摄像头,可见光传感器或不可见光传感器),接近光传感器,触摸传感器,压力传感器,机械设备(比如,表冠,开关,按钮或按键等),温度传感器,通信设备(比如,有线或无线通信装置)等,或者,输入设备101也可以是上述各种部件的一些组合。In some embodiments, one or more input devices 101 can detect various types of inputs and provide signals (for example, input signals) corresponding to the detected inputs, and then one or more input devices 101 can input The signal is provided to one or more processors 103. In some examples, the one or more input devices 101 may include any components or components capable of detecting input signals. For example, the input device 101 may include audio sensors (such as one or more microphones), distance sensors, optical or visual sensors (such as cameras, visible light sensors or invisible light sensors), proximity light sensors, touch sensors, pressure sensors, and mechanical A device (for example, a crown, a switch, a button or a key, etc.), a temperature sensor, a communication device (for example, a wired or wireless communication device), etc., or the input device 101 may also be some combination of the above-mentioned various components.
在一些实施例中,一个或多个输出设备102可以提供各种类型的输出。比如,一个或多个输出设备102可以接收一个或多个信号(比如,由一个或多个处理器103提供的输出信号),并提供与该信号对应的输出。在一些示例中,输出设备102可以包括用于提供输出的任何合适的部件或组件。比如,输出设备102可以包括音频输出设备(比如,一个或多个扬声器),视觉输出设备(比如,一个或多个灯或显示器),触觉输出设备,通信设备(比如,有线或无线通信设备)等等,或者,输出设备102还可以是上述各种部件的一些组合。In some embodiments, one or more output devices 102 may provide various types of output. For example, one or more output devices 102 may receive one or more signals (for example, an output signal provided by one or more processors 103), and provide an output corresponding to the signal. In some examples, the output device 102 may include any suitable components or components for providing output. For example, the output device 102 may include an audio output device (for example, one or more speakers), a visual output device (for example, one or more lights or displays), a tactile output device, and a communication device (for example, a wired or wireless communication device) Etc., or the output device 102 may also be some combination of the above-mentioned various components.
在一些实施例中,一个或多个处理器103可以耦合到输入设备101和输出设备102。处理器103可以与输入设备101和输出设备102之间通信。比如,一个或多个处理器103可以从输入设备101接收输入信号(比如,与输入设备101检测到的输入相对应的输入信号)。一个或多个处理器103可以解析接收到的输入信号以确定是否响应于该输入信号提供一个或多个对应的输出。若是,一个或多个处理器103可以向输出设备102发送输出信号,以提供输出。In some embodiments, one or more processors 103 may be coupled to the input device 101 and the output device 102. The processor 103 can communicate with the input device 101 and the output device 102. For example, one or more processors 103 may receive input signals from the input device 101 (for example, input signals corresponding to the input detected by the input device 101). The one or more processors 103 may parse the received input signal to determine whether to provide one or more corresponding outputs in response to the input signal. If so, one or more processors 103 may send output signals to the output device 102 to provide output.
图3示出了本申请另一实施例提供的音箱300的功能框图。音箱300可以为图2所描述的音箱100的一种示例。如图3所示,音箱300包括麦克风301、扬声器302、处理器303、存储器304、传感器模块306。可以理解的是,图3所示的部件并不构成对音箱300的具体限定,音箱300还可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。FIG. 3 shows a functional block diagram of a speaker 300 provided by another embodiment of the present application. The sound box 300 may be an example of the sound box 100 described in FIG. 2. As shown in FIG. 3, the speaker 300 includes a microphone 301, a speaker 302, a processor 303, a memory 304, and a sensor module 306. It is understandable that the components shown in FIG. 3 do not constitute a specific limitation to the speaker 300, and the speaker 300 may also include more or fewer components than those shown in the figure, or combine certain components, or split certain components. Or different component arrangements.
处理器303可以包括一个或多个处理单元,例如:处理器303可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。其中,控制器可以是音箱30的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。在另一些实施例中,处理器303中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器303中的存储器为高速缓冲存储器。该存储器可以保存处理器303刚用过或循环使用的指令或数据。如果处理器303需要再次使用该指令或数据,可从所述存储器中直接调用,避免了重复存取,减少了处理器303的等待时间,因而提高了系统的效率。处理器303可以运行本申请一些实施例提供的音箱控制方法的软件代码/模块,实现控制音箱的功能。The processor 303 may include one or more processing units. For example, the processor 303 may include an application processor (AP), a modem processor, a graphics processing unit (GPU), and an image signal processor. (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) Wait. Among them, the different processing units may be independent devices or integrated in one or more processors. Among them, the controller may be the nerve center and command center of the speaker 30. The controller can generate operation control signals according to the instruction operation code and timing signals to complete the control of fetching and executing instructions. In other embodiments, the processor 303 may also be provided with a memory for storing instructions and data. In some embodiments, the memory in the processor 303 is a cache memory. The memory can store instructions or data that have just been used or recycled by the processor 303. If the processor 303 needs to use the instruction or data again, it can be directly called from the memory, which avoids repeated access and reduces the waiting time of the processor 303, thereby improving the efficiency of the system. The processor 303 may run the software code/module of the speaker control method provided in some embodiments of the present application to realize the function of controlling the speaker.
麦克风301,也称“话筒”,“传声器”,用于采集声音信号(比如采集用户发出的声音),将声音信号转换为电信号。在一些实施例中,音箱300上可以设置一个或多个麦克风301,比如麦克风阵列。在另一些实施例中,麦克风301除了采集声音信号,还可以实现对声音信号降噪功能,或者还可以识别声音信号的来源、实现定向录音功能等。The microphone 301, also known as a "microphone" or a "microphone", is used to collect sound signals (for example, collecting sounds made by users) and convert the sound signals into electrical signals. In some embodiments, one or more microphones 301 may be provided on the speaker 300, such as a microphone array. In other embodiments, in addition to collecting sound signals, the microphone 301 can also realize the function of noise reduction on the sound signals, or can also identify the source of the sound signals, realize the function of directional recording, and so on.
扬声器302,也称“喇叭”,用于将音频电信号转换为声音信号。音箱300可以通过扬声器302播放音乐等声音信号。The speaker 302, also called a "speaker", is used to convert audio electrical signals into sound signals. The speaker 300 can play sound signals such as music through the speaker 302.
在一些实施例中,麦克风301和扬声器302与处理器303耦合。比如,麦克风301接收到声音信号后,将声音信号或者由声音信号转换而成的音频电信号发送给处理器303。处理器303判断是否响应该声音信号或者音频电信号,若是,则输出相应的输出信号,比如通过扬声器302播放音乐。In some embodiments, the microphone 301 and the speaker 302 are coupled to the processor 303. For example, after the microphone 301 receives the sound signal, it sends the sound signal or the audio electrical signal converted from the sound signal to the processor 303. The processor 303 determines whether to respond to the sound signal or audio electrical signal, and if so, outputs a corresponding output signal, such as playing music through the speaker 302.
存储器304,可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器303通过运行存储在存储器的指令,从而执行音箱300的各种功能应用以及数据处理。存储器可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等,本申请实施例不作限定。在一些实施例中,存储器304中可以存储“唤醒词”等信息。在另一些实施例中,存储器304中还可以存储音频信息(比如,歌曲、相声、评书等)。The memory 304 may be used to store computer executable program code, where the executable program code includes instructions. The processor 303 executes various functional applications and data processing of the speaker 300 by running instructions stored in the memory. The memory may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, a flash memory device, a universal flash storage (UFS), etc., which are not limited in the embodiment of the present application. In some embodiments, the memory 304 may store information such as "wake-up words". In other embodiments, the memory 304 may also store audio information (for example, songs, cross talk, storytelling, etc.).
传感器模块306可以包括气压传感器306A、温度传感器306B等。应理解,图3仅是列举了几种传感器的示例,在实际应用中,音箱300还可以包括更多或很少的传感器,或者使用其他具有相同或类似功能的传感器替换上述列举的传感器等等,本申请实施例不作限定。The sensor module 306 may include an air pressure sensor 306A, a temperature sensor 306B, and so on. It should be understood that FIG. 3 only lists several examples of sensors. In practical applications, the speaker 300 may also include more or fewer sensors, or use other sensors with the same or similar functions to replace the above-listed sensors, etc. , The embodiments of this application are not limited.
气压传感器306A,用于测量气压。在一些实施例中,处理器303可以与气压传感器306A耦合,通过气压传感器306A测得的气压值辅助计算,比如计算声音的衰减系数等。The air pressure sensor 306A is used to measure air pressure. In some embodiments, the processor 303 may be coupled with the air pressure sensor 306A, and use the air pressure value measured by the air pressure sensor 306A to assist in the calculation, such as calculating the attenuation coefficient of the sound.
温度传感器306B,用于检测温度。在一些实施例中,处理器303可以与温度传感器306B耦合,通过温度传感器306B测得的温度值辅助计算,比如计算声音的衰减系数等。The temperature sensor 306B is used to detect temperature. In some embodiments, the processor 303 may be coupled with the temperature sensor 306B, and use the temperature value measured by the temperature sensor 306B to assist in the calculation, such as calculating the attenuation coefficient of the sound.
通信模块305,可以是无线通信模块(比如蓝牙、无线)。音箱300通过通信模块305与其他设备,比如另一个音箱,手机,电视机等连接。The communication module 305 may be a wireless communication module (such as Bluetooth, wireless). The speaker 300 is connected to other devices, such as another speaker, mobile phone, TV, etc., through the communication module 305.
在一些实施例中,音箱300可以包含显示器(或显示屏),也可以不包含显示器。显示器,可以用于显示应用的显示界面,比如当前播放的歌曲等。显示器包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode的,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,显示器中可以设置触摸传感器,形成触摸屏,本申请实施例不作限定。触摸传感器用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给处理器303,以确定触摸事件类型。处理器303可以通过显示器提供与触摸操作相关的视觉输出。In some embodiments, the speaker 300 may include a display (or display screen), or may not include a display. The display can be used to display the display interface of the application, such as the currently playing song. The display includes a display panel. The display panel can adopt liquid crystal display (LCD), organic light-emitting diode (OLED), active matrix organic light-emitting diode or active-matrix organic light-emitting diode (active-matrix organic light-emitting diode). AMOLED, flexible light-emitting diode (FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diode (QLED), etc. In some embodiments, a touch sensor may be provided in the display to form a touch screen, which is not limited in the embodiment of the present application. The touch sensor is used to detect touch operations acting on or near it. The touch sensor may transmit the detected touch operation to the processor 303 to determine the type of the touch event. The processor 303 may provide visual output related to the touch operation through the display.
在一些实施例中,图3还可以包含更多的器件,比如电池、USB接口等等,本申请实施例不多赘述。In some embodiments, FIG. 3 may also include more devices, such as batteries, USB ports, etc., which are not described in detail in the embodiments of the present application.
图4A示出了本申请一实施例提供的音箱的结构示意图。音箱400可以为图2或图3所描述的音箱的一种示例。如图4A所示,音箱400可以包括底座401和壳体402。FIG. 4A shows a schematic structural diagram of a sound box provided by an embodiment of the present application. The sound box 400 may be an example of the sound box described in FIG. 2 or FIG. 3. As shown in FIG. 4A, the sound box 400 may include a base 401 and a housing 402.
在一些实施例中,底座401可以起到支撑的作用。比如,底座401可以支撑壳体402以及壳体402内所包围的部件(比如、处理器、麦克风、扬声器等)。在一些示例中,底座401可以由金属,塑料,陶瓷等其它的任何能够起到支撑作用的材料,或者这些材料的组合而构成。In some embodiments, the base 401 can function as a support. For example, the base 401 can support the housing 402 and the components enclosed in the housing 402 (for example, a processor, a microphone, a speaker, etc.). In some examples, the base 401 may be made of any other supporting material such as metal, plastic, ceramic, etc., or a combination of these materials.
在一些实施例中,底座401上可以支撑一个或多个扬声器406。比如,底座401可以支撑一个固定件404,该固定件404上可以设置一个或多个扬声器406。在一些示例中,底座401可以通过支撑柱405或其他方式支撑固定件404。固定件404可以是任何形状,比如圆形,方形等。在一些实施例中,一个或多个扬声器406可以在固定件404上按照一定的排布方式。比如,一个或多个扬声器406可以均匀分布在固定件404上的边缘,比如每个扬声器之间的距离间隔相同。在一些实施例中,一个或多个扬声器406可以与处理器403耦合。处理器403可以通过一个或多个扬声器406输出音频信号。In some embodiments, one or more speakers 406 may be supported on the base 401. For example, the base 401 may support a fixing member 404, and one or more speakers 406 may be provided on the fixing member 404. In some examples, the base 401 may support the fixing member 404 through a supporting column 405 or other methods. The fixing member 404 can be any shape, such as a circle, a square, and so on. In some embodiments, one or more speakers 406 may be arranged on the fixing member 404 in a certain arrangement. For example, one or more speakers 406 may be evenly distributed on the edge of the fixing member 404, for example, the distance between each speaker is the same. In some embodiments, one or more speakers 406 may be coupled with the processor 403. The processor 403 may output audio signals through one or more speakers 406.
在一些实施例中,壳体402可以是圆柱体、立方体、正方体等任何立体形状。壳体402可以包裹处理器403、固定件404、一个或多个扬声器406等部件。壳体402可以是单个壳体构件,或多于两个的壳体构件组成。比如,壳体402可以包括上壳体402a和侧壳体402b。一个或多个壳体构件可以是金属,塑料,陶瓷,晶体、或者这些材料的组合,或其它的任何适合设置在音箱上的壳体构件等等。在一些实施例中,侧壳体402b可以是具有网眼结构的壳体,比如,网眼可以是圆孔,方孔,六角孔等形状。网眼结构的壳体可以起到装饰、防尘、保护壳体内部的器件(比如扬声器、麦克风等)等作用,且网眼结构的壳体可以减少对扬声器输出的声音的阻挡。In some embodiments, the housing 402 may be any three-dimensional shape such as a cylinder, a cube, or a cube. The housing 402 may enclose components such as the processor 403, the fixing member 404, and one or more speakers 406. The housing 402 may be a single housing member or composed of more than two housing members. For example, the housing 402 may include an upper housing 402a and a side housing 402b. The one or more shell members may be metal, plastic, ceramic, crystal, or a combination of these materials, or any other shell members suitable to be arranged on the sound box, and so on. In some embodiments, the side shell 402b may be a shell with a mesh structure, for example, the mesh may be a round hole, a square hole, a hexagonal hole, or the like. The shell of the mesh structure can play the role of decoration, dustproof, and protection of the internal components of the shell (such as speakers, microphones, etc.), and the shell of the mesh structure can reduce the blocking of the sound output by the speaker.
在一些实施例中,上壳体402a可以是网眼结构,或者不是网眼结构的壳体。上壳体402a可以设置输入设备,比如开关,按钮或按键等。比如,开关用于开启或关闭音箱。按钮或按键可以用于调节音量等功能。在另一些实施例中,上壳体402a上可以设置显示屏409(比如触摸显示屏),可以用于接收输入、提供视觉输出等。比如,显示屏409上可以显示当前播放的歌曲的名称、歌手的名字等。当然,音箱上也可以不设置显示屏,本申请实施例不作限定。In some embodiments, the upper casing 402a may be a mesh structure or a casing that is not a mesh structure. The upper housing 402a can be provided with input devices, such as switches, buttons or keys. For example, the switch is used to turn the speaker on or off. Buttons or keys can be used to adjust functions such as volume. In other embodiments, a display screen 409 (such as a touch screen) can be provided on the upper housing 402a, which can be used to receive input, provide visual output, and so on. For example, the name of the song currently playing, the name of the singer, etc. can be displayed on the display screen 409. Of course, the display screen may not be provided on the speaker, which is not limited in the embodiment of the present application.
在一些实施例中,上壳体402a可以与固定件407连接。固定件407上可以设置一个或多个麦克风408。固定件407可以是任何形状,比如圆形,方形等。在一些实施例中,一个或多个麦克风408可以在固定件407上按照一定的排布方式。比如,一个或多个麦克风408可以均匀分布在固定件407上的边缘,比如每个麦克风之间的距离间隔相同。再比如,每相邻两个麦克风对应的中心角a(比如,两个麦克风分别与固定件407的中心点连接的直线所形成的夹角)可以是固定的,比如,30、60度等。In some embodiments, the upper housing 402a may be connected to the fixing member 407. One or more microphones 408 can be provided on the fixing member 407. The fixing member 407 can be any shape, such as a circle, a square, and so on. In some embodiments, one or more microphones 408 may be arranged on the fixing member 407 in a certain arrangement. For example, one or more microphones 408 may be evenly distributed on the edge of the fixing member 407, for example, the distance between each microphone is the same. For another example, the center angle a corresponding to each two adjacent microphones (for example, the angle formed by the straight line connecting the two microphones to the center point of the fixing member 407) may be fixed, such as 30 degrees, 60 degrees, and so on.
在一些实施例中,一个或多个麦克风408可以与处理器403耦合。处理器403可以通过一个或多个扬声器406获得输入信号(比如用户发出的声音信号)。In some embodiments, one or more microphones 408 may be coupled with the processor 403. The processor 403 may obtain an input signal (such as a sound signal sent by a user) through one or more speakers 406.
本申请以下的实施例中,以图1的应用场景为例,且以图1中的音箱1和/或音箱2是上述图4A所示的音箱400为例。为了方便描述,下文将音箱1和音箱2中的一个称为主音箱,另一个称为从音箱。在一些实施例中,主音箱和从音箱可以是配套使用的。比如,主音箱用于播放左声道,从音箱用于播放右声道,或者主音箱用于播放右声道,从音箱用于播放左声道。也就是说,主音箱和从音箱的配合可以实现音频的立体声音效果。在一些实施例中,一个音箱是主音箱还是从音箱,可以是该音箱出厂之前设置好,也可以是用户自定义的(比如,音箱通过触摸显示屏接入输入操作,该输入操作用于选择该音箱是主音箱还是从音箱)。In the following embodiments of the present application, the application scenario of FIG. 1 is taken as an example, and the speaker 1 and/or the speaker 2 in FIG. 1 is the speaker 400 shown in FIG. 4A as an example. For the convenience of description, one of the speaker 1 and the speaker 2 is called the main speaker, and the other is called the slave speaker. In some embodiments, the main sound box and the slave sound box may be used in conjunction. For example, the main speaker is used to play the left channel, the slave speaker is used to play the right channel, or the main speaker is used to play the right channel, and the slave speaker is used to play the left channel. In other words, the cooperation of the main speaker and the slave speaker can achieve the stereo sound effect of the audio. In some embodiments, whether a speaker is a master speaker or a slave speaker can be set before the speaker leaves the factory, or it can be user-defined (for example, the speaker accesses the input operation through the touch screen, and the input operation is used to select Whether the speaker is the main speaker or the slave speaker).
在一些实施例中,主音箱和从音箱的结构可以相同,比如,主音箱和从音箱都是图4A所示的结构。在另一些实施例中,主音箱和从音箱的结构也可以不完全相同,比如,主音箱可以设置有显示屏,而从音箱不设置显示屏等。在其他实施例中,主音箱和从音箱中部 分部件的功能可以不完全相同。比如,主音箱中的处理器可以用于计算时延差(比如,第一时长和第二时长之间的时间差,第一时长可以是声音从主音箱到用户所需的时长,第二时长可以是声音从音箱到用户所需的时长),响度增益等,而从音箱中的处理器不具有该功能。In some embodiments, the structure of the main sound box and the slave sound box may be the same. For example, the main sound box and the slave sound box both have the structure shown in FIG. 4A. In other embodiments, the structure of the main speaker and the slave speaker may not be exactly the same. For example, the master speaker may be provided with a display screen, but the slave speaker may not be provided with a display screen. In other embodiments, the functions of some parts of the main speaker and the slave speaker may not be exactly the same. For example, the processor in the main speaker can be used to calculate the delay difference (for example, the time difference between the first duration and the second duration. The first duration can be the time required for the sound from the main speaker to the user, and the second duration can be It is the time required for the sound from the speaker to the user), loudness gain, etc., and the processor in the slave speaker does not have this function.
在一些实施例中,主音箱和/或从音箱中的存储器中可以存储音频文件(比如,歌曲、相声、评书等),主音箱和从音箱可以播放存储的音频文件。比如,主音箱可以接收输入(比如,通过触摸显示屏接收输入操作,或者通过麦克风接收语言输入),该输入可以用于启动主音箱和/或从音箱,或用于控制主音箱和从音箱播放、切换歌曲等。在一些实施例中,主音箱中的一个或多个麦克风采集到声音信号(比如,用户发出的声音信号),处理器识别出该声音信号中包含“唤醒词+播放歌曲”,处理器确定存储器中不存在该歌曲时,可以从网络侧下载该歌曲,或者输出提示信息(比如语言信息)提示用户不存在该歌曲。In some embodiments, audio files (for example, songs, cross talks, storytelling, etc.) can be stored in the memory in the master speaker and/or the slave speaker, and the master speaker and the slave speaker can play the stored audio files. For example, the main speaker can receive input (for example, receiving input operations through the touch screen, or receiving language input through the microphone), and the input can be used to activate the main speaker and/or the slave speaker, or to control the main speaker and the playback of the slave speaker , Switch songs, etc. In some embodiments, one or more microphones in the main speaker collect a sound signal (for example, a sound signal sent by a user), the processor recognizes that the sound signal contains "wake-up word + play song", and the processor determines the memory When the song does not exist in the file, the song can be downloaded from the network side, or prompt information (such as language information) can be output to prompt the user that the song does not exist.
在另一些实施例中,主音箱和/或从音箱可以与其它电子设备(比如手机、电视机)连接,可以通过有线或无线的方式连接。以主音箱与手机连接(比如,蓝牙连接)为例。手机可以将音频信号发送给主音箱,使得主音箱和从音箱播放该音频信号(比如,主音箱接收到音频信号之后,可以将音频信号发送给从音箱)。比如,手机正在运行音乐播放应用(比如,酷狗音乐),且正在播放歌曲“一路向北”,手机可以将该歌曲的音频信号发送给主音箱,使得主音箱和从音箱播放该音频信号。在另一些实施例中,主音箱和手机连接之后,用户可以通过主音箱控制手机执行相应的操作。继续以前面的例子为例,用户在房间内发出“小白播放歌曲听妈妈的话”的声音信号,主音箱采集到该声音信号,可以暂停播放一路向北,而是输出提示信息“正在为您寻找听妈妈的话”。比如,主音箱可以从本地存储器中寻找是否存在歌曲听妈妈的话,若不存在,主音箱可以从网络侧下载,或者主音箱可以向手机发送指令,该指令用户指示手机播放听妈妈的话,手机接收到该指令后,下载或者在线播放该歌曲,将该歌曲的音频信号发送给主音箱,使得主音箱和从音箱播放该歌曲(即听妈妈的话)的音频信号。In other embodiments, the master speaker and/or the slave speaker can be connected to other electronic devices (such as mobile phones, televisions), and can be connected in a wired or wireless manner. Take the connection between the main speaker and the mobile phone (for example, Bluetooth connection) as an example. The mobile phone can send the audio signal to the main speaker, so that the main speaker and the slave speaker can play the audio signal (for example, after the main speaker receives the audio signal, the audio signal can be sent to the slave speaker). For example, if the mobile phone is running a music playing application (for example, Kugou Music), and is playing the song "All the Way North", the mobile phone can send the audio signal of the song to the main speaker, so that the main speaker and the slave speaker can play the audio signal. In other embodiments, after the main speaker is connected to the mobile phone, the user can control the mobile phone to perform corresponding operations through the main speaker. Continuing to take the previous example as an example, the user sends out the sound signal "Xiaobai plays songs and listens to her mother" in the room. The main speaker collects the sound signal and can pause the playback all the way to the north. Instead, it outputs the prompt message "It is for you." Seek to listen to what my mother says." For example, the main speaker can find from the local storage whether there is a song listening to mom’s words. If it does not exist, the main speaker can download it from the network side, or the main speaker can send an instruction to the mobile phone. The instruction user instructs the mobile phone to play listening to mom’s words and the phone receives After the instruction is reached, download or play the song online, and send the audio signal of the song to the main speaker, so that the main speaker and the slave speaker play the audio signal of the song (that is, listening to mother's words).
在一些实施例中,主音箱和从音箱均可以启动自动识别“唤醒词”的功能。以主音箱为例,主音箱启动自动识别“唤醒词”的功能之后,主音箱中的全部或部分部件(比如,一个或多个麦克风、处理器等)处于使能状态。用户在房间内发出的声音信号被主音箱中的一个或多个麦克风接收。一个或多个麦克风将接收到的声音信号发送给处理器,处理器判断声音信号中包含“唤醒词”时,启动其它部件(比如,一个或多个扬声器)。在一些实施例中“唤醒词”可以是音箱出厂时默认设置好的,也可以是用户自定义的,比如“唤醒词”可以是“小白”、“小音”、“小艺”等。In some embodiments, both the master speaker and the slave speaker can activate the function of automatically recognizing the "wake word". Take the main speaker as an example. After the main speaker activates the function of automatically recognizing the "wake word", all or part of the components (for example, one or more microphones, processors, etc.) in the main speaker are in an enabled state. The sound signal sent by the user in the room is received by one or more microphones in the main speaker. One or more microphones send the received sound signal to the processor, and when the processor determines that the sound signal contains the "wake-up word", it activates other components (for example, one or more speakers). In some embodiments, the "wake-up word" can be set by default when the speaker is shipped from the factory, or it can be user-defined. For example, the "wake-up word" can be "Xiaobai", "Xiaoyin", "Xiaoyi" and so on.
在另一些实施例中,主音箱和从音箱均可以启动自动识别“唤醒词+播放歌曲”的功能。以主音箱为例,主音箱启动自动设备“唤醒词+播放歌曲”的功能之后,主音箱中的全部或部分部件(比如,一个或多个麦克风、处理器等)处于使能状态。用户在房间内发出的声音信号被主音箱中的一个或多个麦克风接收。一个或多个麦克风将接收到的声音信号发送给处理器,当处理器判断声音信号中包含“唤醒词+播放歌曲”时,启动其它部件(比如,一个或多个扬声器)。举例来说,用户在房间内发出“小白播放一路向北”。主音箱中的麦克风采集的该声音信号,然后发送给处理器,处理器识别出声音信号中包括唤醒词:小白,还包括:播放歌曲,处理器启动其它部件(比如,一个或多个扬声器)。In other embodiments, both the master speaker and the slave speaker can activate the function of automatically recognizing the "wake-up word + play song". Taking the main speaker as an example, after the main speaker activates the function of the automatic device "wake word + play song", all or part of the components (for example, one or more microphones, processors, etc.) in the main speaker are in an enabled state. The sound signal sent by the user in the room is received by one or more microphones in the main speaker. One or more microphones send the received sound signal to the processor, and when the processor determines that the sound signal contains "wake-up word + play song", it activates other components (for example, one or more speakers). For example, the user sends out "Xiaobai play all the way north" in the room. The sound signal collected by the microphone in the main speaker is then sent to the processor. The processor recognizes that the sound signal includes the wake-up word: Xiaobai. It also includes: playing a song, and the processor activates other components (such as one or more speakers). ).
在一些实施例中,主音箱可以通过输入设备(比如主音箱上的触摸屏)接收输入操作 或者通过与主音箱连接的其它设备比如手机接收输入操作,响应于该输入操作,启动自动识别“唤醒词”或“唤醒词+播放歌曲”的功能时,主音箱可以向从音箱发送一指令,该指令用于指示从音箱启动自动识别“唤醒词”或“唤醒词+播放歌曲”的功能。In some embodiments, the main speaker can receive input operations through an input device (such as a touch screen on the main speaker) or through other devices connected to the main speaker, such as a mobile phone. "Or "Wake up word + play song" function, the main speaker can send an instruction to the slave speaker, this instruction is used to instruct the slave speaker to start the automatic recognition of "wake word" or "wake up word + play song" function.
在一些实施例中,主音箱和从音箱分别被摆放在房间内的不同位置。主音箱可以检测与从音箱之间的距离D,以备使用,该距离可以是主音箱和从音箱之间的直线距离。主音箱检测到距离D之后,可以将该距离D发送给从音箱,从音箱无需检测距离D;或者,从音箱也自己可以检测与主音箱之间的距离D,以备使用。当然,从音箱可以检测与主音箱之间的距离D,然后发送给主音箱,即主音箱无需检测距离D等等。In some embodiments, the main speakers and the slave speakers are placed in different positions in the room. The main sound box can detect the distance D between the main sound box and the slave sound box for use. The distance can be the straight line distance between the main sound box and the slave sound box. After the main speaker detects the distance D, the distance D can be sent to the slave speaker, and the slave speaker does not need to detect the distance D; or the slave speaker can detect the distance D from the main speaker itself for use. Of course, the slave speaker can detect the distance D from the main speaker and then send it to the master speaker, that is, the master speaker does not need to detect the distance D and so on.
以主音箱为例,作为一种示例,主音箱可以通过距离传感器检测与从音箱之间的距离。距离传感器可以是激光距离传感器、红外距离传感器等。例如,主音箱上的距离传感器发出特定频率的红外光,被从音箱反射,主音箱接收到从音箱发射的光。主音箱可以根据发射红外光的第一时间和接收到反射光的第二时间计算主音箱和从音箱之间的距离。作为另一种示例,主音箱还可以通过与从音箱通信实现测量主音箱和从音箱之间的距离的目的。例如,主音箱向从音箱发射一探测信号,从音箱接收到该探测信号后向主音箱发送反馈信号,主音箱接收到反馈信号。主音箱可以根据接收反馈信号的第二时间,和发送探测信号的第一时间,确定主音箱和从音箱之间的距离。作为又一种示例,主音箱还可以通过输入设备(比如主音箱上的触摸屏)接收输入操作,该输入操作用于输入主音箱和从音箱之间的距离。Taking the main speaker as an example, as an example, the main speaker can detect the distance between the main speaker and the slave speaker through a distance sensor. The distance sensor may be a laser distance sensor, an infrared distance sensor, or the like. For example, the distance sensor on the main sound box emits infrared light of a specific frequency, which is reflected from the sound box, and the main sound box receives the light emitted from the sound box. The main sound box can calculate the distance between the main sound box and the slave sound box according to the first time when the infrared light is emitted and the second time when the reflected light is received. As another example, the master speaker can also communicate with the slave speaker to achieve the purpose of measuring the distance between the master speaker and the slave speaker. For example, the main speaker transmits a detection signal to the slave speaker, and after receiving the detection signal, the slave speaker sends a feedback signal to the main speaker, and the main speaker receives the feedback signal. The main speaker can determine the distance between the main speaker and the slave speaker according to the second time when the feedback signal is received and the first time when the detection signal is sent. As another example, the main speaker may also receive an input operation through an input device (such as a touch screen on the main speaker), and the input operation is used to input the distance between the main speaker and the slave speaker.
在一些实施例中,用户可能处于房间内的任何位置,主音箱和从音箱与用户之间的距离可能不同。主音箱和从音箱启动自动识别“唤醒词”或“唤醒词+播放歌曲”的功能。主音箱和从音箱中的扬声器采集到声音信号。当主音箱和从音箱确定该声音信号中包含“唤醒词”或“唤醒词+播放歌曲”时,可以判断用户的位置,然后根据用户的位置控制主音箱和从音箱的声音参数。比如,声音参数可以包括主音箱和从音箱的时延差、响度增益等。因此,在该实施例中,当主音箱和从音箱识别出采集到的声音信号中包含“唤醒词”或“唤醒词+播放歌曲”时,才根据用户的位置调整主音箱和从音箱的声音参数。In some embodiments, the user may be anywhere in the room, and the distance between the main speaker and the slave speaker and the user may be different. The main speaker and the slave speaker activate the function of automatically recognizing "wake word" or "wake word + play song". The main speaker and the speaker in the slave speaker collect sound signals. When the main and slave speakers determine that the sound signal contains "wake-up words" or "wake-up words + play songs", the user's position can be judged, and then the sound parameters of the main and slave speakers can be controlled according to the user's position. For example, the sound parameters may include the time delay difference between the main speaker and the slave speaker, loudness gain, and so on. Therefore, in this embodiment, when the master speaker and the slave speaker recognize that the collected sound signal contains "wake word" or "wake word + play song", the sound parameters of the master speaker and slave speaker are adjusted according to the user's position .
以图4A所示的结构为例,主音箱和从音箱判断用户的位置的过程可以包括:主音箱采集到声音信号1。从音箱采集到声音信号2。主音箱确定声音信号1中包括“唤醒词”,从音箱确定声音信号2中包括“唤醒词”。当然,为了提升准确性,从音箱还可以将声音信号2或声音信号2中所包括的“唤醒词”发送给主音箱,由主音箱确定声音信号1和声音信号2中的“唤醒词”是同一个唤醒词。主音箱可以根据声音信号1确定用户相对于主音箱的第一方向/方位,例如,该第一方向/方位可以表示为用户在主音箱所构建的坐标系中与x轴之间的第一角度。从音箱可以根据声音信号2确定用户相对于从音箱的第二方向/方位,例如,该第二方向/方位可以表示为用户在从音箱所构建的坐标系中的与x轴之间的第二角度。从音箱可以将第二角度发送给主音箱,主音箱根据第一角度和第二角度,以及主音箱和从音箱之间的距离D确定用户的位置。具体的,主音箱和从音箱构建坐标系,以及主音箱和从音箱确定用户位置的过程将在后文详细介绍。Taking the structure shown in FIG. 4A as an example, the process of determining the position of the user by the main speaker and the slave speaker may include: the main speaker collects the sound signal 1. The sound signal is collected from the speaker 2. The main speaker determines that the sound signal 1 includes the "wake-up word", and the slave speaker determines that the sound signal 2 includes the "wake-up word". Of course, in order to improve accuracy, the slave speaker can also send the "wake-up word" included in sound signal 2 or sound signal 2 to the main speaker, and the master speaker determines whether the "wake-up word" in sound signal 1 and sound signal 2 is The same wake word. The main speaker can determine the first direction/azimuth of the user relative to the main speaker according to the sound signal 1. For example, the first direction/azimuth can be expressed as the first angle between the user and the x-axis in the coordinate system constructed by the main speaker . The slave speaker can determine the second direction/azimuth of the user relative to the slave speaker according to the sound signal 2. For example, the second direction/azimuth can be expressed as the second direction of the user between the x-axis and the coordinate system constructed by the slave speaker. angle. The slave speaker can send the second angle to the master speaker, and the master speaker determines the user's position according to the first angle and the second angle, and the distance D between the master speaker and the slave speaker. Specifically, the construction of the coordinate system of the main speaker and the slave speaker, and the process of determining the position of the user by the master speaker and the slave speaker will be described in detail later.
继续以图4A所示的结构为例,主音箱根据声音信号1确定用户相对于主音箱的第一方向/方位的方式可以有多种。例如,麦克风阵列定位技术(比如,根据主音箱上的麦克风阵列中的至少两个麦克风接收的声音信号的时间差来估计声源的方位)、波束指向(steered-beamformer)定位方法,基于高分辩率谱分析(high-resolution spectral analysis) 定位方法,和基于声音时间差(time-delay estimation,TDE)声源定位技术等等,本申请实施例不作限定。以麦克风阵列定位技术为例,主音箱根据声音信号1确定用户相对于主音箱的第一方向/方位的过程可以包括;主音箱中的麦克风阵列408采集到声音信号,假设麦克风408-1和麦克风408-2采集到声音信号的强度较大,主音箱可以根据麦克风408-1采集到声音信号的第一时刻t1,和麦克风408-2采集到的声音信号的第二时刻t2,以及麦克风408-1和麦克风408-2之间的距离L1(该距离可以出厂之后存储在主音箱中),计算声源即用户相对于主音箱的第一方位。参见图4B所示,主音箱可以根据(t1-t2)*c和L1,以及三角函数关系,确定出用户相对于麦克风408-1的夹角A,该夹角A可以作为用户相对于主音箱的第一方位,或者,由于该夹角A是用户相对于麦克风408-1的夹角,主音箱可以将夹角A进行坐标转换进而转换到主音箱构建的坐标系中,得到夹角B,该夹角B也可以作为用户相对于主音箱的第一方位。从音箱与主音箱的结构可以相同,所以从音箱确定用户相对于从音箱的第二方位的过程,可以与上述过程类似。Continuing to take the structure shown in FIG. 4A as an example, the main speaker can determine the first direction/azimuth of the user relative to the main speaker according to the sound signal 1 in many ways. For example, microphone array positioning technology (for example, estimation of the sound source position based on the time difference between the sound signals received by at least two microphones in the microphone array on the main speaker), the beam-direction (steered-beamformer) positioning method, based on high resolution High-resolution spectral analysis (high-resolution spectral analysis) positioning methods, and sound source positioning techniques based on sound time-delay estimation (TDE), etc., are not limited in the embodiment of the present application. Taking the microphone array positioning technology as an example, the process of determining the first direction/azimuth of the user relative to the main speaker by the main speaker according to the sound signal 1 may include; the microphone array 408 in the main speaker collects the sound signal, assuming that the microphone 408-1 and the microphone The strength of the sound signal collected by 408-2 is relatively high. The main speaker can use the first time t1 when the sound signal is collected by the microphone 408-1 and the second time t2 when the sound signal is collected by the microphone 408-2, and the microphone 408- The distance L1 between 1 and the microphone 408-2 (the distance can be stored in the main speaker after leaving the factory), and the sound source, that is, the first position of the user relative to the main speaker is calculated. As shown in Figure 4B, the main speaker can determine the angle A of the user relative to the microphone 408-1 according to (t1-t2)*c and L1, and the trigonometric function relationship. This angle A can be used as the user relative to the main speaker Or, because the included angle A is the included angle of the user relative to the microphone 408-1, the main speaker can transform the included angle A into the coordinate system constructed by the main speaker to obtain the included angle B, The included angle B can also be used as the user's first orientation relative to the main speaker. The structure of the slave speaker and the main speaker can be the same, so the process of determining the second orientation of the user relative to the slave speaker can be similar to the above process.
在另一些实施例中,用户不断的发出的声音信号的过程中,主音箱和从音箱可以实时的、不断的采集声音信号(该声音信号可能不包含“唤醒词”或“唤醒词+播放歌曲”),然后判断用户的位置,根据用户的位置调整主音箱和从音箱的声音参数,直到检测到包含“唤醒词”或“唤醒词+播放歌曲”的声音信号时,以调整后的声音参数(比如,主音箱和从音箱的时延差、响度增益等)参数控制主音箱和从音箱播放音频信号。In other embodiments, in the process of the user continuously emitting sound signals, the main speaker and the slave speaker can collect sound signals in real time and continuously (the sound signals may not include "wake-up words" or "wake-up words + play songs". "), and then determine the user's location, adjust the sound parameters of the main speaker and the slave speaker according to the user's location, until a sound signal containing "wake-up words" or "wake-up words + play songs" is detected, the adjusted sound parameters (For example, the delay difference between the main speaker and the slave speaker, loudness gain, etc.) parameters control the main speaker and the slave speaker to play audio signals.
以下实施例介绍主音箱和从音箱根据用户的位置控制主音箱和从音箱的声音参数的可能的实现方式。The following embodiment introduces possible implementations of controlling the sound parameters of the main speaker and the slave speaker according to the user's position.
图5示出了本申请一实施例提供的音箱控制方法的流程示意图。如图5所示,该流程可以包括:FIG. 5 shows a schematic flowchart of a speaker control method provided by an embodiment of the present application. As shown in Figure 5, the process can include:
501:主音箱采集声音信号。501: The main speaker collects sound signals.
502:从音箱采集声音信号。502: Collect sound signals from speakers.
在一些实施例中,主音箱和从音箱都是图4A所示的音箱400。主音箱和从音箱中的一个或多个麦克风可以始终处于使能状态。因此,主音箱和从音箱中的麦克风可以采集到声音信号(比如,用户发出的声音信号)。以主音箱为例,该主音箱中的一个或多个麦克风408采集到用户发出的声音信号。一个或多个麦克风408将采集到的声音信号发送给处理器403。处理器403识别该声音信号。比如,处理器403可以判断该声音信号中是否包含“唤醒词”,若是,处理器403启动主音箱,比如,为主音箱内的其它部件(比如一个或多个扬声器406,显示屏409等)供电,使其处于使能状态。In some embodiments, both the master speaker and the slave speaker are the speakers 400 shown in FIG. 4A. One or more microphones in the main speaker and the slave speaker can always be enabled. Therefore, the microphones in the main speaker and the slave speaker can collect sound signals (for example, the sound signals sent by the user). Taking the main sound box as an example, one or more microphones 408 in the main sound box collect sound signals sent by the user. The one or more microphones 408 send the collected sound signals to the processor 403. The processor 403 recognizes the sound signal. For example, the processor 403 can determine whether the sound signal contains a "wake-up word". If so, the processor 403 activates the main speaker, for example, other components in the main speaker (such as one or more speakers 406, a display screen 409, etc.) Supply power to enable it.
503:主音箱计算声源相对于主音箱的第一角度。503: The main speaker calculates the first angle of the sound source relative to the main speaker.
504:从音箱计算声源相对于从音箱的第二角度。504: Calculate the second angle of the sound source from the sound box relative to the sound box.
以主音箱为例,且以主音箱是图4A所示的结构为例,参见图6所示,主音箱可以建立第一坐标系。比如,以主音箱上显示屏的边缘作为坐标原点,重力方向作为z轴方向,显示屏的短边为x方向,长边为y轴方向,建立x1-y1-z1坐标系。主音箱可以确定声源级用户在该坐标系中的第一方位,比如,用户与x1轴的之间的第一夹角
Figure PCTCN2020110720-appb-000003
或者,用户的位置与第一坐标系原点在x1-y1-z1坐标系中形成的射线的函数关系。在另一些实施例中,若主音箱上不设置显示屏,还可以通过其它方式构建坐标系。例如,以主音箱的中心点作为坐标原点,重力方向作为z轴方向,以中心点到主音箱中某个麦克风的方向为x轴方向构建坐标系等等,本申请实施例不限定构建坐标系的方式。
Taking the main sound box as an example, and taking the main sound box having the structure shown in FIG. 4A as an example, referring to FIG. 6, the main sound box can establish a first coordinate system. For example, take the edge of the display screen on the main speaker as the coordinate origin, the gravity direction as the z-axis direction, the short side of the display screen is the x direction, and the long side is the y-axis direction to establish the x1-y1-z1 coordinate system. The main speaker can determine the first position of the sound source level user in the coordinate system, for example, the first angle between the user and the x1 axis
Figure PCTCN2020110720-appb-000003
Or, the functional relationship between the position of the user and the ray formed by the origin of the first coordinate system in the x1-y1-z1 coordinate system. In other embodiments, if the display screen is not provided on the main sound box, the coordinate system can also be constructed in other ways. For example, the coordinate system is constructed with the center point of the main sound box as the coordinate origin, the direction of gravity as the z-axis direction, and the direction from the center point to a microphone in the main sound box as the x-axis direction. The way.
在一些实施例中,继续如图6所示,从音箱也可以建立第二坐标系(比如图中的x2-y2-z2坐标系)。从音箱和主音箱构建坐标系的方式可以相同。从音箱可以确定用户在自身构建的坐标系中的第二方位,比如用户与x2之间的第二夹角l。In some embodiments, as shown in FIG. 6, the second coordinate system (such as the x2-y2-z2 coordinate system in the figure) can also be established from the sound box. The way of constructing the coordinate system from the speaker and the main speaker can be the same. The second position of the user in the coordinate system constructed by the speaker can be determined, such as the second angle l between the user and x2.
前文已经描述过,主音箱确定用户相对于主音箱的第一方位,以及从音箱确定用户相对于从音箱的第二方位的过程,在此不重复赘述。As described above, the process of determining the first orientation of the user relative to the main speaker by the master speaker, and determining the second orientation of the user relative to the slave speaker by the slave speaker, will not be repeated here.
505:从音箱将计算结果发送给主音箱。505: The slave speaker sends the calculation result to the master speaker.
在一些实施例中,从音箱构建第二坐标系即x2-y2-z2坐标系之后,从音箱的计算结果可以是第二角度l。因此,从音箱可以将该第二角度l发送给主音箱。主音箱可以根据第一角度和第二角度,以及主音箱和从音箱之间的距离D,确定用户的位置。当然,在另一些实施例中,从音箱的计算结果也可以是第二夹角l在第二坐标系即x2-y2-z2坐标系中形成的第二射线的函数关系,即,用户的位置与第二坐标系原点在第二坐标系中形成的射线的函数关系,从音箱可以将该第二射线的函数关系发送给主音箱。In some embodiments, after constructing the second coordinate system from the sound box, that is, the x2-y2-z2 coordinate system, the calculation result from the sound box may be the second angle l. Therefore, the secondary speaker can send the second angle l to the main speaker. The main speaker can determine the position of the user according to the first angle and the second angle, and the distance D between the main speaker and the slave speaker. Of course, in some other embodiments, the calculation result of the slave speaker can also be the function of the second ray formed by the second included angle l in the second coordinate system, that is, the x2-y2-z2 coordinate system, that is, the position of the user The functional relationship of the ray formed in the second coordinate system with the origin of the second coordinate system, and the secondary speaker can send the functional relationship of the second ray to the main speaker.
506:主音箱计算声源的位置。506: The main speaker calculates the position of the sound source.
示例1,如果从音箱向主音箱发送第二角度l。参见图7A所示,主音箱接收从音箱发送的第二角度l后,可以在x1-y1-z1坐标系中根据第二角度l确定第二射线,例如图7A中所示的第二射线。主音箱还可以根据第一角度
Figure PCTCN2020110720-appb-000004
在x1-y1-z1坐标系中确定第一射线。主音箱确定第一射线和第二射线之间的交点Q即为用户的位置。
Example 1, if the second angle l is sent from the speaker to the main speaker. Referring to FIG. 7A, after the main sound box receives the second angle l sent from the sound box, the second ray can be determined according to the second angle l in the x1-y1-z1 coordinate system, such as the second ray shown in FIG. 7A. The main speaker can also be based on the first angle
Figure PCTCN2020110720-appb-000004
Determine the first ray in the x1-y1-z1 coordinate system. The main speaker determines that the intersection Q between the first ray and the second ray is the position of the user.
示例2,如果从音箱是将第二角度l在第二坐标系即x2-y2-z2坐标系中形成的第二射线的函数关系发送给主音箱。那么主音箱接收到的是第二射线的函数关系(例如,y=ax+b),且该函数关系是在从音箱构建的坐标系中确定的,与主音箱构建的坐标系不是同一个坐标系。因此,主音箱可以将第二射线的函数关系进行坐标转换,转换到主音箱构建的x1-y1-z1坐标系中的第三射线的函数关系。这样的话,主音箱可以确定第一射线和第三射线之间的交点是用户的位置。示例性的,参见图7B所示,主音箱接收到第二角度l对应的第二射线的函数关系之后,在x1-y1-z1坐标系中标识该第二射线,由于第二射线不是在x1-y1-z1坐标系中确定的,所以可以将第二射线沿着z1方向平移距离D,得到第三数学关系,该第三数学关系对应的第三射线就是第二射线在主音箱x1-y1-z1坐标系中的表示。主音箱将第二射线进行坐标转换后,可以确定第一射线和第三射线之间的交点(比如,求解第一射线对应的第一数学关系和第三射线对应的第三数学关系构成的方程组),该交点即声源(比如,用户)的位置。Example 2: If the slave speaker sends the second ray function relationship formed by the second angle l in the second coordinate system, that is, the x2-y2-z2 coordinate system, to the main speaker. Then the main speaker receives the functional relationship of the second ray (for example, y=ax+b), and the functional relationship is determined in the coordinate system constructed from the speaker, which is not the same coordinate as the coordinate system constructed by the main speaker system. Therefore, the main sound box can perform coordinate conversion on the function relationship of the second ray, and convert it to the function relationship of the third ray in the x1-y1-z1 coordinate system constructed by the main sound box. In this case, the main speaker can determine that the intersection between the first ray and the third ray is the position of the user. Exemplarily, referring to FIG. 7B, after the main sound box receives the functional relationship of the second ray corresponding to the second angle l, the second ray is identified in the x1-y1-z1 coordinate system, because the second ray is not at x1. -y1-z1 coordinate system, so the second ray can be translated by the distance D along the z1 direction to obtain the third mathematical relationship. The third ray corresponding to the third mathematical relationship is the second ray in the main speaker x1-y1 -The representation in the z1 coordinate system. After the main speaker performs coordinate conversion of the second ray, the intersection point between the first ray and the third ray can be determined (for example, solving the equation formed by the first mathematical relationship corresponding to the first ray and the third mathematical relationship corresponding to the third ray Group), the intersection is the location of the sound source (for example, the user).
507:主音箱确定声源到主音箱的第一距离d1,声源到从音箱的第二距离d2。507: The main speaker determines the first distance d1 from the sound source to the main speaker, and the second distance d2 from the sound source to the slave speaker.
在一些实施例中,主音箱确定第一射线和第三射线之间的交点之后,可以根据三角定律确定该交点与主音箱(比如第一坐标系的原点)之间的距离d1,也可以确定该交点与从音箱之间的距离d2(比如该交点与第一坐标系上第三射线与-z1轴的交点A之间的距离)。In some embodiments, after the main sound box determines the intersection point between the first ray and the third ray, the distance d1 between the intersection point and the main sound box (such as the origin of the first coordinate system) can be determined according to the law of triangles, or it can be determined The distance d2 between the intersection point and the secondary speaker (for example, the distance between the intersection point and the intersection point A of the third ray and the -z1 axis on the first coordinate system).
508:主音箱根据第一距离d1和第二距离d2计算时延差。508: The main speaker calculates the time delay difference according to the first distance d1 and the second distance d2.
在一些实施例中,主音箱可以根据如下公式确定第一距离d1和第二距离d2之间的时延差:In some embodiments, the main speaker can determine the time delay difference between the first distance d1 and the second distance d2 according to the following formula:
d1/c=t1;d1/c=t1;
d2/c=t2;d2/c=t2;
t1-t2=Δt;t1-t2=Δt;
其中,c是声音传播速度,t1是声音信号从主音箱到用户所需的时长,t2是声音信号 从音箱到用户所需的时长,Δt即声音信号在第一距离d1和第二距离d2传播的时延差。在一些实施例中,声音传播速度会受到多种因素的影响,比如温度、气压等。以下实施例介绍c的几种可能的确定方式。Among them, c is the sound propagation speed, t1 is the time required for the sound signal to travel from the main speaker to the user, t2 is the time required for the sound signal to travel from the speaker to the user, Δt is the sound signal propagating in the first distance d1 and the second distance d2 The delay is poor. In some embodiments, the sound propagation speed may be affected by various factors, such as temperature and air pressure. The following embodiments introduce several possible ways of determining c.
方式1:c可以基于如下公式获得:Method 1: c can be obtained based on the following formula:
c=331.3+0.606Tc=331.3+0.606T
其中T是主音箱中温度传感器检测到的温度值。Where T is the temperature value detected by the temperature sensor in the main speaker.
方式2:c还可以基于如下公式获得:Method 2: c can also be obtained based on the following formula:
Figure PCTCN2020110720-appb-000005
Figure PCTCN2020110720-appb-000005
其中,γ为定压比热与定容比热之比,γ的取值可以固定,比如γ=1.4;R为气体常数为287J/(kg·K),T为温度传感器检测到的温度(比如,绝对温度);M为气体的摩尔质量,M的取值可以固定,比如22.4L/mol。Among them, γ is the ratio of constant pressure specific heat to constant volume specific heat, and the value of γ can be fixed, such as γ=1.4; R is the gas constant of 287J/(kg·K), and T is the temperature detected by the temperature sensor ( For example, absolute temperature); M is the molar mass of the gas, and the value of M can be fixed, such as 22.4L/mol.
方式3:c还可以基于如下公式获得:Method 3: c can also be obtained based on the following formula:
Figure PCTCN2020110720-appb-000006
Figure PCTCN2020110720-appb-000006
Pw是空气中水蒸气的分压强(比如,Pw=水的饱和蒸汽压相对湿度),Pw可以是固定值,比如可以是预先存储在主音箱中的取值。T是主音箱中温度传感器检测到的温度值,P是主音箱中气压传感器检测到的压强值。Pw is the partial pressure of water vapor in the air (for example, Pw=saturated vapor pressure and relative humidity of water), and Pw can be a fixed value, such as a value pre-stored in the main speaker. T is the temperature value detected by the temperature sensor in the main speaker, and P is the pressure value detected by the air pressure sensor in the main speaker.
在一些实施例中,主音箱确定出时延差Δt之后,可以基于时延差Δt调整主音箱或从音箱的发声的时延,使得主音箱和从音箱发声的声音到达用户的时间的一致的。In some embodiments, after the main speaker determines the time delay difference Δt, the time delay of the sound of the main speaker or the slave speaker can be adjusted based on the time delay difference Δt, so that the time for the sound from the main speaker and the slave speaker to reach the user is consistent. .
举例来说,主音箱确定d1>d2时,可以控制从音箱中的一个或多个扬声器延迟Δt发声。比如,主音箱可以向从音箱发生一指令,该指令中可以携带主音箱的发声时间点,以及从音箱需要延迟的时长。从音箱接收到该指令之后,可以确定主音箱的发声时刻、以及从音箱需要延迟的时长。这样的话,从音箱延迟发声,可以使得主音箱和从音箱发出的声音同时达到用户。For example, when the main speaker determines that d1>d2, one or more speakers in the slave speaker can be controlled to delay sound by Δt. For example, the master speaker can send a command to the slave speaker, and the command can carry the sounding time point of the master speaker and the length of time the slave speaker needs to delay. After receiving the instruction from the speaker, the sounding time of the main speaker and the length of time the slave speaker needs to be delayed can be determined. In this case, delaying the sound from the speaker can make the sound from the main speaker and the slave speaker reach the user at the same time.
再比如,主音箱确定d1<d2时,主音箱可以向从音箱发生一指令,该指令用于指示从音箱的发声时间点。主音箱控制自身的一个或多个扬声器在该发声时间点的基础上延迟一定时长再发声。这样的话,主音箱延迟发声,可以使得主音箱和从音箱发出的声音同时到达用户。For another example, when the master speaker determines that d1<d2, the master speaker can issue an instruction to the slave speaker, and the instruction is used to indicate the sounding time point of the slave speaker. The main speaker controls its own one or more speakers to delay a certain period of time based on the sounding time point before sounding. In this case, the sound of the main speaker is delayed so that the sound from the main speaker and the slave speaker can reach the user at the same time.
再比如,主音箱确定d1=d2时,主音箱可以向从音箱发生一指令,该指令用于指示从音箱的发声时间点。主音箱控制自身的一个或多个扬声器在该发声时间点发声。For another example, when the master speaker determines that d1=d2, the master speaker can issue an instruction to the slave speaker, and the instruction is used to indicate the sounding time point of the slave speaker. The main speaker controls its own one or more speakers to sound at the sounding time point.
509:主音箱根据第一距离d1和第二距离d2计算响度增益。509: The main speaker calculates the loudness gain according to the first distance d1 and the second distance d2.
在一些实施例中,主音箱可以通过如下公式确定响度增益:In some embodiments, the main speaker can determine the loudness gain by the following formula:
e1=d1×α1;e1=d1×α1;
e2=d2×α2;e2=d2×α2;
其中,e1和e2是响度增益,d1是主音箱与声源之间的距离,d2是从音箱与声源之间的距离,α1和α2是声音衰减系数。由于声音衰减系数会受到多种因素的影响,比如,气压、温度等。以下实施例介绍声音衰减系数的几种可能的确定方式。Among them, e1 and e2 are loudness gains, d1 is the distance between the main speaker and the sound source, d2 is the distance between the slave speaker and the sound source, and α1 and α2 are the sound attenuation coefficients. Because the sound attenuation coefficient will be affected by many factors, such as air pressure, temperature and so on. The following embodiments introduce several possible ways of determining the sound attenuation coefficient.
方式1:声音衰减系数可以通过如下公式获得:Method 1: The sound attenuation coefficient can be obtained by the following formula:
Figure PCTCN2020110720-appb-000007
Figure PCTCN2020110720-appb-000007
其中,P0为标准大气压(1013.25百帕),P为气压传感器检测到的气压值,T0为293K,T为温度传感器检测到的温度。f为所述音箱的声音频率。在一些实施例中,f的取值可以是音箱预先设置的,比如,f可以与音箱中扬声器的性能规格相关,当音箱出厂之前,该音箱的声音频率就设置好了。Among them, P0 is the standard atmospheric pressure (1013.25 hPa), P is the pressure value detected by the pressure sensor, T0 is 293K, and T is the temperature detected by the temperature sensor. f is the sound frequency of the speaker. In some embodiments, the value of f may be preset by the sound box. For example, f may be related to the performance specifications of the speaker in the sound box. The sound frequency of the sound box is set before the sound box leaves the factory.
方式2:声音衰减系数可以通过如下公式获得:Method 2: The sound attenuation coefficient can be obtained by the following formula:
α=α c1rotvib(O 2)+α vib(N 2) α=α c1rotvib (O 2 )+α vib (N 2 )
其中,α cl是由黏性和导热性引起的经典部分,α rot是转动受激分子弛豫过程引起的转动弛豫部分,α vib振动受激分子弛豫过程引起的振动弛豫部分。在一些实施例中,α cl、α rot、α vib可以通过如下公式获得: Among them, α cl is the classical part caused by viscosity and thermal conductivity, α rot is the rotational relaxation part caused by the relaxation process of rotating excited molecules, and the vibration relaxation part caused by the relaxation process of α vib vibration excited molecules. In some embodiments, α cl , α rot and α vib can be obtained by the following formula:
Figure PCTCN2020110720-appb-000008
Figure PCTCN2020110720-appb-000008
Figure PCTCN2020110720-appb-000009
Figure PCTCN2020110720-appb-000009
Figure PCTCN2020110720-appb-000010
Figure PCTCN2020110720-appb-000010
其中,P0为标准大气压(1013.25百帕),P为气压传感器检测到的气压值,T0为293K,T为温度传感器检测到的温度。f为音箱的声音频率。f r,o代表氧气分子的振动弛豫频率;f r,N代表氮气分子的振动弛豫频率;f r,o和f r,N可以通过如下公式获得: Among them, P0 is the standard atmospheric pressure (1013.25 hPa), P is the pressure value detected by the pressure sensor, T0 is 293K, and T is the temperature detected by the temperature sensor. f is the sound frequency of the speaker. f r,o represents the vibrational relaxation frequency of oxygen molecules; f r,N represents the vibrational relaxation frequency of nitrogen molecules; f r,o and fr,N can be obtained by the following formula:
Figure PCTCN2020110720-appb-000011
Figure PCTCN2020110720-appb-000011
Figure PCTCN2020110720-appb-000012
Figure PCTCN2020110720-appb-000012
其中,q为比湿,当给定湿压e的情况下,有q=100ep,p为气压传感器检测到的压强,p0为101325N/m2,T是温度传感器检测到的温度,T0为293K。其中,湿压e可以用于指示空气中的水汽压。比如,音箱中包括一传感器,该传感器可以用于检测空气中的水汽压;再比如,音箱可以在网络侧(比如,气象服务)查询当前控件中的水汽压。Among them, q is the specific humidity, when a given wet pressure e, q=100ep, p is the pressure detected by the air pressure sensor, p0 is 101325N/m2, T is the temperature detected by the temperature sensor, and T0 is 293K. Among them, the wet pressure e can be used to indicate the vapor pressure in the air. For example, the sound box includes a sensor, which can be used to detect the water vapor pressure in the air; for another example, the sound box can query the water vapor pressure in the current control on the network side (for example, weather service).
在一些实施例中,主音箱确定出e1和e2之后,可以基于e1和e2调整主音箱和从音箱中的一个或多个扬声器的响度增益。In some embodiments, after the master speaker determines e1 and e2, the loudness gain of one or more speakers in the master speaker and the slave speaker can be adjusted based on e1 and e2.
在一些实施例中,主音箱可以控制一个或多个扬声器的响度增加e1。主音箱还可以向从音箱发送一指令,该指令用于指示从音箱中的一个或多个扬声器的响度增加e2。In some embodiments, the main speaker can control the loudness of one or more speakers to increase e1. The master speaker can also send an instruction to the slave speaker, which is used to instruct the loudness of one or more speakers in the slave speaker to increase by e2.
在另一些实施例中,主音箱确定d1>d2时,可以控制主音箱中的一个或多个扬声器的响度增加e1,从音箱的响度可以不增加。或者,主音箱确定d1<d2时,主音箱可以向从音箱发送一指令,该指令用于指示从音箱中的一个或多个扬声器的响度增加e2,主音箱的响度可以不增加。或者,主音箱确定d1=d2时,主音箱和从音箱均可以不增加响度,或主音箱的响度增加e1,从音箱的响度增加e2。In other embodiments, when the main speaker determines that d1>d2, the loudness of one or more speakers in the main speaker can be controlled to increase e1, and the loudness of the slave speaker may not increase. Or, when the main speaker determines that d1<d2, the main speaker may send an instruction to the slave speaker, which is used to instruct the loudness of one or more speakers in the slave speaker to increase by e2, and the loudness of the main speaker may not increase. Or, when the main speaker determines that d1=d2, both the main speaker and the slave speaker may not increase the loudness, or the loudness of the main speaker increases by e1, and the loudness of the slave speaker increases by e2.
510:主音箱根据时延差和响度增益控制主音箱和从音箱的声音播放。具体来说,510包括510a和510b,其中,510a为控制主音箱播放,510b为主音箱控制从音箱播放。510: The main speaker controls the sound playback of the main speaker and the slave speaker according to the delay difference and loudness gain. Specifically, 510 includes 510a and 510b, where 510a controls the playback of the main speaker, and 510b controls the playback of the master speaker and the slave speaker.
在一些实施例中,主音箱计算时延差和响度增益的过程可以同时发生,或者不同发生,本申请实施例不作限定。In some embodiments, the process of calculating the delay difference and the loudness gain of the main speaker may occur simultaneously or differently, which is not limited in the embodiment of the present application.
在一些实施例中,上述根据第一射线和第二射线计算声源位置的过程也可以由从音箱执行,比如主音箱确定第一角度后,将第一角度的相关信息(比如第一角度,或者第一角度在第一坐标系中形成的第一射线的第一数学关系)发送给从音箱,由从音箱计算声源的位置、时延差、响度增益等。In some embodiments, the above process of calculating the position of the sound source based on the first ray and the second ray may also be performed by the slave speaker. For example, after the master speaker determines the first angle, the related information of the first angle (such as the first angle, Or the first mathematical relationship of the first ray formed by the first angle in the first coordinate system) is sent to the slave speaker, and the slave speaker calculates the position of the sound source, the time delay difference, and the loudness gain.
在其它些实施例中,从音箱可以不具备计算能力(比如,不具有计算声源相对于从音箱的第二方位的计算能力)。参见图8所示,为本申请另一实施例提供的音箱控制方法的流程示意图。如图8所示,该流程可以包括:In other embodiments, the slave speaker may not have the computing capability (for example, it does not have the computing capability of calculating the second orientation of the sound source relative to the slave speaker). Refer to FIG. 8, which is a schematic flowchart of a speaker control method provided by another embodiment of this application. As shown in Figure 8, the process may include:
801:主音箱采集声音信号。801: The main speaker collects sound signals.
802:从音箱采集声音信号。802: Collect sound signals from speakers.
801和802的描述可以参见图5所示实施例中关于501和502的描述,在此不重复。The description of 801 and 802 can refer to the description of 501 and 502 in the embodiment shown in FIG. 5, which is not repeated here.
803:从音箱将采集的声音信号发送给主音箱。803: Send the collected sound signal from the speaker to the main speaker.
804:主音箱计算声源相对于主音箱的第一角度,声源相对于从音箱的第二角度。804: The main speaker calculates the first angle of the sound source relative to the main speaker, and the second angle of the sound source relative to the slave speaker.
在一些实施例中,从音箱不具有计算能力(比如,不具有计算声源相对于从音箱的第二方位的计算能力),所以从音箱无法计算声源相对于从音箱的第二角度。从音箱可以将采集到的声音信号发送给主音箱。在一些实施例中,从音箱中包括一个或多个麦克风,每个麦克风采集到的声音信号不同。从音箱可以将每个麦克风采集到的声音信号都发送给主音箱。在一些示例中,如果主音箱和从音箱中的麦克风阵列中麦克风的分布相同时,主音箱接收到从音箱发送的每个麦克风采集到的声音信号时,可以对应映射到主音箱中的麦克风。示例性的,主音箱和从音箱中每个麦克风设置有编号。从音箱向主音箱发送的声音信号包括:麦克风1对应的声音信号1,麦克风2对应的声音信号2,麦克风3对应的声音信号3。主音箱接收到该声音信号(比如,声音信号1、声音信号2和声音信号3组成的声音信号组)后,可以对应映射到主音箱中的麦克风,即主音箱中的麦克风1采集到声音信号1,麦克风2采集到声音信号2,麦克风3采集到声音信号3。因此,对于主音箱而言,采集到两个/组声音信号,一个/组声音信号是采集到用户发出的,另一个/组声音信号是接收从音箱发送的。这两个/组声音信号可以不同,比如,响度不同等。主音箱可以根据每个/组声音信号计算出一个角度,得到两个角度,比如第一角度
Figure PCTCN2020110720-appb-000013
和第二角度l。
In some embodiments, the slave speaker does not have the calculation ability (for example, it does not have the calculation ability to calculate the second orientation of the sound source relative to the slave speaker), so the slave speaker cannot calculate the second angle of the sound source relative to the slave speaker. The slave speaker can send the collected sound signal to the main speaker. In some embodiments, one or more microphones are included in the sound box, and the sound signals collected by each microphone are different. From the speaker, the sound signal collected by each microphone can be sent to the main speaker. In some examples, if the distribution of microphones in the microphone array in the main speaker and the slave speaker are the same, when the main speaker receives the sound signal collected by each microphone sent from the speaker, it can be mapped to the microphone in the main speaker. Exemplarily, each microphone in the main speaker and the slave speaker is set with a number. The sound signal sent from the sound box to the main sound box includes: sound signal 1 corresponding to microphone 1, sound signal 2 corresponding to microphone 2, and sound signal 3 corresponding to microphone 3. After the main speaker receives the sound signal (for example, the sound signal group composed of sound signal 1, sound signal 2 and sound signal 3), it can be mapped to the microphone in the main speaker, that is, the microphone 1 in the main speaker collects the sound signal 1. The microphone 2 collects the sound signal 2, and the microphone 3 collects the sound signal 3. Therefore, for the main speaker, two/groups of sound signals are collected, one/group of sound signals is collected and sent by the user, and the other/group of sound signals is received and sent from the speaker. The two/groups of sound signals can be different, for example, different loudness, etc. The main speaker can calculate an angle according to each/group of sound signals to get two angles, such as the first angle
Figure PCTCN2020110720-appb-000013
And the second angle l.
805:主音箱计算声源的位置。805: The main speaker calculates the position of the sound source.
参见图9所示,主音箱可以根据第一角度
Figure PCTCN2020110720-appb-000014
确定第一射线,根据第二角度l和距离D确定第二射线。主音箱确定第一射线和第二射线的交点,该交点即用户的位置。
See Figure 9, the main speaker can be based on the first angle
Figure PCTCN2020110720-appb-000014
The first ray is determined, and the second ray is determined according to the second angle l and the distance D. The main sound box determines the intersection of the first ray and the second ray, and the intersection is the location of the user.
806:主音箱计算声源与主音箱的第一距离,声源与从音箱的第二距离。806: The main speaker calculates the first distance between the sound source and the main speaker, and the second distance between the sound source and the slave speaker.
在一些实施例中,主音箱确定第一射线和第二射线的交点之后,可以根据三角定律确定声源与主音箱的第一距离d1,声源与主音箱的第二距离d2。In some embodiments, after the main sound box determines the intersection of the first ray and the second ray, the first distance d1 between the sound source and the main sound box and the second distance d2 between the sound source and the main sound box can be determined according to the triangle law.
807:主音箱根据第一距离和第二距离计算时延差和/或响度增益。807: The main speaker calculates the delay difference and/or loudness gain based on the first distance and the second distance.
其中,确定时延差、响度增益等声音参数的过程参见图5所示的实施例的介绍,比如图5中的步骤508、509。For the process of determining sound parameters such as time delay difference and loudness gain, refer to the introduction of the embodiment shown in FIG. 5, such as steps 508 and 509 in FIG. 5.
808,主音箱根据时延差和响度增益控制主音箱和从音箱的声音播放。具体来说,808包括808a和808b,其中,808a为控制主音箱播放,808b为主音箱控制从音箱播放。808, the main speaker controls the sound playback of the main speaker and the slave speaker according to the delay difference and loudness gain. Specifically, 808 includes 808a and 808b, where 808a controls the playback of the main speaker, and 808b controls the playback of the slave speaker.
在其他实施例中,主音箱可以不具备计算能力(比如,不具有计算声源相对于主音箱的第一方位的计算能力),所以,图8中从音箱的执行步骤由主音箱执行,主音箱的步骤由从音箱执行。In other embodiments, the main speaker may not have the computing capability (for example, it does not have the computing capability to calculate the first orientation of the sound source relative to the main speaker). Therefore, the execution steps of the slave speaker in Figure 8 are executed by the master speaker. The steps of the speaker are performed by the slave speaker.
在一些实施例中,主音箱和/或从音箱开机后(每次开机或购买后首次开机),可以统计用户常听音乐的位置。在一些示例中,参见图10所示,主音箱和/或从音箱接收到语音指令,该语音指令用于启动音箱,主音箱和/或从音箱基于该语音指令,确定用户的第一位置。主音箱和/或从音箱存储该第一位置。当主音箱和/或从音箱再次检测到语音指令,该语音指令用于启动音箱播放歌曲,主音箱和/或从音箱基于该语音指令确定用户的第二位置,从存储该第二位置。因此,主音箱和/或从音箱可以确定用户的若干个位置。比如,图10中所有的黑色实心点都是主音箱和/或从音箱确定的用户的位置。主音箱和/或从音箱可以确定所有的位置坐标点两两坐标点之间的距离小于预设距离的坐标点,然后可以根据最小包络圆的方式确定这些坐标点所构成的区域,该区域即用户长听音乐的区域,可以称为活跃区域,比如图10中密集分布的点构成的区域。In some embodiments, after the main speaker and/or the slave speaker are turned on (every time it is turned on or the first time after purchase), the location where the user often listens to music can be counted. In some examples, as shown in FIG. 10, the main speaker and/or the slave speaker receive a voice command, the voice command is used to activate the speaker, and the master speaker and/or the slave speaker determine the user's first position based on the voice command. The master speaker and/or the slave speaker store the first position. When the main sound box and/or the slave sound box detects a voice command again, the sound command is used to start the sound box to play a song, and the master sound box and/or the slave sound box determine the user's second position based on the voice command, and store the second position. Therefore, the main speaker and/or the slave speaker can determine several positions of the user. For example, all the black solid points in Figure 10 are the user's position determined by the main speaker and/or the slave speaker. The main speaker and/or the slave speaker can determine all the coordinate points of the position where the distance between the two coordinate points is less than the preset distance, and then the area constituted by these coordinate points can be determined according to the minimum envelope circle. That is, the area where the user listens to music for a long time can be called the active area, such as the area formed by densely distributed points in FIG.
在一些实施例中,主音箱和从音箱采集到声源发出的声音信号。主音箱确定声音信号中包括“唤醒词”或“唤醒词+播放歌曲”时,确定声源的位置。主音箱可以判断声源位置是否处于活跃区域,若是,则基于上述图5或图7所示的流程控制主音箱和从音箱的声音,若否,则无需使用上述图5或图7所示的流程控制主音箱和从音箱的声音。在一些实施例中,“活跃区域”可以是用户常听音乐的位置。In some embodiments, the main sound box and the slave sound box collect the sound signals emitted by the sound source. When the main speaker determines that the sound signal includes "wake-up word" or "wake-up word + play song", the location of the sound source is determined. The main speaker can determine whether the sound source position is in the active area. If so, control the sound of the main speaker and the slave speaker based on the process shown in Figure 5 or Figure 7 above. If not, you do not need to use the sound source shown in Figure 5 or Figure 7 above. The process controls the sound of the master speaker and the slave speaker. In some embodiments, the "active area" may be a location where users often listen to music.
举例来说,用户在门口的位置发出声源“小白播放一路向北”。主音箱接收到该声音信号,判断该声音信号中包括“唤醒词+播放歌曲”,确定用户的位置。主音箱确定该位置不处于活跃区域,则无需调整主音箱和从音箱的声音参数(比如,响度增益、时延差等),比如,主音箱和从音箱同时发声,或者使用上一次使用的声音参数控制声音,或者使用默认的声音参数(比如,出厂时默认设置好的)控制发声。这种情况下,主音箱判断用户不处于活跃区域,所以无需根据用户的位置调整声音参数(比如,响度增益、时延差),有助于节省功耗。For example, the user emits the sound source "Xiaobai plays all the way north" at the door. The main speaker receives the sound signal, determines that the sound signal includes "wake-up word + play song", and determines the user's location. If the main speaker determines that the position is not in the active area, there is no need to adjust the sound parameters of the main speaker and the slave speaker (such as loudness gain, delay difference, etc.), for example, the main speaker and the slave speaker sound at the same time, or use the last used sound Parameters control the sound, or use the default sound parameters (for example, the factory default settings) to control the sound. In this case, the main speaker judges that the user is not in the active area, so there is no need to adjust the sound parameters (such as loudness gain, delay difference) according to the user's location, which helps to save power consumption.
本申请的各个实施方式可以任意进行组合,以实现不同的技术效果。The various embodiments of the present application can be combined arbitrarily to achieve different technical effects.
上述本申请提供的实施例中,从音箱(主音箱和/或从音箱)作为执行主体的角度对本申请实施例提供的方法进行了介绍。为了实现上述本申请实施例提供的方法中的各功能,终端设备可以包括硬件结构和/或软件模块,以硬件结构、软件模块、或硬件结构加软件模块的形式来实现上述各功能。上述各功能中的某个功能以硬件结构、软件模块、还是硬件结构加软件模块的方式来执行,取决于技术方案的特定应用和设计约束条件。In the above-mentioned embodiments provided by the present application, the method provided by the embodiments of the present application has been introduced from the perspective of a speaker (main speaker and/or slave speaker) as the execution subject. In order to implement the functions in the methods provided in the above embodiments of the present application, the terminal device may include a hardware structure and/or a software module, and implement the above functions in the form of a hardware structure, a software module, or a hardware structure plus a software module. Whether a certain function of the above-mentioned functions is executed by a hardware structure, a software module, or a hardware structure plus a software module depends on the specific application and design constraint conditions of the technical solution.
以上实施例中所用,根据上下文,术语“当…时”或“当…后”可以被解释为意思是“如果…”或“在…后”或“响应于确定…”或“响应于检测到…”。类似地,根据上下文,短语“在确定…时”或“如果检测到(所陈述的条件或事件)”可以被解释为意思是“如果确定…”或“响应于确定…”或“在检测到(所陈述的条件或事件)时”或“响应于检测到(所陈述的条件或事件)”。另外,在上述实施例中,使用诸如第一、第二之类的关系术语来区份一个实体和另一个实体,而并不限制这些实体之间的任何实际的关系和顺序。As used in the above embodiments, depending on the context, the term "when" or "after" can be interpreted as meaning "if..." or "after" or "in response to determining..." or "in response to detecting …". Similarly, depending on the context, the phrase "when determining..." or "if detected (statement or event)" can be interpreted as meaning "if determined..." or "in response to determining..." or "when detected (Condition or event stated)" or "in response to detection of (condition or event stated)". In addition, in the above embodiments, relationship terms such as first and second are used to distinguish one entity from another entity, and any actual relationship and order between these entities are not limited.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本发明实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质 中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘Solid State Disk(SSD))等。In the above-mentioned embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented by software, it can be implemented in the form of a computer program product in whole or in part. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, the processes or functions described in the embodiments of the present invention are generated in whole or in part. The computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices. The computer instructions may be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium. For example, the computer instructions may be transmitted from a website, computer, server, or data center. Transmission to another website site, computer, server or data center via wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server or a data center integrated with one or more available media. The usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).
需要指出的是,本专利申请文件的一部分包含受著作权保护的内容。除了对专利局的专利文件或记录的专利文档内容制作副本以外,著作权人保留著作权。It should be pointed out that a part of this patent application file contains content protected by copyright. Except for making copies of the patent documents or the contents of the recorded patent documents of the Patent Office, the copyright owner reserves the copyright.

Claims (14)

  1. 一种音箱控制方法,应用于音箱组,所述音箱组包括第一音箱和第二音箱,所述第一音箱和所述第二音箱被设置在不同的位置,其特征在于,所述方法包括:A sound box control method, applied to a sound box group, the sound box group including a first sound box and a second sound box, the first sound box and the second sound box are arranged in different positions, characterized in that the method includes :
    所述第一音箱采集到第一声音信号,所述第二音箱采集到第二声音信号;The first sound box collects a first sound signal, and the second sound box collects a second sound signal;
    所述第一音箱根据所述第一声音信号和所述第二声音信号确定声源的位置;Determining the position of the sound source by the first sound box according to the first sound signal and the second sound signal;
    所述第一音箱确定所述声源与所述第一音箱的第一距离以及所述声源与所述第二音箱的第二距离;Determining, by the first sound box, a first distance between the sound source and the first sound box and a second distance between the sound source and the second sound box;
    所述第一音箱基于所述第一距离和所述第二距离确定时延差;Determining, by the first sound box, a time delay difference based on the first distance and the second distance;
    所述第一音箱向所述第二音箱发送第一指令,所述第一指令用于指示所述第二音箱在第二时刻发出声音,所述第二时刻是根据第一时刻和所述时延差确定的,所述第一时刻是所述第一音箱的发声时间;The first sound box sends a first instruction to the second sound box. The first instruction is used to instruct the second sound box to emit a sound at a second time. The second time is based on the first time and the time. If the delay is determined, the first moment is the sounding time of the first speaker;
    所述第一音箱在所述第一时刻发出第三声音信号,所述第二音箱在所述第二时刻发出第四声音信号;其中,所述第三声音信号和所述第四声音信号是同一音频文件的不同声道的信号。The first sound box emits a third sound signal at the first moment, and the second sound box emits a fourth sound signal at the second moment; wherein the third sound signal and the fourth sound signal are Signals of different channels of the same audio file.
  2. 如权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1, wherein the method further comprises:
    所述第一音箱根据所述第一距离确定第一响度增益;Determining, by the first speaker, a first loudness gain according to the first distance;
    所述第一音箱根据所述第二距离确定第二响度增益;Determining, by the first speaker, a second loudness gain according to the second distance;
    所述第一音箱根据所述第一响度增益调整所述第一音箱的响度;Adjusting the loudness of the first sound box according to the first loudness gain by the first sound box;
    所述第一音箱向所述第二音箱发送第二指令,所述第二指令用于指示所述第二音箱基于所述第二响度增益调整所述第二音箱的响度。The first sound box sends a second instruction to the second sound box, where the second instruction is used to instruct the second sound box to adjust the loudness of the second sound box based on the second loudness gain.
  3. 如权利要求1或2所述的方法,其特征在于,所述第一音箱根据所述第一声音信号和所述第二声音信号确定声源的位置,包括:The method according to claim 1 or 2, wherein the first sound box determining the position of the sound source according to the first sound signal and the second sound signal comprises:
    所述第一音箱根据所述第一声音信号,确定所述声源在第一坐标系中的第一角度;Determining, by the first sound box, a first angle of the sound source in a first coordinate system according to the first sound signal;
    所述第一音箱根据所述第二声音信号,确定所述声源在所述第一坐标系中的第二角度;Determining, by the first sound box, a second angle of the sound source in the first coordinate system according to the second sound signal;
    所述第一音箱根据所述第一角度、所述第二角度,以及所述第一音箱和所述第二音箱之间的距离,确定所述声源的位置。The first sound box determines the position of the sound source according to the first angle, the second angle, and the distance between the first sound box and the second sound box.
  4. 如权利要求1-3任一所述的方法,其特征在于,在所述第一音箱根据所述第一声音信号和所述第二声音信号确定声源的位置之前,所述方法还包括:The method according to any one of claims 1 to 3, wherein before the first sound box determines the position of the sound source according to the first sound signal and the second sound signal, the method further comprises:
    所述第一音箱确定所述第一声音信号和所述第二声音信号中包括唤醒词。The first sound box determines that the first sound signal and the second sound signal include a wake-up word.
  5. 如权利要求1-4任一所述的方法,其特征在于,在所述第一音箱根据所述第一声音信号和所述第二声音信号确定声源的位置之后,且在所述第一音箱在所述第一时刻发出第三声音信号,所述第二音箱在所述第二时刻发出第四声音信号之前,所述方法还包括:The method according to any one of claims 1 to 4, wherein after the first sound box determines the position of the sound source according to the first sound signal and the second sound signal, and after the first sound box determines the position of the sound source The sound box emits a third sound signal at the first time, and before the second sound box emits a fourth sound signal at the second time, the method further includes:
    确定所述声源的位置处于活跃区域内;Determining that the position of the sound source is in the active area;
    其中,所述活跃区域是用户使用所述音箱组的次数大于第一预设阈值或者使用所述音箱组的频率大于第二预设阈值的区域。Wherein, the active area is an area where the number of times the user uses the speaker group is greater than a first preset threshold or the frequency of using the speaker group is greater than a second preset threshold.
  6. 如权利要求1-5任一所述的方法,其特征在于,所述第三声音信号是所述音频文件的左声道信号,所述第四声音信息是所述音频文件的右声道信号;或者,所述第四声音信号是所述音频文件的左声道信号,所述第三声音信息是所述音频文件的右声道信号。The method according to any one of claims 1 to 5, wherein the third sound signal is the left channel signal of the audio file, and the fourth sound information is the right channel signal of the audio file Or, the fourth sound signal is the left channel signal of the audio file, and the third sound information is the right channel signal of the audio file.
  7. 如权利要求1-6任一所述的方法,其特征在于,所述第一音箱基于所述第一距离和 所述第二距离确定时延差,包括:The method according to any one of claims 1 to 6, wherein the first sound box determining the time delay difference based on the first distance and the second distance comprises:
    所述第一音箱根据当前温度确定声音传播速度;The first sound box determines the sound propagation speed according to the current temperature;
    所述第一音箱根据所述第一距离和所述声音传播速度,确定第一时长;The first sound box determines a first duration according to the first distance and the sound propagation speed;
    所述第一音箱根据所述第二距离和所述声音传播速度,确定第二时长;The first sound box determines a second duration according to the second distance and the sound propagation speed;
    所述第一音箱根据所述第一时长和所述第二时长确定所述时延差。The first sound box determines the time delay difference according to the first time length and the second time length.
  8. 如权利要求7所述的方法,其特征在于,所述第一音箱根据当前温度确定声音传播速度,包括:8. The method according to claim 7, wherein the first sound box determining the sound propagation speed according to the current temperature comprises:
    所述第一音箱检测当前温度值;Detecting the current temperature value of the first speaker;
    所述第一音箱根据所述温度值和公式c=331.3+0.606T,确定所述声音传播速度;其中,T为温度值,c为所述声音传播速度;The first sound box determines the sound propagation speed according to the temperature value and the formula c=331.3+0.606T; where T is the temperature value, and c is the sound propagation speed;
    或者,or,
    所述第一音箱根据所述温度值和公式
    Figure PCTCN2020110720-appb-100001
    确定所述声音传播速度;其中,γ为定压比热与定容比热之比,R为气体常数,T为所述温度值,M为气体的摩尔质量,c为所述声音传播速度;
    The first sound box is based on the temperature value and formula
    Figure PCTCN2020110720-appb-100001
    Determine the sound propagation speed; where γ is the ratio of the constant pressure specific heat to the constant volume specific heat, R is the gas constant, T is the temperature value, M is the molar mass of the gas, and c is the sound propagation speed;
    或者,or,
    所述第一音箱根据所述温度值和公式
    Figure PCTCN2020110720-appb-100002
    确定所述声音传播速度;其中,Pw是空气中水蒸气的分压强,T是所述温度值,P是所述第一音箱检测到的压强值,c为所述声音传播速度。
    The first sound box is based on the temperature value and formula
    Figure PCTCN2020110720-appb-100002
    Determine the sound propagation speed; where Pw is the partial pressure of water vapor in the air, T is the temperature value, P is the pressure value detected by the first sound box, and c is the sound propagation speed.
  9. 如权利要求7或8所述的方法,其特征在于,所述第一音箱根据所述第一时长和所述第二时长确定所述时延差,包括:The method according to claim 7 or 8, wherein the first sound box determining the delay difference according to the first time length and the second time length comprises:
    当所述第一距离大于所述第二距离时,所述第一时长大于所述第二时长,所述第一音箱确定所述第一时长和所述第二时长的差值为所述时延差;所述第二时刻是所述第一时刻延迟所述时延差后的时刻;When the first distance is greater than the second distance, the first duration is greater than the second duration, and the first sound box determines that the difference between the first duration and the second duration is the time Delay; the second moment is the moment after the first moment is delayed by the delay difference;
    当所述第一距离小于所述第二距离时,所述第一时长小于所述第二时长,所述第一音箱确定所述第二时长和所述第一时长的差值为所述时延差;所述第二时刻是所述第一时刻提前所述时延差的时刻。When the first distance is less than the second distance, the first time length is less than the second time length, and the first sound box determines that the difference between the second time length and the first time length is the time Delay; the second moment is a moment when the first moment is advanced by the delay difference.
  10. 如权利要求2-9任一所述的方法,其特征在于,所述方法还包括:9. The method according to any one of claims 2-9, wherein the method further comprises:
    当所述第一距离大于所述第二距离时,所述第一音箱根据所述第一响度增益调整所述第一音箱的响度,包括:所述第一音箱根据所述第一响度增益增大所述第一音箱的当前响度;所述第二指令用于指示所述第二音箱根据所述第二响度增益降低所述第二音箱的当前响度;When the first distance is greater than the second distance, adjusting the loudness of the first sound box according to the first loudness gain by the first sound box includes: increasing the first sound box according to the first loudness gain Increase the current loudness of the first sound box; the second instruction is used to instruct the second sound box to reduce the current loudness of the second sound box according to the second loudness gain;
    当所述第一距离小于所述第二距离时,所述第一音箱根据所述第一响度增益调整所述第一音箱的响度,包括:所述第一音箱根据所述第一响度增益降低所述第一音箱的当前响度;所述第二指令用于指示所述第二音箱根据所述第二响度增益增大所述第二音箱的当前响度。When the first distance is less than the second distance, adjusting the loudness of the first sound box according to the first loudness gain by the first sound box includes: reducing the first sound box according to the first loudness gain The current loudness of the first sound box; the second instruction is used to instruct the second sound box to increase the current loudness of the second sound box according to the second loudness gain.
  11. 一种音箱,其特征在于,包括:一个或多个处理器、一个或多个存储器、一个或多个麦克风、一个或多个扬声器、以及通信模块;A speaker, characterized by comprising: one or more processors, one or more memories, one or more microphones, one or more speakers, and a communication module;
    所述一个或多个麦克风,用于采集声音信号;The one or more microphones are used to collect sound signals;
    所述通信模块,用于与其它音箱进行通信;The communication module is used to communicate with other speakers;
    所述一个或多个扬声器,用于发出声音信号;The one or more speakers are used to emit sound signals;
    所述一个或多个存储器,用于存储程序指令,所述程序指令被所述一个或多个处理器执行,使得所述音箱执行如权利要求1-10任一所述的方法。The one or more memories are used to store program instructions, and the program instructions are executed by the one or more processors, so that the sound box executes the method according to any one of claims 1-10.
  12. 一种音箱系统,包括:第一音箱和第二音箱,所述第一音箱和所述第二音箱被设置在不同的位置;所述第一音箱和所述第二音箱之间能够通信;其特征在于,所述第一音箱为如权利要求11所述的音箱。A sound box system, comprising: a first sound box and a second sound box, the first sound box and the second sound box are arranged at different positions; the first sound box and the second sound box can communicate with each other; It is characterized in that, the first sound box is the sound box according to claim 11.
  13. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序包括程序指令,所述程序指令当被计算机执行时,使所述计算机执行如权利要求1-10中任意一项所述的方法。A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and the computer program includes program instructions, and when executed by a computer, the program instructions cause the computer to execute The method described in any one of 1-10.
  14. 一种程序产品,其特征在于,所述程序产品存储有计算机程序,所述计算机程序包括程序指令,所述程序指令当被计算机执行时,使所述计算机执行如权利要求1-10中任意一项所述的方法。A program product, characterized in that the program product stores a computer program, the computer program includes program instructions, and when the program instructions are executed by a computer, the computer executes any one of claims 1-10. The method described in the item.
PCT/CN2020/110720 2019-08-23 2020-08-24 Loudspeaker box control method, loudspeaker box, and loudspeaker box system WO2021036970A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910785595.9 2019-08-23
CN201910785595.9A CN110677801B (en) 2019-08-23 2019-08-23 Sound box control method, sound box and sound box system

Publications (1)

Publication Number Publication Date
WO2021036970A1 true WO2021036970A1 (en) 2021-03-04

Family

ID=69076453

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/110720 WO2021036970A1 (en) 2019-08-23 2020-08-24 Loudspeaker box control method, loudspeaker box, and loudspeaker box system

Country Status (2)

Country Link
CN (1) CN110677801B (en)
WO (1) WO2021036970A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110677801B (en) * 2019-08-23 2021-02-23 华为技术有限公司 Sound box control method, sound box and sound box system
CN112105129B (en) * 2020-04-09 2023-11-17 苏州触达信息技术有限公司 Intelligent lamp, intelligent lighting method and computer readable storage medium
CN112083379B (en) * 2020-09-09 2023-10-20 极米科技股份有限公司 Audio playing method and device based on sound source localization, projection equipment and medium
CN114257924A (en) * 2020-09-24 2022-03-29 华为技术有限公司 Method for distributing sound channels and related equipment
CN112188368A (en) * 2020-09-29 2021-01-05 深圳创维-Rgb电子有限公司 Method and system for directionally enhancing sound
CN112327253A (en) * 2020-10-28 2021-02-05 苏州触达信息技术有限公司 Method and device for positioning personnel in water
US20240114309A1 (en) * 2020-12-03 2024-04-04 Dolby Laboratories Licensing Corporation Progressive calculation and application of rendering configurations for dynamic applications
CN116320902B (en) * 2023-05-19 2023-08-25 南昌航天广信科技有限责任公司 Sound box synchronous playing method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104954930A (en) * 2015-06-03 2015-09-30 冠捷显示科技(厦门)有限公司 Method for automatically adjusting sound direction and time delay of audible device and achieving best sound effects
CN108966077A (en) * 2018-06-19 2018-12-07 四川斐讯信息技术有限公司 A kind of control method and system of speaker volume
CN109910771A (en) * 2019-02-22 2019-06-21 广州小鹏汽车科技有限公司 A kind of audio frequency playing method and vehicle audio system of stereophonic field
US10332538B1 (en) * 2018-08-17 2019-06-25 Apple Inc. Method and system for speech enhancement using a remote microphone
CN110677801A (en) * 2019-08-23 2020-01-10 华为技术有限公司 Sound box control method, sound box and sound box system

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001238298A (en) * 2000-02-25 2001-08-31 Matsushita Electric Ind Co Ltd Sound image localization device
EP2899997A1 (en) * 2014-01-22 2015-07-29 Thomson Licensing Sound system calibration
CN103945301B (en) * 2014-04-24 2018-04-17 Tcl集团股份有限公司 A kind of sound system balance adjusting method and device
CN104637505B (en) * 2014-12-31 2017-06-16 小米科技有限责任公司 Audio frequency playing method and device
CN109309888A (en) * 2017-07-27 2019-02-05 深圳市冠旭电子股份有限公司 Voice information processing method, playback equipment and computer readable storage medium
CN108762104A (en) * 2018-05-17 2018-11-06 江西午诺科技有限公司 Speaker control method, device, readable storage medium storing program for executing and mobile terminal
CN108966112B (en) * 2018-06-29 2020-10-13 北京橙鑫数据科技有限公司 Time delay parameter adjusting method, system and device
CN109831735B (en) * 2019-01-11 2022-10-11 歌尔科技有限公司 Audio playing method, device, system and storage medium suitable for indoor environment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104954930A (en) * 2015-06-03 2015-09-30 冠捷显示科技(厦门)有限公司 Method for automatically adjusting sound direction and time delay of audible device and achieving best sound effects
CN108966077A (en) * 2018-06-19 2018-12-07 四川斐讯信息技术有限公司 A kind of control method and system of speaker volume
US10332538B1 (en) * 2018-08-17 2019-06-25 Apple Inc. Method and system for speech enhancement using a remote microphone
CN109910771A (en) * 2019-02-22 2019-06-21 广州小鹏汽车科技有限公司 A kind of audio frequency playing method and vehicle audio system of stereophonic field
CN110677801A (en) * 2019-08-23 2020-01-10 华为技术有限公司 Sound box control method, sound box and sound box system

Also Published As

Publication number Publication date
CN110677801B (en) 2021-02-23
CN110677801A (en) 2020-01-10

Similar Documents

Publication Publication Date Title
WO2021036970A1 (en) Loudspeaker box control method, loudspeaker box, and loudspeaker box system
TWI578288B (en) Method and device for generating multimedia poster
KR102067019B1 (en) Apparatus and method for controlling charging path of mobile terminal
US9575155B2 (en) Ultrasonic location determination
US10817255B2 (en) Scene sound effect control method, and electronic device
CN108922537B (en) Audio recognition method, device, terminal, earphone and readable storage medium
CN108319445B (en) Audio playing method and mobile terminal
WO2018223837A1 (en) Music playing method and related product
WO2017215652A1 (en) Sound effect parameter adjustment method, and mobile terminal
WO2017215635A1 (en) Sound effect processing method and mobile terminal
CN104035877B (en) Manage the device and method of the memory of mobile terminal
CN111757241B (en) Sound effect control method and device, sound box array and wearable device
WO2021244057A1 (en) Interaction method and apparatus, earphone, and earphone accommodation apparatus
WO2017215511A1 (en) Control method of scene sound effect and related products
US8861310B1 (en) Surface-based sonic location determination
WO2019165999A1 (en) Ultrasonic fingerprint collection precision control processing method, storage medium and mobile terminal
US20210159867A1 (en) Context based volume adaptation by voice assistant devices
WO2017101260A1 (en) Method, device, and storage medium for audio switching
US9578086B2 (en) Method and apparatus of setting data transmission and reception period
WO2021244059A1 (en) Interaction method and device, earphone, and server
CN110297543B (en) Audio playing method and terminal equipment
US9654891B2 (en) System and method for determining proximity of a controller to a media rendering device
WO2021147583A1 (en) Method, apparatus and system for determining relative angle between smart devices, and smart device
WO2020253377A1 (en) Terminal positioning method and mobile terminal
CN111028867B (en) Audio playing method and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20856187

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20856187

Country of ref document: EP

Kind code of ref document: A1