CN109831735B - Audio playing method, device, system and storage medium suitable for indoor environment - Google Patents

Audio playing method, device, system and storage medium suitable for indoor environment Download PDF

Info

Publication number
CN109831735B
CN109831735B CN201910026267.0A CN201910026267A CN109831735B CN 109831735 B CN109831735 B CN 109831735B CN 201910026267 A CN201910026267 A CN 201910026267A CN 109831735 B CN109831735 B CN 109831735B
Authority
CN
China
Prior art keywords
sound source
playing
audio
intelligent audio
intelligent
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910026267.0A
Other languages
Chinese (zh)
Other versions
CN109831735A (en
Inventor
崔文秋
杨小平
朱志军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Goertek Techology Co Ltd
Original Assignee
Goertek Techology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Goertek Techology Co Ltd filed Critical Goertek Techology Co Ltd
Priority to CN201910026267.0A priority Critical patent/CN109831735B/en
Publication of CN109831735A publication Critical patent/CN109831735A/en
Application granted granted Critical
Publication of CN109831735B publication Critical patent/CN109831735B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Stereophonic System (AREA)

Abstract

The embodiment of the application provides an audio playing method, equipment, a system and a storage medium suitable for an indoor environment. In the embodiment of the application, a plurality of intelligent audio devices are dispersively deployed around a listening area in an indoor environment, and a sound source position-playing parameter mapping relation corresponding to each intelligent audio device is determined by combining environmental characteristics of the indoor environment and relative positions of the intelligent audio devices and the listening area in advance, so that each intelligent audio device can determine playing parameters corresponding to a target sound source position contained in surround sound according to the corresponding mapping relation, and then respectively output sound signals in each target sound source position according to the playing parameters corresponding to each target sound source position, and the purpose of generating surround sound effect in the listening area is achieved. In this embodiment, a surround sound field is constructed by using the intelligent audio device, which not only can exert the intelligent advantages of the intelligent audio device, but also can improve the sound effect of the intelligent audio device.

Description

Audio playing method, device, system and storage medium suitable for indoor environment
Technical Field
The present application relates to the field of signal processing technologies, and in particular, to an audio playing method, device, system, and storage medium suitable for an indoor environment.
Background
With the popularization of consumer electronics, the intelligent man-machine interaction mode of products brings great convenience to users, for example, intelligent audio is already present in more and more families. However, the audio products on the market are basically mono or 1.1 channel, and the sound effect thereof cannot be compared with the traditional 2.1 channel or 5.1 or even 7.1 channel sound system.
In practical applications, if a user chooses to use an intelligent audio product, how to improve the sound effect is a big problem to be solved urgently.
Disclosure of Invention
Various aspects of the application provide an audio playing method, device, system and storage medium suitable for indoor environment, so as to exert the intelligent advantages of intelligent audio equipment, improve the sound effect of the intelligent audio equipment, and meet the dual requirements of users on intelligence and sound effect.
The embodiment of the application provides an audio playing method suitable for an indoor environment, wherein a plurality of intelligent audio devices are dispersedly deployed around a listening area of the indoor environment; the method comprises the following steps: the method comprises the steps that first intelligent audio equipment obtains a target audio signal needing to be played according to a received playing instruction, wherein the target audio signal comprises a plurality of target sound source positions; the first intelligent audio equipment determines playing parameters corresponding to the target sound source directions according to the corresponding sound source direction-playing parameter mapping relation; the first intelligent audio equipment respectively outputs sound signals in the target sound source directions in the target audio signals according to the playing parameters corresponding to the target sound source directions so as to generate surround sound effects in the listening area by matching with other intelligent audio equipment; wherein the first smart audio device is any one of the plurality of smart audio devices, and the sound source bearing-play parameter mapping relationship is predetermined according to the environmental characteristics of the indoor environment and the relative position of the first smart audio device and the listening area.
An embodiment of the present application further provides an audio playing system suitable for an indoor environment, including: a plurality of intelligent audio devices; the intelligent audio devices are dispersedly deployed around a listening area in the indoor environment; the plurality of intelligent audio devices are respectively used for acquiring target audio signals to be played according to the received playing instructions, and the target audio signals comprise a plurality of target sound source positions; determining playing parameters corresponding to the target sound source directions according to the sound source direction-playing parameter mapping relations corresponding to the target sound source directions; outputting sound signals in the target sound source directions in the target audio signals according to the playing parameters corresponding to the target sound source directions so as to generate surround sound effect in the listening area; wherein, the sound source position-playing parameter mapping relation corresponding to each of the plurality of intelligent audio devices is predetermined according to the environmental characteristics of the indoor environment and the relative positions of the plurality of intelligent audio devices and the listening zone.
The embodiment of the present application further provides an intelligent audio device, including: a communication component, a memory, a processor, and an audio component; the communication component is used for receiving a playing instruction; the memory is used for storing a computer program and a sound source direction-playing parameter mapping relation corresponding to the intelligent audio equipment; the processor to execute the computer program to: acquiring a target audio signal to be played according to a playing instruction received by the communication assembly, wherein the target audio signal comprises a plurality of target sound source positions; determining playing parameters corresponding to the intelligent audio equipment and each target sound source position according to the sound source position-playing parameter mapping relation stored in the memory, and outputting the playing parameters to the audio component; the audio component is used for outputting sound signals in the target sound source directions in the target audio signals according to the playing parameters corresponding to the intelligent audio equipment and the target sound source directions so as to generate surround sound effects in a listening area of the indoor environment by matching with other intelligent audio equipment in the indoor environment where the intelligent audio equipment is located; wherein the sound source bearing-playing parameter mapping relationship is predetermined according to the environmental characteristics of the indoor environment and the relative position of the intelligent audio equipment and the listening area.
Embodiments of the present application also provide a computer-readable storage medium storing a computer program, which, when executed by one or more processors, causes the one or more processors to perform the steps of embodiments of the method of the present application.
In the embodiment of the application, a plurality of intelligent audio devices are dispersively deployed around a listening area in an indoor environment, and a sound source position-playing parameter mapping relation corresponding to each intelligent audio device is determined by combining environmental characteristics of the indoor environment and relative positions of the intelligent audio devices and the listening area in advance, so that when surround sound needs to be played, each intelligent audio device can determine playing parameters corresponding to a target sound source position contained in the surround sound according to the corresponding mapping relation, and then sound signals in each target sound source position are respectively output according to the playing parameters corresponding to each target sound source position, and the purpose of generating a sound effect of the surround sound in the listening area is achieved. In the embodiment, the surround stereo field is constructed by the intelligent audio equipment, so that the intelligent advantage of the intelligent audio equipment can be exerted, and the sound effect of the intelligent audio equipment is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
fig. 1a is a schematic structural diagram of an audio playing system suitable for an indoor environment according to an exemplary embodiment of the present application;
fig. 1b is a schematic structural diagram of another audio playing system suitable for indoor environment according to an exemplary embodiment of the present application;
fig. 1c is a schematic diagram of a process of determining playing parameters of each intelligent audio device in a candidate sound source orientation by taking a rectangular area as an example in the exemplary embodiment of the present application;
fig. 1d is a schematic structural diagram of another audio playing system suitable for indoor environment according to an exemplary embodiment of the present application;
FIG. 1e is a schematic diagram illustrating a microphone of an exemplary embodiment of the present application;
fig. 2a is a schematic flowchart of an audio playing method suitable for an indoor environment according to an exemplary embodiment of the present application;
fig. 2b is a schematic flowchart of another audio playing method suitable for indoor environment according to an exemplary embodiment of the present application;
fig. 3 is a schematic structural diagram of an intelligent audio device according to an exemplary embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the technical solutions of the present application will be described in detail and completely with reference to the following specific embodiments of the present application and the accompanying drawings. It should be apparent that the described embodiments are only a few embodiments of the present application, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
In the prior art, if a user chooses to use an intelligent audio device, the user will face a technical problem of how to improve the sound effect of the intelligent audio device. In some embodiments of the present application, a plurality of intelligent audio devices are dispersively disposed around a listening area in an indoor environment, and a sound source position-playing parameter mapping relationship corresponding to each intelligent audio device is determined in advance by combining environmental characteristics of the indoor environment and relative positions of the plurality of intelligent audio devices and the listening area, so that when surround sound needs to be played, each intelligent audio device can determine playing parameters corresponding to a target sound source position included in the surround sound according to the corresponding mapping relationship, and further output sound signals in each target sound source position according to the playing parameters corresponding to each target sound source position, thereby achieving a purpose of generating a surround sound effect in the listening area. In this embodiment, a surround sound field is constructed by using the intelligent audio device, which not only can exert the intelligent advantages of the intelligent audio device, but also can improve the sound effect of the intelligent audio device.
The technical solutions provided by the embodiments of the present application are described in detail below with reference to the accompanying drawings.
Fig. 1a is a schematic structural diagram of an audio playing system suitable for an indoor environment according to an exemplary embodiment of the present application. As shown in fig. 1a, the audio playing system 100 is deployed in an indoor environment, and specifically includes: a plurality of intelligent audio devices 101, the plurality of intelligent audio devices 101 being deployed dispersed around a listening area 102 in an indoor environment. The smart audio device 101 refers to a device that can play audio signals, and may be, for example, a smart speaker, a smart phone, or the like.
The listening zone 102 in the indoor environment refers to a location area where the user is accustomed to listening to the audio signal played by the intelligent audio device 101. The listening zone may vary according to the indoor environment. For example, assuming that the indoor environment is a living room, and the user is habitually sitting on a sofa to listen to audio or watch video, the listening area may be a location area in the living room where the sofa is located. For another example, assuming that the indoor environment is a master bedroom, and the user is habitually lying in bed to listen to audio or watch video, the listening zone may be a bedside position in the master bedroom. For another example, assuming that the indoor environment is a multimedia conference room, the listening zone may be a location area where a conference table in the multimedia conference room is located.
In the embodiment of the present application, the manner in which the plurality of smart audio devices 101 are disposed around the listening area 102 in a distributed manner is not limited, and the smart audio devices need to be disposed flexibly according to the position of the listening area 102, the shape of the indoor environment, and the like. For example, assuming that the indoor environment is a relatively regular rectangular, square, or circular area and the listening area 102 is a central area of the rectangular, square, or circular area, the plurality of smart audio devices 101 may be evenly disposed around the central area. Of course, if the indoor environment is an irregular area, and the listening area 102 may be located on a certain side or corner of the irregular area, then the plurality of smart audio devices 101 may be dispersed around the listening area 102 to form a semi-enclosed state for the listening area 102. Since the indoor environment is an irregular area, and the distances from different locations to the listening area 102 may be different, the plurality of smart audio devices 101 may be unevenly disposed in order to achieve the desired surround sound effect.
In addition, the number of the smart audio devices 101 is not limited in the embodiment of the present application, and the number can be flexibly set according to the desired surround sound effect, the size of the indoor environment, and the like. For example, 3, 4, 6, or 8 smart audio devices 101 may be disposed around the listening area 102. In this embodiment, a plurality of smart audio devices 101 are deployed around the listening zone 102 in advance, and after the deployment of the plurality of smart audio devices 101, the mapping relationship between the sound source direction and the playing parameter corresponding to each smart audio device 101 may be determined according to the environmental characteristics of the indoor environment and the relative positions of the plurality of smart audio devices 101 and the listening zone 102. The sound source azimuth-play parameter mapping relationship corresponding to each intelligent audio device 101 stores play parameters corresponding to each candidate sound source azimuth that may possibly occur to the intelligent audio device 101. For the same candidate sound source direction, if each smart audio device 101 plays according to the playing parameters corresponding to the candidate sound source direction, the sound effect of the surround sound from the candidate sound source direction will be heard at the listening area 102.
In this explanation, in the embodiments of the present application, sound source orientations that may occur in audio signals (audio signals are broadly referred to herein) are referred to as candidate sound source orientations for convenience of description and distinction. In this embodiment, the playing parameters of the intelligent audio device 101 at each possible candidate sound source position are established in advance. In addition, in the embodiment of the present application, a sound source azimuth existing in a specific audio signal played by the smart audio device 101 during a certain playing process is referred to as a target sound source azimuth. The candidate sound source locations may be regarded as a full set of sound source locations, and the target sound source location is a subset thereof.
Based on the above, when an audio signal needs to be played by using the audio playing system 100, a playing instruction may be issued to a plurality of intelligent audio devices 101. For each intelligent audio device 101, the audio signal to be played may be obtained according to the received play instruction. For convenience of description and distinction, an audio signal to be played is referred to as a target audio signal, which includes a plurality of target sound source orientations and belongs to surround sound. Then, each intelligent audio device 101 may determine, according to its corresponding sound source bearing-playing parameter mapping relationship, a playing parameter corresponding to each target sound source bearing contained in the target audio signal. Then, each smart audio device 101 can output the sound signal of each target sound source position in the target audio signal according to its own playing parameter corresponding to each target sound source position, so as to generate the surround sound effect at the listening area 102.
Optionally, the form of the target audio signal includes, but is not limited to: music, video dubbing, celebrity speech, voice, audio books, english audio material, etc.
In this embodiment, a surround sound field is constructed by using the intelligent audio device, so that the intelligent advantage of the intelligent audio device can be exerted, and playing audio signals based on the surround sound field is also beneficial to improving the playing effect of the intelligent audio device.
In an application scenario, the multiple intelligent audio devices 101 support a voice interaction manner and have a function of internet surfing, so that a user may send a voice playing instruction to the multiple intelligent audio devices 101 to request the multiple intelligent audio devices 101 to play a surround sound on the internet. The voice playing instruction may carry an identifier of the surround sound to be played by the smart audio device 101, for example, information such as a name and an album of music to be played. For a plurality of intelligent audio devices 101, a target audio signal to be played may be acquired from the internet according to a voice playing instruction issued by a user.
Further optionally, the plurality of intelligent audio devices 101 have an equal status, and each of the plurality of intelligent audio devices 101 may receive a voice playing instruction sent by a user, and obtain a target audio signal to be played from the internet according to the received voice playing instruction.
Further alternatively, the plurality of smart audio devices 101 are divided into a master speaker and a slave speaker, the slave speaker establishes a communication connection, such as a bluetooth connection, a WiFi connection, or an infrared connection, with the master speaker, and the slave speaker receives control of the master speaker. Based on the above, the user can send a voice playing instruction to the master sound box, the master sound box receives the voice playing instruction sent by the user, on one hand, a target audio signal to be played is obtained from the internet according to the voice playing instruction, and on the other hand, the voice playing instruction is forwarded to the slave sound box; for the slave loudspeaker boxes, the voice playing instruction forwarded by the master loudspeaker box can be received, and then the target audio signal to be played is obtained from the internet according to the voice playing instruction.
In another application scenario, a user uses other audio/video playing devices to play audio/video signals, and in order to improve sound effects, the audio signals played by the audio/video playing devices can be transmitted to the plurality of intelligent audio devices 101, so that surround sound effects are generated by the plurality of intelligent audio devices 101. The audio/video playing device may be any device having an audio/video playing function, and may be a television, a mobile phone, a notebook computer, a tablet computer (ipad), an AR device, a VR device, or a tablet computer. In the application scenario, the multiple intelligent audio devices 101 may establish wireless connection with the audio/video playing device, for example, wiFi connection, bluetooth connection, infrared connection, or the like; when the audio/video playing device needs to transmit an audio signal to multiple intelligent audio devices 101, a playing instruction may be issued in a wireless manner, and for convenience of distinguishing and describing, the playing instruction issued in a wireless manner is referred to as a wireless playing instruction. For the plurality of intelligent audio devices 101, the target audio signal transmitted by the audio and video playing device may be received according to a wireless playing instruction sent by the audio and video playing device.
Further optionally, the plurality of intelligent audio devices 101 have an equal status, and the plurality of intelligent audio devices 101 establish wireless connection with the audio and video playing device, so that wireless playing instructions sent by the audio and video playing device can be received respectively, and further target audio signals transmitted by the audio and video playing device are received according to the received wireless playing instructions.
Further optionally, the plurality of smart audio devices 101 are divided into a master speaker and a slave speaker, the slave speaker establishes a communication connection with the master speaker, for example, a bluetooth connection, a WiFi connection, or an infrared connection, and the slave speaker accepts control of the master speaker. Based on the method, the master sound box can establish wireless connection with the audio and video playing equipment, such as Bluetooth connection, wiFi connection or infrared connection and the like, then receive a wireless playing instruction sent by the audio and video playing equipment based on the wireless connection, receive a target audio signal transmitted by the audio and video playing equipment according to the wireless playing instruction, and forward the target audio signal to the slave sound box; and receiving the target audio signal forwarded by the main loudspeaker box by the auxiliary loudspeaker box.
In any application scenario, after the target audio signal is acquired, the plurality of intelligent audio devices 101 may determine, according to the mapping relationship between the sound source position and the playing parameter corresponding to each intelligent audio device, a playing parameter corresponding to each target sound source position included in the target audio signal; then, each smart audio device may output the sound signal at the position of each target sound source in the target audio signal according to the playing parameter corresponding to each target sound source position, so as to generate the surround sound effect at the listening area 102.
In the embodiment of the present application, the manner of obtaining the mapping relationship between the sound source direction and the playing parameter corresponding to each of the plurality of intelligent audio devices 101 is not limited. For example, the sound source bearing-playback parameter mapping relationship may be individually established for itself by the intelligent audio devices 101. For another example, the main speakers in the multiple smart audio devices 101 may be responsible for establishing a mapping relationship between the sound source direction and the playing parameter for each smart audio device 101. For another example, the server may be responsible for establishing a sound source direction-playing parameter mapping relationship for each intelligent audio device 101.
As shown in fig. 1b, a server 103 is added to the audio playing system 100 shown in fig. 1 a. The server 103 may be deployed in the cloud, or may be deployed locally, which is not limited to this.
In this embodiment, considering that the intelligent audio device 101 has limited computing capability, the server 103 is deployed in the audio playing system 100, and the server 103 is responsible for establishing respective mapping relationships between sound source orientations and playing parameters for the multiple intelligent audio devices 101.
In the present embodiment, the plurality of intelligent audio devices 101 establish communication connections with the server 103, respectively. The communication connection may be wired or wireless. The server 103 may establish a mapping relationship between the sound source position and the playing parameter corresponding to each of the plurality of intelligent audio devices 101 according to the three-dimensional spatial information of the indoor environment and the relative position information between the plurality of intelligent audio devices 101 and the listening area 102, and issue the mapping relationship between the sound source position and the playing parameter corresponding to each of the plurality of intelligent audio devices 101 to the corresponding intelligent audio device 101, so that the intelligent audio device 101 may store the mapping relationship locally.
In an optional embodiment, a user may obtain three-dimensional spatial information of an indoor environment in advance, such as a house-type drawing, a decoration drawing, and the like; in addition, after the plurality of intelligent audio devices 101 are deployed in the indoor environment, the user may manually measure the relative position information of the plurality of intelligent audio devices 101 and the listening area 102, and upload the three-dimensional spatial information of the indoor environment and the relative position information of the plurality of intelligent audio devices 101 and the listening area 102 to the server 103.
In another alternative embodiment, the plurality of intelligent audio devices 101 may detect the distance information between the intelligent audio devices and the surrounding environment and between the intelligent audio devices and other intelligent audio devices, and report the distance information to the server 103; the server 103 constructs three-dimensional spatial information of the indoor environment according to the distance information reported by the intelligent audio devices 101, and determines the relative position information of each intelligent audio device 101 and the listening zone 102.
In this embodiment, the manner in which the plurality of intelligent audio devices 101 detect the distance information between the plurality of intelligent audio devices and the ambient environment and between the plurality of intelligent audio devices and other intelligent audio devices is not limited, and any manner in which the plurality of intelligent audio devices can detect the distance information between the plurality of intelligent audio devices and the ambient environment and between the plurality of intelligent audio devices and other intelligent audio devices is applicable to the embodiment of the present application. Several detection modes are listed below:
in this embodiment, the smart audio device 101 includes an ultrasonic module, which can transmit and receive ultrasonic signals. Based on this, the smart audio device 101 can detect its distance information from the surrounding environment using the ultrasonic wave signal. For example, an ultrasonic module is arranged on the top of the intelligent audio device 101, and is used to send an ultrasonic signal upwards, record the time T1 for sending the ultrasonic signal, wait for the ultrasonic signal to be reflected back after hitting the ceiling above, and record the time T2 for receiving the reflected ultrasonic signal, and the distance from the intelligent audio device 101 to the ceiling can be calculated according to the transmission rate of the ultrasonic wave in the air and the time difference (T2-T1). Similarly, an ultrasonic module may be disposed in front of the intelligent audio device 101, the ultrasonic module is used to send an ultrasonic signal to the front of the intelligent audio device 101, and the distance from the intelligent audio device 101 to an obstacle in front of the intelligent audio device, such as a wall or a bookshelf, may be calculated by using the sending and returning time of the ultrasonic signal and combining the transmission rate of the ultrasonic wave in the air. By analogy, the ultrasonic modules may be disposed at different positions of the intelligent audio device 101, so as to detect the distance between the intelligent audio device 101 and the corresponding obstacle in each direction in the environment region. Therefore, the plurality of intelligent audio devices 101 may respectively detect the distance information between the intelligent audio devices and the surrounding environment by using the ultrasonic signals and report the distance information to the server 103.
In this embodiment, the smart audio device 101 includes a bluetooth module, and the bluetooth module can transmit and receive bluetooth signals. The two smart audio devices 101 may establish a bluetooth connection through their respective bluetooth modules and may transmit bluetooth signals based on the bluetooth connection. In addition, the smart audio device 101 also includes an audio module that can emit audio signals. Two smart audio devices 101 may transmit audio signals emitted by their audio modules based on a bluetooth connection between the two. Based on this, for two intelligent audio devices 101, one intelligent audio device 101 can simultaneously transmit one bluetooth signal and one audio signal to the other intelligent audio device 101 based on the bluetooth connection between the two; the other intelligent audio device 101 may receive the bluetooth signal and the audio signal, and since the transmission of the bluetooth signal is fast, the transmission time of the bluetooth signal may be ignored, so that the other intelligent audio device 101 may calculate the distance information between the other intelligent audio device 101 and the other intelligent audio device 101 according to the difference between the time of receiving the audio signal and the time of receiving the bluetooth signal, and by combining the transmission rate of the audio signal. For any intelligent audio device 101 in the plurality of intelligent audio devices 101, the distance information between the intelligent audio device 101 and other intelligent audio devices can be detected by using the bluetooth signal and the audio signal, and reported to the server 103.
The server 103 receives the distance information between the intelligent audio equipment 101 and the surrounding environment and the distance information between the intelligent audio equipment 101 and other intelligent audio equipment, and can construct three-dimensional space information of the indoor environment by combining the distance information and also can calculate the relative position information between the intelligent audio equipment 101 and the listening area 102; furthermore, a corresponding mapping relationship between the sound source direction and the playing parameter may be established for each intelligent audio device 101 according to the three-dimensional spatial information of the indoor environment and the relative position information of each intelligent audio device 101 and the listening area 102.
Alternatively, taking the regular rectangular area shown in fig. 1c as an example, assuming that the listening area 102 is located at the center position O of the rectangular area, the smart audio devices S1, S2, S3 and S4 may be respectively disposed at four corners of the rectangular area. It should be noted that, for the convenience of calculation, 4 intelligent audio devices are taken as an example, but not limited to 4. For any candidate sound source position, the server 103 may calculate the playing parameters of each intelligent audio device in the candidate sound source position in the same manner.
Taking the sound source direction from the angle δ in front of the center position O on the right as an example, the server 103 may calculate the angles d1 and d2 and the angles a and b according to the previously established three-dimensional spatial information of the rectangular area and the relative position information of the smart audio devices S1, S2, S3, and S4 and the center position O (i.e., the listening zone); further, according to the vector solving method and the trigonometric function, the ratio of the loudness value of the smart audio device S1 to the loudness value of the smart audio device S2 is (SPL (S1))/(SPL (S2)) = (Sin (b))/(Sin (a)) = (Sin (d 1- δ))/(Sin (d 1+ δ)). By analogy, for other candidate sound source directions, the ratio of the loudness values of the corresponding intelligent audio equipment can be calculated in the manner, and then the initial playing parameters of the corresponding intelligent audio equipment are determined according to the ratio of the loudness values of the corresponding intelligent audio equipment. The initial playing parameter herein mainly refers to the overall playing volume of the intelligent audio device. The English spelling of SPL is Sound Pressure Level, which means Sound Pressure value and is the traditional expression mode of loudness value. SPL (S1) represents the loudness value of the smart audio device S1, and SPL (S2) represents the loudness value of the smart audio device S2, similarly.
Furthermore, the server 103 may establish a sound source position-playback parameter mapping relationship corresponding to each of the plurality of intelligent audio devices according to the initial playback parameters of each of the plurality of intelligent audio devices in each of the candidate sound source positions. For example, for each intelligent audio device, the server 103 may store the playing parameters of the intelligent audio device in each candidate sound source position and each candidate sound source position correspondingly, so as to form a sound source position-playing parameter mapping relationship corresponding to the intelligent audio device.
It should be noted that the server 103 may directly calculate the initial playing parameter of each intelligent audio device in each candidate sound source position according to the vector solving method and the trigonometric function, and further directly establish a respective corresponding sound source position-playing parameter mapping relationship for each intelligent audio device based on the initial playing parameter. However, considering that the environmental characteristics of the indoor environment are random, dynamic, and the like, there may be some deviation from the theoretical calculation. In order to further improve the accuracy of the mapping relationship between the sound source position and the playing parameter corresponding to each intelligent audio device, as shown in fig. 1d, a recording device 104 is added on the basis of the audio playing system 100 shown in fig. 1b, and the recording device 104 is arranged in the listening area 102 and is responsible for simulating human ears, and is mainly used for testing the initial playing parameter of each intelligent audio device in each candidate sound source position, which is directly calculated by the server 103 according to a vector solving method and a trigonometric function, so as to improve the accuracy of the playing parameter, and further improve the accuracy of the mapping relationship between the sound source position and the playing parameter corresponding to each intelligent audio device.
It should be noted that the embodiment of the present application does not limit the device form of the recording device 104, and any device with a recording function is suitable for the embodiment of the present application. For example, recording device 104 may be an additional smart audio device, and may also be a cell phone, a tablet computer, a voice pen, a voice recorder, a video recorder, and so on.
Optionally, in order to facilitate the recording device 104 to accurately collect the sound emitted by each smart audio device 101, a plurality of microphones (mic) may be disposed on the recording device 104, as shown in fig. 1e, 6 microphones may be uniformly disposed on the recording device 104, and the 6 microphones surround the recording device 104. Of course, the number of microphones on the recording device 104 is not limited to 6, and may be 3, 4, 8, etc.
On the basis of adding the recording device 104, the server 103 calculates initial playing parameters of a plurality of intelligent audio devices in each candidate sound source position according to three-dimensional space information of an indoor environment and relative position information of the intelligent audio devices and a listening area; the method can control a plurality of intelligent audio devices to output test audio signals according to the initial playing parameters under each candidate sound source position; and correcting initial playing parameters of the plurality of intelligent audio devices in each candidate sound source position according to the difference between the actual sound source position of the surround sound with the stereo effect corresponding to each candidate sound source position reported by the recording device 104 and the corresponding candidate sound source position, and further establishing a sound source position-playing parameter mapping relation corresponding to each of the plurality of intelligent audio devices based on the corrected playing parameters.
For the recording device 104, surround sound with a stereo effect and actual sound source directions generated by the test audio signals output by the multiple intelligent audio devices 101 may be collected in the listening area 102, and surround sound with a stereo effect and actual sound source directions of surround sound with a stereo effect corresponding to the candidate sound source directions may be reported to the server 103.
It should be noted that the process of correcting the initial playing parameters of each intelligent audio device in each candidate sound source position by the server 103 is the same or similar, and the first candidate sound source position is taken as an example for description below. The first candidate sound source position is any one of the candidate sound source positions.
Taking the first candidate sound source position as an example, the server 103 may calculate initial playing parameters of each intelligent audio device in the first candidate sound source position according to a vector solution method and a trigonometric function, and then may control the multiple intelligent audio devices to output a test audio signal according to the initial playing parameters (mainly referring to overall playing volume) in the first candidate sound source position, where the test audio signal refers to surround sound used in testing the playing parameters in the first candidate sound source position. For example, the server 103 may request each smart audio device to play the 20 to 20KHZ audio signal according to the calculated initial playing parameters (mainly referring to the overall playing volume), and ideally, the 20 to 20KHZ audio signal should be mixed to generate the sound effect from the first candidate sound source direction. The sound signals output by the intelligent audio devices are mixed and then reach the recording device 104 in the listening area 102, and surround sound from a certain sound source direction can be heard for the recording device 104. The recording device 104 reports the acquired surround sound and the actual sound source location of the surround sound to the server 103.
The server 103 compares the actual sound source position of the surround sound acquired by the recording device 104 with the first candidate sound source position; if the error of the two is within the set error range, the initial playing parameters of the intelligent audio equipment are considered to be accurate, and the mapping relation between the first candidate sound source position and the initial playing parameters of the intelligent audio equipment can be established; if the error between the actual sound source position and the first candidate sound source position acquired by the recording device 104 is within the set error range, the server 103 determines that the initial playing parameter of each intelligent audio device is inaccurate, corrects the initial playing parameters of the multiple intelligent audio devices, re-issues the corrected playing parameters to each intelligent audio device, controls each intelligent audio device to re-output the test audio signal according to the corrected playing parameters, and establishes the mapping relationship between the first candidate sound source position and the corrected playing parameters of the multiple intelligent audio devices until the error between the actual sound source position of the surround sound acquired by the recording device 104 and the first candidate sound source position is within the set error range.
Optionally, the server 103 may determine, according to an error condition between the actual sound source position of the surround sound with the stereo effect reported by the sound recording device 104 and the first candidate sound source position, a target intelligent audio device that needs to correct the playing parameter and a correction direction of the playing parameter, and then correct, according to a set correction step length, a current playing parameter (the current playing parameter may be an initial playing parameter, or a playing parameter corrected last time) that affects the target intelligence along the determined correction direction. The correction direction here refers to whether the correction is directed to a large correction or a small correction.
It should be noted that the playing parameters of the intelligent audio device may further include volume equalization parameters in a certain frequency band, in addition to the overall playing volume of the intelligent audio device. The volume balance parameter is mainly used for limiting or compensating the playing volume on a certain frequency band on the basis of the overall playing volume. For example, the volume equalization parameter may be a volume value that needs to be decreased for a particular high frequency signal, and for example, an increased volume value for a portion of a low frequency signal.
Optionally, for the server 103, the frequency response analysis may be performed on the surround sound with the stereo effect corresponding to each candidate sound source position reported by the sound recording device 104 to obtain a Frequency Response (FR) curve of the surround sound with the stereo effect corresponding to each candidate sound source position, and then the frequency band that needs to be equalized and the volume equalization parameter corresponding to the frequency band are determined by combining the FR curve and the expected sound effect. Similarly, when the volume equalization parameters corresponding to the frequency bands needing equalization processing are determined, the volume equalization parameters can also be adjusted for multiple times and are sent to each intelligent audio device again for testing until the expected sound effect is achieved, and the volume equalization parameters corresponding to each intelligent terminal at the moment are sent to the corresponding intelligent terminal for being solidified in the local memory.
In the embodiment shown in fig. 1b or fig. 1d, the intelligent audio device is combined with technologies such as ultrasonic waves and bluetooth, so that the establishment of a surrounding stereo field can be conveniently realized by the intelligent audio device, the difficulty and cost of establishing a professional stereo field in an indoor environment are greatly reduced, artificial intelligence and professional stereo can simultaneously enter thousands of households, and the dual requirements of users on intellectualization and high sound effect are met.
Based on the foregoing system embodiment, an audio playing method suitable for an indoor environment is further provided in this application exemplary embodiment, where the method is applied to a first intelligent audio device, and the first intelligent audio device is any one of a plurality of intelligent audio devices dispersedly deployed around a listening area of the indoor environment in the foregoing system embodiment. As shown in fig. 2a, the method comprises the steps of:
201. the first intelligent audio equipment acquires a target audio signal to be played according to the received playing instruction, wherein the target audio signal comprises a plurality of target sound source positions.
202. And the first intelligent audio equipment determines the playing parameters corresponding to the target sound source directions according to the corresponding sound source direction-playing parameter mapping relation.
203. The first intelligent audio equipment outputs sound signals in the target sound source directions in the target audio signals according to the playing parameters corresponding to the target sound source directions so as to generate surround sound effects in a listening area by matching with other intelligent audio equipment; the mapping relation between the sound source direction and the playing parameter corresponding to the first intelligent audio device is predetermined according to the environmental characteristics of the indoor environment and the relative position of the first intelligent audio device and the listening area.
In an application scenario, the first intelligent audio device supports a voice interaction mode and has an internet access function, and a target audio signal to be played can be acquired from the internet according to a voice playing instruction sent by a user.
Further optionally, if the plurality of intelligent audio devices have equal status, the first intelligent audio device may receive a voice playing instruction sent by the user by itself, and obtain a target audio signal to be played from the internet according to the received voice playing instruction.
Further optionally, the plurality of smart audio devices are divided into a master speaker and a slave speaker, the slave speaker establishes a communication connection, such as a bluetooth connection, a WiFi connection, or an infrared connection, with the master speaker, and the slave speaker receives control of the master speaker. Based on this, if the first intelligent audio device is a master loudspeaker box, a voice playing instruction sent by a user can be received, on one hand, a target audio signal to be played is obtained from the internet according to the voice playing instruction, and on the other hand, the voice playing instruction is forwarded to a slave loudspeaker box. If the first intelligent audio device is a slave loudspeaker box, the voice playing instruction forwarded by the master loudspeaker box can be received, and then the target audio signal to be played is obtained from the internet according to the voice playing instruction.
In another application scenario, a user uses other audio/video playing devices to play audio/video signals, and in order to improve the sound effect, the audio signals played by the audio/video playing devices can be transmitted to a plurality of intelligent audio devices, so that the plurality of intelligent audio devices are utilized to generate surround sound effect. Based on this, the first intelligent audio device can receive the target audio signal transmitted by the audio and video playing device according to the wireless playing instruction sent by the audio and video playing device.
Further optionally, the plurality of intelligent audio devices have equal status, and the first intelligent audio device may establish wireless connection with the audio/video playing device, so as to directly receive a wireless playing instruction sent by the audio/video playing device, and further receive a target audio signal transmitted by the audio/video playing device according to the received wireless playing instruction.
Further optionally, the plurality of smart audio devices are divided into a master speaker and a slave speaker, the slave speaker establishes a communication connection, such as a bluetooth connection, a WiFi connection, or an infrared connection, with the master speaker, and the slave speaker receives control of the master speaker. Based on this, if the first intelligent audio device is a master sound box, a wireless connection can be established with the audio and video playing device, a wireless playing instruction sent by the audio and video playing device is received based on the wireless connection, a target audio signal transmitted by the audio and video playing device is received according to the wireless playing instruction, and the target audio signal is forwarded to the slave sound box. If the first intelligent audio device is a slave loudspeaker, the target audio signal forwarded by the master loudspeaker can be received.
Optionally, the first intelligent audio device may further detect distance information between the first intelligent audio device and the ambient environment by using the ultrasonic signal and report the distance information to the server, and detect distance information between the first intelligent audio device and another intelligent audio device by combining the bluetooth signal and the audio signal and report the distance information to the server, so that the server determines three-dimensional space information of the indoor environment and relative position information of the first intelligent audio device and the listening area according to the distance information.
For the server, a sound source direction-playing parameter mapping relation corresponding to the first intelligent audio equipment can be established according to the three-dimensional space information of the indoor environment and the relative position information of the first intelligent audio equipment and the listening area, and the sound source direction-playing parameter mapping relation is issued to the first intelligent audio equipment. The first intelligent audio equipment receives the mapping relation between the sound source direction and the playing parameter, which is sent by the server and corresponds to the first intelligent audio equipment, and stores the mapping relation locally so as to be used in playing the target audio signal. The sound source position-playing parameter mapping relation corresponding to the first intelligent audio equipment comprises the playing parameters of the first intelligent audio equipment in the possible candidate sound source positions, and therefore when the intelligent audio equipment is used, table lookup can be directly carried out.
Further, as shown in fig. 2b, another audio playing method suitable for an indoor environment provided by an exemplary embodiment of the present application includes the following steps:
20a, the plurality of intelligent audio devices respectively detect the distance information between the intelligent audio devices and the surrounding environment by using the ultrasonic signals.
20b, the plurality of intelligent audio devices respectively report the information of the detected distance between the intelligent audio devices and the surrounding environment to the server.
20c, the plurality of intelligent audio devices detect the distance information between the intelligent audio devices and other intelligent audio devices by combining the Bluetooth signals and the audio signals.
And 20d, the plurality of intelligent audio devices respectively report the detected distance information between the intelligent audio devices and other intelligent audio devices to the server.
And 20e, the server determines the three-dimensional space information of the indoor environment where the intelligent audio devices are located and the relative position information of the intelligent audio devices and the listening area in the indoor environment according to the distance information reported by the intelligent audio devices.
And 20f, the server establishes a sound source direction-playing parameter mapping relation corresponding to each of the plurality of intelligent audio devices according to the three-dimensional space information of the indoor environment and the relative position information of the plurality of intelligent audio devices and the listening area.
And 20g, the server respectively issues the mapping relation between the sound source direction and the playing parameter corresponding to each of the plurality of intelligent audio devices to the corresponding intelligent audio device.
Correspondingly, the plurality of intelligent audio devices receive the mapping relation between the sound source position and the playing parameter, which is sent by the server and corresponds to each other, and store the mapping relation locally.
20h, the plurality of intelligent audio devices respectively receive a voice playing instruction sent by a user, and a target audio signal needing to be played is obtained according to the voice playing instruction, wherein the target audio signal comprises a plurality of target sound source positions.
And 20i, determining the playing parameters corresponding to the target sound source directions by the plurality of intelligent audio devices according to the sound source direction-playing parameter mapping relations corresponding to the intelligent audio devices.
20j, the plurality of intelligent audio devices respectively output sound signals in the target audio signals at the positions of the target sound sources according to the playing parameters corresponding to the positions of the target sound sources, so that a user can hear the sound effect of surround sound in a listening area.
In this embodiment, combine together techniques such as intelligent audio equipment and ultrasonic wave, bluetooth, can be very convenient must utilize intelligent audio equipment to realize encircling the establishment of stereo field, greatly reduced under the indoor environment professional stereo field's the degree of difficulty and cost of establishing to let artificial intelligence and professional stereo can walk into thousands of households simultaneously, satisfy the dual demand of user to intellectuality and high audio.
It should be noted that in some of the flows described in the above embodiments and the drawings, a plurality of operations are included in a specific order, but it should be clearly understood that the operations may be executed out of the order presented herein or in parallel, and the sequence numbers of the operations, such as 201, 202, etc., are merely used for distinguishing different operations, and the sequence numbers do not represent any execution order per se. Additionally, the flows may include more or fewer operations, and the operations may be performed sequentially or in parallel. It should be noted that, the descriptions of "first", "second", etc. in this document are used for distinguishing different messages, devices, modules, etc., and do not represent a sequential order, nor limit the types of "first" and "second" to be different.
Fig. 3 is a schematic structural diagram of an intelligent audio device according to an exemplary embodiment of the present application. As shown in fig. 3, the intelligent audio device includes: a communication component 31, a memory 32, a processor 33, and an audio component 34. The intelligent audio device can be realized as any device capable of playing audio signals, such as an intelligent sound box, a smart phone and the like.
In the present embodiment, the communication component 31 is configured to facilitate communication between the intelligent audio device and other devices (e.g., other intelligent audio devices or servers) in a wired or wireless manner. The intelligent audio device may access a wireless network based on a communication standard, such as WiFi,2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component may support Near Field Communication (NFC) technology, radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, ultra Wideband (UWB) technology, bluetooth (BT) technology, and other technologies. As shown in fig. 3, the communication module 31 includes a bluetooth module, an infrared module, a WiFi module, and the like.
In the present embodiment, the memory 32 may be implemented by any type or combination of volatile or non-volatile memory devices, such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disks, optical disks, or SSD.
Memory 32 is used to store computer programs and may be configured to store other various data to support operations on the intelligent audio device. Examples of such data include instructions for any application, OS or method operating on the intelligent audio device, the intelligent audio device's corresponding sound source bearing-playback parameter mapping relationship, and the like.
In the present embodiment, the communication component 31 may receive a play instruction. The processor 33 is coupled to the memory 32 and may execute computer programs stored in the memory 32 for: acquiring a target audio signal to be played according to a playing instruction received by the communication component 31, wherein the target audio signal comprises a plurality of target sound source positions; according to the mapping relationship between the sound source position and the playing parameter stored in the memory 32, the playing parameter corresponding to each target sound source position of the intelligent audio device is determined and output to the audio component 34.
And the audio component 34 is configured to output a sound signal in each target sound source position in the target audio signal according to the playing parameters corresponding to each target sound source position of the intelligent audio device, so as to cooperate with other intelligent audio devices in the indoor environment where the intelligent audio device is located to generate a surround sound effect in a listening area of the indoor environment.
The mapping relation of the sound source direction and the playing parameter is predetermined according to the environmental characteristics of the indoor environment and the relative position of the intelligent audio equipment and the listening area.
In an optional embodiment, the intelligent audio device of this embodiment supports a voice interaction manner and has an internet access function. Based on this, the processor 33 can specifically obtain the target audio signal to be played from the internet according to the voice playing instruction sent by the user.
Further alternatively, the audio component 34 may receive a voice playing instruction from a user, and transmit the voice playing instruction to the processor 33, and the processor 33 obtains a target audio signal to be played from the internet through the communication component 31 according to the received voice playing instruction.
Further optionally, if the intelligent audio device of this embodiment is a master speaker, the audio component 34 may receive a voice playing instruction sent by a user, and transmit the voice playing instruction to the processor 33, where on one hand, the processor 33 obtains a target audio signal to be played from the internet through the communication component 31 according to the voice playing instruction, and on the other hand, forwards the voice playing instruction to the slave speaker through the communication component 31. If the intelligent audio device of this embodiment is a slave speaker, the audio component 34 may receive a voice playing instruction forwarded by the master speaker, and transmit the voice playing instruction to the processor 33, and the processor 33 obtains a target audio signal to be played from the internet through the communication component 31 according to the voice playing instruction.
In an optional embodiment, the processor 33 may specifically receive a target audio signal transmitted by the audio/video playing device according to a wireless playing instruction sent by the audio/video playing device.
Further optionally, the communication component 31 may receive a wireless playing instruction sent by the audio/video playing device, and transmit the wireless playing instruction to the processor 33, and the processor 33 receives a target audio signal transmitted by the audio/video playing device through the communication component 31 according to the wireless playing instruction.
Further optionally, if the intelligent audio device of this embodiment is a master speaker, the communication component 31 may receive a wireless playing instruction sent by the audio/video playing device, transmit the wireless playing instruction to the processor 33, and the processor 33 receives a target audio signal transmitted by the audio/video playing device through the communication component 31 according to the wireless playing instruction, and forwards the target audio signal to the slave speaker through the communication component 31. If the smart audio device of this embodiment is a slave speaker, the communication component 31 may receive a target audio signal forwarded by a master speaker.
Further optionally, as shown in fig. 3, the intelligent audio device of this embodiment further includes an ultrasonic module 35, and the ultrasonic module 35 may receive and transmit ultrasonic signals. Based on this, the processor 33 may also use the ultrasonic module 35 to receive and transmit ultrasonic signals, detect distance information between the intelligent audio device and the surrounding environment based on the ultrasonic signals, and report the distance information to the server through the communication component 31.
Further optionally, the processor 33 may further utilize the bluetooth module and the audio component to simultaneously send a bluetooth signal and an audio signal to other intelligent audio devices, so that the other intelligent audio devices determine distance information with the intelligent audio device according to the difference between the receiving times of the bluetooth signal and the audio signal, and report the distance information to the server. Correspondingly, the processor 33 may also receive a bluetooth signal and an audio signal simultaneously sent by other intelligent audio devices, determine distance information between the intelligent audio device and other intelligent audio devices according to a difference between receiving times of the bluetooth signal and the audio signal, and report the distance information to the server.
Further, as shown in fig. 3, the intelligent audio device further includes: a display 36, a power supply assembly 37, and the like. Only some of the components are schematically shown in fig. 3, and it is not meant that the intelligent audio device includes only the components shown in fig. 3. In addition, the components shown in dashed boxes in fig. 3 are optional components, not required components.
By adopting the intelligent audio equipment provided by the embodiment, a surrounding stereo sound field can be constructed, the intelligent advantage of the intelligent audio equipment can be exerted, and the playing of audio signals based on the surrounding stereo sound field is also beneficial to improving the playing effect of the intelligent audio equipment.
Accordingly, the present application further provides a computer readable storage medium storing a computer program, where the computer program is capable of implementing the steps that can be executed by the intelligent audio device in the foregoing method embodiments when executed.
Accordingly, the present application further provides a computer-readable storage medium storing a computer program, where the computer program can implement the steps that can be executed by the server in the foregoing method embodiments when executed.
The display in the above embodiments includes a screen, which may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation.
The power supply assembly of the above embodiments provides power to various components of the device in which the power supply assembly is located. The power components may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for the device in which the power component is located.
The audio component of the above embodiments may be configured to output and/or input an audio signal. For example, the audio component includes a Microphone (MIC) configured to receive an external audio signal when the device in which the audio component is located is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signal may further be stored in a memory or transmitted via a communication component. In some embodiments, the audio assembly further comprises a speaker for outputting audio signals.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include forms of volatile memory in a computer readable medium, random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrases "comprising a," "8230," "8230," or "comprising" does not exclude the presence of other like elements in a process, method, article, or apparatus comprising the element.
The above description is only an example of the present application and is not intended to limit the present application. Various modifications and changes may occur to those skilled in the art to which the present application pertains. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.

Claims (10)

1. An audio playing method suitable for an indoor environment is characterized in that a plurality of intelligent audio devices are dispersedly deployed around a listening area of the indoor environment; the method comprises the following steps:
the method comprises the steps that first intelligent audio equipment obtains a target audio signal needing to be played according to a received playing instruction, wherein the target audio signal comprises a plurality of target sound source positions;
the first intelligent audio equipment determines playing parameters corresponding to the target sound source directions according to the corresponding sound source direction-playing parameter mapping relation;
the first intelligent audio equipment respectively outputs sound signals in the target sound source directions in the target audio signals according to playing parameters corresponding to the target sound source directions so as to generate surround sound effects in the listening area by matching with other intelligent audio equipment;
the first intelligent audio device is further configured to output a test audio signal according to an initial playing parameter in the first candidate sound source orientation according to a control instruction issued by the server, so that the server corrects the initial playing parameters of the plurality of intelligent audio devices according to the test audio signal; the first intelligent audio device is further configured to re-output a test audio signal according to the modified playing parameters, so that the server establishes a mapping relationship between the first candidate sound source bearing and the modified playing parameters of the plurality of intelligent audio devices according to the re-output test audio signal;
the first intelligent audio device is any one of the multiple intelligent audio devices, the sound source position-playing parameter mapping relationship is predetermined according to the environmental characteristics of the indoor environment and the relative position of the first intelligent audio device and the listening area, and the first candidate sound source position is any one of candidate sound source positions which may occur to the intelligent audio device and is included in the sound source position-playing parameter mapping relationship corresponding to each intelligent audio device.
2. The method of claim 1, wherein the plurality of smart audio devices comprises a master audio enclosure and a slave audio enclosure;
the first intelligent audio device acquires a target audio signal to be played from the internet according to a voice playing instruction sent by a user, and the method comprises the following steps:
if the first intelligent audio equipment is a master sound box, receiving a voice playing instruction sent by a user, acquiring a target audio signal to be played from the Internet according to the voice playing instruction, and forwarding the voice playing instruction to a slave sound box;
if the first intelligent audio equipment is a slave loudspeaker box, receiving the voice playing instruction forwarded by the master loudspeaker box, and acquiring a target audio signal to be played from the Internet;
the first intelligent audio device receives a target audio signal transmitted by the audio and video playing device according to a wireless playing instruction sent by the audio and video playing device, and the method comprises the following steps:
if the first intelligent audio device is a master loudspeaker box, receiving a wireless playing instruction sent by audio and video playing equipment, receiving a target audio signal transmitted by the audio and video playing equipment according to the wireless playing instruction, and forwarding the target audio signal to a slave loudspeaker box;
and if the first intelligent audio equipment is a slave loudspeaker box, receiving the target audio signal forwarded by the master loudspeaker box.
3. The method of claim 1 or 2, further comprising:
the first intelligent audio device detects the distance information between the first intelligent audio device and the surrounding environment by using ultrasonic signals and reports the distance information to the server, and detects the distance information between the first intelligent audio device and other intelligent audio devices by combining Bluetooth signals and audio signals and reports the distance information to the server, so that the server can determine the three-dimensional space information of the indoor environment and the relative position information of the first intelligent audio device and the listening area.
4. The method of claim 3, further comprising:
the first intelligent audio equipment receives the mapping relation between the sound source direction and the playing parameter, which is sent by the server and corresponds to the first intelligent audio equipment; the sound source direction-playing parameter mapping relation is established by the server according to the three-dimensional space information of the indoor environment and the relative position information of the first intelligent audio equipment and the listening area;
wherein, the sound source direction-playing parameter mapping relation includes the playing parameters of the first intelligent audio device in each possible candidate sound source direction.
5. An audio playback system adapted for an indoor environment, comprising: a plurality of intelligent audio devices and a server; the intelligent audio devices are dispersedly deployed around a listening area in the indoor environment;
the plurality of intelligent audio devices are respectively used for acquiring target audio signals to be played according to the received playing instructions, and the target audio signals comprise a plurality of target sound source positions; determining playing parameters corresponding to the target sound source directions according to the sound source direction-playing parameter mapping relations corresponding to the target sound source directions; outputting sound signals in the target sound source directions in the target audio signals according to playing parameters corresponding to the target sound source directions so as to generate a surround sound effect in the listening area;
the server is used for recording equipment arranged in the listening area and controlling the plurality of intelligent audio equipment to output test audio signals according to initial playing parameters in the first candidate sound source direction for the first candidate sound source direction; correcting the initial playing parameters of the plurality of intelligent audio devices according to the test audio signals, and controlling the plurality of intelligent audio devices to output the test audio signals again according to the corrected playing parameters; according to the re-output test audio signal, establishing a mapping relation between the first candidate sound source direction and the corrected playing parameters of the plurality of intelligent audio devices;
the recording equipment is used for collecting surround sound with a stereo effect and the actual sound source position thereof generated by the test audio signals output by the intelligent audio equipment in the listening area and reporting the actual sound source position of the surround sound with the stereo effect to the server;
the mapping relationship between the sound source position and the playing parameter corresponding to each of the plurality of intelligent audio devices is predetermined according to the environmental characteristics of the indoor environment and the relative positions of the plurality of intelligent audio devices and the listening area, and the first candidate sound source position is any sound source position in the sound source position-playing parameter mapping relationship corresponding to each intelligent audio device, wherein the sound source position-playing parameter mapping relationship comprises the candidate sound source positions which may appear in the intelligent audio device.
6. The system of claim 5, wherein prior to controlling the plurality of intelligent audio devices to output the test audio signals according to the initial playback parameters at the first candidate sound source location, the server is further configured to:
establishing a sound source direction-playing parameter mapping relation corresponding to each of the plurality of intelligent audio devices according to the three-dimensional space information of the indoor environment and the relative position information of the plurality of intelligent audio devices and the listening area;
the sound source position-playing parameter mapping relation corresponding to each intelligent audio device comprises playing parameters of the intelligent audio device in each possible candidate sound source position.
7. The system of claim 6, wherein the server is specifically configured to:
calculating initial playing parameters of the intelligent audio equipment under the position of each candidate sound source respectively according to the three-dimensional space information of the indoor environment and the relative position information of the intelligent audio equipment and the listening area;
and establishing a sound source position-playing parameter mapping relation corresponding to each intelligent audio device according to the initial playing parameters of the intelligent audio devices in the candidate sound source positions.
8. The system according to claim 7, wherein the server modifies initial playing parameters of the plurality of intelligent audio devices according to the test audio signal, specifically to:
when the error between the actual sound source position acquired by the sound recording equipment and the first candidate sound source position is not within a set error range, correcting the initial playing parameters of the plurality of intelligent audio equipment;
correspondingly, the server establishes a mapping relationship between the first candidate sound source position and the modified playing parameters of the plurality of intelligent audio devices according to the re-output test audio signal, and is used for:
and establishing a mapping relation between the first candidate sound source position and the corrected playing parameters of the plurality of intelligent audio devices until the error between the actual sound source position acquired by the recording device and the first candidate sound source position is within a set error range.
9. An intelligent audio device, comprising: a communication component, a memory, a processor, and an audio component;
the communication component is used for receiving a playing instruction;
the memory is used for storing a computer program and a sound source direction-playing parameter mapping relation corresponding to the intelligent audio equipment;
the processor to execute the computer program to: acquiring a target audio signal to be played according to a playing instruction received by the communication assembly, wherein the target audio signal comprises a plurality of target sound source positions; determining playing parameters corresponding to the intelligent audio equipment and each target sound source position according to the sound source position-playing parameter mapping relation stored in the memory, and outputting the playing parameters to the audio component;
the audio component is used for outputting sound signals in the target sound source directions in the target audio signals according to the playing parameters corresponding to the intelligent audio equipment and the target sound source directions so as to generate surround sound effects in a listening area of an indoor environment by matching with other intelligent audio equipment in the indoor environment where the intelligent audio equipment is located;
the audio component is further configured to output a test audio signal according to the control instruction received by the communication component and the initial playing parameter in the first candidate sound source direction, so that the server corrects the initial playing parameter of the intelligent audio device according to the test audio signal; the audio component is further configured to re-output a test audio signal according to the modified playing parameter according to the control instruction received by the communication component, so that the server establishes a mapping relationship between the first candidate sound source position and the modified playing parameter of the intelligent audio device according to the re-output test audio signal;
the sound source position-playing parameter mapping relationship is predetermined according to the environmental characteristics of the indoor environment and the relative position of the intelligent audio device and the listening area, and the first candidate sound source position is any one of the candidate sound source positions which may occur to the intelligent audio device and is included in the sound source position-playing parameter mapping relationship corresponding to the intelligent audio device.
10. A computer-readable storage medium storing a computer program, wherein the computer program, when executed by one or more processors, causes the one or more processors to perform the steps of the method of any one of claims 1-4.
CN201910026267.0A 2019-01-11 2019-01-11 Audio playing method, device, system and storage medium suitable for indoor environment Active CN109831735B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910026267.0A CN109831735B (en) 2019-01-11 2019-01-11 Audio playing method, device, system and storage medium suitable for indoor environment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910026267.0A CN109831735B (en) 2019-01-11 2019-01-11 Audio playing method, device, system and storage medium suitable for indoor environment

Publications (2)

Publication Number Publication Date
CN109831735A CN109831735A (en) 2019-05-31
CN109831735B true CN109831735B (en) 2022-10-11

Family

ID=66860939

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910026267.0A Active CN109831735B (en) 2019-01-11 2019-01-11 Audio playing method, device, system and storage medium suitable for indoor environment

Country Status (1)

Country Link
CN (1) CN109831735B (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110677801B (en) * 2019-08-23 2021-02-23 华为技术有限公司 Sound box control method, sound box and sound box system
CN112492506A (en) * 2019-09-11 2021-03-12 深圳市优必选科技股份有限公司 Audio playing method and device, computer readable storage medium and robot
CN111258530B (en) * 2020-01-09 2022-01-07 珠海格力电器股份有限公司 Audio playing control method, server and audio playing system
CN111586553B (en) * 2020-05-27 2022-06-03 京东方科技集团股份有限公司 Display device and working method thereof
CN112153525B (en) * 2020-08-11 2022-09-16 广东声音科技有限公司 Positioning method and system for multi-loudspeaker panoramic sound effect
CN112083379B (en) * 2020-09-09 2023-10-20 极米科技股份有限公司 Audio playing method and device based on sound source localization, projection equipment and medium
US11895466B2 (en) 2020-12-28 2024-02-06 Hansong (Nanjing) Technology Ltd. Methods and systems for determining parameters of audio devices
CN112312278B (en) * 2020-12-28 2021-03-23 汉桑(南京)科技有限公司 Sound parameter determination method and system
CN113411725B (en) * 2021-06-25 2022-09-02 Oppo广东移动通信有限公司 Audio playing method and device, mobile terminal and storage medium
CN113676720B (en) * 2021-08-04 2023-11-10 Oppo广东移动通信有限公司 Multimedia resource playing method and device, computer equipment and storage medium
CN113660513A (en) * 2021-08-17 2021-11-16 北京小米移动软件有限公司 Method, device and storage medium for synchronizing playing time
CN113889140A (en) * 2021-09-24 2022-01-04 北京有竹居网络技术有限公司 Audio signal playing method and device and electronic equipment
CN114245267B (en) * 2022-02-27 2022-07-08 北京荣耀终端有限公司 Method and system for multi-device cooperative work and electronic device
CN117193702A (en) * 2023-08-10 2023-12-08 深圳市昂晖电子科技有限公司 Playing control method based on embedded player and related device
CN117812504A (en) * 2023-12-29 2024-04-02 恩平市金马士音频设备有限公司 Audio equipment volume data management system and method based on Internet of things

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104869523B (en) * 2014-02-26 2018-03-16 北京三星通信技术研究有限公司 Virtual multiple sound channel plays method, terminal and the system of audio file
CN104185132B (en) * 2014-09-02 2019-02-01 Oppo广东移动通信有限公司 Channel configuration method, intelligent terminal and corresponding system
CN105163241B (en) * 2015-09-14 2018-04-13 小米科技有限责任公司 Audio frequency playing method and device, electronic equipment
CN105959841A (en) * 2016-04-28 2016-09-21 乐视控股(北京)有限公司 Mobile terminal audio playing method, device and headset
CN106303836B (en) * 2016-11-15 2019-10-01 广东小天才科技有限公司 A kind of method and device adjusting played in stereo
CN106817654B (en) * 2017-03-22 2019-10-22 微鲸科技有限公司 Sound effect treatment method, device and multimedia equipment
EP3627852A4 (en) * 2017-05-17 2020-04-29 Sony Corporation Sound output control device, sound output control method, and program

Also Published As

Publication number Publication date
CN109831735A (en) 2019-05-31

Similar Documents

Publication Publication Date Title
CN109831735B (en) Audio playing method, device, system and storage medium suitable for indoor environment
US20230379644A1 (en) Audio Playback Settings for Voice Interaction
US11979960B2 (en) Contextualization of voice inputs
US11877139B2 (en) Playback device calibration
US11816393B2 (en) Dynamic computation of system response volume
US10880644B1 (en) Three-dimensional beam forming with a microphone array
US10750304B2 (en) Calibration of audio playback devices
US10034116B2 (en) Acoustic position measurement
US11184702B2 (en) Systems and methods of user localization
US11728780B2 (en) Audio calibration of a portable playback device
US20240080637A1 (en) Calibration of Audio Playback Devices

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant