CN111182382A

CN111182382A - Audio playing control method based on sound box and sound box

Info

Publication number: CN111182382A
Application number: CN201911003429.5A
Authority: CN
Inventors: 易发
Original assignee: Guangdong Genius Technology Co Ltd
Current assignee: Guangdong Genius Technology Co Ltd
Priority date: 2019-10-22
Filing date: 2019-10-22
Publication date: 2020-05-19

Abstract

The embodiment of the application discloses an audio playing control method based on a sound box and the sound box, wherein the method comprises the following steps: when an audio playing instruction is received, acquiring a distance value and a direction angle of the sound box relative to a user; when the distance value of the sound box relative to the user is not a preset distance value and the direction angle of the sound box relative to the user is not a preset direction angle, controlling the sound box to move to a preset position, wherein the sound effect of the sound box at the preset position is optimal; and playing preset audio. By implementing the embodiment of the application, the use experience of the user can be improved.

Description

Audio playing control method based on sound box and sound box

Technical Field

The application relates to the technical field of sound boxes, in particular to an audio playing control method based on a sound box and the sound box.

Background

Most of the sound boxes on the market are responsible for converting audio signals into sound signals, so with the prosperity of the entertainment market, the first function carried by the sound boxes is music playing. In practice, it is found that when a user enjoys music by using a sound box, the sound box often needs to be moved manually to adjust the relative position of the sound box and the user in order to obtain a good sound effect. Under this condition, if the position that the user listened to the music at every turn is not fixed, in order to guarantee good audio experience, required the user to move the audio amplifier at every turn by hand, seriously influenced user's use experience.

Disclosure of Invention

The embodiment of the application discloses an audio playing control method based on a sound box and the sound box, which can improve the use experience of a user.

The first aspect of the embodiment of the application discloses an audio playing control method based on a sound box, which comprises the following steps:

when an audio playing instruction is received, acquiring a distance value and a direction angle of the sound box relative to a user;

when the distance value is not a preset distance value and the direction angle is not a preset direction angle, controlling the sound box to move to a preset position; the sound effect of the sound box at the preset position is optimal;

and playing preset audio.

As an optional implementation manner, in the first aspect of this embodiment of the present application, before the playing the preset audio, the method further includes:

collecting a user image;

obtaining a user identity by identifying the user image;

determining a playlist matching the user identity;

obtaining an emotion identifier of the user by analyzing the user image;

and taking the audio corresponding to the emotion identification in the play list as preset audio.

As an optional implementation manner, in the first aspect of this embodiment of the present application, after the obtaining, when the audio playing instruction is received, a distance value and a direction angle of the sound box with respect to the user, the method further includes:

scanning the environment of the sound box to construct a three-dimensional model;

marking a user coordinate point in the three-dimensional model;

determining a preset coordinate point of the sound box in the three-dimensional model according to a preset rule and the user coordinate point;

determining a preset distance value and a preset direction angle according to the preset coordinate point and the user coordinate point;

when the distance value is not for predetermineeing the distance value, just when the direction angle is not for predetermineeing the direction angle, control the audio amplifier removes to preset position, include:

when the distance value is not a preset distance value and the direction angle is not a preset direction angle, determining a moving route according to the preset coordinate point and the current coordinate point of the sound box;

and controlling the sound box to move to a preset position according to the indication of the moving route.

As an optional implementation manner, in the first aspect of the embodiment of the present application, after controlling the sound box to move to the preset position when the distance value is not the preset distance value and the direction angle is not the preset direction angle, the method further includes:

detecting whether a projection instruction sent by VR glasses is received;

if the projection instruction is received, obtaining a VR video associated with a preset audio, wherein a corresponding relation exists between a video frame of the VR video and an audio frame of the preset audio;

and according to the corresponding relation, projecting and displaying the VR video while playing the preset audio.

As an optional implementation manner, in the first aspect of the embodiment of the present application, if the preset audio is a story before the child falls asleep, when the distance value is not the preset distance value and the direction angle is not the preset direction angle, the method further includes controlling the sound box to move to a preset position and before playing the preset audio:

adjusting the lighting equipment of the environment where the sound box is located to preset brightness;

after the preset audio is played, the method further comprises:

detecting whether a sleep instruction sent by wrist wearable equipment worn by a user is received;

and if the sleep instruction is received, stopping playing the preset audio and controlling the lighting equipment to be closed.

A second aspect of the embodiments of the present application discloses a sound box, including:

the acquisition unit is used for acquiring a distance value and a direction angle of the sound box relative to a user when an audio playing instruction is received;

the first control unit is used for controlling the sound box to move to a preset position when the distance value is not a preset distance value and the direction angle is not a preset direction angle; the sound effect of the sound box at the preset position is optimal;

and the playing unit is used for playing the preset audio.

As an optional implementation manner, in the second aspect of the embodiments of the present application, the sound box further includes:

the identification unit is used for acquiring a user image before the playing unit plays the preset audio and acquiring the user identity by identifying the user image;

and the analysis unit is used for determining a play list matched with the user identity, obtaining an emotion identifier of the user by analyzing the user image, and taking the audio corresponding to the emotion identifier in the play list as preset audio.

the model building unit is used for scanning the environment where the sound box is located to build a three-dimensional model after the distance value and the direction angle of the sound box relative to the user are obtained when the obtaining unit receives the audio playing instruction;

the marking unit is used for marking a user coordinate point in the three-dimensional model and determining a preset coordinate point of the sound box in the three-dimensional model according to a preset rule and the user coordinate point;

the determining unit is used for determining a preset distance value and a preset direction angle according to the preset coordinate point and the user coordinate point;

the first control unit is used for controlling the loudspeaker box to move to a preset position in a mode that when the distance value is not a preset distance value and the direction angle is not a preset direction angle:

the first control unit is used for determining a moving route according to the preset coordinate point and the current coordinate point of the sound box when the distance value is not the preset distance value and the direction angle is not the preset direction angle, and controlling the sound box to move to the preset position according to the indication of the moving route.

the first detection unit is used for detecting whether a projection instruction sent by VR glasses is received or not after the sound box is controlled to move to a preset position when the distance value is not a preset distance value and the direction angle is not a preset direction angle by the first control unit;

the projection unit is used for acquiring a VR video associated with a preset audio when the projection instruction is received, and the video frame of the VR video and the audio frame of the preset audio have a corresponding relation; and according to the corresponding relation, projecting and displaying the VR video while playing the preset audio.

As an optional implementation manner, in the second aspect of the embodiment of the present application, if the preset audio is a pre-sleep story of a child, the sound box further includes:

the second control unit is used for controlling the lighting equipment of the environment where the sound box is located to be adjusted to preset brightness after the sound box is moved to the preset position and before the preset audio is played by the playing unit when the distance value is not the preset distance value and the direction angle is not the preset direction angle by the first control unit;

the second detection unit is used for detecting whether a sleep instruction sent by wrist wearable equipment worn by a user is received or not after the preset audio is played by the playing unit;

the second control unit is further configured to terminate playing the preset audio and control the lighting device to turn off when the sleep instruction is received.

A third aspect of the embodiments of the present application discloses an electronic device, including:

a memory storing executable program code;

a processor coupled with the memory;

the processor calls the executable program code stored in the memory to perform part or all of the steps of any one of the methods of the first aspect of the present application.

A fourth aspect of embodiments of the present application discloses a computer-readable storage medium storing a computer program comprising a program code for performing some or all of the steps of any one of the methods of the first aspect of the present application.

A fifth aspect of embodiments of the present application discloses a computer program product, which, when run on a computer, causes the computer to perform part or all of the steps of any one of the methods of the first aspect.

A sixth aspect of embodiments of the present application discloses an application issuing system, configured to issue a computer program product, where the computer program product is configured to, when run on a computer, cause the computer to perform part or all of the steps of any one of the methods of the first aspect.

Compared with the prior art, the embodiment of the application has the following beneficial effects:

in the embodiment of the application, when an audio playing instruction is received, the distance value and the direction angle of the sound box relative to a user are obtained; when the distance value of the sound box relative to the user is not a preset distance value and the direction angle of the sound box relative to the user is not a preset direction angle, controlling the sound box to move to a preset position, wherein the sound effect of the sound box at the preset position is optimal; and playing preset audio. According to the embodiment of the application, the sound box is automatically adjusted to the optimal preset position of the sound effect before the audio is played based on the preset direction angle and the preset distance value, the sound box does not need to be manually moved by a user, and the use experience of the user is favorably improved.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without making a creative effort.

Fig. 1 is a schematic flowchart of an audio playback control method based on a sound box disclosed in an embodiment of the present application;

fig. 2 is a schematic flowchart of another audio playback control method based on a sound box disclosed in the embodiment of the present application;

fig. 3 is a schematic flowchart of another audio playing control method based on a sound box disclosed in the embodiment of the present application;

fig. 4 is a schematic structural diagram of a sound box disclosed in the embodiment of the present application;

FIG. 5 is a schematic structural diagram of another loudspeaker disclosed in the embodiments of the present application;

FIG. 6 is a schematic structural diagram of another speaker disclosed in the embodiments of the present application;

fig. 7 is a schematic structural diagram of another sound box disclosed in the embodiment of the present application.

Detailed Description

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

It should be noted that the terms "comprises," "comprising," and any variations thereof in the embodiments and drawings of the present application are intended to cover non-exclusive inclusions. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those steps or elements listed, but may alternatively include other steps or elements not listed, or inherent to such process, method, article, or apparatus.

The embodiment of the application discloses an audio playing control method based on a sound box and the sound box, which can improve the use experience of a user. The detailed description is as follows.

Example one

Referring to fig. 1, fig. 1 is a schematic flowchart illustrating an audio playing control method based on a speaker according to an embodiment of the present application. The audio playing control method based on the sound box shown in fig. 1 may specifically include the following steps:

101. and when an audio playing instruction is received, acquiring a distance value and a direction angle of the sound box relative to a user.

In this embodiment of the application, the audio playing instruction may be input by a voice of a user, or may be input by a gesture through a mobile terminal by the user, which is not limited in this embodiment of the application.

If the audio playing command is input by the user through voice, before step 101, the following steps may be further performed: the method comprises the steps of detecting whether user voice is collected or not by utilizing microphone equipment of a sound box, extracting voiceprint characteristics of the user voice when the user voice is collected, extracting keywords of the user voice when the voiceprint characteristics are matched with preset legal voiceprint characteristics of the sound box, and determining that an audio playing instruction is received when the keywords are preset keywords for indicating audio playing.

Optionally, the sound box disclosed in the embodiment of the present application may include a microphone array, and when the user inputs the audio playing instruction by voice, the user position may be located by the microphone array included in the sound box, so as to obtain a distance value and a direction angle of the sound box relative to the user. The method for locating the position of the user by using the microphone array can comprise the following steps:

acquiring a vector position relation of microphone equipment contained in a microphone array;

recording the time point of each microphone device in the microphone array receiving the user voice;

setting any microphone device in a microphone array as a reference microphone device, and obtaining sound collection time delay of a non-reference microphone device relative to the reference microphone device according to the time point of each microphone device in the microphone array for receiving user voice;

and analyzing the vector position relation of the microphone equipment contained in the microphone array and the sound acquisition time delay of the non-reference microphone equipment relative to the reference microphone equipment to obtain the user position.

By implementing the method, when the user inputs the audio playing instruction through voice, the user position is obtained based on the microphone array, and the positioning precision of the user position can be ensured.

If the audio playing instruction is input by the user through a gesture by using the mobile terminal, the specific implementation manner may be: the mobile terminal detects a user gesture by using a built-in sensor device, when the user gesture is a preset gesture, an audio playing instruction carrying a device identifier is input to the sound box according to a preset mode, and when the sound box judges that the device identifier is a legal device identifier, the sound box determines that the audio playing instruction is received. The preset mode may be WLAN or WIFI, and based on the description, the mode of obtaining the distance value and the direction angle of the sound box relative to the user may be obtained by scanning and modeling through a camera module of the sound box, specifically, please refer to the description below in step 102. It should be noted that the mobile terminal may be an electronic device such as a mobile phone, a tablet computer, or a smart watch, and the operating systems of various electronic devices may include, but are not limited to, an Android operating system, an IOS operating system, a Symbian operating system, a Black Berry operating system, a windows phone8 operating system, and the like.

Based on the above description, when the user inputs the audio playing instruction by voice, the user legality is verified by the identification of the voiceprint features, and when the user inputs the audio playing instruction by using the mobile terminal through gestures, the user legality is verified by the identification device identifier, so that the loudspeaker box can be effectively prevented from being illegally used.

102. When the distance value of the sound box relative to the user is not a preset distance value and the direction angle of the sound box relative to the user is not a preset direction angle, controlling the sound box to move to a preset position; wherein, the sound effect of the sound box at the preset position is optimal.

As an optional implementation manner, in this embodiment, after step 101, the following steps may also be performed:

and scanning the environment of the sound box to construct a three-dimensional model.

And marking a user coordinate point in the three-dimensional model.

and determining a preset distance value and a preset direction angle according to the preset coordinate point and the user coordinate point.

Based on the above description, when the distance value of the sound box relative to the user is not the preset distance value, and the direction angle of the sound box relative to the user is not the preset direction angle, controlling the sound box to move to the preset position may include:

and when the distance value of the sound box relative to the user is not the preset distance value and the direction angle of the sound box relative to the user is not the preset direction angle, determining the moving route according to the preset coordinate point and the current coordinate point of the sound box.

In this application embodiment, above-mentioned rule of predetermineeing can put the rule for the audio amplifier, puts rule, audio amplifier for user's distance value and audio amplifier for user's direction angle according to this audio amplifier, can obtain the best audio position of audio amplifier in arbitrary enclosure space, predetermines the position promptly, and when the audio amplifier was predetermineeing position department, the user can obtain the best audio and experience.

As an optional implementation manner, in the application example, after controlling the sound box to move to the preset position according to the instruction of the moving route, the following steps may be further performed:

acquiring a distance value from the ear of a user to the ground;

and adjusting the height of the sound box to a preset height according to the distance value between the ears of the user and the ground.

By implementing the method, when the sound box moves to the preset position with the best sound effect, the height of the sound box is adjusted according to the distance value between the ears of the user and the ground, and the sound effect experience of the user can be further improved.

103. And playing preset audio.

By implementing the method, the sound box can be automatically adjusted to the preset position with the best sound effect before the audio is played, the sound box does not need to be manually moved by a user, the use experience feeling of the user is favorably improved, the height of the sound box can be adjusted according to the distance value between the ear of the user and the ground when the sound box moves to the preset position with the best sound effect, the sound effect experience feeling of the user is further improved, the sound box can be effectively prevented from being illegally used based on the recognition of voiceprint features or equipment identification, and the accurate positioning of the position of the user can be realized through the microphone array.

Example two

Referring to fig. 2, fig. 2 is a schematic flowchart illustrating another audio playback control method based on a speaker according to an embodiment of the present application. The audio playing control method based on the sound box shown in fig. 2 may specifically include the following steps:

201. and when an audio playing instruction is received, acquiring a distance value and a direction angle of the sound box relative to a user.

202. When the distance value of the sound box relative to the user is not a preset distance value and the direction angle of the sound box relative to the user is not a preset direction angle, controlling the sound box to move to a preset position; wherein, the sound effect of the sound box at the preset position is optimal.

For detailed descriptions of step 201 to step 202, please refer to the description of step 101 to step 102 in the first embodiment, which is not described again in this embodiment.

203. A user image is acquired.

204. And obtaining the user identity by identifying the user image.

205. And determining a playlist matched with the user identity.

206. And analyzing the user image to obtain the emotion identification of the user.

In the first embodiment, the user identity may be obtained by using the voiceprint feature and the device identifier in the same way as the voiceprint feature and the device identifier. In the embodiment of the application, the expression features can be extracted from the user image, and the extracted expression features are analyzed by using a pre-designed convolutional neural network to obtain the emotion identification of the user.

207. And taking the audio corresponding to the emotion identification in the playlist as preset audio.

208. And playing preset audio.

In the embodiment of the application, the step 203 to the step 208 are executed, the personalized recommendation of the preset audio is realized through the user identity recognition and the user emotion recognition, and the user emotion can be improved under the condition that the user emotion is not good.

As an optional implementation manner, in the embodiment of the present application, after the step 207 is executed, the following steps may also be executed:

detecting whether a projection instruction sent by VR glasses is received;

if the projection instruction is received, obtaining a VR video associated with the preset audio, wherein a corresponding relation exists between a video frame of the VR video and an audio frame of the preset audio;

and according to the corresponding relation, projecting and displaying the VR video while playing preset audio.

In this application embodiment, the VR glasses can send a projection instruction to the speaker when detecting that the VR glasses are in a wearing state. By implementing the method, when the preset audio is played, the user can watch the VR video associated with the preset audio through the VR glasses, and the improvement effect of the emotion of the user is improved.

By implementing the method, the sound box can be automatically adjusted to the preset position with the best sound effect before the audio is played, the sound box does not need to be manually moved by a user, the use experience of the user can be improved, the height of the sound box can be adjusted according to the distance value between the ear of the user and the ground when the sound box moves to the preset position with the best sound effect, the sound effect experience of the user is further improved, the sound box can be effectively prevented from being illegally used based on the recognition of voiceprint features or equipment identification, the accurate positioning of the position of the user can be realized through the microphone array, the emotion of the user can be improved based on the personalized recommendation of the preset audio, and the emotion improvement effect of the user can be improved.

EXAMPLE III

Referring to fig. 3, fig. 3 is a schematic flowchart of another audio playback control method based on a speaker disclosed in the embodiment of the present application. The preset audio in the audio playing control method based on the sound box shown in fig. 3 may be a pre-sleep story of a child, and specifically may include the following steps:

for detailed descriptions of steps 301 to 307, please refer to steps 201 to 207 in embodiment two, which is not described in detail in this embodiment.

308. And adjusting the lighting equipment of the environment where the sound box is located to preset brightness.

309. And playing preset audio.

Optionally, in this embodiment of the application, before step 309 is executed, a target tone selected by the user may be determined, and a preset audio is played according to the target tone, where the target tone may be a tone of a family member such as dad or mom, so that the child has a sense of intimacy when listening to a pre-sleep story.

310. Detecting whether a sleep instruction sent by wrist wearable equipment worn by a user is received, if so, executing step 311; if not, the flow is ended.

311. And stopping playing the preset audio and controlling the lighting equipment of the environment where the sound box is positioned to be closed.

The preset brightness in the embodiment of the application is suitable for children to sleep, the wrist wearable device can judge whether the user is in a sleep state through obtaining the motion frequency of the arm of the user, the heart rate and other factors, and when the user is judged to be in the sleep state, the sleep instruction is sent to the sound box, the lighting device of the environment where the sound box is located is controlled to be turned off, the sound box and the lighting device are turned off in time, and energy saving is facilitated. In addition, in the second embodiment, it is mentioned that the VR video associated with the preset audio can be projected and displayed while the preset audio is played, and then, the projection of the VR video is terminated while the playing of the preset audio is terminated.

By implementing the method, the sound box can be automatically adjusted to the preset position with the best sound effect before the audio is played, the sound box does not need to be manually moved by a user, the use experience of the user can be improved, the height of the sound box can be adjusted according to the distance value between the ear of the user and the ground when the sound box moves to the preset position with the best sound effect, the sound effect experience of the user is further improved, the sound box can be effectively prevented from being illegally used based on the recognition of voiceprint features or equipment identification, the accurate positioning of the position of the user can be realized through the microphone array, the emotion of the user can be improved based on the personalized recommendation of the preset audio, the emotion improvement effect of the user can be improved, and the energy saving is facilitated.

Example four

Referring to fig. 4, fig. 4 is a schematic structural diagram of a sound box disclosed in the embodiment of the present application. As shown in fig. 4, the sound box may include:

the obtaining unit 401 is configured to obtain a distance value and a direction angle of the sound box relative to the user when receiving the audio playing instruction.

In this embodiment of the application, if the audio playing instruction is input by a user voice, the obtaining unit 401 may be further configured to, when the audio playing instruction is received, detect whether the user voice is collected by using a microphone device of the sound box before obtaining a distance value and a direction angle of the sound box relative to the user, extract a voiceprint feature of the user voice when the user voice is collected, extract a keyword of the user voice when the voiceprint feature matches a legal voiceprint feature preset by the sound box, and determine that the audio playing instruction is received when the keyword is a preset keyword for instructing to play audio.

Optionally, the sound box disclosed in the embodiment of the present application may include a microphone array, and the manner that the obtaining unit 401 is configured to obtain the distance value and the direction angle of the sound box relative to the user when receiving the audio playing instruction may specifically be: the obtaining unit 401 is configured to, when receiving an audio playing instruction, locate a user position through a microphone array of the sound box, and further obtain a distance value and a direction angle of the sound box relative to the user. The manner in which the obtaining unit 401 is configured to locate the position of the user by using the microphone array is specifically: an obtaining unit 401, configured to obtain a vector position relationship of microphone devices included in a microphone array; recording the time point of each microphone device in the microphone array receiving the user voice; setting any one microphone device in the microphone array as a reference microphone device, and obtaining the sound collection time delay of the non-reference microphone device relative to the reference microphone device according to the time point of each microphone device in the microphone array for receiving the user voice; and analyzing the vector position relation of the microphone equipment contained in the microphone array and the sound collection time delay of the non-reference microphone equipment relative to the reference microphone equipment to obtain the user position. By implementing the method, when the user inputs the audio playing instruction by voice, the user position is obtained based on the microphone array, and the positioning precision of the user position can be ensured.

In this embodiment of the application, if the audio playing instruction is input by the user through a gesture by using the mobile terminal, the specific implementation manner may be: the mobile terminal detects a user gesture by using a built-in sensor device, when the user gesture is a preset gesture, an audio playing instruction carrying a device identifier is input to the sound box according to a preset mode, and when the sound box judges that the device identifier is a legal device identifier, the sound box determines that the audio playing instruction is received. The preset mode may be WLAN or WIFI, and based on the description, the mode of obtaining the distance value and the direction angle of the sound box relative to the user may be obtained by scanning and modeling through a camera module of the sound box, specifically, please refer to the description below in step 102.

A first control unit 402, configured to control the sound box to move to a preset position when the distance value of the sound box with respect to the user is not the preset distance value and the direction angle of the sound box with respect to the user is not the preset direction angle; wherein, the sound effect of the sound box at the preset position is optimal.

In the embodiment of the present application, please refer to the description in embodiment two for the description of the preset rule, which is not repeated herein.

A playing unit 403, configured to play preset audio.

As an optional implementation manner, in this application embodiment, the sound box may further include:

and a model building unit, configured to, after the obtaining unit 401 obtains the distance value and the direction angle of the sound box relative to the user when receiving the audio playing instruction, scan an environment where the sound box is located, and build a three-dimensional model.

And the marking unit is used for marking the user coordinate point in the three-dimensional model and determining the preset coordinate point of the sound box in the three-dimensional model according to a preset rule and the user coordinate point.

And the determining unit is used for determining a preset distance value and a preset direction angle according to the preset coordinate point and the user coordinate point.

The first control unit 402 is configured to, when the distance value of the sound box relative to the user is not the preset distance value, and the direction angle of the sound box relative to the user is not the preset direction angle, control the sound box to move to the preset position in a specific manner:

the first control unit 402 is configured to determine a moving route according to the preset coordinate point and the current coordinate point of the sound box when the distance value of the sound box relative to the user is not the preset distance value and the direction angle of the sound box relative to the user is not the preset direction angle, and control the sound box to move to the preset position according to an instruction of the moving route.

As an optional implementation manner, in the application example, the first control unit 402 may be further configured to obtain a distance value between the ear of the user and the ground after controlling the sound box to move to the preset position according to the instruction of the moving route; and adjusting the height of the sound box to a preset height according to the distance value between the ears of the user and the ground. Through implementing this mode, when the audio amplifier moves the best preset position of audio, adjust the audio amplifier height according to the distance value of user's ear apart from ground, can further improve user's audio experience and feel.

Through implementing above-mentioned audio amplifier, can be before the broadcast audio frequency, automatic adjustment audio amplifier to the best preset position department of audio, need not the manual audio amplifier of moving of user, be favorable to improving user's use and experience the sense, can also move when the audio amplifier reaches the best preset position of audio, adjust the audio amplifier height according to the distance value of user's ear distance ground, further improve user's audio and experience the sense, can also be based on the discernment of voiceprint characteristic or equipment sign, effectively avoid the audio amplifier by illegal use, can also realize the accurate positioning of user's position through microphone array.

EXAMPLE five

Referring to fig. 5, fig. 5 is a schematic structural diagram of another sound box disclosed in the embodiment of the present application. The sound box shown in fig. 5 is optimized from the sound box shown in fig. 4, and as shown in fig. 5, the sound box may further include:

the recognizing unit 404 is configured to capture a user image before the playing unit 403 plays the preset audio, and obtain a user identity by recognizing the user image.

In this embodiment of the application, the first control unit 402 is further configured to, when the distance value of the sound box relative to the user is not the preset distance value and the direction angle of the sound box relative to the user is not the preset direction angle, control the sound box to move to the preset position and then send a start instruction to the recognition unit 404, so as to trigger the recognition unit 404 to execute the above-mentioned user image acquisition, and obtain the user identity by recognizing the user image.

The analyzing unit 405 is configured to determine a playlist matching the user identity, obtain an emotion identifier of the user by analyzing the user image, and use an audio corresponding to the emotion identifier in the playlist as a preset audio.

In the embodiment of the application, the preset audio is personalized and recommended through user identity recognition and user emotion recognition, and the emotion of the user can be improved under the condition that the emotion of the user is not good.

the first detection unit is used for detecting whether a projection instruction sent by the VR glasses is received or not after the first control unit 402 controls the sound box to move to a preset position when the distance value of the sound box relative to the user is not a preset distance value and the direction angle of the sound box relative to the user is not a preset direction angle;

the projection unit is used for acquiring a VR video associated with a preset audio when the projection instruction is received, and the video frame of the VR video and the audio frame of the preset audio have a corresponding relation; and according to the corresponding relation, projecting and displaying the VR video while playing preset audio.

In this application embodiment, the VR glasses can send a projection instruction to the speaker when detecting that the VR glasses are in a wearing state. Through implementing the above mode, when the audio frequency is predetermine in the broadcast, the user can also watch the VR video of predetermineeing audio frequency relevance through VR glasses, is favorable to improving the improvement effect of user's mood.

Through implementing above-mentioned audio amplifier, can be before playing the audio frequency, automatic adjustment audio amplifier to the best preset position department of audio, need not user's manual sound box of moving, be favorable to improving user's use and experience the sense, can also when the audio amplifier moves to the best preset position of audio, adjust the audio amplifier height according to the distance value of user's ear distance ground, further improve user's audio experience sense, can also be based on the discernment of voiceprint characteristic or equipment sign, effectively avoid the audio amplifier by illegal use, can also realize the accurate location of user's position through the microphone array, still can be based on the individualized recommendation improvement user's mood of presetting the audio frequency, still be favorable to improving user's mood's improvement effect.

EXAMPLE six

Referring to fig. 6, fig. 6 is a schematic structural diagram of another sound box disclosed in the embodiment of the present application. The sound box shown in fig. 6 is obtained by optimizing the sound box shown in fig. 5, and the sound box shown in fig. 6 may further include:

and a second control unit 406, configured to, when the distance value of the sound box with respect to the user is not the preset distance value and the direction angle of the sound box with respect to the user is not the preset direction angle, control the sound box to move to the preset position and adjust the lighting device in the environment where the sound box is located to the preset brightness after the sound box is moved to the preset position and before the preset audio is played by the playing unit 403, by the first control unit 402.

Specifically, in this embodiment of the application, the analysis unit 405 is further configured to send a start instruction to the second control unit 406 after the audio corresponding to the emotion identifier in the playlist is used as a preset audio, so as to trigger the second control unit 406 to adjust the lighting device of the environment where the sound box is located to a preset brightness; the playing unit 403 is specifically configured to play a preset audio after the second control unit 406 adjusts the lighting device of the environment where the sound box is located to a preset brightness.

Optionally, the playing unit 403 may be further configured to determine a target tone color selected by the user before playing the preset audio, and play the preset audio according to the target tone color. The target tone can be the tone of family members such as dad or mom, so that children can feel more intimate before sleeping.

The second detecting unit 407 is configured to detect whether a sleep instruction sent by the wrist wearable device worn by the user is received after the playing unit 403 plays the preset audio.

The second control unit 406 is further configured to terminate playing the preset audio when receiving the sleep instruction, and control the lighting device in the environment where the sound box is located to turn off.

Through implementing above-mentioned audio amplifier, can be before broadcasting the audio frequency, automatic adjustment audio amplifier to the best preset position department of audio, need not user's manual sound box of moving, be favorable to improving user's use and experience the sense, can also when the audio amplifier moves to the best preset position of audio, adjust the audio amplifier height according to the distance value of user's ear distance ground, further improve user's audio experience sense, can also be based on the discernment of voiceprint characteristic or equipment sign, effectively avoid the audio amplifier by illegal use, can also realize the accurate location of user's position through the microphone array, still can be based on the individualized recommendation improvement user's mood of presetting the audio frequency, still be favorable to improving user's mood's improvement effect, be favorable to the energy can be.

Referring to fig. 7, fig. 7 is a schematic structural diagram of a sound box disclosed in the embodiment of the present application. As shown in fig. 7, the sound box may include:

a memory 701 in which executable program code is stored;

a processor 702 coupled to the memory 701;

the processor 702 calls the executable program code stored in the memory 701 to execute any one of the sound box-based audio playing control methods shown in fig. 1 to 3.

The embodiment of the application discloses a computer-readable storage medium, which stores a computer program, wherein the computer program enables a computer to execute any one of the audio playing control methods based on the sound box in fig. 1 to 3.

The embodiment of the application discloses a computer program product, which enables a computer to execute any one of the audio playing control methods based on the sound box in fig. 1-3 when the computer program product runs on the computer.

The embodiment of the application publishing system is used for publishing a computer program product, wherein when the computer program product runs on a computer, the computer is enabled to execute any one of the audio playing control methods based on the sound box in the figures 1 to 3.

It will be understood by those skilled in the art that all or part of the steps in the methods of the embodiments described above may be implemented by instructions associated with a program, which may be stored in a computer-readable storage medium, where the storage medium includes Read-Only Memory (ROM), Random Access Memory (RAM), Programmable Read-Only Memory (PROM), Erasable Programmable Read-Only Memory (EPROM), One-time Programmable Read-Only Memory (OTPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), compact disc-Read-Only Memory (CD-ROM), or other Memory, magnetic disk, magnetic tape, or magnetic tape, Or any other medium which can be used to carry or store data and which can be read by a computer.

The foregoing describes in detail an audio playing control method based on a sound box and the sound box disclosed in the embodiments of the present application, and specific examples are applied herein to explain the principle and implementation of the present application, and the size of the step number in the specific examples does not mean that the execution sequence is necessarily sequential, and the execution sequence of each process should be determined by the function and the inherent logic of the process, but should not limit the implementation process of the embodiments of the present application. The units described as separate parts may or may not be physically separate, and some or all of the units may be selected according to actual needs to achieve the purpose of the embodiment.

The character "/" herein generally indicates that the former and latter associated objects are in an "or" relationship. In the embodiments provided herein, it should be understood that "B corresponding to a" means that B is associated with a from which B can be determined. It should also be understood, however, that determining B from a does not mean determining B from a alone, but may also be determined from a and/or other information. In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit. If the integrated unit is implemented as a software functional unit and sold or used as a stand-alone product, it may be stored in a memory accessible to a computer. Based on such understanding, the technical solution of the present application, which is a part of or contributes to the prior art in essence, or all or part of the technical solution, may be embodied in the form of a software product, stored in a memory, including several requests for causing a computer device (which may be a personal computer, a server, a network device, or the like, and may specifically be a processor in the computer device) to execute part or all of the steps of the above-described method of the embodiments of the present application.

The above description of the embodiments is only for the purpose of helping to understand the method of the present application and its core ideas; meanwhile, for a person skilled in the art, according to the idea of the present application, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present application.

Claims

1. A sound box-based audio playing control method is characterized by comprising the following steps:

and playing preset audio.

2. The method of claim 1, wherein before playing the preset audio, the method further comprises:

collecting a user image;

obtaining a user identity by identifying the user image;

determining a playlist matching the user identity;

obtaining an emotion identifier of the user by analyzing the user image;

3. The method according to claim 1 or 2, wherein after obtaining the distance value and the direction angle of the sound box relative to the user when receiving the audio playing instruction, the method further comprises:

marking a user coordinate point in the three-dimensional model;

4. The method according to claim 1, wherein when the distance value is not a preset distance value and the direction angle is not a preset direction angle, after controlling the sound box to move to a preset position, the method further comprises:

detecting whether a projection instruction sent by VR glasses is received;

5. The method of claim 4, wherein if the predetermined audio is a pre-sleep story of the child, when the distance value is not a predetermined distance value and the direction angle is not a predetermined direction angle, the method further comprises controlling the speaker to move to a predetermined position after the speaker is moved to the predetermined position and before the predetermined audio is played:

after the preset audio is played, the method further comprises:

6. An acoustic enclosure, comprising:

and the playing unit is used for playing the preset audio.

7. An acoustic enclosure according to claim 6, further comprising:

8. An acoustic enclosure according to claim 6 or 7, further comprising:

9. An acoustic enclosure according to claim 6, further comprising:

10. The audio enclosure of claim 9, wherein if the predetermined audio is a pre-sleep story of a child, the audio enclosure further comprises:

11. An acoustic enclosure, comprising:

a memory storing executable program code;

a processor coupled with the memory;

the processor calls the executable program code stored in the memory to execute the steps of the audio playing control method based on the loudspeaker box of any one of claims 1 to 5.

12. A computer readable storage medium having stored thereon computer instructions, which when executed, cause a computer to execute the steps of the speaker-based audio playback control method according to any one of claims 1 to 5.