WO2023061145A1 - 立体声音箱系统的组建方法及相关装置 - Google Patents
立体声音箱系统的组建方法及相关装置 Download PDFInfo
- Publication number
- WO2023061145A1 WO2023061145A1 PCT/CN2022/119134 CN2022119134W WO2023061145A1 WO 2023061145 A1 WO2023061145 A1 WO 2023061145A1 CN 2022119134 W CN2022119134 W CN 2022119134W WO 2023061145 A1 WO2023061145 A1 WO 2023061145A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speaker
- sound box
- acceleration
- sound
- sequence
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 73
- 230000033001 locomotion Effects 0.000 claims abstract description 118
- 230000009471 action Effects 0.000 claims abstract description 117
- 230000001133 acceleration Effects 0.000 claims description 189
- 230000005236 sound signal Effects 0.000 claims description 62
- 230000015654 memory Effects 0.000 claims description 25
- 230000000694 effects Effects 0.000 claims description 18
- 238000012216 screening Methods 0.000 claims description 16
- 238000004891 communication Methods 0.000 claims description 13
- 230000001795 light effect Effects 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims description 8
- 238000010276 construction Methods 0.000 abstract description 7
- 230000001960 triggered effect Effects 0.000 abstract description 6
- 230000003993 interaction Effects 0.000 abstract 1
- 238000013461 design Methods 0.000 description 39
- 230000000875 corresponding effect Effects 0.000 description 23
- 238000010586 diagram Methods 0.000 description 23
- 230000006870 function Effects 0.000 description 20
- 230000008569 process Effects 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 8
- 101100544813 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) ypt-1 gene Proteins 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 230000004044 response Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 5
- 238000005070 sampling Methods 0.000 description 5
- 229920001621 AMOLED Polymers 0.000 description 4
- 238000010183 spectrum analysis Methods 0.000 description 4
- 230000000007 visual effect Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 238000013528 artificial neural network Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 239000000919 ceramic Substances 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 239000004033 plastic Substances 0.000 description 2
- 239000002096 quantum dot Substances 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000005034 decoration Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 239000009509 xiaoyin Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2227/00—Details of public address [PA] systems covered by H04R27/00 but not provided for in any of its subgroups
- H04R2227/005—Audio distribution systems for home, i.e. multi-room use
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/12—Circuits for transducers, loudspeakers or microphones for distributing signals to two or more loudspeakers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R5/00—Stereophonic arrangements
- H04R5/04—Circuit arrangements, e.g. for selective connection of amplifier inputs/outputs to loudspeakers, for loudspeaker detection, or for adaptation of settings to personal preferences or hearing impairments
Definitions
- the present application relates to the technical field of terminals, and in particular to a method for building a stereo speaker system and related devices.
- Speakers have become ubiquitous electronic products in people's lives. In order to be able to enjoy high-quality audio anytime and anywhere, portable speakers have also become a favorite product for electronic enthusiasts.
- a stereo effect can be created by building a stereo speaker system.
- two speakers can be used as the left channel and the right channel respectively, which can provide a more three-dimensional sound effect than one speaker.
- a conventional way to build a stereo speaker system is to first use a corresponding application (APP) to scan nearby speakers, and if multiple similar speakers are found, a stereo speaker system can be built. As shown in Figure 1, taking two speakers scanned as an example, the user can manually select which speaker is set as the left channel and which speaker is set as the right channel in the interface, and then completes the formation of the stereo speaker system.
- APP corresponding application
- Another implementation is to press the buttons on the speakers at the same time to trigger the formation of a stereo speaker system.
- the purpose of the present application is to provide a method for building a stereo speaker system and a related device, which are used to solve the problem in the related art that there is a single operation mode for building a stereo speaker system.
- a method for building a stereo speaker system which is applied to a first speaker, and the method includes:
- the user causes the speaker to produce a certain motion state.
- a stereo speaker system can be established. Therefore, for the user, holding the speaker to perform the corresponding action is A stereo speaker system can be set up, which is easy to operate.
- the method before acquiring the first motion state of the first sound box, the method further includes:
- the user is prompted for the preset actions to be executed for building the stereo speaker system.
- the obtaining the first motion state of the first sound box includes:
- a first acceleration sequence of the first sound box is generated, and first indication information arranged in time sequence is stored in the first acceleration sequence of the first sound box, and the first indication information Used to express the correspondence between acceleration and the duration of said acceleration;
- the characteristics of the preset action include a first sequence template, and the method further includes:
- the method of template matching is adopted to determine whether the collected first operating state matches the characteristics of the preset actions.
- the template matching method is simple and easy to implement, and the template of the characteristics of the preset actions uses time-domain information To express that different actions can be applied, and the scope of application is not limited.
- the obtaining the first motion state of the first sound box includes:
- a second acceleration sequence of the first sound box is generated, and second indication information is stored in the second acceleration sequence of the first sound box, and the second indication information is used to express acceleration and collecting the frequency domain information of the acceleration;
- the characteristics of the preset action include a second sequence template, and the method further includes:
- the method of template matching is adopted to determine whether the collected first operating state matches the characteristics of the preset actions.
- the template matching method is simple and easy to implement, and the template of the characteristics of the preset actions uses frequency domain information. To express that different actions can be applied, and the scope of application is not limited.
- the method also includes:
- the method Before the searching for the second speaker, the method also includes:
- the formation of the stereo speaker system will be triggered, which can improve the accuracy of triggering the establishment of the stereo speaker system and avoid false triggering as much as possible.
- the formation of a stereo speaker system with the second speaker includes:
- the main speaker can be independently screened out, and the sound channels can be independently configured for each speaker.
- the main speaker screening rules include at least one of the following rules:
- the screening rule is simple and easy to implement, and can screen out the main speakers as unambiguously as possible.
- the configuring the sound channel for the first speaker includes:
- the user can use certain actions to configure the sound channels of each speaker, for example, shake it once to configure it as one sound channel, and shake it twice to configure it as another sound channel, which increases the flexibility of sound channel configuration and ease of operation.
- the prompting the user to configure a sound channel for the first speaker includes:
- the user may be prompted in easy-to-implement ways such as effect, light effect, and screen display, so as to guide the user to complete the operation of setting up the stereo speaker system.
- the second motion state includes at least one of the following parameters:
- the number of times the first sound box is shaken the speed of being shaken, the acceleration of being shaken, the direction of being shaken, the moving distance, the number of times the first sound box is collided, and the moving track of the first sound box.
- the user can be supported to manually control the sound box in an easy-to-operate manner to complete the channel configuration.
- the method also includes:
- the specified position relationship includes: a distance between the first sound box and the second sound box is smaller than a distance threshold.
- speakers with a relatively short distance are used to form a speaker system, which is simple and easy to implement and can ensure that speakers within a certain distance range are used to form a speaker system to ensure a stereo effect.
- the embodiment of the present application also provides a first speaker, the speaker includes:
- An acquisition module configured to acquire the first motion state of the first speaker
- a search module configured to search for the second sound box if the first motion state matches the characteristics of the preset action
- the system building module is configured to form a stereo speaker system with the second speaker if the second speaker is found.
- the speaker further includes:
- the prompting module is configured to prompt the user with the preset action to be executed for building a stereo speaker system before acquiring the first motion state of the first speaker.
- the acquiring module is specifically used for:
- a first acceleration sequence of the first sound box is generated, and first indication information arranged in time sequence is stored in the first acceleration sequence of the first sound box, and the first indication information Used to express the correspondence between acceleration and the duration of said acceleration;
- the characteristics of the preset action include a first sequence template, and the speaker also includes:
- a first matching module configured to match the first sequence template with the first acceleration sequence of the first sound box
- the acquiring module is specifically used for:
- a second acceleration sequence of the first sound box is generated, and second indication information is stored in the second acceleration sequence of the first sound box, and the second indication information is used to express acceleration and collecting the frequency domain information of the acceleration;
- the characteristics of the preset action include a second sequence template, and the speaker also includes:
- a second matching module configured to match the second sequence template with the second acceleration sequence of the first sound box
- the speaker further includes:
- a distance determination module configured to determine the moving distance of the first sound box based on the first motion state of the first sound box
- the speaker also includes:
- the distance determination module is configured to determine that the moving distance of the first sound box is greater than a specified distance before the search for the second sound box.
- the system builds modules specifically for:
- the main speaker screening rules include at least one of the following rules:
- system building modules are specifically used for:
- system building modules are specifically used for:
- the second motion state includes at least one of the following parameters:
- the number of times the first sound box is shaken the speed of being shaken, the acceleration of being shaken, the direction of being shaken, the moving distance, the number of times the first sound box is collided, and the moving track of the first sound box.
- system building blocks are also used for:
- the specified position relationship includes: a distance between the first sound box and the second sound box is smaller than a distance threshold.
- the embodiment of the present application also provides a sound box
- the sound box includes: one or more processors, one or more memories, one or more speakers, one or more microphones, and a communication module; wherein, one or multiple microphones for collecting sound signals; a communication module for communicating with other speakers; one or more speakers for sending out sound signals; one or more processors coupled to the one or more memories;
- one or more memories are used to store computer-executable program codes; wherein, the program codes include instructions, and when the one or more processors execute the instructions, the sound box is made to perform the above-mentioned first aspect and Any possible technical solution of its first aspect.
- an embodiment of the present application provides a chip, the chip includes a processor and an interface; the interface is used to receive code instructions, and transmit the received code instructions to the processor; the chip and The memory coupling in the sound box enables the processor to execute the technical solution of the first aspect of the embodiment of the present application and any possible design of the first aspect; "coupling" in the embodiment of the present application means that two components are directly or indirectly combined with each other.
- the embodiment of the present application provides a speaker system, the speaker system includes one or more speakers, wherein at least one speaker is the speaker described in the second aspect and the third aspect above, and the speaker system can perform the above-mentioned All or part of the steps of the first sound box in the first aspect.
- the embodiment of the present application provides a speaker system, the speaker system includes a first speaker and at least one second speaker, and the first speaker and the second speaker are arranged in different positions; Communication is possible between the first sound box and the second sound box; the first sound box is the sound box as described in the above-mentioned second aspect and the third aspect (the sound box can be any one of the above-mentioned first aspect and the first aspect thereof) The first sound box in the technical scheme of design).
- a computer-readable storage medium includes a computer program, and when the computer program runs on the computer, the computer executes the first aspect and the first aspect of the embodiment of the present application. Any possible technical solution of its first aspect.
- a program product in the embodiment of the present application when the computer program product is run on a computer, causes the computer to execute the technology of the first aspect of the embodiment of the present application and any possible design of the first aspect thereof plan.
- FIG. 1 is a schematic diagram of an application scenario provided by an embodiment of the present application
- Fig. 2 is one of the schematic diagrams of the structure of the sound box provided by an embodiment of the present application.
- Fig. 3 is the second schematic diagram of the structure of the sound box provided by an embodiment of the present application.
- Fig. 4A is the third schematic diagram of the structure of the sound box provided by an embodiment of the present application.
- FIG. 4B is a schematic diagram of determining the orientation of a user by a speaker provided by an embodiment of the present application.
- FIG. 5 is a schematic diagram of operations performed by a user with a hand-held speaker provided by an embodiment of the present application
- FIG. 6 is a schematic diagram of a custom preset action provided by an embodiment of the present application.
- FIG. 7 is a schematic flowchart of a method for constructing a stereo speaker system according to an embodiment of the present application.
- Fig. 8 is a schematic diagram of the first sequence sub-template provided by an embodiment of the present application.
- Fig. 9 is a schematic diagram of the process of matching with the characteristics of preset actions provided by an embodiment of the present application.
- FIG. 10 is a schematic diagram of channel configuration when there are two speakers provided by an embodiment of the present application.
- Fig. 11 is a schematic diagram of channel configuration provided by an embodiment of the present application when there are three speakers;
- Fig. 12 is a schematic diagram of channel configuration provided by an embodiment of the present application when there are six sound boxes;
- Fig. 13 is a schematic diagram of channel configuration provided by an embodiment of the present application when there are twelve sound boxes;
- FIG. 14 is another schematic flowchart of a method for building a stereo speaker system according to an embodiment of the present application.
- FIG. 15 is another schematic flowchart of a method for building a stereo speaker system provided by an embodiment of the present application.
- FIG. 16 is a schematic diagram of an application scenario for acquiring network resources of a stereo speaker system provided by an embodiment of the present application
- FIG. 17 is a schematic diagram of another application scenario for acquiring network resources of a stereo speaker system provided by an embodiment of the present application.
- Fig. 18 is another structural schematic diagram of a sound box provided by an embodiment of the present application.
- a plurality referred to in the embodiment of the present application means greater than or equal to two. It should be noted that in the description of the embodiments of the present application, words such as “first” and “second” are only used to distinguish the purpose of description, and cannot be understood as indicating or implying relative importance, nor can they be understood as indicating or imply order.
- references to "one embodiment” or “some embodiments” or the like in this specification means that a particular feature, structure, or characteristic described in connection with the embodiment is included in one or more embodiments of the present application.
- appearances of the phrases “in one embodiment,” “in some embodiments,” “in other embodiments,” “in other embodiments,” etc. in various places in this specification are not necessarily All refer to the same embodiment, but mean “one or more but not all embodiments” unless specifically stated otherwise.
- the terms “including”, “comprising”, “having” and variations thereof mean “including but not limited to”, unless specifically stated otherwise.
- FIG. 1 shows a schematic diagram of an application scenario provided by an embodiment of the present application.
- FIG. 1 it is a schematic diagram of multiple people using their own speakers to build a stereo speaker system during a party.
- User A holds speaker 1
- user B holds speaker 2.
- User A and user B can shake their respective speakers at the same time, and speaker 1 and speaker 2 can be realized to form a stereo speaker system.
- a stereo speaker system can be built by shaking, which increases the fun and convenience of building.
- the speaker 100 may include one or more input devices (input device) 101 , one or more output devices (output device) 102 and one or more processors (processor) 103 .
- the input device 101 can detect various types of input signals (may be referred to as: input), and the output device 102 can provide various types of output information (may be referred to as: output).
- the processor 103 may receive input signals from one or more input devices 101 , generate output information in response to the input signals, and output through one or more output devices 102 .
- one or more input devices 101 may detect various types of inputs and provide signals (eg, input signals) corresponding to the detected inputs, and then the one or more input devices 101 may transmit the input The signals are provided to one or more processors 103 .
- one or more input devices 101 may include any component or component capable of detecting an input signal.
- the input device 101 may include an audio sensor (such as one or more microphones), an acceleration sensor, a distance sensor, an optical or visual sensor (such as a camera, a visible light sensor or an invisible light sensor), a proximity light sensor, a touch sensor, a pressure Sensors, mechanical devices (such as crowns, switches, buttons or keys, etc.), temperature sensors, communication devices (such as wired or wireless communication devices), etc., or, the input device 101 can also be some combination of the above various components.
- the data of the acceleration sensor can be used to determine whether the user shakes or touches the speaker, so as to facilitate the construction of a stereo speaker.
- one or more output devices 102 may provide various types of output.
- one or more output devices 102 may receive one or more signals (eg, output signals provided by one or more processors 103 ) and provide outputs corresponding to the signals.
- output device 102 may include any suitable components or components for providing output.
- output devices 102 may include audio output devices (e.g., one or more speakers), visual output devices (e.g., one or more lights or displays), tactile output devices, communication devices (e.g., wired or wireless communication devices) etc., or, the output device 102 may also be some combination of the above-mentioned various components.
- one or more processors 103 may be coupled to the input device 101 and the output device 102 .
- the processor 103 can communicate with the input device 101 and the output device 102 .
- one or more processors 103 may receive an input signal from the input device 101 (eg, an input signal corresponding to an input detected by the input device 101 ).
- One or more processors 103 may parse the received input signal to determine whether to provide one or more corresponding outputs in response to the input signal. If so, one or more processors 103 may send an output signal to output device 102 to provide an output.
- Fig. 3 shows a functional block diagram of a sound box 300 provided by another embodiment of the present application.
- the sound box 300 may be an example of the sound box 100 described in FIG. 2 .
- the sound box 300 includes a microphone 301 , a speaker 302 , a processor 303 , a memory 304 , a communication module 305 , a sensor module 306 , and a light 307 .
- the components shown in FIG. 3 do not constitute a specific limitation on the speaker 300, and the speaker 300 may also include more or fewer components than shown in the figure, or combine some components, or split some components, or different component arrangements.
- the processor 303 may include one or more processing units, for example: the processor 303 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural network processor (neural-network processing unit, NPU) wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors. Wherein, the controller may be the nerve center and command center of the speaker 300 . The controller can generate an operation control signal according to the instruction opcode and timing signal, and complete the control of fetching and executing the instruction.
- a memory may also be set in the processor 303 for storing instructions and data.
- the memory in processor 303 is a cache memory.
- the memory may hold instructions or data that the processor 303 has just used or recycled. If the processor 303 needs to use the instruction or data again, it can be called directly from the memory, avoiding repeated access, reducing the waiting time of the processor 303, thus improving the efficiency of the system.
- the processor 303 may run software codes/modules of the method for building a stereo speaker system provided in some embodiments of the present application to realize the function of controlling the speakers.
- the microphone 301 also called “microphone” or “microphone” is used to collect sound signals (for example, to collect sounds from users), and convert the sound signals into electrical signals.
- one or more microphones 301 such as a microphone array, may be provided on the sound box 300 .
- the microphone 301 in addition to collecting sound signals, can also implement a noise reduction function for the sound signals, or can also identify the source of the sound signals, implement directional recording functions, and the like.
- the speaker 302 also called “horn” is used to convert audio electrical signals into sound signals.
- the speaker 300 can play sound signals such as music through the speaker 302 .
- a microphone 301 and a speaker 302 are coupled with the processor 303 .
- the microphone 301 sends the sound signal or an audio electrical signal converted from the sound signal to the processor 303 .
- the processor 303 judges whether to respond to the sound signal or audio electrical signal, and if so, outputs a corresponding output signal, such as playing music through the speaker 302 .
- the memory 304 may be used to store computer executable program codes, where the executable program codes include instructions.
- the processor 303 executes various functional applications and data processing of the speaker 300 by executing instructions stored in the memory.
- the memory may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash memory (universal flash storage, UFS), etc., which are not limited in the embodiment of the present application.
- the memory 304 may store information such as "wake-up word”.
- the memory 304 may also store audio information (eg, songs, comic dialogue, storytelling, etc.).
- the communication module 305 may be a wireless communication module (such as Bluetooth, wireless).
- the speaker 300 is connected to other devices, such as another speaker, a mobile phone, and a television, through a communication module 305 .
- the sensor module 306 may include an air pressure sensor 306A, a temperature sensor 306B, an acceleration sensor 306C, and the like. It should be understood that FIG. 3 is only an example of several sensors, and in practical applications, the sound box 300 may also include more or fewer sensors, or use other sensors with the same or similar functions to replace the sensors listed above, etc. , the embodiment of this application is not limited.
- the air pressure sensor 306A is used for measuring air pressure.
- the processor 303 may be coupled to the air pressure sensor 306A, and the air pressure value measured by the air pressure sensor 306A may be used to assist calculations, such as calculating sound attenuation coefficients and the like.
- the temperature sensor 306B is used to detect temperature.
- the processor 303 may be coupled with the temperature sensor 306B, and the temperature value measured by the temperature sensor 306B may be used to assist calculations, such as calculating sound attenuation coefficients and the like.
- the acceleration sensor 306C is used to collect the acceleration information of the speaker when the user shakes the speaker, so as to construct a stereo speaker system according to the acceleration information.
- the sound box 300 may include a display (or a display screen), or may not include a display.
- the display can be used to display the display interface of the application, such as the currently playing song.
- the display includes a display panel.
- the display panel can adopt liquid crystal display (liquid crystal display, LCD), organic light-emitting diode (organic light-emitting diode, OLED), active matrix organic light-emitting diode or active-matrix organic light-emitting diode (active-matrix organic light emitting diodes, AMOLED), flexible light-emitting diodes (flex light-emitting diodes, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light emitting diodes (quantum dot light emitting diodes, QLED), etc.
- a touch sensor may be disposed in the display to form a touch screen, which is not limited in this embodiment of the present application.
- the touch sensor is used to detect a touch operation on or near it.
- the touch sensor can transmit the detected touch operation to the processor 303 to determine the type of the touch event.
- the processor 303 may provide visual output related to the touch operation through the display.
- FIG. 3 may further include more devices, such as a battery, a USB interface, etc., which will not be described in detail in this embodiment of the present application.
- Fig. 4A shows a schematic structural diagram of a sound box provided by an embodiment of the present application.
- the sound box 400 may be an example of the sound box described in FIG. 2 or FIG. 3 .
- the sound box 400 may include a base 401 and a casing 402 .
- the base 401 can function as a support.
- the base 401 can support the casing 402 and the components enclosed in the casing 402 (such as a processor, a microphone, a speaker, etc.).
- the base 401 may be made of metal, plastic, ceramics and any other supporting material, or a combination of these materials.
- one or more speakers 406 may be supported on the base 401 .
- the base 401 can support a fixing part 404, and one or more speakers 406 can be arranged on the fixing part 404.
- the base 401 can support the fixture 404 through a support column 405 or other means.
- the fixing piece 404 can be in any shape, such as round, square and so on.
- one or more speakers 406 may be arranged in a certain arrangement on the fixing member 404 .
- one or more speakers 406 may be evenly distributed on the edge of the fixing member 404, for example, the distance between each speaker is the same.
- one or more speakers 406 may be coupled to processor 403 .
- Processor 403 may output audio signals through one or more speakers 406 .
- the housing 402 may be any three-dimensional shape such as a cylinder, a cube, or a cube.
- the casing 402 may enclose components such as a processor 403 , a fixing member 404 , and one or more speakers 406 .
- Housing 402 may be a single housing member, or more than two housing members.
- the housing 402 may include an upper housing 402a and a side housing 402b.
- One or more housing components can be metal, plastic, ceramic, crystal, or a combination of these materials, or any other housing components suitable for being arranged on the sound box, and the like.
- the side shell 402b may be a shell with a mesh structure, for example, the mesh may be in the shape of round holes, square holes, hexagonal holes and the like.
- the shell with mesh structure can play the roles of decoration, dustproof, and protection of devices inside the shell (such as speakers, microphones, etc.), and the shell with mesh structure can reduce the blocking of sound output by the speaker.
- the upper shell 402a may be a mesh structure, or a shell that is not a mesh structure.
- the upper housing 402a can be provided with input devices, such as switches, buttons or keys.
- a switch is used to turn a speaker on or off.
- Buttons or keys can be used to adjust functions such as volume.
- a display screen 409 (such as a touch display screen) may be provided on the upper housing 402a, and may be used for receiving input, providing visual output, and the like. For example, the title of the currently played song, the name of the singer, etc. can be displayed on the display screen 409 .
- the speaker may not be provided with a display screen, which is not limited in this embodiment of the present application.
- the upper case 402a can be connected with the fixing part 407 .
- One or more microphones 408 may be disposed on the fixing member 407 .
- the fixing piece 407 can be in any shape, such as round, square and so on.
- one or more microphones 408 may be arranged in a certain manner on the fixing member 407 .
- one or more microphones 408 may be evenly distributed on the edge of the fixing member 407, for example, the distance between each microphone is the same.
- the central angle a corresponding to every two adjacent microphones (for example, the angle formed by the straight line connecting the two microphones to the center point of the fixing member 407 ) may be fixed, for example, 30 degrees, 60 degrees, etc.
- one or more microphones 408 may be coupled to processor 403 .
- the processor 403 can obtain input signals (such as voice signals from the user) through one or more microphones 408 .
- the application scenario in FIG. 1 is taken as an example, and the speaker 1 and/or the speaker 2 in FIG. 1 is the speaker 400 shown in FIG. 4A as an example.
- one of the speakers 1 and 2 is called the main speaker, and the other is called the slave speaker.
- the master sound box and the slave sound box can be used together.
- the master speaker is used to play the left channel and the slave speaker is used to play the right channel, or the master speaker is used to play the right channel and the slave speaker is used to play the left channel. That is to say, the cooperation of the main speaker and the slave speaker can realize the stereo sound effect of the audio.
- whether a speaker is the master speaker or the slave speaker can be set before the speaker leaves the factory, or it can be user-defined (for example, the speaker is connected to the input operation through the touch screen, and the input operation is used to select Whether the speaker is the master speaker or the slave speaker), in this embodiment of the application, it can also be selected according to certain rules.
- the structure of the main sound box and the slave sound box may be the same, for example, both the main sound box and the slave sound box have the structures shown in FIG. 4A .
- the structure of the main speaker and the slave speaker may not be exactly the same, for example, the master speaker may be provided with a display screen, while the slave speaker may not be provided with a display screen.
- the functions of some components in the master sound box and the slave sound box may not be exactly the same.
- the processor in the main speaker can be used to calculate the delay difference (for example, the time difference between the first duration and the second duration, the first duration can be the duration required for the sound to travel from the main speaker to the user, and the second duration can be It is the time required for the sound to travel from the speaker to the user), loudness gain, etc., but the processor in the slave speaker does not have this function.
- the delay difference for example, the time difference between the first duration and the second duration
- the first duration can be the duration required for the sound to travel from the main speaker to the user
- the second duration can be It is the time required for the sound to travel from the speaker to the user
- loudness gain etc.
- the master speaker and/or the slave speaker can store audio files (such as songs, comic dialogue, storytelling, etc.) in the memory, and the master speaker and the slave speaker can play the stored audio files.
- the master speaker can receive input (such as receiving input operations through a touch screen, or receiving speech input through a microphone), and this input can be used to activate the master speaker and/or slave speakers, or to control the master speaker and slave speakers. , switch songs, etc.
- one or more microphones in the main sound box collects a sound signal (for example, a sound signal emitted by a user), and the processor recognizes that the sound signal contains "wake word + play song", and the processor determines the memory
- a sound signal for example, a sound signal emitted by a user
- the processor recognizes that the sound signal contains "wake word + play song"
- the processor determines the memory
- the song can be downloaded from the network side, or a prompt message (such as language information) can be output to prompt the user that the song does not exist.
- the main speaker and/or the slave speaker can be connected with other electronic devices (such as mobile phones, televisions), and can be connected in a wired or wireless manner.
- the connection between the main speaker and the mobile phone for example, Bluetooth connection
- the mobile phone can send the audio signal to the master speaker, so that the master speaker and the slave speaker can play the audio signal (for example, after the master speaker receives the audio signal, it can send the audio signal to the slave speaker).
- the mobile phone is running a music player application (for example, Kugou Music), and the song "All the way to the north" is being played, the mobile phone can send the audio signal of the song to the main speaker, so that the main speaker and the slave speaker play the audio signal.
- a music player application for example, Kugou Music
- the user can control the mobile phone to perform corresponding operations through the main speaker.
- the user sends out a sound signal of "Xiaobai plays a song and listens to mother's words" in the room.
- the main speaker collects the sound signal, and can pause the playback all the way to the north, but output the prompt message "I am playing for you Find out what your mother says.”
- the main speaker can find out whether there is a song "Listen to Mom" from the local storage. If it does not exist, the main speaker can download it from the network side, or the main speaker can send instructions to the mobile phone. After receiving the instruction, download or play the song online, and send the audio signal of the song to the main speaker, so that the main speaker and the slave speaker play the audio signal of the song (that is, listen to the mother's words).
- both the master speaker and the slave speaker can activate the function of automatically recognizing the "wake word”.
- the main speaker as an example, after the main speaker activates the function of automatically recognizing the "wake word”, all or some components (for example, one or more microphones, processors, etc.) in the main speaker are enabled.
- the sound signal from the user in the room is picked up by one or more microphones in the main speaker.
- One or more microphones send the received sound signal to the processor, and when the processor judges that the sound signal contains a "wake word", it activates other components (for example, one or more speakers).
- the "wake-up word” can be set by default when the speaker leaves the factory, or it can be user-defined.
- the "wake-up word” can be "Xiaobai", “Xiaoyin”, “Xiaoyi” and so on.
- both the master speaker and the slave speaker can activate the function of automatically recognizing "wake word + playing a song".
- the main speaker activates the function of the automatic device "wake word + play song"
- all or part of the components (eg, one or more microphones, processors, etc.) in the main speaker are enabled.
- the sound signal from the user in the room is picked up by one or more microphones in the main speaker.
- One or more microphones send the received sound signal to the processor, and when the processor judges that the sound signal contains "wake word + play song", other components (for example, one or more speakers) are activated.
- the user sends "Xiaobai plays all the way north" in the room.
- the sound signal collected by the microphone in the main speaker is then sent to the processor, and the processor recognizes that the sound signal includes the wake-up word: Xiaobai, and also includes: playing a song, and the processor starts other components (for example, one or more loudspeakers ).
- the main speaker can receive an input operation through an input device (such as a touch screen on the main speaker) or receive an input operation through other devices connected to the main speaker such as a mobile phone, and in response to the input operation, the automatic recognition of the "wake word” is started.
- an input device such as a touch screen on the main speaker
- receive an input operation through other devices connected to the main speaker such as a mobile phone
- the master speaker can send an instruction to the slave speaker, which is used to instruct the slave speaker to start the function of automatically recognizing "wake word” or "wake word + play song”.
- the user may be at any position in the room, and the distances between the master speaker and the slave speaker may be different from the user.
- the main speaker and the slave speakers can automatically recognize the "wake word” or "wake word + play song” function. Sound signals are collected from the microphones in the main speakers and the slave speakers.
- the master speaker and the slave speaker determine that the sound signal contains "wake-up word” or "wake-up word + play song”
- the position of the user can be determined, and then the sound parameters of the master speaker and the slave speaker can be controlled according to the user's position.
- the sound parameters may include a time delay difference between a master speaker and a slave speaker, loudness gain, and the like. Therefore, in this embodiment, when the master speaker and the slave speakers recognize that the collected sound signal contains "wake-up word” or "wake-up word + play song", the sound parameters of the master speaker and the slave speaker are adjusted according to the position of the user. .
- the process of judging the position of the user by the master speaker and the slave speakers may include: the master speaker collects the sound signal 1 . Acoustic signal 2 is collected from the speaker. The master speaker determines that the sound signal 1 includes the "wake-up word", and the slave speaker determines that the sound signal 2 includes the "wake-up word”. Of course, in order to improve the accuracy, the slave speaker can also send the sound signal 2 or the "wake-up word" included in the sound signal 2 to the master speaker, and the master speaker determines whether the "wake-up word" in the sound signal 1 or the sound signal 2 is Same wake word.
- the main speaker can determine the first direction/orientation of the user relative to the main speaker according to the sound signal 1, for example, the first direction/orientation can be expressed as the first angle between the user and the x-axis in the coordinate system constructed by the main speaker .
- the slave speaker can determine the second direction/orientation of the user relative to the slave speaker according to the sound signal 2, for example, the second direction/orientation can be expressed as the second direction between the user and the x-axis in the coordinate system constructed by the slave speaker. angle.
- the slave speaker may send the second angle to the master speaker, and the master speaker determines the position of the user according to the first angle and the second angle, and the distance D between the master speaker and the slave speaker.
- the construction of the coordinate system by the master speaker and the slave speakers, and the process of determining the user's position by the master speaker and the slave speakers will be described in detail later.
- microphone array positioning technology for example, estimating the direction of the sound source based on the time difference of the sound signals received by at least two microphones in the microphone array on the main speaker
- beam pointing (steered-beamformer) positioning method based on high-resolution Spectral analysis (high-resolution spectral analysis) positioning method
- sound source positioning technology based on sound time difference time-delay estimation, TDE, etc.
- TDE time-delay estimation
- the process for the main speaker to determine the first direction/orientation of the user relative to the main speaker according to the sound signal 1 may include; the microphone array 408 in the main speaker collects the sound signal, assuming that the microphone 408-1 and The intensity of the sound signal collected by 408-2 is relatively high, and the main speaker can collect the sound signal at the first moment t1 according to the microphone 408-1, and the second moment t2 of the sound signal collected by the microphone 408-2, and the microphone 408- 1 and the distance L1 between the microphone 408-2 (this distance can be stored in the main speaker after leaving the factory), calculate the sound source, that is, the first orientation of the user relative to the main speaker. Referring to FIG.
- the main speaker can determine the angle A of the user relative to the microphone 408-1 according to (t1-t2)*c and L1, and the trigonometric function relationship, and the angle A can be used as the angle A between the user and the main speaker. or, since the included angle A is the included angle of the user relative to the microphone 408-1, the main speaker can convert the included angle A into the coordinate system constructed by the main speaker to obtain the included angle B, The included angle B can also be used as the first orientation of the user relative to the main speaker.
- the structure of the slave speaker and the master speaker may be the same, so the process of determining the second position of the user relative to the slave speaker may be similar to the above process.
- the main speaker and the slave speakers can continuously collect sound signals in real time (the sound signals may not contain "wake-up words” or “wake-up words + play songs”) , and then determine the user's position, adjust the sound parameters of the master speaker and the slave speaker according to the user's position, until a sound signal containing "wake-up word” or "wake-up word+play song” is detected, the adjusted sound parameters (such as , delay difference between the master speaker and the slave speaker, loudness gain, etc.) to control the master speaker and the slave speaker to play audio signals.
- the sound signals may not contain "wake-up words” or “wake-up words + play songs”
- the adjusted sound parameters such as , delay difference between the master speaker and the slave speaker, loudness gain, etc.
- the distance D between the master speaker and the slave speakers may be detected so as to determine whether to build a stereo speaker system, and the distance may be a straight-line distance between the master speaker and the slave speakers.
- the master speaker detects the distance D, it can send the distance D to the slave speaker, and the slave speaker does not need to detect the distance D; or, the slave speaker can also detect the distance D to the master speaker for use.
- the slave speaker can detect the distance D to the master speaker, and then send it to the master speaker, that is, the master speaker does not need to detect the distance D and so on.
- the distance between the master speaker and the slave speaker can be detected by a distance sensor.
- the distance sensor may be a laser distance sensor, an infrared distance sensor, or the like.
- the distance sensor on the main speaker emits infrared light of a specific frequency, which is reflected by the secondary speaker, and the main speaker receives the light emitted from the secondary speaker.
- the master speaker can calculate the distance between the master speaker and the slave speaker according to the first time when the infrared light is emitted and the second time when the reflected light is received.
- the master speaker can also communicate with the slave speakers to achieve the purpose of measuring the distance between the master speaker and the slave speakers.
- the master speaker transmits a detection signal to the slave speakers, and after receiving the detection signal, the slave speaker sends a feedback signal to the master speaker, and the master speaker receives the feedback signal.
- the master speaker can determine the distance between the master speaker and the slave speaker according to the second time of receiving the feedback signal and the first time of sending the detection signal.
- the master speaker may also receive an input operation through an input device (such as a touch screen on the master speaker), where the input operation is used to input the distance between the master speaker and the slave speaker.
- the main speaker and the slave speaker can also use the microphone array positioning technology (for example, to estimate the direction of the sound source according to the time difference of the sound signals received by at least two microphones in the microphone array on the main speaker), beam pointing (steered- Beamformer) positioning method, based on high-resolution spectral analysis (high-resolution spectral analysis) positioning method, and based on sound time difference (time-delay estimation, TDE) sound source positioning technology, etc. to determine the distance between the master and slave speakers.
- the main speaker can emit sound
- the slave speaker can determine the distance between itself and the main speaker based on the microphone array.
- the master speaker and the slave speaker are not determined, at least one of the two speakers can emit sound to realize distance measurement.
- Motion state that is, the motion of the speaker.
- the first motion state and the second motion state mentioned in the following embodiments are parameters describing the movement of the sound box.
- the motion state may be data collected by the acceleration sensor, or information obtained by further processing the data collected by the acceleration sensor.
- the first indication information used to describe the correspondence between the acceleration and its duration. For example, defining the same acceleration refers to the same acceleration when the acceleration change is less than the preset change. It should be noted that different acceleration directions mean different accelerations. For example, the acceleration range [1,2] is considered to be the same acceleration within the range of 0.5 based on 1.5. Then, the duration of each acceleration is counted, and each acceleration and its corresponding duration can be expressed by using the corresponding first indication information.
- the first indication information is (A, T), where A represents an acceleration, and T represents a duration corresponding to the acceleration A.
- the duration T of the acceleration A is described explicitly, and in another embodiment, the duration T of the acceleration A may also be described implicitly.
- the acceleration sensor collects data at intervals of time, it can be defined that the duration of each acceleration is expressed by this time interval (such as 5ms), so the first indication information can also be expressed as (A), that is, the default duration is 5ms.
- the acceleration sensor can simultaneously collect acceleration information in three directions (ie, X, Y, and Z directions). Then (A, T) can be expressed as (X1, Xt1, Y1, Yt1, Z1, Zt1) or (X1, Y1, Z1, Xt1, Yt1, Zt1). Among them, X1, Y1, and Z1 represent the collected accelerations in various directions, Xt1 represents the duration of acceleration X1, Yt1 represents the duration of acceleration Y1, and Zt1 represents the duration of acceleration Z1.
- the first acceleration sequence (time sequence information): used to describe the organization of the first indication information, that is, the time domain signal of the motion state.
- the first indication information may be sorted in time sequence to obtain the first acceleration sequence of the sound box.
- the first acceleration sequence S ⁇ (X1, Xt1, Y1, Yt1, Z1, Zt1), (X2, Xt2, Y2, Yt2, Z2, Zt2)...(Xn, Xtn, Yn, Ytn, Zn, Ztn) ⁇ , n is a positive integer.
- the second indication information the frequency domain information of each point in the motion state of the speaker.
- the frequency domain information of each acceleration in the embodiment of the present application may be referred to as second indication information.
- One way of expressing the frequency-domain information is, for example, expressing the frequency-domain information of the statistical acceleration A as P, then its second indication information is expressed as (A, P). For example, assuming that 9s is a sampling time, the obtained sampling result of the acceleration in the X direction is (1,2,3,4,1,2,3,5,1,2,3,4,1,2, 3, 5, ...), convert the sampling result to the frequency domain, and obtain the frequency domain information in the X direction.
- the Y direction and the Z direction refer to the processing method of the X direction, which will not be repeated in this application.
- the second acceleration sequence (frequency domain information): used to describe the organization of the second indication information, and used to express the motion state of the speaker in the frequency domain (ie, the frequency domain signal of the motion state).
- the acceleration in the X direction includes (1,2,3,4,1,2,3,5,1,2,3,4,1,2,3,5,...), then about X
- the preset actions can be pre-defined in the embodiment of the present application.
- the preset action may be to draw a certain figure (such as a figure 8, a wavy line, or a five-pointed star) with the hand-held speaker. For example, shake the speaker left and right, shake the speaker up and down, or make the speaker collide.
- the way of generating the collision may be to collide with another speaker or another object, as long as the speaker collides.
- preset actions can be configured according to actual needs during implementation, and even users can customize preset actions, which are applicable to the embodiments of the present application.
- the characteristics of the preset action are used in the embodiment of the present application to describe the preset action.
- the acquisition method of the characteristics of the preset action can be implemented as, after the preset action is pre-defined, then operate the speaker according to the preset action (for example, draw a figure-eight pattern with the speaker), and collect the motion state of the speaker (such as collecting the above-mentioned the first acceleration sequence or the second acceleration sequence), thereby obtaining the characteristics of the preset action and storing it in the memory of the sound box.
- features of preset actions applicable to different user groups may be constructed for the same preset action.
- the preset action is to draw a figure of eight
- the data of drawing a figure of eight for the elderly can be collected to obtain the characteristics of drawing a figure of eight suitable for the elderly
- the data of drawing a figure of eight for children can be collected to obtain the characteristics of drawing a figure of eight suitable for children.
- the embodiment of the present application also supports user-defined preset actions.
- the user triggers the business logic of a custom preset action through a button on the speaker.
- the speaker can then prompt the user to customize the preset actions for the speaker.
- the custom preset action as drawing a five-pointed star as an example
- the user draws a five-pointed star while holding the speaker, and the speaker collects its own motion state as a sample of the custom preset action, and executes the custom preset action. count.
- the speaker collects 3 samples. At this time, the preset counting requirement is reached, and the sample collection is stopped.
- the three samples are analyzed (for example, averaged), and then the characteristics of the custom preset actions are obtained and stored in the memory. Afterwards, when the user draws a five-pointed star with the speaker in hand, it will be matched with the characteristics of the preset actions stored in the memory, and the matching will trigger the formation of a stereo speaker system.
- the first sequence template and the second sequence template are the characteristics of preset actions, and are used to match the collected motion states of the speakers. Since the previous section 3) and 5) respectively introduced the timing information obtained by sorting and analyzing the acceleration information, that is, the first acceleration sequence, and the frequency domain information, that is, the second acceleration sequence. Therefore, in the embodiment of the present application, a first sequence template corresponding to the first acceleration sequence is provided, and a second sequence template corresponding to the second acceleration sequence is provided.
- the first sequence template and the first acceleration sequence are used for matching.
- the second sequence template and the second acceleration sequence are used for matching. Then it is determined whether to trigger the operation of forming a stereo speaker system based on the matching result.
- FIG. 7 shows a schematic flowchart of a method for constructing a stereo speaker system provided by an embodiment of the present application. Taking two speakers as an example, it includes the following steps:
- the speaker will prompt the user with the preset actions required to build the stereo speaker system, such as a voice reminder: "Please hold the speaker and draw a number 8 to build a stereo speaker system” , if the speaker has a display screen, it can also be prompted by outputting the words “please draw 8 characters with the speaker to build a stereo speaker system” through the display screen, and the specific prompt method is not limited in this application.
- Step 701 when the user moves the first speaker, the first speaker will obtain the first motion state of the first speaker. Similarly, when the user moves the second speaker, the second speaker will also obtain the first motion state of the second speaker.
- timing for the user to move the first speaker and the second speaker is not limited, that is, the first speaker can be moved first and then the second speaker, or the second speaker can be moved first and then the first speaker, or both can be moved at the same time.
- the second speaker and the first speaker may be the same user or different users.
- step 702 the first speaker determines whether the first motion state of the first speaker matches the feature of the preset action. Similarly, the second speaker determines whether the first motion state of the second speaker matches the feature of the preset action.
- the preset action of the first speaker and the preset action of the second speaker may be the same or different.
- the preset action of the first speaker and the second speaker may both be to draw a character 8, or the default action of the first speaker is to draw a character 8, and the default action of the second speaker is to draw a five-pointed star.
- the first sound box is taken as an example to describe whether the first motion state matches the feature of the preset action.
- the characteristics of the defined preset actions are represented by the first sequence template.
- the first sequence template includes first sequence sub-templates in X, Y, and Z directions respectively.
- FIG. 8 a schematic diagram of the first sequence of sub-templates in the X direction is given.
- the first sequence of sub-templates in the X direction shows the trend graph of acceleration in the X direction changing with time (by default, the duration of each acceleration is the sampling time interval).
- the acceleration sensor outputs acceleration values
- a first acceleration sequence is obtained, and the first acceleration sequence includes first acceleration subsequences in the three directions of X, Y, and Z respectively.
- FIG. 8 a schematic diagram of the first acceleration subsequence in the X direction is given.
- FIG. 8 shows the trend of the acceleration in the X direction over time (by default, the duration of each acceleration is the sampling time interval).
- the acceleration value When the acceleration value is collected, start to use the first sequence sub-template in the X direction and the first acceleration sub-sequence collected in the X direction to carry out template matching, as shown in Figure 8, it can be known that three first-sequence sub-templates are matched (as shown in Figure 8 A1, A2 and A3 in 8).
- the Y direction can use the first sub-template sequence in the Y direction and the first acceleration sub-sequence in the Y direction to perform template matching
- the Z direction can use the first sequence sub-template in the Z direction and the first acceleration sub-sequence in the Z direction. match, and will not be repeated here.
- the first acceleration sub-sequence in the Y direction matches the feature of the preset action in the Y direction.
- the number of times the Z direction is matched to the first sequence sub-template in the Z direction is greater than a specified number of times, it is determined that the first acceleration sub-sequence in the Z direction matches the feature of the preset action in the Z direction.
- the acceleration sensor may not collect acceleration for a period of time. Then, in order to accurately determine whether it matches the feature of the preset action during implementation, a duration threshold t' can be set. Taking the X direction as an example, for example, when the number of times the X direction matches the first sequence sub-template in the X direction within the duration threshold t' range is greater than the specified number of times, determine the first acceleration sub-sequence in the X direction and the preset action in the X direction feature matches, otherwise it does not match.
- the first sequence of sub-templates in the X direction is first matched within the time period A1, and the number of matching times is 1, and then no sub-templates in the X direction are collected or matched within the time t (t>t').
- the first sequence of sub-templates previously matched to the X direction are invalidated, and the matching times are re-recorded as 0. Then start counting the number of matching times again until the number of matching times is greater than the specified number of times, and determine the feature matching of the preset action in the X direction.
- the time difference threshold of the first sequence sub-templates matched twice in the X direction can also be defined. If the time difference of the first sequence sub-templates matched twice in the X direction is less than the time difference threshold, the matching to The number of sub-templates of the first sequence in the X direction is counted cumulatively, otherwise, counting is performed again.
- the Y direction and the Z direction are also processed in the same way, which will not be repeated here.
- the specific matching method is not limited in this application, and the matching method of the time-domain signal can be referred to, or the difference value can be matched point by point, and if the difference value is within the preset difference value range, it is determined to be a good match.
- the above describes the implementation of how to determine whether the first motion matches the characteristics of the preset action when the first acceleration sequence is used to express the first motion state of the speaker.
- the following describes the frequency domain signal, that is, the second acceleration sequence to express the first motion state of the speaker. 1. The matching method in the state of motion.
- the second acceleration sequence in the embodiment of the present application has a corresponding second sequence template, which can be implemented as if the number of times the second sequence template is matched in the second acceleration sequence is greater than or equal to the second specified number of times, It is then determined that the first motion state matches the features of the preset action; if the number of times the second sequence template is matched in the second acceleration sequence is less than the second specified number of times, it is determined that the first motion state does not match the feature of the preset action.
- the second sequence template may include the second sequence sub-templates in X, Y, and Z directions, taking the X direction as an example, the corresponding second sequence sub-template, the frequency domain information in the X direction and the second sequence in the X direction Sub-template matching, if the times of matching the second sequence of sub-templates is greater than the second specified number of times, it is determined that the X-direction matches, otherwise the frequency domain information in the X-direction does not match the second sequence of sub-templates in the X-direction.
- the acceleration sensor does not collect data within a certain period of time (the period of time is greater than the duration threshold t'), or does not match the second sequence template, the matching fails this time, and the motion state of the speaker and the characteristics of the preset action Mismatch.
- the time difference between two adjacent matches to the second sequence template is less than the time difference threshold, the number of times the second sequence template is matched will be counted cumulatively; otherwise, if the time difference between two adjacent matches to the second sequence template is not If it is less than the time difference threshold, the matching fails this time and the counting is repeated.
- the operation of forming a stereo sound box system may be triggered.
- whether the moving distance of the sound box is long enough can also be used as another trigger condition, which can be implemented as step 703 shown in FIG. 7 .
- Step 703 Determine the moving distance of the first sound box in the first motion state of the first sound box, and determine the moving distance of the second sound box in the first motion state of the second sound box.
- step 702 and step 703 are not limited.
- Step 704 if the first motion state of the first speaker matches the feature of the preset action, and the moving distance of the first speaker is greater than the specified distance, trigger the operation of building a stereo speaker system, that is, the first speaker searches for surrounding speakers, and simultaneously If the first motion state of the second speaker matches the feature of the preset action, and the moving distance of the second speaker is greater than the specified distance, the operation of building a stereo speaker system is triggered, that is, the second speaker searches for surrounding speakers.
- the specific search method may be that both the first speaker and the second speaker scan the information of the peer device, such as system version, signal strength and other information.
- the first speaker and the second speaker each broadcast their own device information, then the first speaker can search for the device information of the second speaker, and the second speaker can search for the device information of the first speaker.
- the conditions for building a stereo speaker system can be added in the embodiment of the present application.
- step 705 after the first speaker searches for the second speaker, the positional relationship between the first speaker and the second speaker can be determined. Similarly, after the second speaker searches for the first speaker, the relationship between the second speaker and the first speaker can be determined. speaker position.
- step 706 if the positional relationship between the first sound box and the second sound box is a specified positional relationship, then the first sound box and the second sound box form a stereo sound box system.
- the first speaker and the second speaker can also prompt the user which speakers can construct a stereo speaker system.
- the first speaker and the second speaker can be lit, or controlled to generate a certain lighting effect to remind the user that the first speaker and the second speaker can form a stereo speaker system.
- the sound effect prompt method can also be used in this application.
- a display prompt can also be used, of course, in other embodiments, a combination of prompts can also be used, such as light effect + sound effect prompt, sound effect + display screen prompt, etc. are applicable In the embodiment of this application.
- the positional relationship can be expressed in terms of distance.
- the first speaker can broadcast its own device information so that surrounding speakers can perceive itself, and similarly, the second speaker broadcasts its own device information so that it can be perceived by surrounding speakers.
- the distance to the second speaker is determined.
- the distance between the second speaker and the first speaker can be determined. When the distance between the two speakers is less than the distance threshold D, it is determined that the two speakers meet the team formation condition. Then the two speakers began to form a stereo speaker system.
- Setting up the stereo speaker system includes the following aspects: the one is to filter out the master speaker and the slave speaker (can be implemented as step 707), and the 2nd is to determine the role of each speaker, that is, determine the channel configuration of each speaker (can be implemented as Step 708).
- step 707 based on the main speaker screening rule, the main speaker is selected.
- the main speaker screening rules include at least one of the following rules.
- the order of the following rules is in order of priority (of course, it can also be prioritized according to actual needs.
- the level is also applicable to the embodiment of this application):
- the first priority is to select the speaker that has been connected to the Internet as the main speaker
- the second priority is to select the speaker that has been configured but not connected to the network as the main speaker
- the third priority is to select the speaker connected to the smart terminal device as the main speaker
- the fourth priority is to select the speaker that matches the feature of the preset action the earliest as the main speaker;
- the fifth priority is to select the speaker with the largest MAC address as the main speaker.
- the speaker that has been connected to the network is first selected as the main speaker, so that the required speaker playback resources can be obtained through the network. If each speaker is not connected to the Internet, select the speaker that has already been connected to the network as the main speaker. It can communicate with the network to obtain playback resources.
- the speaker that matches the feature of the preset action the earliest may be selected as the main speaker. For example, when a stereo speaker system is built by shaking, the speaker that was shaken the earliest serves as the main speaker.
- the speaker that first matches the feature of the preset action may be selected as the main speaker.
- the screening rule is to prioritize the selection of speakers that can communicate with the network as the main speaker. If there is no such speaker, it is enough to ensure that each speaker can be uniformly selected as a main speaker. This application does not limit this .
- the user may be prompted which speaker is the main speaker through sound effects or light effects.
- the search can be repeated several times. If the search is not found for many times, the operation of building a stereo speaker system can be ended, and the user can also be prompted to fail to build and the reason for the failure , for example, the reason is that no surrounding speakers have been found.
- Step 708 each speaker can determine its own sound channel based on its own position information, and prompt the user.
- each speaker can obtain its relative positional relationship in the stereo speaker system, and the illustration of the relative positional relationship can include several situations as shown in Figure 10-13 for illustration.
- the first sound box when the first sound box is on the left side and the second sound box is on the right side, the first sound box is used as the left sound channel, and the second sound box is used as the right sound channel. Then the first speaker prompts that the first speaker is the left channel and the second speaker is the right channel through audio and/or light effects.
- the first speaker determines itself as the left channel
- the second speaker determines itself as the right channel
- the third speaker determines itself as the subwoofer.
- the first speaker determines itself as the left channel
- the second speaker as the right channel
- the third speaker as the subwoofer
- the fourth speaker as the center
- the fifth speaker as the The left rear channel and the sixth speaker are the right rear channel.
- the first speaker determines itself as the left channel
- the second speaker as the right channel
- the third speaker as the subwoofer
- the fourth speaker as the center
- the fifth speaker is the center.
- the sixth speaker is the right rear channel
- the seventh speaker is the center left channel
- the eighth speaker is the center right channel
- the ninth to twelfth speakers are all surround as shown in Figure 13 Voice.
- each speaker may not determine its own channel, but prompts the user to configure the channel of each speaker.
- a prompt may be given to the user so as to confirm the sound channel configuration or modify the sound channel configuration of each sound box.
- the user can configure sound channels for each speaker by moving the speaker.
- the first speaker acquires the second motion state of the first speaker, and then configures the channel corresponding to the second motion state as its own channel.
- the second sound box is in its own second motion state, and then the channel corresponding to the second motion state is configured as its own channel.
- the second motion states of the first sound box and the second sound box are different. Different second motion states may correspond to different channel configurations. For example, during implementation, the second motion state may be described by at least one of the following parameters:
- the number of shakes For example, shake once as the left channel, shake twice as the right channel, and so on. The number of shakes is different and the channel configuration is different.
- the speed range can be set, the first speed range corresponds to the left channel, the second speed range corresponds to the right channel, and so on, different speed ranges have different channel configurations.
- Acceleration of being shaken similarly, in addition to determining the channel configuration by velocity, it can also be determined by acceleration. For example, slight shaking corresponds to the first acceleration range, and its sound channel is the left sound channel; violent shaking is represented by the second acceleration range, which corresponds to the right sound channel, and so on.
- multiple acceleration ranges can be stored in the speaker, and the user's shaking acceleration is compared with the acceleration range to check which acceleration range the shaking acceleration belongs to, and then determine which channel the speaker belongs to.
- the direction of being shaken For example, moving the speaker to the left is the left channel, moving the speaker to the right is the right channel, moving the speaker forward is the subwoofer, and so on, different moving directions correspond to different channel configurations .
- the first moving distance range is the left channel
- the second moving distance range is the right channel.
- the number 1 is the left channel
- the number 2 is the right channel
- the circle is the surround sound.
- the above parameters can be used not only alone, but also in combination.
- a slight shake once is the left channel
- a slight shake twice is the right channel.
- Shake 1 time for the left rear channel violently shake 2 times for the right rear channel. Any manner that can be combined is also applicable to this embodiment of the application.
- the channel configuration of the speaker can also be determined by judging whether the user shakes the speaker or touches the speaker. During implementation, it can be judged whether to shake or touch the speaker according to the change frequency or magnitude of the acceleration and further combined with the moving distance.
- One possible implementation is to consider the acceleration greater than g/2 and the distance between the two devices when triggered to be less than the distance threshold (eg 5cm) as a touch, otherwise as a shake.
- the user can also customize the second motion state corresponding to different channel configurations.
- each speaker when each speaker configures its own sound channel, it may simultaneously prompt the user which sound channel it is configured for, so as to facilitate the user's confirmation.
- step 710 the user is prompted to build a stereo speaker system, and the channel configuration of each speaker is prompted so that the user can understand the building situation.
- the third speaker is added.
- the first speaker is the main speaker, as shown in Figure 14:
- Step 1401 the user shakes the third speaker, and the third speaker obtains its first motion state.
- the third speaker determines that the first motion state matches the feature of the preset action, and searches for nearby speakers after the moving distance in the first motion state is greater than a specified distance.
- the user can move simultaneously, or move the first speaker and the third speaker successively within a certain time range so that the first speaker and the third speaker can search for each other.
- step 1403 the first speaker is found, so the first speaker interacts with the third speaker, and the distance between the first speaker and the third speaker is confirmed.
- the first sound box may notify the third sound box that the first sound box has established a stereo sound box system, and thus step 1404 may be implemented.
- Step 1404 if the distance between the first speaker and the third speaker is less than the distance threshold, the third speaker prompts the user to configure its own sound channel. That is, the third speaker can prompt the user to configure a sound channel for itself when it knows that it has joined the existing stereo speaker system.
- Step 1405 after the sound channel is configured, the third speaker prompts the user with the configuration result of the sound channel, that is, what kind of sound channel it is.
- the first speaker can determine the relative positional relationship of each speaker, automatically determine the channel configuration of the third speaker based on the positional relationship and notify the third speaker, and then the third speaker prompts the user whether to confirm the Channel configuration. If the user agrees to the channel configuration, the third speaker will play content based on the channel configuration; otherwise, if the user disagrees, the user can customize the channel configuration of the third speaker, and the channel configuration of the third speaker is completed.
- the channel configuration of each speaker may change.
- the user is also supported to re-customize the channel configuration of each speaker, or the first speaker may The positional relationship re-determines the channel configuration of each speaker.
- the main speaker can be re-screened or the main speaker can be kept unchanged.
- a possible implementation is to filter out the speakers according to the priority mentioned above. If the priority of the first speaker is higher, the first speaker can still be used as the main speaker. If the priority of the first speaker is lower, consider whether to Replace the third speaker as the main speaker. For example, if the first speaker is already connected to the network, the operation of screening the main speaker may not be performed when the third speaker is added. If the first speaker cannot be connected to the network, after the third speaker is added, the main speaker can be re-screened.
- the third speaker is to request to join the established stereo speaker system (that is, the speaker system formed by the first speaker and the second speaker), during implementation, for an established speaker system It may be required to allow other speakers (such as the third speaker) to join within the first duration threshold when the formation is completed, or the first duration threshold may not be set.
- the first duration threshold is not set, new speakers can be added to the established speaker system at any time.
- the first duration threshold can be set by a user.
- the first sound box, the second sound box, and the third sound box are moved by the user almost simultaneously (for example, within the second duration threshold range) to form a stereo sound box system.
- it can be implemented as:
- Step 1501 the first speaker, the second speaker and the third speaker are moved by the user, and each speaker acquires its first motion state.
- each speaker determines that its first motion state matches the feature of its preset action, and after determining that its moving distance is greater than a specified distance, it searches for nearby speakers.
- step 1503 the speakers interact with each other to confirm the distance between the speakers.
- Step 1504 if the distance between the speakers is less than the distance threshold, each speaker prompts the user to configure its own channel and selects the main speaker.
- Step 1505 after the user configures the sound channel, each speaker prompts the user to configure the sound channel for itself, and the configuration is completed.
- the main speaker can respond to the user's voice and obtain the audio resources indicated by the user through the network. After obtaining the audio resource, play it to each slave speaker respectively.
- the mobile phone can obtain audio resources in the network and then the master speaker can distribute them to each slave speaker for playback.
- the embodiment of the present application also provides a first speaker, as shown in Figure 18, the speaker includes:
- An acquisition module 1801 configured to acquire a first motion state of the first sound box
- a search module 1802 configured to search for a second sound box if the first movement state matches the characteristics of a preset action
- the system building module 1803 is configured to form a stereo speaker system with the second speaker if the second speaker is found.
- the speaker further includes:
- the prompting module is configured to prompt the user with the preset action to be executed for building a stereo speaker system before acquiring the first motion state of the first speaker.
- the acquiring module is specifically used for:
- a first acceleration sequence of the first sound box is generated, and first indication information arranged in time sequence is stored in the first acceleration sequence of the first sound box, and the first indication information Used to express the correspondence between acceleration and the duration of said acceleration;
- the characteristics of the preset action include a first sequence template, and the speaker also includes:
- a first matching module configured to match the first sequence template with the first acceleration sequence of the first sound box
- the acquiring module is specifically used for:
- a second acceleration sequence of the first sound box is generated, and second indication information is stored in the second acceleration sequence of the first sound box, and the second indication information is used to express acceleration and collecting the frequency domain information of the acceleration;
- the characteristics of the preset action include a second sequence template, and the speaker also includes:
- a second matching module configured to match the second sequence template with the second acceleration sequence of the first sound box
- the speaker further includes:
- a distance determination module configured to determine the moving distance of the first sound box based on the first movement state of the first sound box
- the speaker also includes:
- the distance determination module is configured to determine that the moving distance of the first sound box is greater than a specified distance before the search for the second sound box.
- the system builds modules specifically for:
- the main speaker screening rules include at least one of the following rules:
- system building modules are specifically used for:
- system building modules are specifically used for:
- the second motion state includes at least one of the following parameters:
- the number of times the first sound box is shaken the speed of being shaken, the acceleration of being shaken, the direction of being shaken, the moving distance, the number of times the first sound box is collided, and the moving track of the first sound box.
- system building blocks are also used for:
- the specified position relationship includes: a distance between the first sound box and the second sound box is smaller than a distance threshold.
- the methods provided in the embodiments of the present application are introduced from the perspective of the speakers (master speaker and/or slave speaker) as the execution subject.
- the terminal device may include a hardware structure and/or a software module, and realize the above-mentioned functions in the form of a hardware structure, a software module, or a hardware structure plus a software module. Whether one of the above-mentioned functions is executed in the form of a hardware structure, a software module, or a hardware structure plus a software module depends on the specific application and design constraints of the technical solution.
- the terms “when” or “after” may be interpreted to mean “if” or “after” or “in response to determining" or “in response to detecting ".
- the phrases “in determining” or “if detected (a stated condition or event)” may be interpreted to mean “if determining" or “in response to determining" or “on detecting (a stated condition or event)” or “in response to detecting (a stated condition or event)”.
- relational terms such as first and second are used to distinguish one entity from another, without limiting any actual relationship and order between these entities.
- all or part of them may be implemented by software, hardware, firmware or any combination thereof.
- software When implemented using software, it may be implemented in whole or in part in the form of a computer program product.
- the computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, the processes or functions according to the embodiments of the present invention will be generated in whole or in part.
- the computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable devices.
- the computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be transmitted from a website, computer, server or data center Transmission to another website site, computer, server, or data center by wired (eg, coaxial cable, optical fiber, DSL) or wireless (eg, infrared, wireless, microwave, etc.) means.
- the computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center integrated with one or more available media.
- the available medium may be a magnetic medium (such as a floppy disk, a hard disk, or a magnetic tape), an optical medium (such as a DVD), or a semiconductor medium (such as a solid state disk (solid state disk, SSD)), etc.
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Stereophonic System (AREA)
Abstract
本申请提供一种立体声音箱系统的组建方法及相关装置。该方法涉及智能终端、人机交互等领域。该方法中可支持用户移动音箱产生第一运动状态,当音箱的第一运行状态和预设的动作匹配时,可触发组建立体声音箱系统。例如用户可手持音箱绘制移动的图案,当绘制的图案为指定图案时,可和其他音箱组件立体声音箱系统。这种组建方式对于用户而言,操作简便。此外,本申请中为了避免误触发组建音箱系统设置了相应的条件,也为避免误组建音箱系统设置了相应的条件,由此提高触发组建音箱系统和组建音箱系统的准确性。进一步的,本申请在用户操作之前,还可提示用户执行何种操作以便于引导用户创建音箱系统。
Description
相关申请的交叉引用
本申请要求在2021年10月12日提交中国专利局、申请号为202111186340.4、申请名称为“立体声音箱系统的组建方法及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
本申请涉及终端技术领域,尤其涉及一种立体声音箱系统的组建方法及相关装置。
音箱已成为人们生活中普遍存在的电子产品。为了能够随时随地享受较高品质的音频,便携式音箱也成为电子发烧友酷爱的产品。
当有多个音箱时,可通过组建立体声音箱系统来创造立体声效果。例如两个音箱可分别作为左声道和右声道,可提供相对一个音箱而言更为立体的音效效果。
一种组建立体声音箱系统的常规方式是,首先采用相应的应用程序(application,APP)扫描附近的音箱,若扫描到多个同类音箱则可以组建立体声音箱系统。如图1所示以扫描到两个音箱为例,用户可以在界面中手动选择将哪个音箱设置为左声道,将哪个音箱设置为右声道,进而完成立体声音箱系统的组建。
另一种实现方式是同时按压音箱上的按键来触发组建立体声音箱系统。
然而,组建立体声音箱系统的操作方式单一,需要一种新的组建立体声音箱系统的方法。
发明内容
本申请的目的在于提供了一种立体声音箱系统的组建方法及相关装置,用于解决相关技术中组建立体声音箱系统的操作方式单一的问题。
上述目标和其他目标将通过独立权利要求中的特征来达成。进一步的实现方式在从属权利要求、说明书和附图中体现。
第一方面,提供了一种立体声音箱系统的组建方法,应用于第一音箱,所述方法包括:
获取所述第一音箱的第一运动状态;
若所述第一运动状态与预设动作的特征匹配,则搜索第二音箱;
若搜索到第二音箱,则与所述第二音箱组建立体声音箱系统。
由此,本申请中用户使音箱产生一定的运动状态,当该运动状态与预设动作的特征匹配时,即可组建立体声音箱系统,由此对于用户而言,持有音箱做相应的动作即可组建立体声音箱系统,操作方便。
在一种可能的设计中,所述获取所述第一音箱的第一运动状态之前,所述方法还包括:
将用于组建立体声音箱系统所需执行的所述预设动作提示给用户。
由此,通过动作提示,能够方便用户了解采取何种动作来组建音箱系统,帮助用户了 解和实施组建音箱系统的操作。
在一种可能的设计中,所述获取所述第一音箱的第一运动状态,包括:
基于所述第一音箱的加速度信息,生成所述第一音箱的第一加速度序列,所述第一音箱的第一加速度序列中存储有依时序排列的第一指示信息,所述第一指示信息用于表达加速度和所述加速度的持续时长之间的对应关系;
所述预设动作的特征包括第一序列模板,所述方法还包括:
将所述第一序列模板与所述第一音箱的第一加速度序列进行匹配操作;
若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数大于或等于第一指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;
若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数小于所述第一指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
由此,本申请中采用模板匹配的方式来确定采集的第一运行状态是否和预设动作的特征匹配,模板匹配的方式实施起来简便易实施,且预设动作的特征的模板采用时域信息来表达能够适用不同的动作,适用范围不受限制。
在一种可能的设计中,所述获取所述第一音箱的第一运动状态,包括:
基于所述第一音箱的加速度信息,生成所述第一音箱的第二加速度序列,所述第一音箱的第二加速度序列中存储有第二指示信息,所述第二指示信息用于表达加速度和采集到所述加速度的频域信息;
所述预设动作的特征包括第二序列模板,所述方法还包括:
将所述第二序列模板与所述第一音箱的第二加速度序列进行匹配操作;
若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数大于或等于第二指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;
若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数小于所述第二指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
由此,本申请中采用模板匹配的方式来确定采集的第一运行状态是否和预设动作的特征匹配,模板匹配的方式实施起来简便易实施,且预设动作的特征的模板采用频域信息来表达能够适用不同的动作,适用范围不受限制。
在一种可能的设计中,所述方法还包括:
基于所述第一音箱的所述第一运动状态确定所述第一音箱的移动距离;
所述搜索第二音箱之前,所述方法还包括:
确定所述第一音箱的移动距离大于指定距离。
由此,本申请中当第一音箱的移动距离足够大时,才会触发组建立体声音箱系统,能够提高触发组建立体声音箱系统的准确性,尽可能避免误触发。
在一种可能的设计中,所述与所述第二音箱组建立体声音箱系统,包括:
基于主音箱筛选规则,确定所述第一音箱是否作为所述立体声音箱系统中的主音箱;并,
为所述第一音箱配置声道。
由此,本申请中能够基于设定的规则,自主筛选出主音箱,并为各音箱自主配置声道。
在一种可能的设计中,所述主音箱筛选规则包括以下规则中的至少一种:
选择已经联网的音箱作为主音箱;
选择已经配网但未联网的音箱作为主音箱;
选择连接智能终端设备的音箱作为主音箱;
选择最早匹配到所述预设动作的特征的音箱作为主音箱;
选择媒体存取控制位址(media access control address,MAC)地址最大的音箱作为主音箱。
该筛选规则简单易实施,能够尽可能无歧义的筛选出主音箱。
在一种可能的设计中,所述为所述第一音箱配置声道,包括:
提示用户为所述第一音箱配置声道;
获取所述第一音箱的第二运动状态;
将所述第二运动状态对应的声道配置为所述第一音箱的声道。
由此,本申请中用户可以采用一定的动作配置各音箱的声道,例如摇一摇即可配置为一个声道,摇动两次即可配置为另一个声道,增加了声道配置的灵活性和操作便利性。
在一种可能的设计中,所述提示用户为所述第一音箱配置声道,包括:
采用声效、光效和屏幕显示中的至少一种方式提示用户为所述第一音箱配置声道。
由此,本申请中,可以通过生效、光效、屏显等这种易实施的方式来提示用户,以便于引导用户完成组建立体声音箱系统的操作。
在一种可能的设计中,所述第二运动状态包括以下参数中的至少一种:
所述第一音箱被摇动的次数、被摇动的速度、被摇动的加速度、被摇动的方向、移动的距离、所述第一音箱产生碰撞的次数、所述第一音箱的移动轨迹。
由此,本申请中,可支持用户采用易于操作的方式来手动控制音箱,完成声道的配置。
在一种可能的设计中,所述方法还包括:
确定与所述第二音箱的位置关系;
并若确定与所述第二音箱的位置关系为指定位置关系,则执行所述与所述第二音箱组建立体声音箱系统的操作。
由此,本申请中基于位置关系能够保证组建音箱系统的准确性,尽可能避免误组建音箱系统。
在一种可能的设计中,所述指定位置关系包括:所述第一音箱与所述第二音箱的距离小于距离阈值。
由此,本申请中采用距离较近的音箱来组建音箱系统,简单易实施且能够保证一定距离范围内的音箱组建音箱系统,保证立体声效果。
第二方面,本申请实施例还提供了一种第一音箱,所述音箱包括:
获取模块,用于获取所述第一音箱的第一运动状态;
搜索模块,用于若所述第一运动状态与预设动作的特征匹配,则搜索第二音箱;
系统组建模块,用于若搜索到第二音箱,则与所述第二音箱组建立体声音箱系统。
在一种可能的设计中,所述音箱还包括:
提示模块,用于在所述获取所述第一音箱的第一运动状态之前,将用于组建立体声音箱系统所需执行的所述预设动作提示给用户。
在一种可能的设计中,所述获取模块,具体用于:
基于所述第一音箱的加速度信息,生成所述第一音箱的第一加速度序列,所述第一音箱的第一加速度序列中存储有依时序排列的第一指示信息,所述第一指示信息用于表达加 速度和所述加速度的持续时长之间的对应关系;
所述预设动作的特征包括第一序列模板,所述音箱还包括:
第一匹配模块,用于将所述第一序列模板与所述第一音箱的第一加速度序列进行匹配操作;
若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数大于或等于第一指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;
若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数小于所述第一指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
在一种可能的设计中,所述获取模块,具体用于:
基于所述第一音箱的加速度信息,生成所述第一音箱的第二加速度序列,所述第一音箱的第二加速度序列中存储有第二指示信息,所述第二指示信息用于表达加速度和采集到所述加速度的频域信息;
所述预设动作的特征包括第二序列模板,所述音箱还包括:
第二匹配模块,用于将所述第二序列模板与所述第一音箱的第二加速度序列进行匹配操作;
若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数大于或等于第二指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;
若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数小于所述第二指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
在一种可能的设计中,所述音箱还包括:
距离确定模块,用于基于所述第一音箱的所述第一运动状态确定所述第一音箱的移动距离;
所述音箱还包括:
距离确定模块,用于在所述搜索第二音箱之前,确定所述第一音箱的移动距离大于指定距离。
在一种可能的设计中,系统组建模块,具体用于:
基于主音箱筛选规则,确定所述第一音箱是否作为所述立体声音箱系统中的主音箱;并,
为所述第一音箱配置声道。
在一种可能的设计中,所述主音箱筛选规则包括以下规则中的至少一种:
选择已经联网的音箱作为主音箱;
选择已经配网但未联网的音箱作为主音箱;
选择连接智能终端设备的音箱作为主音箱;
选择最早匹配到所述预设动作的特征的音箱作为主音箱;
选择媒体存取控制位址MAC地址最大的音箱作为主音箱。
在一种可能的设计中,所述系统组建模块,具体用于:
提示用户为所述第一音箱配置声道;
获取所述第一音箱的第二运动状态;
将所述第二运动状态对应的声道配置为所述第一音箱的声道。
在一种可能的设计中,所述系统组建模块,具体用于:
采用声效、光效和屏幕显示中的至少一种方式提示用户为所述第一音箱配置声道。
在一种可能的设计中,所述第二运动状态包括以下参数中的至少一种:
所述第一音箱被摇动的次数、被摇动的速度、被摇动的加速度、被摇动的方向、移动的距离、所述第一音箱产生碰撞的次数、所述第一音箱的移动轨迹。
在一种可能的设计中,所述系统组建模块还用于:
确定与所述第二音箱的位置关系;
并若确定与所述第二音箱的位置关系为指定位置关系,则执行所述与所述第二音箱组建立体声音箱系统的操作。
在一种可能的设计中,所述指定位置关系包括:所述第一音箱与所述第二音箱的距离小于距离阈值。
第三方面,本申请实施例还提供了一种音箱,该音箱包括:一个或多个处理器,一个或多个存储器,一个或多个扬声器,一个或多个麦克风,通信模块;其中,一个或多个麦克风,用于采集声音信号;通信模块,用于与其它音箱进行通信;一个或多个扬声器,用于发出声音信号;一个或多个处理器与所述一个或多个存储器耦合;其中,一个或多个存储器用于存储计算机可执行程序代码;其中,所述程序代码包括指令,当所述一个或多个处理器执行所述指令时,使所述音箱执行上述第一方面及其第一方面任一可能设计的技术方案。
第四方面,本申请实施例提供一种芯片,述芯片包括处理器和接口;所述接口,用于接收代码指令,并将接收的所述代码指令传输至所述处理器;所述芯片与音箱中的存储器耦合,使处理器执行本申请实施例第一方面及其第一方面任一可能设计的技术方案;本申请实施例中“耦合”是指两个部件彼此直接或间接地结合。
第五方面,本申请实施例提供的一种音箱系统,所述音箱系统包括一个或多个音箱,其中至少一个音箱是如上述第二方面、第三方面所述的音箱,该音箱可以执行上述第一方面中的第一音箱的全部或部分步骤。
第六方面,本申请实施例提供的一种音箱系统,所述音箱系统包括第一音箱和至少一个第二音箱,所述第一音箱和所述第二音箱被设置在不同的位置;所述第一音箱和所述第二音箱之间能够通信;所述第一音箱为如上述第二方面、第三方面所述的音箱(该音箱可以是上述第一方面及其第一方面任一可能设计的技术方案中的所述第一音箱)。
第七方面,本申请实施例的一种计算机可读存储介质,所述计算机可读存储介质包括计算机程序,当计算机程序在计算机上运行时,使得所述计算机执行本申请实施例第一方面及其第一方面任一可能设计的技术方案。
第八方面,本申请实施例的中一种程序产品,当所述计算机程序产品在计算机上运行时,使得所述计算机执行本申请实施例第一方面及其第一方面任一可能设计的技术方案。
另外,第二方面至第八方面中任一种可能设计方式所带来的技术效果可参见方法部分相关中不同设计方式所带来的技术效果,此处不再赘述。
图1为本申请一实施例提供的一种应用场景的示意图;
图2为本申请一实施例提供的音箱的结构的示意图之一;
图3为本申请一实施例提供的音箱的结构的示意图之二;
图4A为本申请一实施例提供的音箱的结构的示意图之三;
图4B为本申请一实施例提供的音箱确定用户方位的示意图;
图5为本申请一实施例提供的用户可手持音箱执行的操作的示意图;
图6为本申请一实施例提供的自定义预设动作的示意图;
图7为本申请一实施例提供的立体声音箱系统的组建方法的流程示意图;
图8为本申请一实施例提供的第一序列子模板的示意图;
图9为本申请一实施例提供的和预设动作的特征进行匹配的过程示意图;
图10为本申请一实施例提供的具有两个音箱时的声道配置的示意图;
图11为本申请一实施例提供的具有三个音箱时的声道配置的示意图;
图12为本申请一实施例提供的具有六个音箱时的声道配置的示意图;
图13为本申请一实施例提供的具有十二个音箱时的声道配置的示意图;
图14为本申请一实施例提供的立体声音箱系统的组建方法的另一流程示意图;
图15为本申请一实施例提供的立体声音箱系统的组建方法的又一流程示意图;
图16为本申请一实施例提供的立体声音箱系统的获取网络资源的一种应用场景示意图;
图17为本申请一实施例提供的立体声音箱系统的获取网络资源的另一种应用场景示意图;
图18为本申请一实施例提供的音箱的另一结构示意图。
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述。
本申请实施例涉及的多个,是指大于或等于两个。需要说明的是,在本申请实施例的描述中,“第一”、“第二”等词汇,仅用于区分描述的目的,而不能理解为指示或暗示相对重要性,也不能理解为指示或暗示顺序。
以下实施例中所使用的术语只是为了描述特定实施例的目的,而并非旨在作为对本申请的限制。如在本申请的说明书和所附权利要求书中所使用的那样,单数表达形式“一个”、“一种”、“所述”、“上述”、“该”和“这一”旨在也包括例如“一个或多个”这种表达形式,除非其上下文中明确地有相反指示。还应当理解,在本申请实施例中,“一个或多个”是指一个、两个或两个以上;“和/或”,描述关联对象的关联关系,表示可以存在三种关系;例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B的情况,其中A、B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。
在本说明书中描述的参考“一个实施例”或“一些实施例”等意味着在本申请的一个或多个实施例中包括结合该实施例描述的特定特征、结构或特点。由此,在本说明书中的不同之处出现的语句“在一个实施例中”、“在一些实施例中”、“在其他一些实施例中”、“在另外一些实施例中”等不是必然都参考相同的实施例,而是意味着“一个或多个但不是所有的实施例”,除非是以其他方式另外特别强调。术语“包括”、“包含”、“具有”及它们的变形都意味着“包括但不限于”,除非是以其他方式另外特别强调。
图1示出了本申请一实施例提供的一种应用场景的示意图。如图1所示,为活动聚会时,多个人采用各自的音箱来组建立体声音箱系统的示意图。如图1所示,假设具有两个 音箱(包括音箱1和音箱2)。用户A持有音箱1、用户B持有音箱2,用户A和用户B可同时摇动各自的音箱,即可实现音箱1和音箱2来组建立体声音箱系统。由此,摇一摇即可组建立体声音箱系统,增加了组建的趣味性和便利性。
图2示出了本申请一实施例提供的音箱的功能框图。在一些实施例中,音箱100可以包括一个或多个输入设备(input device)101,一个或多个输出设备(output device)102和一个或多个处理器(processor)103。其中,输入设备101可以检测各种类型的输入信号(可以简称:输入),输出设备102可以提供各种类型的输出信息(可以简称:输出)。处理器103可以从一个或多个输入设备101处接收输入信号,响应于该输入信号,产生输出信息,通过一个或多个输出设备102输出。
在一些实施例中,一个或多个输入设备101可以检测各种类型的输入,并提供与检测到的输入相对应的信号(比如,输入信号),然后一个或多个输入设备101可以将输入信号提供给一个或多个处理器103。在一些示例中,一个或多个输入设备101可以是包括任何能够检测输入信号的部件或组件。比如,输入设备101可以包括音频传感器(比如,一个或多个麦克风),加速度传感器、距离传感器、光学或视觉传感器(比如,摄像头,可见光传感器或不可见光传感器),接近光传感器,触摸传感器,压力传感器,机械设备(比如,表冠,开关,按钮或按键等),温度传感器,通信设备(比如,有线或无线通信装置)等,或者,输入设备101也可以是上述各种部件的一些组合。在本申请实施例中,可采用加速度传感器的数据来确定用户是否摇动或碰一碰音箱,以便于组建立体声音箱。
在一些实施例中,一个或多个输出设备102可以提供各种类型的输出。比如,一个或多个输出设备102可以接收一个或多个信号(比如,由一个或多个处理器103提供的输出信号),并提供与该信号对应的输出。在一些示例中,输出设备102可以包括用于提供输出的任何合适的部件或组件。比如,输出设备102可以包括音频输出设备(比如,一个或多个扬声器),视觉输出设备(比如,一个或多个灯或显示器),触觉输出设备,通信设备(比如,有线或无线通信设备)等等,或者,输出设备102还可以是上述各种部件的一些组合。
在一些实施例中,一个或多个处理器103可以耦合到输入设备101和输出设备102。处理器103可以与输入设备101和输出设备102之间通信。比如,一个或多个处理器103可以从输入设备101接收输入信号(比如,与输入设备101检测到的输入相对应的输入信号)。一个或多个处理器103可以解析接收到的输入信号以确定是否响应于该输入信号提供一个或多个对应的输出。若是,一个或多个处理器103可以向输出设备102发送输出信号,以提供输出。
图3示出了本申请另一实施例提供的音箱300的功能框图。音箱300可以为图2所描述的音箱100的一种示例。如图3所示,音箱300包括麦克风301、扬声器302、处理器303、存储器304、通信模块305、传感器模块306、灯307。可以理解的是,图3所示的部件并不构成对音箱300的具体限定,音箱300还可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。
处理器303可以包括一个或多个处理单元,例如:处理器303可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。其中,控制器可以是音箱300的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。在另一些实施例中,处理器303中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器303中的存储器为高速缓冲存储器。该存储器可以保存处理器303刚用过或循环使用的指令或数据。如果处理器303需要再次使用该指令或数据,可从所述存储器中直接调用,避免了重复存取,减少了处理器303的等待时间,因而提高了系统的效率。处理器303可以运行本申请一些实施例提供的立体声音箱系统的组建方法的软件代码/模块,实现控制音箱的功能。
麦克风301,也称“话筒”,“传声器”,用于采集声音信号(比如采集用户发出的声音),将声音信号转换为电信号。在一些实施例中,音箱300上可以设置一个或多个麦克风301,比如麦克风阵列。在另一些实施例中,麦克风301除了采集声音信号,还可以实现对声音信号降噪功能,或者还可以识别声音信号的来源、实现定向录音功能等。
扬声器302,也称“喇叭”,用于将音频电信号转换为声音信号。音箱300可以通过扬声器302播放音乐等声音信号。
在一些实施例中,麦克风301和扬声器302与处理器303耦合。比如,麦克风301接收到声音信号后,将声音信号或者由声音信号转换而成的音频电信号发送给处理器303。处理器303判断是否响应该声音信号或者音频电信号,若是,则输出相应的输出信号,比如通过扬声器302播放音乐。
存储器304,可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器303通过运行存储在存储器的指令,从而执行音箱300的各种功能应用以及数据处理。存储器可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等,本申请实施例不作限定。在一些实施例中,存储器304中可以存储“唤醒词”等信息。在另一些实施例中,存储器304中还可以存储音频信息(比如,歌曲、相声、评书等)。
通信模块305,可以是无线通信模块(比如蓝牙、无线)。音箱300通过通信模块305与其他设备,比如另一个音箱,手机,电视机等连接。
传感器模块306可以包括气压传感器306A、温度传感器306B、加速度传感器306C等。应理解,图3仅是列举了几种传感器的示例,在实际应用中,音箱300还可以包括更多或很少的传感器,或者使用其他具有相同或类似功能的传感器替换上述列举的传感器等等,本申请实施例不作限定。
气压传感器306A,用于测量气压。在一些实施例中,处理器303可以与气压传感器306A耦合,通过气压传感器306A测得的气压值辅助计算,比如计算声音的衰减系数等。
温度传感器306B,用于检测温度。在一些实施例中,处理器303可以与温度传感器306B耦合,通过温度传感器306B测得的温度值辅助计算,比如计算声音的衰减系数等。
加速度传感器306C,用于在用户摇动音箱时,采集音箱的加速度信息,以便于根据该加速度信息来组建立体声音箱系统。
在一些实施例中,音箱300可以包含显示器(或显示屏),也可以不包含显示器。显示器,可以用于显示应用的显示界面,比如当前播放的歌曲等。显示器包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diodes,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,显示器中可以设置触摸传感器,形成触摸屏,本申请实施例不作限定。触摸传感器用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给处理器303,以确定触摸事件类型。处理器303可以通过显示器提供与触摸操作相关的视觉输出。
在一些实施例中,图3还可以包含更多的器件,比如电池、USB接口等等,本申请实施例不多赘述。
图4A示出了本申请一实施例提供的音箱的结构示意图。音箱400可以为图2或图3所描述的音箱的一种示例。如图4A所示,音箱400可以包括底座401和壳体402。
在一些实施例中,底座401可以起到支撑的作用。比如,底座401可以支撑壳体402以及壳体402内所包围的部件(比如、处理器、麦克风、扬声器等)。在一些示例中,底座401可以由金属,塑料,陶瓷等其它的任何能够起到支撑作用的材料,或者这些材料的组合而构成。
在一些实施例中,底座401上可以支撑一个或多个扬声器406。比如,底座401可以支撑一个固定件404,该固定件404上可以设置一个或多个扬声器406。在一些示例中,底座401可以通过支撑柱405或其他方式支撑固定件404。固定件404可以是任何形状,比如圆形,方形等。在一些实施例中,一个或多个扬声器406可以在固定件404上按照一定的排布方式。比如,一个或多个扬声器406可以均匀分布在固定件404上的边缘,比如每个扬声器之间的距离间隔相同。在一些实施例中,一个或多个扬声器406可以与处理器403耦合。处理器403可以通过一个或多个扬声器406输出音频信号。
在一些实施例中,壳体402可以是圆柱体、立方体、正方体等任何立体形状。壳体402可以包裹处理器403、固定件404、一个或多个扬声器406等部件。壳体402可以是单个壳体构件,或多于两个的壳体构件组成。比如,壳体402可以包括上壳体402a和侧壳体402b。一个或多个壳体构件可以是金属,塑料,陶瓷,晶体、或者这些材料的组合,或其它的任何适合设置在音箱上的壳体构件等等。在一些实施例中,侧壳体402b可以是具有网眼结构的壳体,比如,网眼可以是圆孔,方孔,六角孔等形状。网眼结构的壳体可以起到装饰、防尘、保护壳体内部的器件(比如扬声器、麦克风等)等作用,且网眼结构的壳体可以减少对扬声器输出的声音的阻挡。
在一些实施例中,上壳体402a可以是网眼结构,或者不是网眼结构的壳体。上壳体402a可以设置输入设备,比如开关,按钮或按键等。比如,开关用于开启或关闭音箱。按钮或按键可以用于调节音量等功能。在另一些实施例中,上壳体402a上可以设置显示屏409(比如触摸显示屏),可以用于接收输入、提供视觉输出等。比如,显示屏409上可以显示当前播放的歌曲的名称、歌手的名字等。当然,音箱上也可以不设置显示屏,本申请实施例不作限定。
在一些实施例中,上壳体402a可以与固定件407连接。固定件407上可以设置一个或多个麦克风408。固定件407可以是任何形状,比如圆形,方形等。在一些实施例中,一个或多个麦克风408可以在固定件407上按照一定的排布方式。比如,一个或多个麦克风408可以均匀分布在固定件407上的边缘,比如每个麦克风之间的距离间隔相同。再比如, 每相邻两个麦克风对应的中心角a(比如,两个麦克风分别与固定件407的中心点连接的直线所形成的夹角)可以是固定的,比如,30、60度等。
在一些实施例中,一个或多个麦克风408可以与处理器403耦合。处理器403可以通过一个或多个麦克风408获得输入信号(比如用户发出的声音信号)。
本申请以下的实施例中,以图1的应用场景为例,且以图1中的音箱1和/或音箱2是上述图4A所示的音箱400为例。为了方便描述,下文将音箱1和音箱2中的一个称为主音箱,另一个称为从音箱。在一些实施例中,主音箱和从音箱可以是配套使用的。比如,主音箱用于播放左声道,从音箱用于播放右声道,或者主音箱用于播放右声道,从音箱用于播放左声道。也就是说,主音箱和从音箱的配合可以实现音频的立体声音效果。在一些实施例中,一个音箱是主音箱还是从音箱,可以是该音箱出厂之前设置好,也可以是用户自定义的(比如,音箱通过触摸显示屏接入输入操作,该输入操作用于选择该音箱是主音箱还是从音箱),在本申请实施例中还可以是根据一定规则筛选出来的。
在一些实施例中,主音箱和从音箱的结构可以相同,比如,主音箱和从音箱都是图4A所示的结构。在另一些实施例中,主音箱和从音箱的结构也可以不完全相同,比如,主音箱可以设置有显示屏,而从音箱不设置显示屏等。在其他实施例中,主音箱和从音箱中部分部件的功能可以不完全相同。比如,主音箱中的处理器可以用于计算时延差(比如,第一时长和第二时长之间的时间差,第一时长可以是声音从主音箱到用户所需的时长,第二时长可以是声音从音箱到用户所需的时长),响度增益等,而从音箱中的处理器不具有该功能。
在一些实施例中,主音箱和/或从音箱中的存储器中可以存储音频文件(比如,歌曲、相声、评书等),主音箱和从音箱可以播放存储的音频文件。比如,主音箱可以接收输入(比如,通过触摸显示屏接收输入操作,或者通过麦克风接收语言输入),该输入可以用于启动主音箱和/或从音箱,或用于控制主音箱和从音箱播放、切换歌曲等。在一些实施例中,主音箱中的一个或多个麦克风采集到声音信号(比如,用户发出的声音信号),处理器识别出该声音信号中包含“唤醒词+播放歌曲”,处理器确定存储器中不存在该歌曲时,可以从网络侧下载该歌曲,或者输出提示信息(比如语言信息)提示用户不存在该歌曲。
在另一些实施例中,主音箱和/或从音箱可以与其它电子设备(比如手机、电视机)连接,可以通过有线或无线的方式连接。以主音箱与手机连接(比如,蓝牙连接)为例。手机可以将音频信号发送给主音箱,使得主音箱和从音箱播放该音频信号(比如,主音箱接收到音频信号之后,可以将音频信号发送给从音箱)。比如,手机正在运行音乐播放应用(比如,酷狗音乐),且正在播放歌曲“一路向北”,手机可以将该歌曲的音频信号发送给主音箱,使得主音箱和从音箱播放该音频信号。在另一些实施例中,主音箱和手机连接之后,用户可以通过主音箱控制手机执行相应的操作。继续以前面的例子为例,用户在房间内发出“小白播放歌曲听妈妈的话”的声音信号,主音箱采集到该声音信号,可以暂停播放一路向北,而是输出提示信息“正在为您寻找听妈妈的话”。比如,主音箱可以从本地存储器中寻找是否存在歌曲听妈妈的话,若不存在,主音箱可以从网络侧下载,或者主音箱可以向手机发送指令,该指令用于指示手机播放听妈妈的话,手机接收到该指令后,下载或者在线播放该歌曲,将该歌曲的音频信号发送给主音箱,使得主音箱和从音箱播放该歌曲(即听妈妈的话)的音频信号。
在一些实施例中,主音箱和从音箱均可以启动自动识别“唤醒词”的功能。以主音箱为 例,主音箱启动自动识别“唤醒词”的功能之后,主音箱中的全部或部分部件(比如,一个或多个麦克风、处理器等)处于使能状态。用户在房间内发出的声音信号被主音箱中的一个或多个麦克风接收。一个或多个麦克风将接收到的声音信号发送给处理器,处理器判断声音信号中包含“唤醒词”时,启动其它部件(比如,一个或多个扬声器)。在一些实施例中“唤醒词”可以是音箱出厂时默认设置好的,也可以是用户自定义的,比如“唤醒词”可以是“小白”、“小音”、“小艺”等。
在另一些实施例中,主音箱和从音箱均可以启动自动识别“唤醒词+播放歌曲”的功能。以主音箱为例,主音箱启动自动设备“唤醒词+播放歌曲”的功能之后,主音箱中的全部或部分部件(比如,一个或多个麦克风、处理器等)处于使能状态。用户在房间内发出的声音信号被主音箱中的一个或多个麦克风接收。一个或多个麦克风将接收到的声音信号发送给处理器,当处理器判断声音信号中包含“唤醒词+播放歌曲”时,启动其它部件(比如,一个或多个扬声器)。举例来说,用户在房间内发出“小白播放一路向北”。主音箱中的麦克风采集的该声音信号,然后发送给处理器,处理器识别出声音信号中包括唤醒词:小白,还包括:播放歌曲,处理器启动其它部件(比如,一个或多个扬声器)。
在一些实施例中,主音箱可以通过输入设备(比如主音箱上的触摸屏)接收输入操作或者通过与主音箱连接的其它设备比如手机接收输入操作,响应于该输入操作,启动自动识别“唤醒词”或“唤醒词+播放歌曲”的功能时,主音箱可以向从音箱发送一指令,该指令用于指示从音箱启动自动识别“唤醒词”或“唤醒词+播放歌曲”的功能。
在一些实施例中,用户可能处于房间内的任何位置,主音箱和从音箱与用户之间的距离可能不同。主音箱和从音箱启动自动识别“唤醒词”或“唤醒词+播放歌曲”的功能。主音箱和从音箱中的麦克风采集到声音信号。当主音箱和从音箱确定该声音信号中包含“唤醒词”或“唤醒词+播放歌曲”时,可以判断用户的位置,然后根据用户的位置控制主音箱和从音箱的声音参数。比如,声音参数可以包括主音箱和从音箱的时延差、响度增益等。因此,在该实施例中,当主音箱和从音箱识别出采集到的声音信号中包含“唤醒词”或“唤醒词+播放歌曲”时,才根据用户的位置调整主音箱和从音箱的声音参数。
以图4A所示的结构为例,主音箱和从音箱判断用户的位置的过程可以包括:主音箱采集到声音信号1。从音箱采集到声音信号2。主音箱确定声音信号1中包括“唤醒词”,从音箱确定声音信号2中包括“唤醒词”。当然,为了提升准确性,从音箱还可以将声音信号2或声音信号2中所包括的“唤醒词”发送给主音箱,由主音箱确定声音信号1和声音信号2中的“唤醒词”是同一个唤醒词。主音箱可以根据声音信号1确定用户相对于主音箱的第一方向/方位,例如,该第一方向/方位可以表示为用户在主音箱所构建的坐标系中与x轴之间的第一角度。从音箱可以根据声音信号2确定用户相对于从音箱的第二方向/方位,例如,该第二方向/方位可以表示为用户在从音箱所构建的坐标系中的与x轴之间的第二角度。从音箱可以将第二角度发送给主音箱,主音箱根据第一角度和第二角度,以及主音箱和从音箱之间的距离D确定用户的位置。具体的,主音箱和从音箱构建坐标系,以及主音箱和从音箱确定用户位置的过程将在后文详细介绍。
继续以图4A所示的结构为例,主音箱根据声音信号1确定用户相对于主音箱的第一方向/方位的方式可以有多种。例如,麦克风阵列定位技术(比如,根据主音箱上的麦克风阵列中的至少两个麦克风接收的声音信号的时间差来估计声源的方位)、波束指向(steered-beamformer)定位方法,基于高分辩率谱分析(high-resolution spectral analysis) 定位方法,和基于声音时间差(time-delay estimation,TDE)声源定位技术等等,本申请实施例不作限定。以麦克风阵列定位技术为例,主音箱根据声音信号1确定用户相对于主音箱的第一方向/方位的过程可以包括;主音箱中的麦克风阵列408采集到声音信号,假设麦克风408-1和麦克风408-2采集到声音信号的强度较大,主音箱可以根据麦克风408-1采集到声音信号的第一时刻t1,和麦克风408-2采集到的声音信号的第二时刻t2,以及麦克风408-1和麦克风408-2之间的距离L1(该距离可以出厂之后存储在主音箱中),计算声源即用户相对于主音箱的第一方位。参见图4B所示,主音箱可以根据(t1-t2)*c和L1,以及三角函数关系,确定出用户相对于麦克风408-1的夹角A,该夹角A可以作为用户相对于主音箱的第一方位,或者,由于该夹角A是用户相对于麦克风408-1的夹角,主音箱可以将夹角A进行坐标转换进而转换到主音箱构建的坐标系中,得到夹角B,该夹角B也可以作为用户相对于主音箱的第一方位。从音箱与主音箱的结构可以相同,所以从音箱确定用户相对于从音箱的第二方位的过程,可以与上述过程类似。
在另一些实施例中,用户不断发出声音信号的过程中,主音箱和从音箱可以实时的、不断的采集声音信号(该声音信号可能不包含“唤醒词”或“唤醒词+播放歌曲”),然后判断用户的位置,根据用户的位置调整主音箱和从音箱的声音参数,直到检测到包含“唤醒词”或“唤醒词+播放歌曲”的声音信号时,以调整后的声音参数(比如,主音箱和从音箱的时延差、响度增益等)控制主音箱和从音箱播放音频信号。
在一些实施例中,主音箱可以检测与从音箱之间的距离D,以便于确定是否组建立体声音箱系统,该距离可以是主音箱和从音箱之间的直线距离。主音箱检测到距离D之后,可以将该距离D发送给从音箱,从音箱无需检测距离D;或者,从音箱也自己可以检测与主音箱之间的距离D,以备使用。当然,从音箱可以检测与主音箱之间的距离D,然后发送给主音箱,即主音箱无需检测距离D等等。
以主音箱为例,作为一种示例,主音箱可以通过距离传感器检测与从音箱之间的距离。距离传感器可以是激光距离传感器、红外距离传感器等。例如,主音箱上的距离传感器发出特定频率的红外光,被从音箱反射,主音箱接收到从音箱发射的光。主音箱可以根据发射红外光的第一时间和接收到反射光的第二时间计算主音箱和从音箱之间的距离。作为另一种示例,主音箱还可以通过与从音箱通信实现测量主音箱和从音箱之间的距离的目的。例如,主音箱向从音箱发射一探测信号,从音箱接收到该探测信号后向主音箱发送反馈信号,主音箱接收到反馈信号。主音箱可以根据接收反馈信号的第二时间,和发送探测信号的第一时间,确定主音箱和从音箱之间的距离。作为又一种示例,主音箱还可以通过输入设备(比如主音箱上的触摸屏)接收输入操作,该输入操作用于输入主音箱和从音箱之间的距离。
类似的,主音箱和从音箱也可以通过麦克风阵列定位技术(比如,根据主音箱上的麦克风阵列中的至少两个麦克风接收的声音信号的时间差来估计声源的方位)、波束指向(steered-beamformer)定位方法,基于高分辩率谱分析(high-resolution spectral analysis)定位方法,和基于声音时间差(time-delay estimation,TDE)声源定位技术等等确定主从音箱之间的距离。例如主音箱可以发出声音,从音箱基于麦克风阵列确定自身和主音箱的距离。当未确定主音箱和从音箱时,可以两个音箱中的至少一个音箱发出声音,来实现距离的测定。
以下实施例介绍组建立体声音箱系统的可能的实现方式。
首先,为便于理解对本申请实施例涉及的一些关键名词进行解释:
1)、运动状态:即音箱的运动情况。例如下面实施例提及的第一运动状态,第二运动状态,均为描述音箱的移动情况的参数。运动状态可以为加速度传感器采集的数据,也可以是对加速度传感器采集的数据做进一步加工处理得到的信息。
2)、第一指示信息:用于描述加速度和其持续时长之间的对应关系。例如,定义同一加速度是指加速度变化小于预设变化时为同一加速度,需要说明的是如果加速度方向不同表示不同加速度。例如加速度范围[1,2]认为以1.5为基准加速度变化在0.5范围内的同一加速度。然后,统计每个加速度的持续时长,每个加速度和其对应的持续时长均可以采用对应的第一指示信息来表达。例如,一种可能的实施方式中,第一指示信息为(A,T),其中,A表示加速度,T表示加速度A对应的持续时长。这个例子中采用显式表达的方式说明了加速度A的持续时长T,在另一种实施例中也可以采用隐式表达的方式来说明加速度A的持续时长T。例如,加速度传感器等时间间隔采集数据,则可定义每个加速度的持续时长都采用这个时间间隔(如5ms)来表达,由此第一指示信息也可表达为(A),即默认持续时长为5ms。
需要说明的是,加速度传感器可同时采集三个方向(即X、Y、Z方向)的加速度信息。则(A,T)可表示为(X1,Xt1,Y1,Yt1,Z1,Zt1)或(X1,Y1,Z1,Xt1,Yt1,Zt1)。其中,X1,Y1,Z1表示采集到的各个方向的加速度,Xt1表示加速度X1的持续时长、Yt1表示加速度Y1的持续时长,Zt1表示加速度Z1的持续时长。
3)、第一加速度序列(时序信息):用于描述第一指示信息的组织方式,即运动状态的时域信号。当具有多个加速度时,可以按时序排序第一指示信息,得到音箱的第一加速度序列。如第一加速度序列S={(X1,Xt1,Y1,Yt1,Z1,Zt1),(X2,Xt2,Y2,Yt2,Z2,Zt2)…(Xn,Xtn,Yn,Ytn,Zn,Ztn)},n为正整数。当然,也可以分不同加速度方向排序,例如S={(X1,Xt1,X2,Xt2…Xn,Xtn,),(Y1,Yt1,Y2,Yt2…Yn,Ytn)…(Z1,Zt1,Z2,Zt2…Zn,Ztn)}。
由此,基于第一加速度序列能够表达加速度及其对应的持续时长随时间的变化情况,由此得到在时序上表达的音箱的运动状态。
4)、第二指示信息:音箱运动状态中每个点的频域信息。
本申请实施例中除了前文叙述的可采用第一加速度序列表达运动状态,还可以采用加速度传感器采集的加速度的频域信息来表达。本申请实施例中每个加速度的频域信息可称之为第二指示信息。频域信息的一种表达方式如统计加速度A的频域信息表达为P,则其第二指示信息表示为(A,P)。举个例子,假设9s为一次采样时长,获得的X方向的加速度的采样结果为(1,2,3,4,1,2,3,5,1,2,3,4,1,2,3,5,……),将该采样结果转换为频域,得到X方向的频域信息。Y方向和Z方向参考X方向的处理方式,本申请不再赘述。
5)、第二加速度序列(频域信息):用于描述第二指示信息的组织方式,用于在频域上表达音箱的运动状态(即运动状态的频域信号)。当具有多个第二指示信息时,可以得到第二加速度序列为S={Xp,Yp,Zp},其中,Xp,Yp,Zp为XYZ三个方向上各自的第二指示信息。继续前面的例子,X方向的加速度包括(1,2,3,4,1,2,3,5,1,2,3,4,1,2,3,5,……),则关于X方向的第二加速度序列Xp={(1,4),(2,4)(3,4),(4,8),(5,8)},以此类推可得到Y方向和Z方向的第二加速度序列。
6)、预设动作以及预设动作的特征:为了提高触发组建立体声音箱系统的准确性,避 免音箱的任何运动都会触发执行组建立体声音箱的操作,本申请实施例中可预先定义预设动作。当音箱的运动状态与该预设动作的特征匹配时,才认为需要组建立体声音箱系统由此可避免误组建。例如图5所示,该预设动作可以是手持音箱画一定的图形(例如8字形、波浪线、五角星形)。例如是左右摇动音箱,上下摇动音箱,还可以是让音箱产生碰撞。产生碰撞的方式可以是碰撞另一音箱也可以是碰撞另外的物体,只要是音箱产生碰撞即可。
当然,需要说明的是,实施时可以根据实际需求配置预设动作,甚至用户也可以自定义预设动作均适用于本申请实施例。
为了便于确定用户对音箱执行的操作是否为预设动作,本申请实施例中采用预设动作的特征来描述预设动作。
预设动作的特征的获取方式可实施为,在预先定义好预设动作之后,然后按照该预设动作操作音箱(例如手持音箱绘制8字图案),并采集音箱的运动状态(如采集前文的第一加速度序列或第二加速度序列),由此得到预设动作的特征,并存储在音箱的存储器中。
为了便于不同用户群能够方便的控制音箱组建立体声音箱系统,本申请实施例中,可以针对同一预设动作构建适用于不同用户群的预设动作的特征。例如预设动作为绘制8字,可以采集老年人绘制8字的数据,得到适用于老年人的绘制8字的特征;采集儿童绘制8字的数据,得到适用于儿童的绘制8字的特征。
此外,本申请实施例中还支持用户自定义预设动作。例如,如图6所示,用户通过音箱上的按键触发自定义预设动作的业务逻辑。然后音箱可提示用户对音箱自定义预设动作。以自定义预设动作为绘制五角星形字为例,用户一边手持音箱绘制五角星形,音箱一边采集自身的运动状态作为自定义预设动作的样本,并对执行的自定义预设动作进行计数。假设用户手持音箱绘制了3次五角星形,则音箱采集到3个样本,此时达到预置的计数要求,停止采集样本。然后对这3个样本进行分析(例如求均值),进而得到自定义预设动作的特征保存在存储器中。之后,用户手持音箱绘制五角星形,则会和存储器中存储的预设动作的特征进行匹配,匹配上则会触发组建立体声音箱系统。
7)、第一序列模板、第二序列模板:这两个模板均为预设动作的特征,用于和采集到的音箱的运动状态进行匹配。由于前文在3)中和5)中分别介绍了对加速度信息整理分析得到的时序信息即第一加速度序列,和频域信息即第二加速度序列。故此,本申请实施例中提供和第一加速度序列对应的第一序列模板,并提供第二加速度序列对应的第二序列模板。当音箱的运动状态采用第一加速度序列表达时,采用第一序列模板和第一加速度序列进行匹配。当音箱的运动状态采用第二加速度序列表达时,采用第二序列模板和第二加速度序列进行匹配。然后基于匹配结果确定是否触发组建立体声音箱系统的操作。
如图7所示,示出了本申请一实施例提供的立体声音箱系统的组建方法的流程示意图以两个音箱为例,包括以下步骤:
为了便于用户组建立体声音箱系统,可以在用户移动音箱后,音箱向用户提示用于组建立体声音箱系统所需执行的预设动作,例如语音提醒:“请手持音箱绘制数字8来组建立体声音箱系统”,若音箱带有显示屏也可以通过显示屏输出文字“请手持音箱绘制8字来组建立体声音箱系统”来进行提示,具体的提示方式本申请对此不作限定。
步骤701,用户移动第一音箱,第一音箱会获取到第一音箱的第一运动状态,同理,用户移动第二音箱,第二音箱也会获取到第二音箱的第一运动状态。
需要说明的是,用户移动第一音箱和第二音箱的时机不受限,即可以先移动第一音箱再移动第二音箱,也可以先移动第二音箱再移动第一音箱,也可以同时移动第二音箱和第一音箱。此外,移动第一音箱和第二音箱的用户可以是同一用户也可以是不同用户。
提取到的运动状态,例如是前文所述的第一加速度序列S={(X1,Xt1,Y1,Yt1,Z1,Zt1),(X2,Xt2,Y2,Yt2,Z2,Zt2)…(Xn,Xtn,Yn,Ytn,Zn,Ztn)}。
提取到的运动状态,例如是前文提及的对加速度信息进行频域分析之后得到的第二加速度序列,如X方向的第二加速度序列Xp={(1,4),(2,4),(3,4),(4,8),(5,8)}。
步骤702,第一音箱确定第一音箱的第一运动状态是否和预设动作的特征匹配,同理,第二音箱确定第二音箱的第一运动状态是否和预设动作的特征匹配。
需要说明的是,第一音箱的预设动作和第二音箱的预设动作可以相同也可以不同。例如第一音箱和第二音箱的预设动作可以均是绘制8字,也可以第一音箱的预设动作是绘制8字,第二音箱的预设动作是绘制五角星。
便于理解,这里以第一音箱为例,对第一运动状态是否和预设动作的特征匹配进行说明。定义的预设动作的特征由第一序列模板来表示。第一序列模板中包括X、Y、Z三个方向各自的第一序列子模板。如图8所示给出了X方向的第一序列子模板示意图。X方向的第一序列子模板中展示出了X方向加速度随时间变化的趋势图(默认每个加速度的持续时长为采样的时间间隔)。当加速度传感器输出加速度值时,会得到第一加速度序列,第一加速度序列中包括X、Y、Z三个方向各自的第一加速度子序列。如图8所示,给出了X方向的第一加速度子序列的示意图,图8中示出了X方向的加速度随时间变化的趋势(默认每个加速度的持续时长为采样的时间间隔)。
当采集到加速度值时,开始采用X方向的第一序列子模板和X方向采集到的第一加速度子序列进行模板匹配,如图8所示可知匹配到3个第一序列子模板(如图8中的A1、A2和A3)。类似的,Y方向可以采用Y方向的第一序列子模板和Y方向的第一加速度子序列进行模板匹配,Z方向可以采用Z方向的第一序列子模板和Z方向的第一加速度子序列进行匹配,在此不再赘述。
为提高触发组建立体声音箱系统的精度,当X方向匹配到X方向的第一序列子模板的次数大于指定次数时,确定X方向的第一加速度子序列和X方向的预设动作的特征匹配。
同理,当Y方向匹配到Y方向的第一序列子模板的次数大于指定次数时,确定Y方向的第一加速度子序列和Y方向的预设动作的特征匹配。当Z方向匹配到Z方向的第一序列子模板的次数大于指定次数时,确定Z方向的第一加速度子序列和Z方向的预设动作的特征匹配。
当X、Y、Z三个方向均和各自方向的预设动作的特征匹配时,确定第一音箱的第一运动状态和预设动作的特征匹配,否则,第一音箱的第一运动状态和预设动作的特征不匹配。
在另一些实施例中,可能一段时间内加速度传感器未采集到加速度。则实施时为了准确的确定是否和预设动作的特征匹配可以设定时长阈值t’。以X方向为例,例如在时长阈值t’范围内X方向匹配到X方向的第一序列子模板的次数大于指定次数时,确定X方向的第一加速度子序列和X方向的预设动作的特征匹配,否则不匹配。如图9所示,首先在A1时间段内匹配到X方向的第一序列子模板,匹配次数计数为1,然后在时间t(t>t’)内未采集到或未匹配到X方向的第一序列子模板,则之前匹配到X方向的第一序列子模板 作废,匹配次数重新记为0。然后开始重新对匹配次数进行计数,直至匹配到的次数大于指定次数,确定X方向的预设动作的特征匹配。
此外,还可以限定相邻两次匹配到X方向的第一序列子模板的时间差阈值,则定义若相邻两次匹配到X方向的第一序列子模板的时间差小于时间差阈值,则对匹配到X方向的第一序列子模板的次数进行累计计数,否则重新计数。Y方向和Z方向也做相同的处理,在此不再赘述。
需要说明的是,本申请实施例中不仅可以对X、Y、Z三个方向分别进行匹配,也可以直接和第一序列模板进行匹配。例如,第一序列模板表示为P={(Xp1,Xpt1,Yp1,Ypt1,Zp1,Zpt1),(Xp2,Xpt2,Yp2,Ypt2,Zp2,Zpt2)…(Xpm,Xptm,Ypm,Yptm,Zpm,Zptm)},其中m为正整数。假设第一加速度序列为S={(X1,Xt1,Y1,Yt1,Z1,Zt1),(X2,Xt2,Y2,Yt2,Z2,Zt2)…(Xn,Xtn,Yn,Ytn,Zn,Ztn)},则匹配的方式为(X1,Xt1,Y1,Yt1,Z1,Zt1)与(Xp1,Xpt1,Yp1,Ypt1,Zp1,Zpt1)进行匹配,(Xp2,Xpt2,Yp2,Ypt2,Zp2,Zpt2)与(X2,Xt2,Y2,Yt2,Z2,Zt2)进行匹配,以此类推,若匹配上第一序列模板P,其中P={(Xp1,Xpt1,Yp1,Ypt1,Zp1,Zpt1),(Xp2,Xpt2,Yp2,Ypt2,Zp2,Zpt2)…(Xpm,Xptm,Ypm,Yptm,Zpm,Zptm)},则记为匹配到一次第一序列模板。具体匹配的方式本申请不作限定,可以参考时域信号的匹配方式即可,也可以逐点匹配差值,其中差值若在预设差值范围内则确定为匹配上。例如(X1,Xt1,Y1,Yt1,Z1,Zt1)与(Xp1,Xpt1,Yp1,Ypt1,Zp1,Zpt1)匹配时,如表1所示,分别计算X1与Xp1、Xt1与Xpt1、Y1与Yp1、Yt1与Ypt1、Z1与Zp1、Zt1与Zpt1的差值,取最大差值(假设为α6),若最大差值α6的绝对值小于或等于预设差值,则确定(X1,Xt1,Y1,Yt1,Z1,Zt1)与(Xp1,Xpt1,Yp1,Ypt1,Zp1,Zpt1)匹配上,否则不匹配。
表1
X1 | Xt1 | Y1 | Yt1 | Z1 | Zt1 |
Xp1 | Xpt1 | Yp1 | Ypt1 | Zp1 | Zpt1 |
α1 | α2 | α3 | α4 | α5 | α6 |
以上介绍了采用第一加速度序列表达音箱的第一运动状态时,如何确定第一运动是否和预设动作的特征匹配的实施方式,下面介绍以频域信号,即第二加速度序列表达音箱的第一运动状态时的匹配方式。与第一加速度序列类似,本申请实施例中第二加速度序列有对应的第二序列模板,可实施为若在第二加速度序列中匹配到第二序列模板的次数大于或等于第二指定次数,则确定第一运动状态与预设动作的特征匹配;若在第二加速度序列中匹配到第二序列模板的次数小于第二指定次数,则确定第一运动状态与预设动作的特征不匹配。
例如,第二序列模板可包括X、Y、Z各方向的第二序列子模板,以X方向为例,其对应的第二序列子模板,X方向的频域信息和X方向的第二序列子模板匹配,若匹配上该第二序列子模板的次数大于第二指定次数,则确定X方向匹配,否则X方向的频域信息和X方向的第二序列子模板不匹配。在X方向匹配的同时,同理若Y方向和Z方向也匹配,则确定第一运动状态与预设动作的特征匹配,否则只要存在一个方向不匹配,则确定第一运动状态与预设动作的特征不匹配。
类似的,当加速度传感器一段时长(该段时长大于时长阈值t’)内未采集到数据,或未匹配到第二序列模板时,则本次匹配失败,音箱的运动状态与预设动作的特征不匹配。
同理,定义若相邻两次匹配到第二序列模板的时间差小于时间差阈值,则对匹配到第二序列模板的次数进行累计计数,否则若相邻两次匹配到第二序列模板的时间差不小于时间差阈值,则本次匹配失败,重新计数。
在一些实施例中,当第一音箱的第一运动状态和预设动作的特征匹配时,可以触发组建立体声音箱系统的操作。为了进一步避免误触发,本申请实施例中还可以将音箱的移动距离是否足够长作为另一条触发条件,可实施为如图7所示的步骤703。
步骤703,确定第一音箱在第一音箱的第一运动状态下的移动距离,并确定第二音箱在第二音箱的第一运动状态下的移动距离。
其中,需要说明的是,步骤702和步骤703的执行时机不受限。
步骤704,若第一音箱的第一运动状态和预设动作的特征匹配,且第一音箱的移动距离大于指定距离,则触发组建立体声音箱系统的操作,即第一音箱搜索周围的音箱,同理,若第二音箱的第一运动状态和预设动作的特征匹配,且第二音箱的移动距离大于指定距离,则触发组建立体声音箱系统的操作,即第二音箱搜索周围的音箱。
具体的搜索方式,可以是第一音箱和第二音箱均扫描对端设备信息,如系统版本、信号强度等信息。实施时,如第一音箱和第二音箱各自广播自己的设备信息,然后第一音箱可以搜索到第二音箱的设备信息,第二音箱可以搜索到第一音箱的设备信息。
当两音箱距离较远或中间有障碍物时,其实不适合组建立体声音箱系统,故此,为了便于准确的组建立体声音箱系统,本申请实施例中可以增加组建立体声音箱系统的条件。
可在步骤705中,第一音箱搜索到第二音箱之后可以确定第一音箱和第二音箱之间的位置关系,同理,第二音箱搜索到第一音箱之后可以确定第二音箱和第一音箱的位置关系。
在步骤706中,若第一音箱和第二音箱的位置关系为指定位置关系时,则第一音箱和第二音箱组建立体声音箱系统。此外,第一音箱和第二音箱还可以提示用户哪些音箱可构建立体声音箱系统。例如,第一音箱和第二音箱可以亮灯,或者控制产生一定的灯光效果来提示用户第一音箱和第二音箱可组建立体声音箱系统,除了光效提示还可以采用声效提示方式也适用于本申请实施例,例如输出音频“我参与构建立体声音箱系统”,此外还可以采用显示屏提示,当然在其他实施例中还可以组合提示,例如光效+音效提示,音效+显示屏提示等均适用于本申请实施例。
实施时,位置关系可以采用距离来表述。第一音箱可以广播自身的设备信息以便于周围的音箱感知到自己,同理第二音箱广播自身的设备信息以便于被周围的音箱感知到。第一音箱获得第二音箱的设备信息后确定和第二音箱的距离,同理,第二音箱搜索到第一音箱之后可以确定第二音箱和第一音箱的距离。当两音箱的距离小于距离阈值D时,确定两音箱满足组队条件。然后两音箱开始组建立体声音箱系统。
组建立体声音箱系统包括以下几方面的内容:一是筛选出主音箱和从音箱(可实施为步骤707),二是确定好各个音箱的角色,即确定好各个音箱的声道配置(可实施为步骤708)。
步骤707,基于主音箱筛选规则,从中筛选出主音箱。
主音箱筛选规则包括以下规则中的至少一种,一种可能的实施方式中当主音箱筛选规则多个规则时,以下规则的顺序时按优先级先后进行顺序的(当然也可以根据实际需求排 序优先级也适用于本申请实施例):
第1优先级,选择已经联网的音箱作为主音箱;
第2优先级,选择已经配网但未联网的音箱作为主音箱;
第3优先级,选择连接智能终端设备的音箱作为主音箱;
第4优先级,选择最早匹配到所述预设动作的特征的音箱作为主音箱;
第5优先级,选择MAC地址最大的音箱作为主音箱。
也即,首先筛选已经联网的音箱作为主音箱,以便于可以通过网络获取需要音箱播放资源。若各个音箱均未联网,则选择已经配网的音箱作为主音箱,如果各音箱也均未配网,则选择能够和终端设备如手机通信的音箱作为主音箱,由此筛选出的主音箱都能够和网络通信获取播放资源。
如果以上条件都不满足,则进而可以选择最早匹配到预设动作的特征的音箱作为主音箱。例如,通过摇一摇来组建立体声音箱系统时,最早摇动的音箱作为主音箱。
除此之外,还可以筛选最早匹配到所述预设动作的特征的音箱作为主音箱。当然,需要说明的是,筛选的规则是优先筛选能跟进行网络通信的音箱作为主音箱,若没有该类音箱则可以保证各音箱统一的筛选出一个主音箱即可,本申请对此不作限定。
在筛选出主音箱之后,可以通过声效或光效方式提示用户哪个音箱为主音箱。
需要说明的是,如果未搜索到周围的其他音箱,可以重复搜索多次,若多次均未搜索到,则可以结束组建立体声音箱系统的操作,并且还可以提示用户组建失败,以及失败的原因,例如该原因是未搜索到周围音箱。
步骤708,各音箱可以基于自身的位置信息,确定自己的声道,并提示给用户。
实施时,各音箱可以获取自身在立体声音箱系统中的相对位置关系,相对位置关系的示意可包括如图10-13几种情况进行说明。
如图10所示,第一音箱左侧、第二音箱在右侧时,第一音箱作为左声道,第二音箱作为右声道。然后第一音箱通过音频和/或光效,提示用于第一音箱为左声道,第二音箱为右声道。
如图11所示,若包含三个音箱时,则第一音箱确定自己为左声道、第二音箱确定自己为右声道,第三音箱确定自己为重低音。
如图12所示,若包含六个音箱时,则第一音箱确定自己为左声道、第二音箱为右声道、第三音箱为重低音、第四音箱为中置,第五音箱为左后声道、第六音箱为右后声道。
如图13所示,若包含十二个音箱时,则第一音箱确定自己为左声道、第二音箱为右声道、第三音箱为重低音、第四音箱为中置,第五音箱为左后声道、第六音箱为右后声道、第七音箱为中左声道,第八音箱为中右声道,第九音箱至第十二音箱按图13示出位置均为环绕声。
当然,在另一些实施例中,各音箱也可以不确定自己的声道,而是提示用户去配置各音箱的声道。此外,即使各音箱的声道配置好之后,也可以提示给用户以便于用户确认声道配置或修改各音箱配置的声道。
步骤709,用户可通过移动音箱来为各音箱配置声道。例如,第一音箱获取第一音箱的第二运动状态,然后将第二运动状态对应的声道配置为自己的声道。同理,如图7所示,第二音箱自己的第二运动状态,然后将第二运动状态对应的声道配置为自己的声道。
需要说明的是,第一音箱和第二音箱的第二运动状态不同。不同的第二运动状态可对 应不同的声道配置。例如实施时,第二运动状态可采用如下的参数中的至少一种来描述:
1)、被摇动的次数:例如摇动一次作为左声道,摇动两次作为右声道,以此类推,被摇动的次数不同声道配置不同。
2)、被摇动的速度:例如,可以设定速度范围,第一速度范围对应左声道、第二速度范围对应右声道,以此类推,不同的速度范围的声道配置不同。
3)、被摇动的加速度:类似的,除了用速度确定声道配置,也可以采用加速度来确定。例如轻微摇动对应第一加速度范围,其声道为左声道、猛烈摇动采用第二加速度范围表示,对应右声道,以此类推。实施时,音箱中可存储多个加速度范围,将用户摇动的加速度和加速度范围进行比较,查看摇动的加速度属于哪个加速度范围,进而确定出音箱属于哪个声道。
4)、被摇动的方向:例如,向左移动音箱为左声道,向右移动音箱为右声道,想前移动音箱为重低音,以此类推,不同移动方向对应不同声道配置即可。
5)、移动的距离:例如移动第一距离范围为左声道,移动第二距离范围为右声道。
6)、产生碰撞的次数:例如,碰撞1次为左声道、碰撞2次为右声道,以此类推。
7)、移动轨迹等。其中,例如轨迹为数字1为左声道,轨迹为数字2为右声道,轨迹为圆圈则为环绕声。
当然实施时,以上其中参数不仅可以单独使用,也可以组合使用,例如,被摇动的次数和被摇动的加速度组合时,轻微摇动1次为左声道,轻微摇动2次为右声道、猛烈摇动1次为左后声道,猛烈摇动2次为右后声道。任何能够组合的方式也适用于本申请实施例。
再例如,还可以通过判断用户是摇一摇音箱还是碰一碰音箱来确定音箱的声道配置。实施时,可根据加速度的变化频率或者大小,并进一步结合移动距离判断对音箱的操作是摇一摇还是碰一碰。一种可能的实施方式为,将加速度大于g/2且触发时两个设备距离小于距离门限(如5cm)认为是碰一碰,否则确认为摇一摇。
在另一些实施例中,用户也可以自定义不同声道配置对应的第二运动状态。
在一些实施例中,各音箱在配置自己的声道时可以同时提示用户自己配置为何种声道以便于用户确认。
步骤710,提示用户组建好立体声音箱系统,并提示各音箱的声道配置以便于用户了解组建情况。
综上所述,详细介绍了两音箱组建音箱系统的情况。当更多音箱组建时,以三音箱为例,对组建方式进行说明。
实施例一:
假设,该实施例中第一音箱和第二音箱按照图7所示的流程组建立体声音箱系统之后,第三音箱加入。假设第一音箱为主音箱,如图14所示:
步骤1401,用户摇一摇第三音箱,第三音箱获取自己的第一运动状态。
步骤1402,第三音箱确定第一运动状态和预设动作的特征匹配,且在第一运动状态下的移动距离大于指定距离之后,搜索附近的音箱。
用户可以同时移动,或在一定时间范围内先后移动第一音箱和第三音箱以便于第一音箱和第三音箱能够搜索到对方。
步骤1403,搜索到第一音箱,所以第一音箱和第三音箱交互,确认第一音箱和第三音箱之间的距离。
在一种可能的实施方式中,第一音箱可通知第三音箱第一音箱已成立立体声音箱系统,由此可实施为步骤1404。
步骤1404,若第一音箱和第三音箱之间的距离小于距离阈值,则第三音箱提示用户配置自己的声道。即第三音箱可以在得知自己加入已有的立体声音箱系统时,可以提示用户为自己配置声道。
步骤1405,配置好声道之后,第三音箱向用户提示声道的配置结果,即提示为何种声道。
在另一些可能的实施方式中,第一音箱可以确定各音箱的相对位置关系,基于该位置关系自动确定第三音箱的声道配置并通知给第三音箱,然后第三音箱提示用户是否确认该声道配置。若用户确认同意该声道配置,则第三音箱基于该声道配置播放内容,否则若用户不同意,可以由用户自定义第三音箱的声道配置,至此第三音箱的声道配置完成。
当然,在一些实施例中,由于第三音箱的加入,各音箱的声道配置可能发生变化,此时,也支持用户重新自定义各音箱的声道配置,或者第一音箱根据各个音箱的相对位置关系重新确定各个音箱的声道配置。
此外,由于第三音箱的加入,主音箱可以重新筛选也可以保持主音箱不变。一种可能的实施方式是按照前文所述的优先级筛选出音箱时,若第一音箱的优先级较高则可以保持第一音箱仍然作为主音箱,如果第一音箱优先级较低则考虑是否更换第三音箱为主音箱。例如,第一音箱已是连接网络的音箱,则第三音箱的加入可不执行筛选主音箱的操作,若第一音箱不能连接网络,则第三音箱加入之后,可以重新筛选主音箱。
还需要说明的是,该实施例中,第三音箱是请求加入已组建的立体声音箱系统(即由第一音箱和第二音箱组建的音箱系统),在实施时,对于一个已组建的音箱系统可以要求在组建完成时的第一时长阈值内允许其他音箱(如第三音箱)加入,也可以不设置该第一时长阈值。当不设置该第一时长阈值时,新的音箱可以随时加入已组建的音箱系统。该第一时长阈值可以支持用户设置。
实施例二
假设该实施例中,第一音箱、第二音箱和第三音箱几乎同时(例如在第二时长阈值范围内)被用户移动组建立体声音箱系统。如图15所示,可实施为:
步骤1501,第一音箱、第二音箱和第三音箱被用户移动,各音箱获取自己的第一运动状态。
步骤1502,各音箱确定各自第一运动状态和各自的预设动作的特征匹配,且确定各自的移动距离大于指定距离之后,搜索附近的音箱。
步骤1503,各音箱之间进行交互,确认各音箱之间的距离。
步骤1504,若各音箱之间的距离小于距离阈值,则各音箱提示用户配置自己的声道并筛选出主音箱。
步骤1505,用户配置好声道之后,各音箱向用户提示为自己配置的声道,组建完毕。
组建完毕之后如图16所示,若主音箱联网,则主音箱可响应用户语音,通过网络获取用户指示的音频资源。获取音频资源之后,分别给各个从音箱进行播放。
此外,如图17所示,若主音箱未联网但连接了用户的手机,则可以由手机获取网络中的音频资源之后由主音箱分发给各个从音箱进行播放。
基于相同的发明构思,本申请实施例还提供了一种第一音箱,如图18所示,所述音箱 包括:
获取模块1801,用于获取所述第一音箱的第一运动状态;
搜索模块1802,用于若所述第一运动状态与预设动作的特征匹配,则搜索第二音箱;
系统组建模块1803,用于若搜索到第二音箱,则与所述第二音箱组建立体声音箱系统。
在一种可能的设计中,所述音箱还包括:
提示模块,用于在所述获取所述第一音箱的第一运动状态之前,将用于组建立体声音箱系统所需执行的所述预设动作提示给用户。
在一种可能的设计中,所述获取模块,具体用于:
基于所述第一音箱的加速度信息,生成所述第一音箱的第一加速度序列,所述第一音箱的第一加速度序列中存储有依时序排列的第一指示信息,所述第一指示信息用于表达加速度和所述加速度的持续时长之间的对应关系;
所述预设动作的特征包括第一序列模板,所述音箱还包括:
第一匹配模块,用于将所述第一序列模板与所述第一音箱的第一加速度序列进行匹配操作;
若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数大于或等于第一指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;
若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数小于所述第一指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
在一种可能的设计中,所述获取模块,具体用于:
基于所述第一音箱的加速度信息,生成所述第一音箱的第二加速度序列,所述第一音箱的第二加速度序列中存储有第二指示信息,所述第二指示信息用于表达加速度和采集到所述加速度的频域信息;
所述预设动作的特征包括第二序列模板,所述音箱还包括:
第二匹配模块,用于将所述第二序列模板与所述第一音箱的第二加速度序列进行匹配操作;
若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数大于或等于第二指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;
若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数小于所述第二指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
在一种可能的设计中,所述音箱还包括:
距离确定模块,用于基于所述第一音箱的所述第一运动状态确定所述第一音箱的移动距离;
所述音箱还包括:
距离确定模块,用于在所述搜索第二音箱之前,确定所述第一音箱的移动距离大于指定距离。
在一种可能的设计中,系统组建模块,具体用于:
基于主音箱筛选规则,确定所述第一音箱是否作为所述立体声音箱系统中的主音箱;并,
为所述第一音箱配置声道。
在一种可能的设计中,所述主音箱筛选规则包括以下规则中的至少一种:
选择已经联网的音箱作为主音箱;
选择已经配网但未联网的音箱作为主音箱;
选择连接智能终端设备的音箱作为主音箱;
选择最早匹配到所述预设动作的特征的音箱作为主音箱;
选择媒体存取控制位址MAC地址最大的音箱作为主音箱。
在一种可能的设计中,所述系统组建模块,具体用于:
提示用户为所述第一音箱配置声道;
获取所述第一音箱的第二运动状态;
将所述第二运动状态对应的声道配置为所述第一音箱的声道。
在一种可能的设计中,所述系统组建模块,具体用于:
采用声效、光效和屏幕显示中的至少一种方式提示用户为所述第一音箱配置声道。
在一种可能的设计中,所述第二运动状态包括以下参数中的至少一种:
所述第一音箱被摇动的次数、被摇动的速度、被摇动的加速度、被摇动的方向、移动的距离、所述第一音箱产生碰撞的次数、所述第一音箱的移动轨迹。
在一种可能的设计中,所述系统组建模块还用于:
确定与所述第二音箱的位置关系;
并若确定与所述第二音箱的位置关系为指定位置关系,则执行所述与所述第二音箱组建立体声音箱系统的操作。
在一种可能的设计中,所述指定位置关系包括:所述第一音箱与所述第二音箱的距离小于距离阈值。
本申请的各个实施方式可以任意进行组合,以实现不同的技术效果。
上述本申请提供的实施例中,从音箱(主音箱和/或从音箱)作为执行主体的角度对本申请实施例提供的方法进行了介绍。为了实现上述本申请实施例提供的方法中的各功能,终端设备可以包括硬件结构和/或软件模块,以硬件结构、软件模块、或硬件结构加软件模块的形式来实现上述各功能。上述各功能中的某个功能以硬件结构、软件模块、还是硬件结构加软件模块的方式来执行,取决于技术方案的特定应用和设计约束条件。
以上实施例中所用,根据上下文,术语“当…时”或“当…后”可以被解释为意思是“如果…”或“在…后”或“响应于确定…”或“响应于检测到…”。类似地,根据上下文,短语“在确定…时”或“如果检测到(所陈述的条件或事件)”可以被解释为意思是“如果确定…”或“响应于确定…”或“在检测到(所陈述的条件或事件)时”或“响应于检测到(所陈述的条件或事件)”。另外,在上述实施例中,使用诸如第一、第二之类的关系术语来区份一个实体和另一个实体,而并不限制这些实体之间的任何实际的关系和顺序。
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本发明实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线)或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务 器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘(solid state disk,SSD))等。
需要指出的是,本专利申请文件的一部分包含受著作权保护的内容。除了对专利局的专利文件或记录的专利文档内容制作副本以外,著作权人保留著作权。
Claims (27)
- 一种立体声音箱系统的组建方法,其特征在于,应用于第一音箱,所述方法包括:获取所述第一音箱的第一运动状态;若所述第一运动状态与预设动作的特征匹配,则搜索第二音箱;若搜索到第二音箱,则与所述第二音箱组建立体声音箱系统。
- 根据权利要求1所述的方法,其特征在于,所述获取所述第一音箱的第一运动状态之前,所述方法还包括:将用于组建立体声音箱系统所需执行的所述预设动作提示给用户。
- 根据权利要求1或2所述的方法,其特征在于,所述获取所述第一音箱的第一运动状态,包括:基于所述第一音箱的加速度信息,生成所述第一音箱的第一加速度序列,所述第一音箱的第一加速度序列中存储有依时序排列的第一指示信息,所述第一指示信息用于表达加速度和所述加速度的持续时长之间的对应关系;所述预设动作的特征包括第一序列模板,所述方法还包括:将所述第一序列模板与所述第一音箱的第一加速度序列进行匹配操作;若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数大于或等于第一指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数小于所述第一指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
- 根据权利要求1或2所述的方法,其特征在于,所述获取所述第一音箱的第一运动状态,包括:基于所述第一音箱的加速度信息,生成所述第一音箱的第二加速度序列,所述第一音箱的第二加速度序列中存储有第二指示信息,所述第二指示信息用于表达加速度和采集到所述加速度的频域信息;所述预设动作的特征包括第二序列模板,所述方法还包括:将所述第二序列模板与所述第一音箱的第二加速度序列进行匹配操作;若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数大于或等于第二指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数小于所述第二指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
- 根据权利要求3或4所述的方法,其特征在于,所述方法还包括:基于所述第一音箱的所述第一运动状态确定所述第一音箱的移动距离;所述搜索第二音箱之前,所述方法还包括:确定所述第一音箱的移动距离大于指定距离。
- 根据权利要求1-5中任一所述的方法,其特征在于,所述与所述第二音箱组建立体声音箱系统,包括:基于主音箱筛选规则,确定所述第一音箱是否作为所述立体声音箱系统中的主音箱;并,为所述第一音箱配置声道。
- 根据权利要求6所述的方法,其特征在于,所述主音箱筛选规则包括以下规则中的至少一种:选择已经联网的音箱作为主音箱;选择已经配网但未联网的音箱作为主音箱;选择连接智能终端设备的音箱作为主音箱;选择最早匹配到所述预设动作的特征的音箱作为主音箱;选择媒体存取控制位址MAC地址最大的音箱作为主音箱。
- 根据权利要求6所述的方法,其特征在于,所述为所述第一音箱配置声道,包括:提示用户为所述第一音箱配置声道;获取所述第一音箱的第二运动状态;将所述第二运动状态对应的声道配置为所述第一音箱的声道。
- 根据权利要求8所述的方法,其特征在于,所述提示用户为所述第一音箱配置声道,包括:采用声效、光效和屏幕显示中的至少一种方式提示用户为所述第一音箱配置声道。
- 根据权利要求8所述的方法,其特征在于,所述第二运动状态包括以下参数中的至少一种:所述第一音箱被摇动的次数、被摇动的速度、被摇动的加速度、被摇动的方向、移动的距离、所述第一音箱产生碰撞的次数、所述第一音箱的移动轨迹。
- 根据权利要求1-10中任一所述的方法,其特征在于,所述方法还包括:确定与所述第二音箱的位置关系;并若确定与所述第二音箱的位置关系为指定位置关系,则执行所述与所述第二音箱组建立体声音箱系统的操作。
- 根据权利要求11所述的方法,其特征在于,所述指定位置关系包括:所述第一音箱与所述第二音箱的距离小于距离阈值。
- 一种第一音箱,其特征在于,所述音箱包括:获取模块,用于获取所述第一音箱的第一运动状态;搜索模块,用于若所述第一运动状态与预设动作的特征匹配,则搜索第二音箱;系统组建模块,用于若搜索到第二音箱,则与所述第二音箱组建立体声音箱系统。
- 根据权利要求13所述的音箱,其特征在于,所述音箱还包括:提示模块,用于在所述获取所述第一音箱的第一运动状态之前,将用于组建立体声音箱系统所需执行的所述预设动作提示给用户。
- 根据权利要求13或14所述的音箱,其特征在于,所述获取模块,具体用于:基于所述第一音箱的加速度信息,生成所述第一音箱的第一加速度序列,所述第一音箱的第一加速度序列中存储有依时序排列的第一指示信息,所述第一指示信息用于表达加速度和所述加速度的持续时长之间的对应关系;所述预设动作的特征包括第一序列模板,所述音箱还包括:第一匹配模块,用于将所述第一序列模板与所述第一音箱的第一加速度序列进行匹配操作;若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数大于或等于第一指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;若在所述第一音箱的第一加速度序列中匹配到所述第一序列模板的次数小于所述第一指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
- 根据权利要求13或14所述的音箱,其特征在于,所述获取模块,具体用于:基于所述第一音箱的加速度信息,生成所述第一音箱的第二加速度序列,所述第一音箱的第二加速度序列中存储有第二指示信息,所述第二指示信息用于表达加速度和采集到所述加速度的频域信息;所述预设动作的特征包括第二序列模板,所述音箱还包括:第二匹配模块,用于将所述第二序列模板与所述第一音箱的第二加速度序列进行匹配操作;若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数大于或等于第二指定次数,则确定所述第一运动状态与所述预设动作的特征匹配;若在所述第一音箱的第二加速度序列中匹配到所述第二序列模板的次数小于所述第二指定次数,则确定所述第一运动状态与所述预设动作的特征不匹配。
- 根据权利要求15或16所述的音箱,其特征在于,所述音箱还包括:距离确定模块,用于基于所述第一音箱的所述第一运动状态确定所述第一音箱的移动距离;所述音箱还包括:距离确定模块,用于在所述搜索第二音箱之前,确定所述第一音箱的移动距离大于指定距离。
- 根据权利要求13-17中任一所述的音箱,其特征在于,系统组建模块,具体用于:基于主音箱筛选规则,确定所述第一音箱是否作为所述立体声音箱系统中的主音箱;并,为所述第一音箱配置声道。
- 根据权利要求18所述的音箱,其特征在于,所述主音箱筛选规则包括以下规则中的至少一种:选择已经联网的音箱作为主音箱;选择已经配网但未联网的音箱作为主音箱;选择连接智能终端设备的音箱作为主音箱;选择最早匹配到所述预设动作的特征的音箱作为主音箱;选择媒体存取控制位址MAC地址最大的音箱作为主音箱。
- 根据权利要求18所述的音箱,其特征在于,所述系统组建模块,具体用于:提示用户为所述第一音箱配置声道;获取所述第一音箱的第二运动状态;将所述第二运动状态对应的声道配置为所述第一音箱的声道。
- 根据权利要求20所述的音箱,其特征在于,所述系统组建模块,具体用于:采用声效、光效和屏幕显示中的至少一种方式提示用户为所述第一音箱配置声道。
- 根据权利要求20所述的音箱,其特征在于,所述第二运动状态包括以下参数中的至少一种:所述第一音箱被摇动的次数、被摇动的速度、被摇动的加速度、被摇动的方向、移动的距离、所述第一音箱产生碰撞的次数、所述第一音箱的移动轨迹。
- 根据权利要求13-22中任一所述的音箱,其特征在于,所述系统组建模块还用于:确定与所述第二音箱的位置关系;并若确定与所述第二音箱的位置关系为指定位置关系,则执行所述与所述第二音箱组建立体声音箱系统的操作。
- 根据权利要求23所述的音箱,其特征在于,所述指定位置关系包括:所述第一音箱与所述第二音箱的距离小于距离阈值。
- 一种第一音箱,其特征在于,包括:一个或多个处理器、一个或多个存储器、一个或多个麦克风、一个或多个扬声器、以及通信模块;所述一个或多个麦克风,用于采集声音信号;所述通信模块,用于与其它音箱进行通信;所述一个或多个扬声器,用于发出声音信号;所述一个或多个存储器,用于存储程序指令,所述程序指令被所述一个或多个处理器执行,使得所述音箱执行如权利要求1-12任一所述的方法。
- 一种芯片,其特征在于,所述芯片包括处理器和接口;所述接口,用于接收代码指令,并将接收的所述代码指令传输至所述处理器;所述处理器,用于运行接收到的所述接口发送的所述代码指令,执行如权利要求1-12任一所述的方法。
- 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序包括程序指令,所述程序指令当被计算机执行时,使所述计算机执行如权利要求1-12中任意一项所述的方法。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP22880085.0A EP4398602A1 (en) | 2021-10-12 | 2022-09-15 | Method for constructing stereo speaker system, and related apparatus |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111186340.4A CN115967897A (zh) | 2021-10-12 | 2021-10-12 | 立体声音箱系统的组建方法及相关装置 |
CN202111186340.4 | 2021-10-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023061145A1 true WO2023061145A1 (zh) | 2023-04-20 |
Family
ID=85903586
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2022/119134 WO2023061145A1 (zh) | 2021-10-12 | 2022-09-15 | 立体声音箱系统的组建方法及相关装置 |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP4398602A1 (zh) |
CN (1) | CN115967897A (zh) |
WO (1) | WO2023061145A1 (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117119352A (zh) * | 2023-09-26 | 2023-11-24 | 深圳市安牛智能创新有限公司 | 一种蓝牙音箱控制方法、系统及蓝牙音箱 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160165343A1 (en) * | 2014-12-04 | 2016-06-09 | Jazz Hipster Corporation | Wireless connection and control method for wireless sound box and system thereof |
CN207427432U (zh) * | 2017-11-17 | 2018-05-29 | 深圳市易顺科技有限公司 | 一种霍尔开关感应磁吸互配启动的无线立体声效蓝牙音箱 |
CN108684023A (zh) * | 2018-04-25 | 2018-10-19 | 江沛岭 | 一种以手势动作互联多个蓝牙设备的控制方法 |
CN113377325A (zh) * | 2021-05-27 | 2021-09-10 | 深圳市亚昱科技有限公司 | 蓝牙音箱及其控制方法 |
CN115129368A (zh) * | 2021-03-24 | 2022-09-30 | 阿里巴巴新加坡控股有限公司 | 智能终端的控制方法、装置和智能音箱 |
-
2021
- 2021-10-12 CN CN202111186340.4A patent/CN115967897A/zh active Pending
-
2022
- 2022-09-15 EP EP22880085.0A patent/EP4398602A1/en active Pending
- 2022-09-15 WO PCT/CN2022/119134 patent/WO2023061145A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160165343A1 (en) * | 2014-12-04 | 2016-06-09 | Jazz Hipster Corporation | Wireless connection and control method for wireless sound box and system thereof |
CN207427432U (zh) * | 2017-11-17 | 2018-05-29 | 深圳市易顺科技有限公司 | 一种霍尔开关感应磁吸互配启动的无线立体声效蓝牙音箱 |
CN108684023A (zh) * | 2018-04-25 | 2018-10-19 | 江沛岭 | 一种以手势动作互联多个蓝牙设备的控制方法 |
CN115129368A (zh) * | 2021-03-24 | 2022-09-30 | 阿里巴巴新加坡控股有限公司 | 智能终端的控制方法、装置和智能音箱 |
CN113377325A (zh) * | 2021-05-27 | 2021-09-10 | 深圳市亚昱科技有限公司 | 蓝牙音箱及其控制方法 |
Also Published As
Publication number | Publication date |
---|---|
CN115967897A (zh) | 2023-04-14 |
EP4398602A1 (en) | 2024-07-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110677801B (zh) | 一种音箱控制方法、音箱以及音箱系统 | |
CN109473095B (zh) | 一种智能家居控制系统及控制方法 | |
CN106652996B (zh) | 一种提示音生成方法和装置、移动终端 | |
CN108763316B (zh) | 一种音频列表管理方法及移动终端 | |
EP3246790B1 (en) | Information processing of combined voice and gesture input operations | |
WO2023082703A1 (zh) | 语音控制方法、装置、电子设备及可读存储介质 | |
CN104252226B (zh) | 一种信息处理的方法及电子设备 | |
JP2016506639A (ja) | サウンドの触知可能な制御を提供するための方法および装置 | |
CN105103457A (zh) | 便携式终端、助听器以及在便携式终端中指示声源的位置的方法 | |
US20210027800A1 (en) | Method for processing audio, electronic device and storage medium | |
CN105208056B (zh) | 信息交互的方法及终端 | |
WO2019105376A1 (zh) | 手势识别方法、终端及存储介质 | |
WO2023061145A1 (zh) | 立体声音箱系统的组建方法及相关装置 | |
WO2021244057A1 (zh) | 一种交互方法、装置、耳机和耳机收纳装置 | |
WO2023284418A1 (zh) | 定位方法、装置、电子设备以及计算机可读存储介质 | |
JPWO2005057549A1 (ja) | 情報処理システム、情報処理方法および情報処理用プログラム | |
US11144130B2 (en) | Information processing apparatus, information processing system, and information processing method | |
CN110568926A (zh) | 一种声音信号处理方法及终端设备 | |
WO2017101260A1 (zh) | 音频切换方法、装置以及存储介质 | |
CN107464548A (zh) | 非接触式音乐传感装置及音乐演奏方法 | |
CN110808019A (zh) | 一种歌曲生成方法及电子设备 | |
KR20180076830A (ko) | 오디오 장치 및 그 제어방법 | |
KR102161237B1 (ko) | 사운드 출력 방법 및 장치 | |
CN107526510A (zh) | 声音播放方法及装置 | |
CN108614263A (zh) | 移动终端、位置检测方法及相关产品 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22880085 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2022880085 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2022880085 Country of ref document: EP Effective date: 20240403 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |