EP3776880A1 - Synchronisiertes sprachsteuermodul, lautsprechersystem und verfahren zum einbau von vc-funktionalität in ein separates lautsprechersystem - Google Patents
Synchronisiertes sprachsteuermodul, lautsprechersystem und verfahren zum einbau von vc-funktionalität in ein separates lautsprechersystemInfo
- Publication number
- EP3776880A1 EP3776880A1 EP19736223.9A EP19736223A EP3776880A1 EP 3776880 A1 EP3776880 A1 EP 3776880A1 EP 19736223 A EP19736223 A EP 19736223A EP 3776880 A1 EP3776880 A1 EP 3776880A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- module
- loudspeaker system
- synchronized
- dsp
- host
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 68
- 230000001360 synchronised effect Effects 0.000 title claims description 92
- 238000012546 transfer Methods 0.000 claims abstract description 29
- 230000004044 response Effects 0.000 claims description 37
- 230000006870 function Effects 0.000 claims description 31
- 238000012545 processing Methods 0.000 claims description 12
- 238000009499 grossing Methods 0.000 claims description 11
- 230000008901 benefit Effects 0.000 claims description 8
- 230000007613 environmental effect Effects 0.000 claims description 8
- 230000004043 responsiveness Effects 0.000 claims description 8
- 230000000694 effects Effects 0.000 claims description 7
- 238000012937 correction Methods 0.000 claims description 3
- 238000007493 shaping process Methods 0.000 claims description 3
- 238000012544 monitoring process Methods 0.000 claims description 2
- 230000005236 sound signal Effects 0.000 description 8
- 238000004891 communication Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000002592 echocardiography Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000003058 natural language processing Methods 0.000 description 2
- 241000238558 Eucarida Species 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
- 230000002618 waking effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
- H04R1/406—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
Definitions
- the present invention relates to Voice Controlled (“VC”) media playback systems or Smart Speakers adapted to receive and respond to a user’s spoken commands.
- VC Voice Controlled
- Smart Speakers adapted to receive and respond to a user’s spoken commands.
- Some VC speaker systems are capable of running existing third party voice- based software (“chat-bots”) or assistant applications (e.g., SkillsTM or ActionsTM) and can respond to a user’s spoken commands with voice-based synthesized audible responses generated as part of Voice Assistance (“VA”) operations.
- chat-bots third party voice- based software
- assistant applications e.g., SkillsTM or ActionsTM
- VA Voice Assistance
- the VC speaker senses or detects user-spoken trigger phrases (i.e., “wake” words or phrases) or commands and generates an audible VA reply or acknowledgement in response.
- Amazon’s VA or voice software system is known as “Alexa”; Google’s VA or voice software system may be summoned by“Hey Google” and Apple’s VA or voice software system may be summoned by addressing“Siri”.
- Each of these VA systems is programmable to respond to a user’s“wake word” or response-triggering phrase, whereupon the VA takes over control of the VC speaker and responds to the user with an audible response or reply.
- VC loudspeaker systems reproduce several types of audio program material, including music, movie soundtracks, news, podcasts, etc., but many of the VC speakers currently offered do a mediocre job of reproducing anything more sonically demanding than news reports. Most loudspeaker products with voice- control systems embedded within them, though desirable for obvious reasons, present some unwelcome issues for many consumers.
- Amazon EchoTM and similar VC speaker devices fail to recognize when and how the host system is operating and offer nothing that would allow the user to alter the
- the present invention is directed to an improved Voice-Controlled (“VC”) Loudspeaker system and method which incorporates a synchronized VC module or“smart puck” programmed to provide a method for incorporating VC functionality into separate high-performance host loudspeaker systems.
- VC Voice-Controlled
- the invention is directed to an improved VC loudspeaker system and method for incorporating VC functionality into separate yet synchronized high-performance host loudspeaker systems, where the synchronized VC module is programmed to optimize its own performance and the performance of the separate host loudspeaker system.
- the synchronized VC module not only accounts for its characteristic acoustic environment, but also recognizes when and how the host system is operating, while providing a
- an exemplary embodiment of the present invention would be an improved or enhanced smart or Voice-Controlled (“VC”) loudspeaker system incorporating a synchronized Voice-Control (“VC”)“Smart Puck” device configured to operate in conjunction with a separate host loudspeaker system.
- VC Voice-Controlled
- a suitable separate high-performance loudspeaker system would be as full featured as one of applicant’s existing high performance Soundbar systems.
- the invention further is directed to a method for incorporating VC functionality into other separate host loudspeaker systems.
- the applicant for the present invention has developed many great sounding feature-rich loudspeaker and audio reproduction devices and systems that are adapted for use with a users’ Wi-Fi system. Such devices and systems incorporate DSP elements programmable to achieve specific sonic goals for specific audio program playback applications.
- US Patents 9,277,044, 9,374,640, 9,584,935, 9,706,320, 9,767,786 and 9,807,484 provide useful context and background for the present invention and are incorporated herein in their entireties by reference.
- the VC speaker system of the present invention includes at least one synchronized Voice-Control (“VC”)“Smart Puck” module which incorporates a Controller or Computing module with a pre-programmed DSP system to provide optimized sound quality during audio program playback as well as subjectively pleasant-sounding Voice Assistance response as perceived by user when used, for example, with an Amazon Alexa or a Google voice device.
- VC Voice-Control
- Smart Puck a Controller or Computing module with a pre-programmed DSP system to provide optimized sound quality during audio program playback as well as subjectively pleasant-sounding Voice Assistance response as perceived by user when used, for example, with an Amazon Alexa or a Google voice device.
- the synchronized VC module of present invention provides several important advantages.
- the“Smart Puck” synchronized VC module can be initiated to capture the complex Transfer Function (“TF”) of audio signals between a separate host loudspeaker (such as a soundbar) and a microphone array on the Smart Puck housing.
- the captured TFs most efficiently expressed as Finite Impulse Response (FIR) filter parameters, may be utilized to improve the Smart Puck’s recognition of the user’s voice queries or commands. That the Smart Puck and separate host audio system are synchronized - indeed, requests for program material should be expected to pass through the Puck - means that the Smart Puck synchronized VC module will be able to monitor both the program broadcast from the separate host
- FIR Finite Impulse Response
- loudspeaker(s) and user voice commands at any given time.
- transfer functions that model room effects at the location of the Smart Puck also are derived over a range of DSP settings via acquisition routines involving noise stimuli emitted by the loudspeaker and captured by the microphone array.
- the inverse of the transfer function that best matches the current state of the separate Host speaker system’s DSP settings is recalled from the Smart Puck’s memory and imposed on the signals acquired by the Smart Puck microphone array as a means of“subtracting” the separate host loudspeaker’s contribution from the microphone array’s acquired signal, thereby greatly improving the“signal” (voice commands/queries) to“noise” (separate host loudspeaker output) ratio and permitting greatly improved responsiveness to a user’s voice commands.
- the Smart Puck synchronized VC module can monitor ambient environmental noise apart from program material, and by virtue of the methods and system elements described and illustrated in Polk Audio’s commonly owned Patent 9,767,786 (Starobin, Lyons, et al, the entire disclosure of which is hereby
- a“micro quiet zone” centered about the smart puck can be established for purposes of further improving the signal to noise ratio associated with voice commands.
- Another aspect of the system and method of the present invention pertains to the use of the Smart Puck synchronized VC module as means for capturing the separate host loudspeaker system’s acoustic frequency response at the primary listening location in conjunction with“room-smoothing” algorithms such as
- Another advantageous aspect of the system and method of the present invention exploits the smart puck’s utility as a microphone array capable of integrating with a host soundbar-subwoofer system for purposes of determining the location of the soundbar and subwoofer relative to a customer’s primary listening position. Once the subwoofer’s location is known relative to the soundbar, certain DSP settings may be modified in order to optimize overall system performance.
- an enhanced Voice-Controlled (“VC”) Loudspeaker system and method in accordance with the invention includes providing a Smart Puck synchronized VC module with a microphone array and configured to generate an audio output linked to a separate host loudspeaker system, where the Smart Puck synchronized VC module is programmed to provide a method for incorporating VC functionality into the separate host loudspeaker system.
- the system and method further includes programming the Smart Puck synchronized VC module to optimize its performance with the separate host loudspeaker system, the programming including, for example, the steps of determining the characteristic acoustic
- the host loudspeaker system is separate and remote from the VC module; that is, it is a separate loudspeaker which may incorporate a soundbar and may also include a subwoofer.
- the Smart Puck synchronized VC module of the present invention incorporates a processor linked to the microphone array and to the separate host loudspeaker system and is further linked to remote entities by way of a network for obtaining responses to signals from the Smart Puck synchronized VC module’s microphone array and directing the responses to the separate host loudspeaker system.
- the processor includes a microprocessor incorporating suitable memory and operating system components connected to a pre-programmed digital signal processing (DSP) engine to provide optimized sound quality during audio program playback as well as subjectively pleasant-sounding Voice Assistance response as perceived by a user.
- DSP digital signal processing
- the host loudspeaker system may include user controls for providing system settings, and the Smart Puck synchronized VC module’s DSP system is preprogrammed to account for such system settings.
- the processor includes an operating system that is configured and programmed to operate the DSP engine and to manage hardware such as a wireless unit, a USB unit, and a Codec within the synchronized module as well as to operate a variety of control element modules in the processor.
- an enhanced Voice-Controlled (“VC”) Loudspeaker system incorporates a microphone array, a synchronized VC module linked to the array and having an audio output linked to an existing remotely located bespoke or high-performance host loudspeaker system, where the Smart Puck synchronized VC module includes a processor programmed to provide a method for incorporating VC functionality into the separate host loudspeaker system, and operates by capturing (at the microphone array, through the use of acquisition routines) noise stimuli emitted by the separate host
- TF complex transfer function
- FIR Finite Impulse Response
- the method further includes providing a digital signal processing (DSP) engine preprogrammed to include a range of transfer functions corresponding to the separate host loudspeaker’s system settings, capturing transfer functions TF that reflect room effects derived over a range of preprogrammed DSP settings, recalling from memory the inverse of the transfer function that best matches the current state of the DSP settings, and imposing on the signals acquired by the microphone array the best match inverse signals to subtract the separate host loudspeaker’s contribution from user voice signals acquired by the microphone array, thereby greatly improving the signal to noise ratio of voice commands/queries signals to loudspeaker output noise to provide greatly improved responsiveness to user voice commands.
- DSP digital signal processing
- the method may further include monitoring ambient environmental noise apart from desired program material produced by the host loudspeaker system and modifying the DSP settings in accordance with the monitored environmental noise to provide a“micro quiet zone” centered about the modulator for further improving the signal to noise ratio associated with voice commands.
- the method further includes capturing the acoustic frequency response of the loudspeaker system at the user’s primary listening location in conjunction with “room-smoothing” algorithms and modifying by inverse magnitude shaping the DSP settings in accordance with the captured acoustic frequency response to optimize system performance.
- the method may modify the DSP settings in accordance with the captured acoustic frequency response by imposing time- delayed correction signals to optimize system performance.
- the method of the invention may also include determining, in a host loudspeaker system containing a sound bar SB and a subwoofer SW, the location of the sound bar relative to the subwoofer and modifying the DSP settings in
- the system and the method of operating the present invention enable the user of a bespoke or high-performance audio or home theater system (a host loudspeaker system) incorporating, for example, a high performance soundbar to add VC functionality without requiring the user to replace the entire bespoke or high- performance loudspeaker system.
- the invention adds a synchronized module or smart puck which is compatible with an existing (e.g., the Amazon
- EchoTM system architecture and which may be configured as a puck-like Wi-Fi enabled device with a host of capabilities prompted via voice control.
- synchronized module is also configured to permit the user to optimize its
- Figs. 1A-1C illustrate typical Voice-Control (“VC”) speaker architectures and Methods, in accordance with the Prior Art.
- VC Voice-Control
- FIG 2 is a diagram illustrating a synchronized voice-controlled“smart puck” module and loudspeaker system architecture and method for incorporating VC functionality into a separate high performance loudspeaker system, in accordance with the present invention.
- FIG. 3 is a block diagram illustrate the synchronized voice-controlled“smart puck” module and system architecture of Fig. 2 illustrating the interconnected processor components for incorporating VC functionality into a separate high performance loudspeaker system, in accordance with the present invention.
- Patents Nos. 8971543 and 9060224 are illustrative of typical VC speaker systems such as those sold as the Amazon EchoTM“voice-controlled assistant”; Fig. 1A thus illustrates a first exemplary (typical) prior art system
- VC speaker use environment 102 which includes a typical VC speaker system 104 and at least a first user 106.
- the user 106 is typically near or proximal to VC speaker system 104 in the use environment 102.
- the VC speaker system 104 as illustrated is communicatively coupled to one or more remote entities 110 over a network 112.
- the remote entities 110 may include individual people, such as person 114, or automated systems (not shown) that serve as far end talkers to verbally interact with the user 106. Additionally, or alternatively the remote entities 110 may comprise cloud services 116 hosted, for example, on one or more servers 118(1 ) . . . 118(S). These servers 118(1 )-(S) may be arranged in any number of ways, such as server farms, stacks, and the like that are commonly used in data centers.
- the cloud services 116 generally refer to a network accessible platform implemented as a computing infrastructure of
- Cloud services 116 do not require end-user knowledge of the physical location and configuration of the system that delivers the services. Common expressions associated with cloud services include “on-demand computing”, “Software as a Service (SaaS)", “platform computing”, “network accessible platform”, and so forth. [0031]
- the cloud services 116 may host any number of applications that can process the user input received from the VC speaker system 104 and produce a suitable response. Examples of typical applications might include web browsing, online shopping, banking, email, work tools, productivity, entertainment, educational, and so forth.
- user 106 is shown communicating with the remote entities 110 via VC speaker system 104.
- a VC speaker system 104 Voice Assist module outputs an audible question such as, "What do you want to do?" as represented by dialog bubble 120.
- This output may represent a question from a far end talker 114, or from a cloud service 116 (e.g., an entertainment service).
- the user 106 is shown replying to the question by stating, "I'd like to buy tickets to a movie" as represented by the dialog bubble 122.
- the VC speaker system 104 (or voice-controlled assistant 104) is equipped with an array 124 of microphones 126(1) . . . 126(M) to receive the voice input from the user 106 as well as any other audio sounds in the environment 102.
- the microphones 126(1) - (M) are generally arranged at a second or top end of the VC speaker system 104 opposite the base end seated on the table 108. Although multiple microphones are illustrated, in some implementations, the VC speaker system 104 may be embodied with only one microphone.
- the VC speaker system 104 may further include a speaker array 128 of speakers 130(1) . . . 130(P) to output sounds in humanly perceptible frequency ranges.
- the speakers 130(1) - (P) may be configured to emit sounds at various frequency ranges, so that each speaker has a different range. In this manner, the VC speaker system 104 may output high frequency signals, mid frequency signals, and low frequency signals.
- the speakers 130(1) - (P) are generally arranged at the first or base end of the VC speaker system 104 and are oriented to emit the sound in a downward direction toward the base end and in an outward direction generally opposite to or away from the microphone array 124 in the top end.
- the voice-controlled assistant or VC speaker system 104 may further include computing components 132 that process the voice input received by the microphone array 124, enable communication with the remote entities 110 over the network 112, and generate the audio to be output by the speaker array 128.
- the computing components 132 are generally positioned between the microphone array 123 and the speaker array 128, although essentially any other arrangement may be used.
- the VC speaker system 104 may be configured to produce stereo or non-stereo output.
- the speakers 130(1) - (P) may receive a mono signal for output in a non-stereo configuration.
- the computing components 132 may generate as an output to the speakers 130(1) - (P) two different channel signals for stereo output.
- a first channel signal (e.g., left channel signal) is provided to one of the speakers, such as the larger speaker 130(1).
- a second channel signal (e.g., right channel signal) is provided to the other of the speakers, such as the smaller speaker 130(P). Due to the vertically stacked arrangement of the speakers, however, the two-channel stereo output may not be appreciated by the user 106.
- Fig. 1 B illustrates at 200 another implementation of voice interactive computing architecture that is similar to the architecture 100 of Fig. 1A, but in this illustration a voice-controlled assistant or VC speaker system 204 has a different physical packaging layout.
- speaker system 204 has a laterally spaced arrangement of the speakers to better provide stereo output, rather the vertically stacked arrangement found in the system 104 of Fig. 1A. More particularly, the speakers 130(1) - (P) are shown at a horizontally spaced distance from one another.
- VC speaker system 204 is able to play full spectrum stereo using only two speakers of different sizes.
- the VC speaker system 204 is communicatively coupled over the network 112 to an entertainment service 206 that is part of the cloud services 116.
- the entertainment service 206 is hosted on one or more servers, such as servers 208(1) . . . 208(K), which may be arranged in any number of configurations, such as server farms, stacks, and the like that are commonly used in data centers.
- the entertainment service 206 may be configured to stream or otherwise download entertainment content, such as movies, music, audio books, and the like to the voice-controlled assistant.
- the voice-controlled assistant 204 can play the audio in stereo with full spectrum sound quality, even though the device has a small form factor and only two speakers.
- the user 106 is shown using the audible statement, "Pause the music" (in dialog bubble 210) to direct the VC speaker system 204 to pause the music being played.
- the VC speaker system 204 is not only designed to play music in full spectrum stereo, but is also configured with an acoustic echo
- AEC cancellation
- Fig. 1C shows selected functional components of the voice-controlled assistant or VC speaker systems 104 and 204 in more detail.
- each of the VC speaker systems 104 and 204 may be implemented as a standalone device that is relatively simple in terms of functional capabilities with limited input/output components, memory, and processing capabilities.
- the VC speaker systems 104 and 204 do not have a keyboard, keypad, or other form of mechanical input. Nor do they have a display or touch screen to facilitate visual presentation and user touch input.
- the assistants 104 and 204 may be implemented with the ability to receive and transmit audio signals, a network interface (wireless or wire- based), power input, and limited processing/memory capabilities.
- each VC speaker system 104/204 includes the microphone array 124, a speaker array 128, a processor 302, and memory 304.
- the microphone array 124 is used to capture speech input from the user 106, or other sounds in the environment 102.
- the speaker array 128 is used to output speech from a far end talker, audible responses provided by the cloud services, forms of entertainment (e.g., music, audible books, etc.), or any other form of sound.
- the speaker array 128 produces a wide range of output audio frequencies including both human perceptible and non-perceptible frequencies.
- the speaker array 128 is formed of two speakers capable of outputting full spectrum stereo sound, as will be described below in more detail. Two speaker array arrangements are shown, including the vertically stacked arrangement 128A and the horizontally spaced arrangement 128B.
- the memory 304 may include computer-readable storage media (“CRSM”), which may be any available physical media accessible by the processor 302 to execute instructions stored on the memory.
- CRSM may include random access memory (“RAM”) and Flash memory.
- RAM random access memory
- CRSM may include, but is not limited to, read-only memory (“ROM”), electrically erasable programmable read-only memory (“EEPROM”), or any other medium which can be used to store the desired information and which can be accessed by the processor 302.
- ROM read-only memory
- EEPROM electrically erasable programmable read-only memory
- modules such as instruction, datastores, and so forth may be stored within the memory 304 and configured to execute on the processor 302.
- An operating system module 306 is configured to manage hardware and services (e.g., wireless unit, USB, Codec) within and coupled to the assistant 104/204 for the benefit of other modules.
- Several other modules may be provided to process verbal input from the user 106.
- a speech recognition module 308 provides some level of speech recognition functionality.
- this functionality may be limited to specific commands that perform fundamental tasks like waking up the device, configuring the device, and the like.
- the amount of speech recognition capability included on the VC speaker system 104/204 is an implementation detail, but the architecture described herein can support having some speech recognition at the local VC speaker system 104/204 together with more expansive speech recognition at the cloud service 116.
- An acoustic echo cancellation module 310 and a double talk reduction module 312 are provided to process the audio signals to substantially cancel acoustic echoes and substantially reduce double talk that might occur. These modules may work together to identify times where echoes are present, where double talk is likely, or where background noise is present, and attempt to reduce these external factors to isolate and focus on the“near talker” (i.e. , user 106). By isolating on the near talker, better signal quality is provided to the speech recognition module 308 to enable more accurate interpretation of the speech utterances.
- a query formation module 314 may also be provided to receive the parsed speech content output by the speech recognition module 308 and to form a search query or some form of request. This query formation module 314 may utilize natural language processing (NLP) tools as well as various language modules to enable accurate construction of queries based on the user's speech input.
- NLP natural language processing
- the modules shown stored in the memory 304 are merely representative. Other modules 316 for processing the user voice input, interpreting that input, and/or performing functions based on that input may be provided.
- the voice controlled assistant 104/204 might further include a codec 318 coupled to the microphones of the microphone array 124 and the speakers of the speaker array 128 to encode and/or decode the audio signals.
- the codec 318 may convert audio data between analog and digital formats. In this case, a user interacts with the assistant 104/204 by speaking to it, and the microphone array 124 receives the user speech.
- the codec 318 encodes the user speech and transfers that audio data to other components.
- the assistant 104/204 can communicate back to the user by emitting audible statements passed through the codec 318 and output through the speaker array 128. In this manner, the user interacts with the voice-controlled assistant simply through speech, without use of a keyboard or display common to other types of devices.
- the VC speaker system or voice controlled assistant 104/204 includes a wireless unit 320 coupled to an antenna 322 to facilitate a wireless connection to network 112.
- the wireless unit 320 may implement one or more of various wireless technologies, such as Wi-Fi, Bluetooth, RF, and so on.
- a USB port 324 may further be provided as part of the assistant 104/204 to facilitate a wired connection to a network, or a plug-in network device that communicates with other wireless networks. In addition to the USB port 324, or as an alternative thereto, other forms of wired connections may be employed, such as a broadband connection.
- a power unit 326 is further provided to distribute power to the various components on the assistant 104/204.
- a stereo component 328 is optionally provided to output stereo signals to the various speakers in the speaker array 128.
- the VC speaker system 400 of the present invention system incorporates a Voice Controlled (“VC”) Smart Puck synchronized module 404 that is synchronized with and responsive to a separate high performance loudspeaker system 604 (which may be referred to as a“host” loudspeaker system) such that the Smart Puck 404 and the separate host loudspeaker system 604 are linked.
- VC Voice Controlled
- performance host loudspeaker system 604 may incorporate a selected set of the applicant’s commonly owned loudspeaker performance improving developments, including those adapted for use with users’ Wi-Fi systems and incorporating DSP elements programmable to achieve specific sonic goals for specific audio program playback applications
- US Patents 9,277,044; 9,374,640; 9,584,935; 9,706,320; 9,767,786 and 9,807,484 provide useful context and background for this aspect of the present invention and are hereby incorporated herein in their entireties by reference.
- the VC speaker system 400 of the present invention system incorporates a Voice Controlled (“VC”) Smart Puck synchronized module 404 that is preferably configured as a compact puck-shaped product having a housing with a base 406 that rests on a support such as a table 408 and a top 410 which carries and aims an array 424 of multiple (e.g., eight) microphones.
- VC Voice Controlled
- Smart Puck VC module 404 preferably is not limited to any particular industrial design and is synchronized with and connected to a separate high performance loudspeaker system 604 (which may be referred to as a“host” loudspeaker system) such that the Smart Puck 404 and the separate host loudspeaker system 604 are linked in several particular ways that permit superior responsiveness to voice commands 122 when compared to a conventional voice- controlled puck and its incorporated loudspeaker system.
- the synchronized voice control module 404 of system 400 includes a digital processor 432 including components such as a controller or computing module 434 having a pre- programmed DSP system to provide optimized sound quality on the user’s host bespoke or high-performance loudspeaker product 604 during audio program playback.
- the DSP module also provides a subjectively pleasant-sounding Voice Assistance response as perceived by a user when used, for example, with VC devices such as an Amazon Alexa or a Google voice device. This result is obtained regardless of the audio settings that the user may have selected on the user’s bespoke or high-performance loudspeaker product even when those settings might otherwise inhibit VA intelligibility.
- the system and the method of operating the present invention enable the user of a separate loudspeaker system 604 (e.g., optionally an existing bespoke or high-performance audio or home theater system which is configured or
- the invention adds a synchronized module which is compatible with, for example, the Amazon EchoTM architecture and which may be configured as the puck-like Wi-Fi enabled device 404 with a host of capabilities prompted via voice control.
- the Smart Puck synchronized VC module 404 is also configured to incorporate control modules that optimize its performance with the separate host loudspeaker system 604 not only to account for the
- the VC speaker system illustrated in Fig. 2 at 400 has at least one Smart Puck synchronized VC module 404.
- Fig. 2 also illustrates the Smart Puck synchronized VC module 404 in an enlarged diagrammatic view at 404 as incorporating a controller or computing module 432 containing processor components further illustrated in detail in Fig. 3.
- Computing module 432 incorporates a pre-programmed DSP system 434 (as illustrated in Figs 2-3) which, as is known in the art, provides optimized sound quality during audio program playback as well as subjectively pleasant sounding Voice Assistance response (as indicated at 120) as perceived by user 106 when Smart Puck synchronized VC module 404 is connected to the user’s high performance loudspeaker product 604 and when used with, for example, an Amazon Alexa or a Google voice assist, regardless of the audio settings that the user may have selected and which might otherwise inhibit VA intelligibility.
- a pre-programmed DSP system 434 as illustrated in Figs 2-3
- provides optimized sound quality during audio program playback as well as subjectively pleasant sounding Voice Assistance response (as indicated at 120) as perceived by user 106 when Smart Puck synchronized VC module 404 is connected to the user’s high performance loudspeaker product 604 and when used with, for example, an Amazon Alexa or a Google voice assist, regardless of the audio settings that the user may have selected and
- the system 400, smart puck module 404, and the method of the present invention allow the user 106 of the existing high-performance audio or home theater loudspeaker system 604, which may, for example, be a conventional soundbar SB, may include a separate subwoofer SW, and may include conventional user controls 608, to add VC functionality without requiring the user to replace an entire separate high-performance loudspeaker system (e.g., such as a separate high performance soundbar-subwoofer system 604).
- This is accomplished through the addition of the synchronized VC module 404 which is compatible with, for example, the Amazon EchoTM architecture and is connectable to the remote (i.e. separate) host
- the loudspeaker system 604 either by a direct wired connection or wirelessly (e.g., by a Bluetooth connection via a TX/RX module retrofitted (or included) in separate host loudspeaker system 604 as indicated at 606).
- the synchronized module 404 may be configured as a puck-shaped cylindrical Wi-Fi enabled device with a large number of capabilities prompted by voice control.
- the synchronized module 404 is also configured for optimized performance with the host loudspeaker system 604 by incorporating components which not only account for the characteristic acoustic environment 102, but also recognize and account for when and how the host system 604 is operating. By doing this the performance characteristics of the Smart Puck synchronized VC module 404 are altered to optimize overall VC loudspeaker system performance.
- the audio performance of the VC speaker system 400 is enhanced by incorporation of pre-programmed features in synchronized module 404, as will be described.
- the system 400 and synchronized VC module 404 of present invention provide several important advantages.
- the Smart Puck synchronized VC module 404 may optionally be initiated to sense and record and so capture signal(s) revealing the complex Transfer Function (“TF”) between its host loudspeaker 604 and the puck’s microphone array.
- TF complex Transfer Function
- the captured TFs most efficiently expressed as Finite Impulse Response (FIR) filters, may be utilized to improve recognition of user’s voice queries 122. That the puck 404 and separate host loudspeaker system 604 are
- AEC Acoustic or Automatic Echo Cancellation
- Smart Puck synchronized VC module 404 may optionally be used to monitor ambient environmental noise apart from program material, and by virtue of the methods and system elements described and illustrated in Polk’s commonly owned Patent No. 9,767,786 (to Starobin, Lyons, et al, the entire disclosure of which is hereby incorporated herein by reference), a“micro quiet zone” centered about Smart Puck synchronized VC module 404 can be established for purposes of further improving the signal to noise ratio associated with voice commands.
- Another aspect of the system 400 and method of the present invention pertains to the use of the smart puck 404 for capturing the loudspeaker system’s acoustic frequency response at a user’s primary listening location in conjunction with“room-smoothing” algorithms such as
- a final advantageous aspect of system 400 and method of the present invention further exploits the puck’s utility as a microphone array capable of integrating with a host soundbar-subwoofer system for the purpose of determining the location the soundbar and subwoofer relative to a customer’s primary listening position. Once the subwoofer’s location is known relative to the soundbar, certain DSP settings may be modified in order to optimize system performance.
- FIGs 2 and 3 illustrate selected functional components of the Smart Puck synchronized VC module 404 which are utilized to carry out the method of the present invention. These components are preferably implemented as a standalone device that is relatively simple in terms of functional capabilities, requiring only limited input/output components, memory, and processing capabilities with the ability to receive and output audio, and including a network interface (wireless or wire- based), power, and limited processing/memory capabilities.
- each synchronized module 404 includes the microphone array 424 and a computing/communications/audio processor 432.
- processor 432 includes a microprocessor or controller 800 and a memory 802.
- the microphone array 424 is used to sense and capture speech input 122 from the user 106 or other sounds in the environment 102.
- the synchronized module 404 uses separate host loudspeaker system 604 to output speech (e.g., 120) from a far end talker, audible responses provided by the cloud services, forms of entertainment (e.g., music, audible books, etc.), or any other form of sound.
- the separate host loudspeaker system 604 may output a wide range of audio frequencies and in one implementation, the host speaker 604 in the illustrated example comprises a soundbar SB and subwoofer SW (Fig. 2) which may be connected to module 404 by way of the wired or wireless connection 606.
- the memory component 802 of processor 432 may include computer- readable storage media (“CRSM”), which may be any available physical media accessible by the microprocessor 800 to execute instructions stored in the memory.
- CRSM computer- readable storage media
- the CRSM may include both random access memory (“RAM”) and Flash memory.
- the CRSM may include, but is not limited to, read-only memory (“ROM”), electrically erasable programmable read- only memory (“EEPROM”), or any other medium which can be used to store the desired information, and which can be accessed by the microprocessor 800.
- ROM read-only memory
- EEPROM electrically erasable programmable read- only memory
- Several programming modules such as instructions, datastores, and so forth may be stored within the memory 802 and configured to execute on the microprocessor 800.
- the processor 432 incorporates an operating system 804 configured and programmed to operate the Digital Signal Processing (“DSP”) engine 434 and to manage hardware and services (e.g., wireless unit 806, USB unit 808, Codec 810) within synchronized module 404 for the benefit of other control elements 812-832 in the processor.
- DSP Digital Signal Processing
- the control elements and their interconnections as shown in the diagram of Fig. 3 are merely representative; other elements or sub-circuits for processing the user voice input, interpreting that input, and/or performing functions based on that input may be provided.
- the codec unit 810 illustrated as part of the synchronized module 404 converts audio data between analog and digital formats and is coupled to the microphones of the microphone array 424 and to the processor to encode the audio received signals and similarly is connected via the
- communications module 822 to the host loudspeaker 604 to decode the audio signals to be broadcast.
- a user 106 may interact with the Smart Puck synchronized VC module 404 by speaking to it, the microphone array 424 receives the user speech 122, and the codec unit encodes the user speech and transfers that audio data to other components via the operating system for use in carrying out commands or responding to queries, for example, in known manner.
- the Smart Puck synchronized VC module 404 can then communicate back to the user by way of the audio output module 816 and the communication module 822 which produce signals that are converted to audible statements 120 by passing through the codec and being connectable to the host loudspeaker system 604 either by a direct wired connection or wirelessly as via a Bluetooth connection indicated at 606 to be output, or broadcast, through the host speaker 604.
- the user interacts with the VC loudspeaker system 400 and synchronized module 404 simply through speech 122, without use of the keyboard or display that is common in other types of devices.
- cancellation signal ID module and the cancellation signal generating module 832 cooperate with the microprocessor 800 and the DSP component 434 to cancel extraneous environmental sounds that would otherwise degrade the response of the system to voice commands or queries.
- the onboard sensor element 818 and the sensor manager module 820 control various sensor components that may be incorporated in the module 404.
- the wireless unit 806 of the VC speaker system’s synchronized module or smart puck 404 is coupled to an antenna 840 to facilitate a wireless connection to the network 112.
- the wireless unit 806 may implement one or more of various wireless technologies, such as Wi-Fi, Bluetooth, RF, and so on.
- the USB port 808 may be provided as part of the synchronized module 404 to facilitate a wired connection to a network, or a plug-in network device that communicates with other wireless networks. In addition to the USB port, or as an alternative thereto, other forms of wired connections may be employed, such as a broadband connection.
- the power supply module, or unit 812 is provided to distribute power to the various components in synchronized module 404.
- a stereo component may be provided optionally in the communication module 822 to produce stereo signals to the host speaker 604 or other speakers.
- the VC speaker system 400 and synchronized module 404 are designed to support audio interactions with the user 106, in the form of receiving voice
- the synchronized module 404 may include non-input control mechanisms, such as basic volume control button(s) for increasing/decreasing volume, as well as power and reset buttons as a part of the user interface module 814. There may also be a simple light element (e.g., LED) to indicate a state such as, for example, when power is on. But otherwise, synchronized module 404 does not need any input devices or displays to perform its functions.
- non-input control mechanisms such as basic volume control button(s) for increasing/decreasing volume, as well as power and reset buttons as a part of the user interface module 814.
- the method of the present invention further exploits the utility of Smart Puck synchronized VC module 404 as a microphone array capable of integrating with a separate host soundbar-subwoofer system 604 by providing suitable processor components such as the location ID module 826 for the purpose of determining the location the soundbar and subwoofer loudspeaker system 604 relative to a customer’s primary listening position.
- the processor 432 the processor 432
- DSP system 434 incorporates digital processor components such as a controller or computing module with a pre-programmed DSP system 434 to provide optimized sound quality on the user’s bespoke or high-performance loudspeaker product, or host
- the Smart Puck’s Cancellation signal generation module 832 is initiated to generate signal(s) which can then be sensed and captured to determine the complex Transfer Function (“TF”) signal(s) between separate host loudspeaker system 604 and the puck’s microphone array.
- TF Complex Transfer Function
- the captured TFs most efficiently expressed as Finite Impulse Response (FIR) filters, may be utilized to improve recognition of voice queries. Since the puck and separate host loudspeaker system 604 are synchronized, the puck will be able to monitor the program at any given time.
- FIR Finite Impulse Response
- the Smart Puck synchronized VC module 404 functions as a sophisticated listening device the puck can monitor ambient environmental noise apart from the desired program material.
- a“micro quiet zone" centered about the puck 404 can be established for purposes of further improving the signal to noise ratio associated with voice commands.
- Another aspect of the system 400 and the method of the present invention enables the smart puck 404 to capture the loudspeaker system’s acoustic frequency response at the primary listening location in conjunction with“room- smoothing” algorithms such as commercially available ones (e.g. AudisseyTM) or Polk Audio’s propriety one as described in commonly owned Patent No. 8,194,874 to Starobin, Lyon, et al.
- More than one smart puck 404 may be used in a system 400 with more than one host speaker system 604 and once plugged in, each synchronized module 404 may automatically self-configure, or may be configured with a slight amount of assistance by the user and be ready to use.
- the VC speaker system 400 with synchronized module 404 is very convenient to use and may be installed and set up for use at a low cost.
- the Puck that most clearly“hears” a voice command 122 shall control the associated separate host loudspeaker system 604; any other Puck(s) shall cede control to that which hears the particular command best.
- the system incorporates Smart Puck synchronized VC module 404 that is preferably configured as a compact puck- shaped product having a housing with a base that rests on a support which carries and orients an array 424 of multiple (e.g., eight) microphones in microphone array 424; Smart Puck synchronized VC module 404 is configured and programmed to employ beamforming and aiming capabilities when sensing with microphone array 424.
- Smart Puck synchronized VC module 404 is preferably configured as a compact puck- shaped product having a housing with a base that rests on a support which carries and orients an array 424 of multiple (e.g., eight) microphones in microphone array 424;
- Smart Puck synchronized VC module 404 is configured and programmed to employ beamforming and aiming capabilities when sensing with microphone array 424.
- Synchronized VC module 404 preferably is not limited to any particular industrial design and is synchronized with a host loudspeaker system such that the smart puck 404 and the host loudspeaker 604 are linked in several particular ways that permit superior responsiveness to voice commands 122 relative to a
- the captured TFs are then appropriately imposed, in inverse fashion, on the known audio signal reproduced by the loudspeaker system as a means of extracting the loudspeaker audio signals from the open microphone array’s signals.
- any loudspeaker system settings (such as master volume, Bass, Voice AdjustTM, etc.) and the various program modes (Movie, Music, Sports, etc.), to the extent that they have been determined to significantly affect the puck’s responsiveness to voice commands, the best measure of which is signal to noise ratio, are also taken into account by relaying these settings to the puck and applying the appropriate inverse transfer function (as expressed by the previously computed FIR filter and its unique sets of coefficients).
- a host or family of TFs are acquired to reflect the range of permutations of possible audio settings to the extent that the VC performance is significantly improved by taking them into account.
- the transfer function acquisition procedure is semi-automated or may be completely automated, allowing the user to set program modes and to establish other settings by voice command (with affirmation by user 106).
- a second major aspect of the invention which pertains only to host soundbar-subwoofer systems (e.g. 604), is use of the puck 404 as a means for locating the subwoofer relative to the soundbar SB and communicating to the system’s DSP alternative settings for optimal integration between the soundbar SB and subwoofer SW.
- DSP digital signal processor
- the delay imposed on audio signals sent to the soundbar may be adjusted so as to synchronize the time of arrival of incident sounds from both soundbar and subwoofer at the listening location. So long as audio does not lag video by more than 30ms, no“lip synch” issues should be expected. This constraint does imply certain limits on the placement location of the subwoofer relative to the soundbar for which optimization is possible without incurring lip synch issues.
- a third aspect of the method of the present invention concerns use of the puck 404 for purposes of acquiring the loudspeaker system’s in-room acoustic frequency response in accordance with available“room-smoothing” algorithms.
- AudisseyTM available in several mass-marketed brand name AVR’s such as DenonTM, MarantzTM, OnkyoTM and others attempts to improve a loudspeaker system’s low-frequency performance by imposing inverse magnitude shaping (equalization) in accordance with the acquired in-room, time-averaged acoustic response.
- An alternative room-smoothing technique as described by patent # 8,194,874, imposes time delayed“correction signals” as a means of addressing troublesome room modes.
- room-smoothing techniques involve placing Smart Puck synchronized VC module 404 within the primary listening area during the set-up sequence which involves initiating pink-noise, swept-sine or other stimuli so as to capture the loudspeaker-to-puck transfer function which necessarily includes acoustic room artifacts such as resonances and reflections. That said, “room-smoothing” techniques are most effective at addressing low-order room modes, or low-frequency resonances.
- a fourth aspect of the present invention pertains to use of the host loudspeaker 604 as secondary (sound-cancelling) source(s) for creating a“micro quiet zone” in accordance with commonly owned US patent #9,767,786 (the entire disclosure of which is hereby incorporated herein by reference).
- the host loudspeaker system 604 further improves the signal to noise ratio for the smart puck microphone array 432 in the presence of external noise, in addition to performing its primary audio broadcasting duty. Its combined output when fulfilling this noise-reduction function will generally reflect the phase-inverted noise detected by the puck’s microphone array 432 but transformed to account for both its spatial displacement relative to the puck and room effects.
- the user may have a desire to use the system 400 with more than one designated micro-quiet zone and optionally more than one Smart Puck synchronized VC module 404. While the fourth aspect describes a micro quiet zone about the puck where it will normally be positioned for receiving voice commands (e.g., on table 408), a fifth (related) aspect involves a similar advantage whereby Smart Puck synchronized VC module 404 may be positioned within another targeted micro quiet zone (e.g., in accordance with patent # 9,767,786).
- a transfer function between a second spatially displaced micro-quiet zone, such as the primary seating area or the user’s head pillow for a bedroom system, and the normal, fixed location of the smart-puck may optionally be established so as to determine the appropriate TF ratio that shall be applied to the noise cancellation signal that’s reproduced by the separate host loudspeaker system 604.
- the separate host loudspeaker system 604 may radiate a tightly controlled beam, characterized by a high directivity factor, towards the selected micro quiet zone.
- background noise within the listening area may be reduced by this method when noise cancellation signals are combined with program material and any other corrective signals such those associated with room-smoothing techniques.
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Circuit For Audible Band Transducer (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201862614726P | 2018-01-08 | 2018-01-08 | |
PCT/US2019/012738 WO2019136460A1 (en) | 2018-01-08 | 2019-01-08 | Synchronized voice-control module, loudspeaker system and method for incorporating vc functionality into a separate loudspeaker system |
Publications (2)
Publication Number | Publication Date |
---|---|
EP3776880A1 true EP3776880A1 (de) | 2021-02-17 |
EP3776880A4 EP3776880A4 (de) | 2022-06-22 |
Family
ID=67144498
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP19736223.9A Withdrawn EP3776880A4 (de) | 2018-01-08 | 2019-01-08 | Synchronisiertes sprachsteuermodul, lautsprechersystem und verfahren zum einbau von vc-funktionalität in ein separates lautsprechersystem |
Country Status (2)
Country | Link |
---|---|
EP (1) | EP3776880A4 (de) |
WO (1) | WO2019136460A1 (de) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021010884A1 (en) * | 2019-07-18 | 2021-01-21 | Dirac Research Ab | Intelligent audio control platform |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5828768A (en) * | 1994-05-11 | 1998-10-27 | Noise Cancellation Technologies, Inc. | Multimedia personal computer with active noise reduction and piezo speakers |
US9008331B2 (en) * | 2004-12-30 | 2015-04-14 | Harman International Industries, Incorporated | Equalization system to improve the quality of bass sounds within a listening area |
US8194874B2 (en) * | 2007-05-22 | 2012-06-05 | Polk Audio, Inc. | In-room acoustic magnitude response smoothing via summation of correction signals |
US9015612B2 (en) * | 2010-11-09 | 2015-04-21 | Sony Corporation | Virtual room form maker |
US8971543B1 (en) * | 2012-06-25 | 2015-03-03 | Rawles Llc | Voice controlled assistant with stereo sound from two speakers |
BR112015004288B1 (pt) * | 2012-08-31 | 2021-05-04 | Dolby Laboratories Licensing Corporation | sistema para renderizar som com o uso de elementos de som refletidos |
US9251787B1 (en) * | 2012-09-26 | 2016-02-02 | Amazon Technologies, Inc. | Altering audio to improve automatic speech recognition |
US9813808B1 (en) * | 2013-03-14 | 2017-11-07 | Amazon Technologies, Inc. | Adaptive directional audio enhancement and selection |
US10147441B1 (en) * | 2013-12-19 | 2018-12-04 | Amazon Technologies, Inc. | Voice controlled system |
CN107113527A (zh) * | 2014-09-30 | 2017-08-29 | 苹果公司 | 确定扬声器位置变化的方法 |
NL2013704B1 (en) * | 2014-10-29 | 2016-10-04 | Ouborg & Gatin Ip B V | Audio reproduction system comprising speaker modules and control module. |
US10657949B2 (en) * | 2015-05-29 | 2020-05-19 | Sound United, LLC | System and method for integrating a home media system and other home systems |
US9704509B2 (en) * | 2015-07-29 | 2017-07-11 | Harman International Industries, Inc. | Active noise cancellation apparatus and method for improving voice recognition performance |
US9820039B2 (en) * | 2016-02-22 | 2017-11-14 | Sonos, Inc. | Default playback devices |
CN109688442B (zh) * | 2017-05-16 | 2021-06-04 | 苹果公司 | 用于家庭媒体控制的方法和界面 |
-
2019
- 2019-01-08 EP EP19736223.9A patent/EP3776880A4/de not_active Withdrawn
- 2019-01-08 WO PCT/US2019/012738 patent/WO2019136460A1/en unknown
Also Published As
Publication number | Publication date |
---|---|
EP3776880A4 (de) | 2022-06-22 |
WO2019136460A1 (en) | 2019-07-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6615300B2 (ja) | ハンズフリー・ビームパターン構成 | |
CN112584273B (zh) | 空间上回避通过波束形成扬声器阵列产生的音频 | |
US10123119B1 (en) | Voice controlled assistant with stereo sound from two speakers | |
EP3128767B1 (de) | System und verfahren zur verbesserung von lautsprechern, die an vorrichtungen mit mikrofonen angeschlossen sind | |
CN103329576B (zh) | 音频系统及其操作方法 | |
US11809775B2 (en) | Conversation assistance audio device personalization | |
KR20190039646A (ko) | 복수의 음성 명령 디바이스를 사용하는 장치 및 방법 | |
US7889872B2 (en) | Device and method for integrating sound effect processing and active noise control | |
EP3090576A1 (de) | Verfahren und systeme zum konzipieren und anwenden numerisch optimierter binauraler raumimpulsantworten | |
US20220272454A1 (en) | Managing playback of multiple streams of audio over multiple speakers | |
KR20220044204A (ko) | 분산형 오디오 디바이스들을 위한 음향 반향 소거 제어 | |
CN118102179A (zh) | 音频处理方法和系统及相关非暂时性介质 | |
WO2019133942A1 (en) | Voice-control soundbar loudspeaker system with dedicated dsp settings for voice assistant output signal and mode switching method | |
EP3776880A1 (de) | Synchronisiertes sprachsteuermodul, lautsprechersystem und verfahren zum einbau von vc-funktionalität in ein separates lautsprechersystem | |
WO2019139991A1 (en) | System and method for generating an improved voice assist algorithm signal input | |
US20240114309A1 (en) | Progressive calculation and application of rendering configurations for dynamic applications | |
US12003673B2 (en) | Acoustic echo cancellation control for distributed audio devices | |
US12003933B2 (en) | Rendering audio over multiple speakers with multiple activation criteria | |
US20220360899A1 (en) | Dynamics processing across devices with differing playback capabilities | |
Gan et al. | Assisted Listening for Headphones and Hearing Aids | |
Dumčius | Simulation of sound field in a classroom |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAJ | Public notification under rule 129 epc |
Free format text: ORIGINAL CODE: 0009425 |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
111Z | Information provided on other rights and legal means of execution |
Free format text: AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR Effective date: 20201208 |
|
17P | Request for examination filed |
Effective date: 20201207 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
111Z | Information provided on other rights and legal means of execution |
Free format text: AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR Effective date: 20201208 |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) | ||
RAP3 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: POLK AUDIO, LLC |
|
RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: LYONS, MATTHEW, P. Inventor name: STAROBIN, BRADLEY, M. Inventor name: CRISCO, JOHN |
|
R11X | Information provided on other rights and legal means of execution (corrected) |
Free format text: AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR Effective date: 20201208 |
|
D11X | Information provided on other rights and legal means of execution (deleted) | ||
RIC1 | Information provided on ipc code assigned before grant |
Ipc: H04S 7/00 20060101ALN20211220BHEP Ipc: G06F 3/16 20060101ALN20211220BHEP Ipc: H04R 1/40 20060101AFI20211220BHEP |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R079 Free format text: PREVIOUS MAIN CLASS: H04B0003000000 Ipc: H04R0001400000 |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20220524 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 21/02 20130101ALN20220518BHEP Ipc: H04S 7/00 20060101ALN20220518BHEP Ipc: G06F 3/16 20060101ALN20220518BHEP Ipc: H04R 1/40 20060101AFI20220518BHEP |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20221224 |