US20170195779A9 - Psycho-acoustic noise suppression - Google Patents
Psycho-acoustic noise suppression Download PDFInfo
- Publication number
- US20170195779A9 US20170195779A9 US13/471,423 US201213471423A US2017195779A9 US 20170195779 A9 US20170195779 A9 US 20170195779A9 US 201213471423 A US201213471423 A US 201213471423A US 2017195779 A9 US2017195779 A9 US 2017195779A9
- Authority
- US
- United States
- Prior art keywords
- voice
- ambient noise
- binaural
- signals
- pair
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000001629 suppression Effects 0.000 title description 3
- 238000000034 method Methods 0.000 claims abstract description 23
- 210000005069 ears Anatomy 0.000 claims abstract description 15
- 230000008447 perception Effects 0.000 claims abstract description 11
- 238000004891 communication Methods 0.000 claims abstract description 4
- 230000004936 stimulating effect Effects 0.000 claims abstract 7
- 230000000694 effects Effects 0.000 claims description 10
- 230000006870 function Effects 0.000 claims 5
- 238000012545 processing Methods 0.000 description 8
- 230000003044 adaptive effect Effects 0.000 description 3
- 238000003491 array Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000013500 data storage Methods 0.000 description 2
- 239000002360 explosive Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
- H04S1/005—For headphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/22—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired frequency characteristic only
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G3/00—Gain control in amplifiers or frequency changers without distortion of the input signal
- H03G3/20—Automatic control
- H03G3/30—Automatic control in amplifiers having semiconductor devices
- H03G3/32—Automatic control in amplifiers having semiconductor devices the control being dependent upon ambient noise level or sound level
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
Abstract
Description
- This application claims priority to U.S. Patent Application No. 61/486,088, filed on May 13, 2011, and entitled “PSYCHO-ACOUSTIC NOISE SUPPRESSION,” which is incorporated herein by reference.
- Recent developments in the art of manufacturing has brought significant reduction in cost and form factor of mobile consumer devices—tablet, blue tooth headset, net book, net TV etc. As a result, there is an explosive growth in consumption of these consumer devices. Besides communication applications such as voice and video telephony, voice driven machine applications are becoming increasing popular as well. Voice based machine applications include voice driven automated attendants, command recognition, speech recognition, voice based search engine, networked games and such. Video conferencing and other display oriented applications require the user to watch the screen from a hand-held distance. In the hand-held mode, the signal to noise ratio of the desired voice signal at the microphone is severely degraded, both due to the exposure to ambient noise and the exposure to loud acoustic echo feedback from the loudspeakers in close proximity. This is further exacerbated by the fact that voice driven applications and improved voice communications require wide band voice.
- Binaural headsets, both wired and wireless have been increasing along with the explosive growth of mobile consumer devices. However, the noise environments in which the headsets are used are becoming ever more challenging, especially in the presence of ambient noise in the environment of the listener.
- It is within this context that the embodiments arise.
- The embodiments provide take advantage of the observation that the human brain has naturally evolved to perform noise suppression by taking advantage of several cues in the environment. On a lighter note, it is this ability which makes a husband completely miss hearing his wife, while he may be busy watching a TV show.
- In the present embodiments, an arbitrary number of microphones are bifurcated into two groups. The microphones in each group are summed together to form two microphone arrays. Due to the computing ease of the processing operation, i.e., summing, these arrays by themselves provide very little improvement of signal to noise ratio in the desired look direction. However, the microphones are arranged such that the characteristics of the ambient noise from other directions orthogonal to the look direction, is substantially different between the outputs of the two microphone arrays. The embodiments employ a source separation adaptive filtering process between these two outputs to generate the desired signal with substantially improved signal to noise ratio. The separation process also provides ambient noise with significantly reduced voice. There are applications where the ambient noise is of use. The outputs of a multiplicity of microphones is reduced or encoded into two signals, i.e., the virtual microphones. With the reduced bandwidth and fixed signal dimension, it is easier to perform the processing through existing hardware and software systems, such that the processing of interest may be performed either on the end hosts or the network cloud.
- The above summary does not include all aspects of the present invention. The invention includes all systems and methods disclosed in the Detailed Description below and particularly pointed out in the claims.
- The embodiments of the invention are illustrated by way of examples and not be interpreted by way of limitation in the accompanying drawings.
-
FIG. 1 illustrates noise and voice signals originating from a same direction. -
FIG. 2 illustrates noise and voice signals originating from different directions. -
FIG. 3 illustrates a simplified schematic diagram illustrating a technique to excite the ears of a listener with a pair of stereo or binaural signals to create psycho acoustic effects in accordance with one embodiment of the invention. - While several details are set forth, it is understood that some embodiments of the invention may be practiced without these details. In some instances, well-known circuits and techniques have not been shown in detail so as not to obscure the understanding of this description.
- Active Noise Control has been utilized to generate anti-noise in the ears of the listener. However, active noise control requires a fully-covered headset, making it expensive and inconvenient Implementations which use smaller headsets have not fared well in the quality of enhancement.
- Adaptive Volume Control, where the volume of the remote talker's voice is increased in the listener's ears based on the loudness of ambient noise has also been utilized to address this problem. However, this method has a limited useful range in which the volume can be varied without hurting the listener's ears.
- The following psycho-acoustic observations of interest provide a basis for the embodiments described below:
-
- a. Human hearing is more sensitized to directional sound as compared to diffused sound coming from all over.
- b. Human hearing can differentiate two sources of sound better if it comes from two different directions as shown in
FIG. 2 , rather than coming from the same direction as shown inFIG. 1 . - c. Human hearing is more sensitized to a whisper in the ear rather than an equally loud sound from a distance.
- d. Human hearing is sensitized to a shout out from a distance. Shouting makes the voice different—it has the so called Lombard's effect.
- e. Human hearing is sensitized to a moving sound source as compared to a diffused source of sound
- In the embodiments described above, a plurality of microphones is bifurcated into two groups. As shown in
FIG. 3 , methods to excite the two ears of the listener with a pair of stereo, or binaural signals, creating the desired psycho-acoustic effects is provided. The effects processing is made adaptive with the ambient noise characteristics such as direction of arrival of the noise and its loudness. The effects include the following: -
- Playing the remote talker's voice in the two ears with different delays to create a perception of desired directionality.
- Playing the remote talker's voice in the two ears with different attenuation to further support the perception of desired directionality.
- Making the diffused ambient noise seem directional by mixing some of the noise sensed from the environment into one of the ears. It should be appreciated that this makes the listener hear the noise more in one of the ears.
- Playing the remote talker's voice in the two ears with a phase inversion to create the perception of the voice emanating from within the head of the listener.
- Introducing Lombard's effect into the remote talker's voice through signal processing, to create a perception that the remote talker is shouting to keep up with the ambient noise around the listener.
- The received remote talker's voice is mono. We make two copies for the left and right channels to be processed and fed to the two speakers of the stereo headset as shown in
FIG. 3 . Using the local microphones in the device, we sense the ambient noise. There are several ways to sense ambient noise, some described by the authors in other patents. From this sensor, we can determine the intensity and direction of the ambient noise. The two copies of the remote talker's signals are manipulated based on the sensed ambient noise. In one example of the manipulation, we delay the two copies to make it seem to arrive from a direction different from that of the ambient noise. In the event the ambient noise does not have a preferred direction, such as the vase of diffused noise, we artificially make it seem as though it is directional. This is achieved by taking some of the observed noise and adding it towards one of the ears, so the listener is led to believe that the noise is arriving from that direction. In addition, we also propose introducing Lombard's effect, in effect making it sound as though the remote talker's is shouting in response to the ambient noise in the local listener's environment. It should be appreciated that whileFIG. 3 illustrates a binaural headset, the same effect can be realized using stereo or surround speaker systems as well. - With the above embodiments in mind, it should be understood that the embodiments might employ various computer-implemented operations involving data stored in computer systems. These operations are those requiring physical manipulation of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. Further, the manipulations performed are often referred to in terms, such as producing, identifying, determining, or comparing. Any of the operations described herein that form part of the invention are useful machine operations. The embodiments also relates to a device or an apparatus for performing these operations. The apparatus can be specially constructed for the required purpose, or the apparatus can be a general-purpose computer selectively activated or configured by a computer program stored in the computer. In particular, various general-purpose machines can be used with computer programs written in accordance with the teachings herein, or it may be more convenient to construct a more specialized apparatus to perform the required operations
- The invention can also be embodied as computer readable code on a computer readable medium. The computer readable medium is any data storage device that can store data, which can be thereafter read by a computer system. Examples of the computer readable medium include hard drives, network attached storage (NAS), read-only memory, random-access memory, CD-ROMs, CD-Rs, CD-RWs, magnetic tapes, and other optical and non-optical data storage devices. The computer readable medium can also be distributed over a network coupled computer system so that the computer readable code is stored and executed in a distributed fashion. Embodiments of the present invention may be practiced with various computer system configurations including hand-held devices, microprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers and the like. The invention can also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a wire-based or wireless network.
- Although the method operations were described in a specific order, it should be understood that other operations may be performed in between described operations, described operations may be adjusted so that they occur at slightly different times or the described operations may be distributed in a system which allows the occurrence of the processing operations at various intervals associated with the processing.
- Although the foregoing invention has been described in some detail for purposes of clarity of understanding, it will be apparent that certain changes and modifications can be practiced within the scope of the appended claims. Accordingly, the present embodiments are to be considered as illustrative and not restrictive, and the invention is not to be limited to the details given herein, but may be modified within the scope and equivalents of the appended claims.
Claims (14)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/471,423 US9794678B2 (en) | 2011-05-13 | 2012-05-14 | Psycho-acoustic noise suppression |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161486088P | 2011-05-13 | 2011-05-13 | |
US201161486080P | 2011-05-13 | 2011-05-13 | |
US13/471,423 US9794678B2 (en) | 2011-05-13 | 2012-05-14 | Psycho-acoustic noise suppression |
Publications (3)
Publication Number | Publication Date |
---|---|
US20120288125A1 US20120288125A1 (en) | 2012-11-15 |
US20170195779A9 true US20170195779A9 (en) | 2017-07-06 |
US9794678B2 US9794678B2 (en) | 2017-10-17 |
Family
ID=47141909
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/471,423 Active 2034-01-15 US9794678B2 (en) | 2011-05-13 | 2012-05-14 | Psycho-acoustic noise suppression |
Country Status (1)
Country | Link |
---|---|
US (1) | US9794678B2 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2839461A4 (en) | 2012-04-19 | 2015-12-16 | Nokia Technologies Oy | An audio scene apparatus |
EP3441966A1 (en) * | 2014-07-23 | 2019-02-13 | PCMS Holdings, Inc. | System and method for determining audio context in augmented-reality applications |
Family Cites Families (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3924072A (en) * | 1974-07-10 | 1975-12-02 | Koss Corp | Headphone with cross feeding ambience control |
US5386082A (en) * | 1990-05-08 | 1995-01-31 | Yamaha Corporation | Method of detecting localization of acoustic image and acoustic image localizing system |
US5590241A (en) | 1993-04-30 | 1996-12-31 | Motorola Inc. | Speech processing system and method for enhancing a speech signal in a noisy environment |
JP3685812B2 (en) | 1993-06-29 | 2005-08-24 | ソニー株式会社 | Audio signal transmitter / receiver |
DE69327396T2 (en) | 1993-07-28 | 2000-05-11 | Pan Communications Inc | Two-way communication earphones |
DE69527731T2 (en) | 1994-05-18 | 2003-04-03 | Nippon Telegraph & Telephone | Transceiver with an acoustic transducer of the earpiece type |
US5692059A (en) | 1995-02-24 | 1997-11-25 | Kruger; Frederick M. | Two active element in-the-ear microphone system |
US7012630B2 (en) * | 1996-02-08 | 2006-03-14 | Verizon Services Corp. | Spatial sound conference system and apparatus |
US6978159B2 (en) * | 1996-06-19 | 2005-12-20 | Board Of Trustees Of The University Of Illinois | Binaural signal processing using multiple acoustic sensors and digital filtering |
JP4478220B2 (en) * | 1997-05-29 | 2010-06-09 | ソニー株式会社 | Sound field correction circuit |
US7206421B1 (en) * | 2000-07-14 | 2007-04-17 | Gn Resound North America Corporation | Hearing system beamformer |
US7246058B2 (en) | 2001-05-30 | 2007-07-17 | Aliph, Inc. | Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors |
US7116787B2 (en) * | 2001-05-04 | 2006-10-03 | Agere Systems Inc. | Perceptual synthesis of auditory scenes |
US6701170B2 (en) | 2001-11-02 | 2004-03-02 | Nellcor Puritan Bennett Incorporated | Blind source separation of pulse oximetry signals |
US7260209B2 (en) * | 2003-03-27 | 2007-08-21 | Tellabs Operations, Inc. | Methods and apparatus for improving voice quality in an environment with noise |
EP1519628A3 (en) * | 2003-09-29 | 2009-03-04 | Siemens Aktiengesellschaft | Method and device for the reproduction of a binaural output signal which is derived from a monaural input signal |
GB2414369B (en) | 2004-05-21 | 2007-08-01 | Hewlett Packard Development Co | Processing audio data |
WO2007018293A1 (en) | 2005-08-11 | 2007-02-15 | Asahi Kasei Kabushiki Kaisha | Sound source separating device, speech recognizing device, portable telephone, and sound source separating method, and program |
US7612793B2 (en) * | 2005-09-07 | 2009-11-03 | Polycom, Inc. | Spatially correlated audio in multipoint videoconferencing |
US7813923B2 (en) | 2005-10-14 | 2010-10-12 | Microsoft Corporation | Calibration based beamforming, non-linear adaptive filtering, and multi-sensor headset |
JP5098176B2 (en) * | 2006-01-10 | 2012-12-12 | カシオ計算機株式会社 | Sound source direction determination method and apparatus |
FR2899424A1 (en) * | 2006-03-28 | 2007-10-05 | France Telecom | Audio channel multi-channel/binaural e.g. transaural, three-dimensional spatialization method for e.g. ear phone, involves breaking down filter into delay and amplitude values for samples, and extracting filter`s spectral module on samples |
US8374365B2 (en) * | 2006-05-17 | 2013-02-12 | Creative Technology Ltd | Spatial audio analysis and synthesis for binaural reproduction and format conversion |
JP4722878B2 (en) | 2007-04-19 | 2011-07-13 | ソニー株式会社 | Noise reduction device and sound reproduction device |
US8180064B1 (en) * | 2007-12-21 | 2012-05-15 | Audience, Inc. | System and method for providing voice equalization |
KR101082839B1 (en) | 2008-12-22 | 2011-11-11 | 한국전자통신연구원 | Method and apparatus for multi channel noise reduction |
JP4883103B2 (en) * | 2009-02-06 | 2012-02-22 | ソニー株式会社 | Signal processing apparatus, signal processing method, and program |
EP2839461A4 (en) * | 2012-04-19 | 2015-12-16 | Nokia Technologies Oy | An audio scene apparatus |
-
2012
- 2012-05-14 US US13/471,423 patent/US9794678B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US20120288125A1 (en) | 2012-11-15 |
US9794678B2 (en) | 2017-10-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3424229B1 (en) | Systems and methods for spatial audio adjustment | |
CN107637095B (en) | Privacy preserving, energy efficient speaker for personal sound | |
US10469976B2 (en) | Wearable electronic device and virtual reality system | |
US11037544B2 (en) | Sound output device, sound output method, and sound output system | |
JP5762550B2 (en) | 3D sound acquisition and playback using multi-microphone | |
US9560445B2 (en) | Enhanced spatial impression for home audio | |
US9161149B2 (en) | Three-dimensional sound compression and over-the-air transmission during a call | |
JP2013546253A (en) | System, method, apparatus and computer readable medium for head tracking based on recorded sound signals | |
CN109310525B (en) | Media compensation pass-through and mode switching | |
EP3459231B1 (en) | Device for generating audio output | |
US11611840B2 (en) | Three-dimensional audio systems | |
CN113784274A (en) | Three-dimensional audio system | |
WO2016042410A1 (en) | Techniques for acoustic reverberance control and related systems and methods | |
US9794678B2 (en) | Psycho-acoustic noise suppression | |
US20190246230A1 (en) | Virtual localization of sound | |
US20190245503A1 (en) | Method for dynamic sound equalization | |
US11217268B2 (en) | Real-time augmented hearing platform | |
KR101111734B1 (en) | Sound reproduction method and apparatus distinguishing multiple sound sources | |
US20230319492A1 (en) | Adaptive binaural filtering for listening system using remote signal sources and on-ear microphones | |
US20240107259A1 (en) | Spatial Capture with Noise Mitigation | |
WO2017211448A1 (en) | Method for generating a two-channel signal from a single-channel signal of a sound source | |
Shabtai et al. | Spherical array processing with binaural sound reproduction for improved speech intelligibility | |
JP2022128177A (en) | Sound generation device, sound reproduction device, sound reproduction method, and sound signal processing program | |
KR20240062489A (en) | Electronic device and sound output method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: AURENTA, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MUKUND, SHRIDHAR K;REEL/FRAME:033271/0083 Effective date: 20140708 Owner name: PLANTRONICS, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AURENTA, INC.;REEL/FRAME:033271/0164 Effective date: 20140708 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: WELLS FARGO BANK, NATIONAL ASSOCIATION, NORTH CAROLINA Free format text: SECURITY AGREEMENT;ASSIGNORS:PLANTRONICS, INC.;POLYCOM, INC.;REEL/FRAME:046491/0915 Effective date: 20180702 Owner name: WELLS FARGO BANK, NATIONAL ASSOCIATION, NORTH CARO Free format text: SECURITY AGREEMENT;ASSIGNORS:PLANTRONICS, INC.;POLYCOM, INC.;REEL/FRAME:046491/0915 Effective date: 20180702 |
|
FEPP | Fee payment procedure |
Free format text: SURCHARGE FOR LATE PAYMENT, LARGE ENTITY (ORIGINAL EVENT CODE: M1554); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
AS | Assignment |
Owner name: POLYCOM, INC., CALIFORNIA Free format text: RELEASE OF PATENT SECURITY INTERESTS;ASSIGNOR:WELLS FARGO BANK, NATIONAL ASSOCIATION;REEL/FRAME:061356/0366 Effective date: 20220829 Owner name: PLANTRONICS, INC., CALIFORNIA Free format text: RELEASE OF PATENT SECURITY INTERESTS;ASSIGNOR:WELLS FARGO BANK, NATIONAL ASSOCIATION;REEL/FRAME:061356/0366 Effective date: 20220829 |
|
AS | Assignment |
Owner name: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P., TEXAS Free format text: NUNC PRO TUNC ASSIGNMENT;ASSIGNOR:PLANTRONICS, INC.;REEL/FRAME:065549/0065 Effective date: 20231009 |