WO2001084884A2 - Audio system - Google Patents

Audio system Download PDF

Info

Publication number
WO2001084884A2
WO2001084884A2 PCT/EP2001/004323 EP0104323W WO0184884A2 WO 2001084884 A2 WO2001084884 A2 WO 2001084884A2 EP 0104323 W EP0104323 W EP 0104323W WO 0184884 A2 WO0184884 A2 WO 0184884A2
Authority
WO
WIPO (PCT)
Prior art keywords
audio
listener
reproduction system
loudspeakers
microphones
Prior art date
Application number
PCT/EP2001/004323
Other languages
French (fr)
Other versions
WO2001084884A3 (en
Inventor
Harm J. W. Belt
David A. C. M. Roovers
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to JP2001581575A priority Critical patent/JP2003533110A/en
Priority to EP01947224A priority patent/EP1279318A2/en
Priority to KR1020017016712A priority patent/KR20020028918A/en
Publication of WO2001084884A2 publication Critical patent/WO2001084884A2/en
Publication of WO2001084884A3 publication Critical patent/WO2001084884A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/60Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
    • H04N5/607Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals for more than one sound signal, e.g. stereo, multilanguages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation

Definitions

  • the invention relates to an audio reproduction system as described in the preamble of Claim 1.
  • the invention further relates to audio processing means for use in such an audio reproduction system.
  • the invention further relates to a voice controlled system comprising such an audio reproduction system.
  • the invention further relates to a video reproduction system comprising such an audio reproduction system.
  • a disadvantage of known audio reproduction systems with stereo or multichannel audio signals is the fact that the audio quality is strongly dependent on the position of the listener in relation to the loudspeakers. When this position differs from the "ideal" position, the reproduction quality is deteriorated to a great extent. This is caused by unwanted amplitude and phase differences between the acoustic transfer functions of the different loudspeakers to the listener.
  • a further disadvantage is that also in case the listener has taken a position which is geometrically the "ideal" position the acoustic of the room can cause local amplitude and phase differences between the acoustic transfer functions resulting in a deteriorated reproduction.
  • an audio reproduction system comprises the features of claim 1.
  • the microphones By using at least two microphones it is possible to determine the location of the listener, or at least certain characteristics thereof, when the listener speaks. These techniques are commonly known as "Blind Identification" If the microphones are located close to the loudspeakers, then also the acoustic transfer functions from the listener to the loudspeakers or characteristics thereof, are obtained. The invention is based on the insight that by obtaining the transfer function from the listener to the loudspeakers also the transfer function from the loudspeaker to the listener can be obtained using the reciprocity theorem. Hereafter it is possible to amend the audio signal as supplied to the different loudspeakers to optimize the audio quality at the position of the listener by correcting for the identified amplitude and/or phase differences.
  • a disadvantage of such a sound system is that for audio systems the location of the remote control and the listener's position (especially his ears/head) is not the same.
  • An embodiment of the invention comprises the features of claim 2.
  • the echo cancellation means by the microphone(s), received echo signals from the loudspeakers can be cancelled before the speech signal(s) are further processed.
  • FIG. 1 schematically an example of an audio reproduction system according to the invention
  • FIG. 1 shows an audio reproduction system AS comprising audio processing means APM which audio processing means receive an input audio signal IAS and supply after processing a first output audio signal OAS1 to a first loudspeaker LSI and a second output audio signal OAS2 to a second loudspeaker LS2.
  • the loudspeakers LSI and LS2 supply sound signals SSI and SS2.
  • the audio reproduction system AS further comprises a first and a second microphone MP1 respectively MP2 for receiving a voice controlled command NCC from a listener P.
  • the first and second microphone MP1, MP2 are coupled to a command unit CU for handling the microphone output signals and supplying a signal to the audio processing means APM.
  • the audio processing means further comprise echo-cancellation means ECM to cancel the echo signals received with the microphones from the loudspeakers.
  • the microphones are located in the neighborhood of the respective loudspeakers.
  • the invention is based on the insight that by obtaining the transfer function from the listener to the loudspeakers also the transfer function from the loudspeaker to the listener can be obtained using the reciprocity theorem.
  • the echo cancellation means by the microphone(s), received echo signals from the loudspeakers can be cancelled before the voice commands are further processed.
  • Fig. 2 describes in more detail part of the audio reproducing system AS2.
  • This example of a sound reproduction system comprises two closely spaced loudspeakers LS21 and LS22 and two microphones MP21 and MP22.
  • the microphones can be positioned below or above the loudspeaker boxes or they can be integrated into the front panels, or closely in the neighborhood.
  • the Sound filters Hi and H operate on the left and right channels (L and R) of the input stereo signal AL and AR, as indicated in fig. 2.
  • a speaker localization algorithm in the audio reproduction system estimates the difference between the acoustic propagation delays from the user's position to the left and right microphone, respectively. Using the reciprocity theorem, this is also the delay difference between the two acoustic paths from the loudspeakers to the listener, as explained above.
  • this acoustic propagation time delay is compensated for with a delay T ⁇ n in the left channel in a delay device TD2 (if the acoustic path length L 2 between the listener and the right loudspeaker is larger than the acoustic path length L] between the listener and the left speaker) or a delay T d2 in the right channel with a delay device TD1 (if ⁇ > L 2 ).
  • the speaker localization is done at low frequencies only, since at frequencies higher than 4 kHz the speech signal contains little power. Also, time delays are ambiguous at higher frequencies due to the short acoustic wavelengths.
  • the speaker localization algorithm can be combined with a speech detector so that adaptation is halted during non-speech periods.
  • the invention can also be seen as a first step towards voice controlled electronic consumer products, which would be acceptable for the greater public.
  • Applications of the invention can be found in stereo and multi-channel sound reproduction systems such as televisions, in portable stereo sets, and in others.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)

Abstract

Audio reproduction systems are used to reproduce audio signals. A disadvantage of known audio reproduction systems is that in most cases with stereo or multi-channel signals the quality performance of reproduction is very much dependent on the listening position. The invention proposes a solution by making it possible for the audio reproduction system to localize the position of the listener and amend the reproduced audio signal in dependence of the location.

Description

Audio system.
The invention relates to an audio reproduction system as described in the preamble of Claim 1.
The invention further relates to audio processing means for use in such an audio reproduction system. The invention further relates to a voice controlled system comprising such an audio reproduction system.
The invention further relates to a video reproduction system comprising such an audio reproduction system.
A disadvantage of known audio reproduction systems with stereo or multichannel audio signals is the fact that the audio quality is strongly dependent on the position of the listener in relation to the loudspeakers. When this position differs from the "ideal" position, the reproduction quality is deteriorated to a great extent. This is caused by unwanted amplitude and phase differences between the acoustic transfer functions of the different loudspeakers to the listener.
A further disadvantage is that also in case the listener has taken a position which is geometrically the "ideal" position the acoustic of the room can cause local amplitude and phase differences between the acoustic transfer functions resulting in a deteriorated reproduction.
It is inter alia an object of the invention to obtain an improved audio reproduction system.
To achieve these object(s) an audio reproduction system according to the invention comprises the features of claim 1.
By using at least two microphones it is possible to determine the location of the listener, or at least certain characteristics thereof, when the listener speaks. These techniques are commonly known as "Blind Identification" If the microphones are located close to the loudspeakers, then also the acoustic transfer functions from the listener to the loudspeakers or characteristics thereof, are obtained. The invention is based on the insight that by obtaining the transfer function from the listener to the loudspeakers also the transfer function from the loudspeaker to the listener can be obtained using the reciprocity theorem. Hereafter it is possible to amend the audio signal as supplied to the different loudspeakers to optimize the audio quality at the position of the listener by correcting for the identified amplitude and/or phase differences.
It is to be noticed that from the US Patent US-A-5,386,478 a sound system remote control with an acoustic sensor is known to optimize the sound quality at a particular listening location as sensed by a microphone in a hand-held remote control unit.
A disadvantage of such a sound system is that for audio systems the location of the remote control and the listener's position (especially his ears/head) is not the same.
An embodiment of the invention comprises the features of claim 2.
By using the echo cancellation means the, by the microphone(s), received echo signals from the loudspeakers can be cancelled before the speech signal(s) are further processed.
It is further to be noticed here that the not-prepublished international application no. PCT/EP99/08253 (Applicant's reference: PHN 17.163) describes a signal source localization arrangement for use in video conferencing systems. Herein the localization is used to make the videoconference more "real" by steering a camera towards the source.
Further embodiments of the invention are described in the dependent claims.
These and other aspects of the invention will be apparent from and elucidated with reference to the examples described hereinafter. Herein shows
Figure 1 schematically an example of an audio reproduction system according to the invention, and
Figure 2 a second schematically example of an audio reproduction system according to the invention. Fig. 1 shows an audio reproduction system AS comprising audio processing means APM which audio processing means receive an input audio signal IAS and supply after processing a first output audio signal OAS1 to a first loudspeaker LSI and a second output audio signal OAS2 to a second loudspeaker LS2. The loudspeakers LSI and LS2 supply sound signals SSI and SS2.
The audio reproduction system AS further comprises a first and a second microphone MP1 respectively MP2 for receiving a voice controlled command NCC from a listener P. The first and second microphone MP1, MP2 are coupled to a command unit CU for handling the microphone output signals and supplying a signal to the audio processing means APM.
The audio processing means further comprise echo-cancellation means ECM to cancel the echo signals received with the microphones from the loudspeakers.
The microphones are located in the neighborhood of the respective loudspeakers.
By using at least two microphones it is possible to determine the location of the listener, or at least certain characteristics thereof, when the listener speaks. These techniques are commonly known as "Blind Identification" If the microphones are located close to the loudspeakers, then also the acoustic transfer functions from the listener to the loudspeakers or characteristics thereof, are obtained.
The invention is based on the insight that by obtaining the transfer function from the listener to the loudspeakers also the transfer function from the loudspeaker to the listener can be obtained using the reciprocity theorem.
Hereafter it is possible to amend the audio signal as supplied to the different loudspeakers to optimize the audio quality at the position of the listener by correcting for the identified amplitude and/or phase differences.
By using the echo cancellation means the, by the microphone(s), received echo signals from the loudspeakers can be cancelled before the voice commands are further processed.
In this way it is possible to locate the listener by using e.g. the time difference between the received voice-controlled command at the first respectively second microphone. Fig. 2 describes in more detail part of the audio reproducing system AS2. This example of a sound reproduction system comprises two closely spaced loudspeakers LS21 and LS22 and two microphones MP21 and MP22. The microphones can be positioned below or above the loudspeaker boxes or they can be integrated into the front panels, or closely in the neighborhood.
The Sound filters Hi and H operate on the left and right channels (L and R) of the input stereo signal AL and AR, as indicated in fig. 2. A speaker localization algorithm in the audio reproduction system estimates the difference between the acoustic propagation delays from the user's position to the left and right microphone, respectively. Using the reciprocity theorem, this is also the delay difference between the two acoustic paths from the loudspeakers to the listener, as explained above.
Next, this acoustic propagation time delay is compensated for with a delay T<n in the left channel in a delay device TD2 (if the acoustic path length L2 between the listener and the right loudspeaker is larger than the acoustic path length L] between the listener and the left speaker) or a delay Td2 in the right channel with a delay device TD1 (if \ > L2).
It is noted that the speaker localization algorithm operates on a narrow band speech signal sampled at e.g. Fs = 8 kHz, while the Sound filters operate on the full audio bandwidth. The speaker localization is done at low frequencies only, since at frequencies higher than 4 kHz the speech signal contains little power. Also, time delays are ambiguous at higher frequencies due to the short acoustic wavelengths.
The algorithm presented above only works when the music is not playing; with no additional measures the sound emitted by the loudspeakers and picked up by the microphones interferes with the user's speech, leading to incorrect speaker localization. To enable adaptation when the music is playing, two stereo echo cancellers (depicted in figure 1 by ECM) can be used in order to cancel the music signals picked up by the two microphones. In this way, the speaker localization algorithm is not affected by the music. With only one stereo echo canceller operating on one microphone it would be possible to detect a speech command, after which the music can be stopped and the speaker localization can be performed before the music starts playing again.
The speaker localization algorithm can be combined with a speech detector so that adaptation is halted during non-speech periods.
With a speech detector the robustness against background noise is increased. Above the acoustic speaker localization was combined with the so-called Incredible Sound scheme.
However, combinations with any other type of sound processing (other stereo base wideners or normal stereo) are possible.
Until now Incredible Sound could only be heart at full merits only on one position.
With this invention the applicability of Incredible Sound is greatly increased since the listener is no longer restricted to a certain position. The invention can also be seen as a first step towards voice controlled electronic consumer products, which would be acceptable for the greater public.
Applications of the invention can be found in stereo and multi-channel sound reproduction systems such as televisions, in portable stereo sets, and in others.
Above two examples of an audio reproduction system according to the invention are described wherein the speech signals are also used for voice control.
It will be understand by the man skilled in the art that also in cases where no voice controlled operation of the audio reproduction system is available the invention can be used to advantage. The only need is for each loudspeaker a microphone in the neighborhood of the loudspeaker to receive a speech signal from the listener and the audio processing means will amend the audio signal to improve the reproduced audio signal at the location of the listener.
Further it will be noticed that using this audio reproduction system in a video reproduction system is also possible to improve the sound when viewing the pictures on a screen.

Claims

CLAIMS:
1. Audio reproduction system comprising audio processing means for processing an input audio signal, at least two loudspeakers to reproduce the processed audio signal, characterized in that the audio reproduction system comprises means to obtain characteristics of the transfer functions from the loudspeakers to the listener, which means comprise for each loudspeaker a microphone, whereby the microphones are located at least in the neighborhood of the loudspeakers, and the audio processing means comprise means to amend the audio signal in dependence of the location of the listener.
2. Voice-operated audio and/or video reproducing system as claimed in claim 1, characterized in that the audio processing means comprise echo-cancellation means for canceling the echo signals received by the microphones.
3. Audio processing means for use in audio and/or video reproduction system as claimed in claim 1.
4. Voice controlled system comprising an audio reproduction system according to claim 1.
5. Video reproduction system comprising an audio reproduction system according to claim 1.
PCT/EP2001/004323 2000-04-28 2001-04-13 Audio system WO2001084884A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2001581575A JP2003533110A (en) 2000-04-28 2001-04-13 Audio system
EP01947224A EP1279318A2 (en) 2000-04-28 2001-04-13 Audio system
KR1020017016712A KR20020028918A (en) 2000-04-28 2001-04-13 Audio system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP00201529 2000-04-28
EP00201529.5 2000-04-28

Publications (2)

Publication Number Publication Date
WO2001084884A2 true WO2001084884A2 (en) 2001-11-08
WO2001084884A3 WO2001084884A3 (en) 2002-06-06

Family

ID=8171420

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2001/004323 WO2001084884A2 (en) 2000-04-28 2001-04-13 Audio system

Country Status (6)

Country Link
US (1) US20010050995A1 (en)
EP (1) EP1279318A2 (en)
JP (1) JP2003533110A (en)
KR (1) KR20020028918A (en)
CN (1) CN1401203A (en)
WO (1) WO2001084884A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100636193B1 (en) 2004-11-18 2006-10-19 삼성전자주식회사 Method and apparatus for setting sound parameter using motion detection
WO2013095920A1 (en) * 2011-12-19 2013-06-27 Qualcomm Incorporated Automated user/sensor location recognition to customize audio performance in a distributed multi-sensor environment

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1359437A1 (en) * 2002-04-29 2003-11-05 Siemens Aktiengesellschaft Method for determining a position of a user of a communication terminal
US20120148075A1 (en) * 2010-12-08 2012-06-14 Creative Technology Ltd Method for optimizing reproduction of audio signals from an apparatus for audio reproduction
CA2908654C (en) * 2013-04-10 2019-08-13 Nokia Technologies Oy Audio recording and playback apparatus
CN105794231B (en) 2013-11-22 2018-11-06 苹果公司 Hands-free beam pattern configuration
JP2016100617A (en) * 2014-11-18 2016-05-30 学校法人千葉工業大学 Acoustic signal processor
JP6642989B2 (en) * 2015-07-06 2020-02-12 キヤノン株式会社 Control device, control method, and program

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5255326A (en) * 1992-05-18 1993-10-19 Alden Stevenson Interactive audio control system
US5555310A (en) * 1993-02-12 1996-09-10 Kabushiki Kaisha Toshiba Stereo voice transmission apparatus, stereo signal coding/decoding apparatus, echo canceler, and voice input/output apparatus to which this echo canceler is applied

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6535609B1 (en) * 1997-06-03 2003-03-18 Lear Automotive Dearborn, Inc. Cabin communication system
US6483532B1 (en) * 1998-07-13 2002-11-19 Netergy Microelectronics, Inc. Video-assisted audio signal processing system and method
US6778671B1 (en) * 1998-12-14 2004-08-17 Intel Corporation Method of reference to echo time alignment for facilitation of echo cancellation

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5255326A (en) * 1992-05-18 1993-10-19 Alden Stevenson Interactive audio control system
US5555310A (en) * 1993-02-12 1996-09-10 Kabushiki Kaisha Toshiba Stereo voice transmission apparatus, stereo signal coding/decoding apparatus, echo canceler, and voice input/output apparatus to which this echo canceler is applied

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100636193B1 (en) 2004-11-18 2006-10-19 삼성전자주식회사 Method and apparatus for setting sound parameter using motion detection
WO2013095920A1 (en) * 2011-12-19 2013-06-27 Qualcomm Incorporated Automated user/sensor location recognition to customize audio performance in a distributed multi-sensor environment
US9408011B2 (en) 2011-12-19 2016-08-02 Qualcomm Incorporated Automated user/sensor location recognition to customize audio performance in a distributed multi-sensor environment
US10492015B2 (en) 2011-12-19 2019-11-26 Qualcomm Incorporated Automated user/sensor location recognition to customize audio performance in a distributed multi-sensor environment

Also Published As

Publication number Publication date
EP1279318A2 (en) 2003-01-29
JP2003533110A (en) 2003-11-05
KR20020028918A (en) 2002-04-17
US20010050995A1 (en) 2001-12-13
WO2001084884A3 (en) 2002-06-06
CN1401203A (en) 2003-03-05

Similar Documents

Publication Publication Date Title
JP5177820B2 (en) System and method for enhanced subjective stereo audio
JP4946305B2 (en) Sound reproduction system, sound reproduction apparatus, and sound reproduction method
US7477735B2 (en) System and method for enhanced stereo audio
US8699742B2 (en) Sound system and a method for providing sound
JP5540581B2 (en) Audio signal processing apparatus and audio signal processing method
JP4359779B2 (en) Sound reproduction apparatus and sound reproduction method
US20110144779A1 (en) Data processing for a wearable apparatus
EP1545154A2 (en) A virtual surround sound device
US7889872B2 (en) Device and method for integrating sound effect processing and active noise control
US20100290615A1 (en) Echo canceller operative in response to fluctuation on echo path
EP1225789B1 (en) A stereo widening algorithm for loudspeakers
US6665645B1 (en) Speech recognition apparatus for AV equipment
KR20020086671A (en) Asymmetric multichannel filter
JP2002159096A (en) Personal on-demand audio entertainment device that is untethered and allows wireless download of content
US7062041B2 (en) Device and method for carrying out multichannel acoustic echo cancellation with a variable number of channels
CN114424583A (en) Hybrid near-field/far-field speaker virtualization
US20050286727A1 (en) Apparatus for expanding sound image upward
US20010050995A1 (en) Audio system
US7474754B2 (en) Method for canceling unwanted loudspeaker signals
JP2006352728A (en) Audio apparatus
US20220343888A1 (en) Audio device with acoustic echo cancellation
JPH05268700A (en) Stereo listening aid device
Corey et al. Immersive Enhancement and Removal of Loudspeaker Sound Using Wireless Assistive Listening Systems and Binaural Hearing Devices
JPH05227110A (en) Side tone suppressing device for sng
JP3308780B2 (en) Noise suppression circuit

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 01801102.0

Country of ref document: CN

AK Designated states

Kind code of ref document: A2

Designated state(s): CN JP KR

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

WWE Wipo information: entry into national phase

Ref document number: 2001947224

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2001 581575

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 1020017016712

Country of ref document: KR

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): CN JP KR

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

WWP Wipo information: published in national office

Ref document number: 2001947224

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2001947224

Country of ref document: EP