GB2327835B - Method and apparatus for speech enhancement in a speech communication system - Google Patents
Method and apparatus for speech enhancement in a speech communication systemInfo
- Publication number
- GB2327835B GB2327835B GB9814279A GB9814279A GB2327835B GB 2327835 B GB2327835 B GB 2327835B GB 9814279 A GB9814279 A GB 9814279A GB 9814279 A GB9814279 A GB 9814279A GB 2327835 B GB2327835 B GB 2327835B
- Authority
- GB
- United Kingdom
- Prior art keywords
- speech
- unit
- determines
- intelligible
- listener
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000003595 spectral effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/43—Signal processing in hearing aids to enhance the speech intelligibility
Landscapes
- Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Telephonic Communication Services (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Interconnected Communication Systems, Intercoms, And Interphones (AREA)
- Telephone Function (AREA)
Abstract
The characteristics of the speech received by the decoding unit are altered by a processing unit 10 based upon an analysis of the listener's current background noise before the speech is output to enhance its intelligibility to a listener. An analysis unit 12 determines the type and level of the background noise by use of a microphone 13. A decision unit 11 then determines whether the speech currently being received and replayed would be intelligible to an average listener in the current background noise. If unit 11 determines that the speech is readily intelligible then no processing is necessary and the processing unit 10 does not alter the speech which has been passed to it. However, if unit 11 determines that the speech would be unintelligible, then unit 10 alters the speech before passing it to the output to make the speech more intelligible. In a particularly preferred embodiment, the speech characteristics are altered by altering line spectral pair/formant data representing the speech.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9920667A GB2336978B (en) | 1997-07-02 | 1998-07-01 | Method and apparatus for speech enhancement in a speech communication system |
GB0001586A GB2343822B (en) | 1997-07-02 | 1998-07-01 | Method and apparatus for speech enhancement in a speech communication system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB9714001.6A GB9714001D0 (en) | 1997-07-02 | 1997-07-02 | Method and apparatus for speech enhancement in a speech communication system |
Publications (3)
Publication Number | Publication Date |
---|---|
GB9814279D0 GB9814279D0 (en) | 1998-09-02 |
GB2327835A GB2327835A (en) | 1999-02-03 |
GB2327835B true GB2327835B (en) | 2000-04-19 |
Family
ID=10815285
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GBGB9714001.6A Ceased GB9714001D0 (en) | 1997-07-02 | 1997-07-02 | Method and apparatus for speech enhancement in a speech communication system |
GB9814279A Expired - Fee Related GB2327835B (en) | 1997-07-02 | 1998-07-01 | Method and apparatus for speech enhancement in a speech communication system |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GBGB9714001.6A Ceased GB9714001D0 (en) | 1997-07-02 | 1997-07-02 | Method and apparatus for speech enhancement in a speech communication system |
Country Status (12)
Country | Link |
---|---|
EP (1) | EP0993670B1 (en) |
JP (1) | JP2002507291A (en) |
KR (1) | KR20010014352A (en) |
CN (1) | CN1265217A (en) |
AT (1) | ATE214832T1 (en) |
AU (1) | AU8227798A (en) |
CA (1) | CA2235455A1 (en) |
DE (1) | DE69804310D1 (en) |
GB (2) | GB9714001D0 (en) |
PL (1) | PL337717A1 (en) |
WO (1) | WO1999001863A1 (en) |
ZA (1) | ZA985607B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9264836B2 (en) | 2007-12-21 | 2016-02-16 | Dts Llc | System for adjusting perceived loudness of audio signals |
US9312829B2 (en) | 2012-04-12 | 2016-04-12 | Dts Llc | System for adjusting loudness of audio signals in real time |
Families Citing this family (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SE9903553D0 (en) * | 1999-01-27 | 1999-10-01 | Lars Liljeryd | Enhancing conceptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL) |
FR2794322B1 (en) * | 1999-05-27 | 2001-06-22 | Sagem | NOISE SUPPRESSION PROCESS |
US7120579B1 (en) | 1999-07-28 | 2006-10-10 | Clear Audio Ltd. | Filter banked gain control of audio in a noisy environment |
US6876968B2 (en) * | 2001-03-08 | 2005-04-05 | Matsushita Electric Industrial Co., Ltd. | Run time synthesizer adaptation to improve intelligibility of synthesized speech |
DE10124189A1 (en) * | 2001-05-17 | 2002-11-21 | Siemens Ag | Signal reception in digital communications system involves generating output background signal with bandwidth greater than that of background signal characterized by received data |
JP2003255993A (en) * | 2002-03-04 | 2003-09-10 | Ntt Docomo Inc | System, method, and program for speech recognition, and system, method, and program for speech synthesis |
WO2004002028A2 (en) * | 2002-06-19 | 2003-12-31 | Koninklijke Philips Electronics N.V. | Audio signal processing apparatus and method |
US20060126859A1 (en) * | 2003-01-31 | 2006-06-15 | Claus Elberling | Sound system improving speech intelligibility |
KR20050049103A (en) * | 2003-11-21 | 2005-05-25 | 삼성전자주식회사 | Method and apparatus for enhancing dialog using formant |
KR101215944B1 (en) * | 2004-09-07 | 2012-12-27 | 센시어 피티와이 엘티디 | Hearing protector and Method for sound enhancement |
US8280730B2 (en) | 2005-05-25 | 2012-10-02 | Motorola Mobility Llc | Method and apparatus of increasing speech intelligibility in noisy environments |
GB2433849B (en) | 2005-12-29 | 2008-05-21 | Motorola Inc | Telecommunications terminal and method of operation of the terminal |
DE102006001730A1 (en) | 2006-01-13 | 2007-07-19 | Robert Bosch Gmbh | Sound system, method for improving the voice quality and / or intelligibility of voice announcements and computer program |
EP1814109A1 (en) * | 2006-01-27 | 2007-08-01 | Texas Instruments Incorporated | Voice amplification apparatus for modelling the Lombard effect |
JP2007295347A (en) * | 2006-04-26 | 2007-11-08 | Mitsubishi Electric Corp | Voice processor |
KR101414233B1 (en) | 2007-01-05 | 2014-07-02 | 삼성전자 주식회사 | Apparatus and method for improving speech intelligibility |
JP4926005B2 (en) * | 2007-11-13 | 2012-05-09 | ソニー・エリクソン・モバイルコミュニケーションズ株式会社 | Audio signal processing apparatus, audio signal processing method, and communication terminal |
JP5453740B2 (en) * | 2008-07-02 | 2014-03-26 | 富士通株式会社 | Speech enhancement device |
US8538042B2 (en) | 2009-08-11 | 2013-09-17 | Dts Llc | System for increasing perceived loudness of speakers |
EP2372700A1 (en) * | 2010-03-11 | 2011-10-05 | Oticon A/S | A speech intelligibility predictor and applications thereof |
US9117455B2 (en) | 2011-07-29 | 2015-08-25 | Dts Llc | Adaptive voice intelligibility processor |
CN103002105A (en) * | 2011-09-16 | 2013-03-27 | 宏碁股份有限公司 | Mobile communication method capable of improving articulation of communication contents |
CN103297896B (en) * | 2012-02-27 | 2016-07-06 | 联想(北京)有限公司 | A kind of audio-frequency inputting method and electronic equipment |
US9015044B2 (en) | 2012-03-05 | 2015-04-21 | Malaspina Labs (Barbados) Inc. | Formant based speech reconstruction from noisy signals |
EP3010017A1 (en) * | 2014-10-14 | 2016-04-20 | Thomson Licensing | Method and apparatus for separating speech data from background data in audio communication |
JP6565206B2 (en) * | 2015-02-20 | 2019-08-28 | ヤマハ株式会社 | Audio processing apparatus and audio processing method |
EP3107097B1 (en) | 2015-06-17 | 2017-11-15 | Nxp B.V. | Improved speech intelligilibility |
US9847093B2 (en) | 2015-06-19 | 2017-12-19 | Samsung Electronics Co., Ltd. | Method and apparatus for processing speech signal |
JP6790732B2 (en) * | 2016-11-02 | 2020-11-25 | ヤマハ株式会社 | Signal processing method and signal processing device |
EP3566469B1 (en) | 2017-01-03 | 2020-04-01 | Lizn APS | Speech intelligibility enhancing system |
WO2019127112A1 (en) * | 2017-12-27 | 2019-07-04 | 深圳前海达闼云端智能科技有限公司 | Voice interaction method and device and intelligent terminal |
CN109346058B (en) * | 2018-11-29 | 2024-06-28 | 西安交通大学 | Voice acoustic feature expansion system |
US11817114B2 (en) * | 2019-12-09 | 2023-11-14 | Dolby Laboratories Licensing Corporation | Content and environmentally aware environmental noise compensation |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4532648A (en) * | 1981-10-22 | 1985-07-30 | Nissan Motor Company, Limited | Speech recognition system for an automotive vehicle |
US4538295A (en) * | 1982-08-16 | 1985-08-27 | Nissan Motor Company, Limited | Speech recognition system for an automotive vehicle |
WO1987000366A1 (en) * | 1985-07-01 | 1987-01-15 | Motorola, Inc. | Noise supression system |
WO1989006877A1 (en) * | 1988-01-18 | 1989-07-27 | British Telecommunications Public Limited Company | Noise reduction |
EP0505645A1 (en) * | 1991-03-27 | 1992-09-30 | R.G.A. & Associates Ltd. | Public address intelligibility enhancement system |
EP0763888A2 (en) * | 1995-09-13 | 1997-03-19 | Nokia Mobile Phones Ltd. | Method and circuit arrangement for processing audio signal |
GB2306086A (en) * | 1995-10-06 | 1997-04-23 | Richard Morris Trim | Improved adaptive audio systems |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5235669A (en) * | 1990-06-29 | 1993-08-10 | At&T Laboratories | Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec |
-
1997
- 1997-07-02 GB GBGB9714001.6A patent/GB9714001D0/en not_active Ceased
-
1998
- 1998-04-21 CA CA002235455A patent/CA2235455A1/en not_active Abandoned
- 1998-06-26 ZA ZA9805607A patent/ZA985607B/en unknown
- 1998-07-01 JP JP50665899A patent/JP2002507291A/en active Pending
- 1998-07-01 GB GB9814279A patent/GB2327835B/en not_active Expired - Fee Related
- 1998-07-01 EP EP98932337A patent/EP0993670B1/en not_active Expired - Lifetime
- 1998-07-01 KR KR1019997012508A patent/KR20010014352A/en not_active Application Discontinuation
- 1998-07-01 PL PL98337717A patent/PL337717A1/en unknown
- 1998-07-01 WO PCT/GB1998/001936 patent/WO1999001863A1/en not_active Application Discontinuation
- 1998-07-01 DE DE69804310T patent/DE69804310D1/en not_active Expired - Lifetime
- 1998-07-01 AT AT98932337T patent/ATE214832T1/en not_active IP Right Cessation
- 1998-07-01 CN CN98807458A patent/CN1265217A/en active Pending
- 1998-07-01 AU AU82277/98A patent/AU8227798A/en not_active Abandoned
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4532648A (en) * | 1981-10-22 | 1985-07-30 | Nissan Motor Company, Limited | Speech recognition system for an automotive vehicle |
US4538295A (en) * | 1982-08-16 | 1985-08-27 | Nissan Motor Company, Limited | Speech recognition system for an automotive vehicle |
WO1987000366A1 (en) * | 1985-07-01 | 1987-01-15 | Motorola, Inc. | Noise supression system |
WO1989006877A1 (en) * | 1988-01-18 | 1989-07-27 | British Telecommunications Public Limited Company | Noise reduction |
EP0505645A1 (en) * | 1991-03-27 | 1992-09-30 | R.G.A. & Associates Ltd. | Public address intelligibility enhancement system |
EP0763888A2 (en) * | 1995-09-13 | 1997-03-19 | Nokia Mobile Phones Ltd. | Method and circuit arrangement for processing audio signal |
GB2306086A (en) * | 1995-10-06 | 1997-04-23 | Richard Morris Trim | Improved adaptive audio systems |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9264836B2 (en) | 2007-12-21 | 2016-02-16 | Dts Llc | System for adjusting perceived loudness of audio signals |
US9312829B2 (en) | 2012-04-12 | 2016-04-12 | Dts Llc | System for adjusting loudness of audio signals in real time |
US9559656B2 (en) | 2012-04-12 | 2017-01-31 | Dts Llc | System for adjusting loudness of audio signals in real time |
Also Published As
Publication number | Publication date |
---|---|
GB9814279D0 (en) | 1998-09-02 |
JP2002507291A (en) | 2002-03-05 |
CA2235455A1 (en) | 1999-01-02 |
DE69804310D1 (en) | 2002-04-25 |
EP0993670B1 (en) | 2002-03-20 |
ZA985607B (en) | 2000-06-01 |
GB9714001D0 (en) | 1997-09-10 |
PL337717A1 (en) | 2000-08-28 |
CN1265217A (en) | 2000-08-30 |
GB2327835A (en) | 1999-02-03 |
ATE214832T1 (en) | 2002-04-15 |
WO1999001863A1 (en) | 1999-01-14 |
KR20010014352A (en) | 2001-02-26 |
EP0993670A1 (en) | 2000-04-19 |
AU8227798A (en) | 1999-01-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2327835B (en) | Method and apparatus for speech enhancement in a speech communication system | |
HK1003399A1 (en) | Method and apparatus for detection and bypass of tandem vocoding | |
RU2146394C1 (en) | Method and device for alternating rate voice coding using reduced encoding rate | |
EP0785541B1 (en) | Usage of voice activity detection for efficient coding of speech | |
WO1998057436A3 (en) | Source coding enhancement using spectral-band replication | |
SE9500321L (en) | Procedure for noise suppression by spectral subtraction | |
MX9602391A (en) | Method and apparatus for reproducing speech signals and method for transmitting same. | |
JPH1097296A (en) | Method and device for voice coding, and method and device for voice decoding | |
JP3131249B2 (en) | Mixed audio signal receiver | |
AU1324592A (en) | Method and apparatus for the teaching of languages | |
Paul | A 500-800 bps adaptive vector quantization vocoder using a perceptually motivated distance measure | |
US7643991B2 (en) | Speech enhancement for electronic voiced messages | |
GB2343822A (en) | Using LSP to alter frequency characteristics of speech | |
JP3166797B2 (en) | Voice coding method, voice decoding method, and voice codec | |
Cox | Current methods of speech coding | |
SU1674226A1 (en) | Method and apparatus for detecting speech signals and their boundaries | |
Brandenburg et al. | Fast signal processor encodes 48 kHz/16-bit audio into 3-bit in real time | |
Rabiner et al. | Current methods of digital speech processing | |
Gan et al. | Implementation of silence compression scheme for G. 723.1 speech coder using TI TMS320C51 DSP chip | |
Riedhammer et al. | A software kit for automatic voice descrambling | |
Bertrand | Secure narrowband digital conferencing | |
JPS5853349B2 (en) | Speech analysis and synthesis method | |
Bunnell et al. | Speech processing program | |
Batchelor | Noise cancellation to improve the quality of LPC processed speech degraded by noise | |
Okazaki et al. | Implementation of a 4.8 kbps voice codec based on pitch-synchronous DFT coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
732E | Amendments to the register in respect of changes of name or changes affecting rights (sect. 32/1977) | ||
PCNP | Patent ceased through non-payment of renewal fee |
Effective date: 20020701 |