ZA985607B - Method and apparatus for speech enhancement in a speech communication system. - Google Patents
Method and apparatus for speech enhancement in a speech communication system.Info
- Publication number
- ZA985607B ZA985607B ZA9805607A ZA985607A ZA985607B ZA 985607 B ZA985607 B ZA 985607B ZA 9805607 A ZA9805607 A ZA 9805607A ZA 985607 A ZA985607 A ZA 985607A ZA 985607 B ZA985607 B ZA 985607B
- Authority
- ZA
- South Africa
- Prior art keywords
- speech
- unit
- determines
- intelligible
- listener
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/43—Signal processing in hearing aids to enhance the speech intelligibility
Abstract
The characteristics of the speech received by the decoding unit are altered by a processing unit 10 based upon an analysis of the listener's current background noise before the speech is output to enhance its intelligibility to a listener. An analysis unit 12 determines the type and level of the background noise by use of a microphone 13. A decision unit 11 then determines whether the speech currently being received and replayed would be intelligible to an average listener in the current background noise. If unit 11 determines that the speech is readily intelligible then no processing is necessary and the processing unit 10 does not alter the speech which has been passed to it. However, if unit 11 determines that the speech would be unintelligible, then unit 10 alters the speech before passing it to the output to make the speech more intelligible. In a particularly preferred embodiment, the speech characteristics are altered by altering line spectral pair/formant data representing the speech.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB9714001.6A GB9714001D0 (en) | 1997-07-02 | 1997-07-02 | Method and apparatus for speech enhancement in a speech communication system |
Publications (1)
Publication Number | Publication Date |
---|---|
ZA985607B true ZA985607B (en) | 2000-06-01 |
Family
ID=10815285
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ZA9805607A ZA985607B (en) | 1997-07-02 | 1998-06-26 | Method and apparatus for speech enhancement in a speech communication system. |
Country Status (12)
Country | Link |
---|---|
EP (1) | EP0993670B1 (en) |
JP (1) | JP2002507291A (en) |
KR (1) | KR20010014352A (en) |
CN (1) | CN1265217A (en) |
AT (1) | ATE214832T1 (en) |
AU (1) | AU8227798A (en) |
CA (1) | CA2235455A1 (en) |
DE (1) | DE69804310D1 (en) |
GB (2) | GB9714001D0 (en) |
PL (1) | PL337717A1 (en) |
WO (1) | WO1999001863A1 (en) |
ZA (1) | ZA985607B (en) |
Families Citing this family (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SE9903553D0 (en) * | 1999-01-27 | 1999-10-01 | Lars Liljeryd | Enhancing conceptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL) |
FR2794322B1 (en) * | 1999-05-27 | 2001-06-22 | Sagem | NOISE SUPPRESSION PROCESS |
EP1210765B1 (en) | 1999-07-28 | 2007-03-07 | Clear Audio Ltd. | Filter banked gain control of audio in a noisy environment |
US6876968B2 (en) * | 2001-03-08 | 2005-04-05 | Matsushita Electric Industrial Co., Ltd. | Run time synthesizer adaptation to improve intelligibility of synthesized speech |
DE10124189A1 (en) * | 2001-05-17 | 2002-11-21 | Siemens Ag | Signal reception in digital communications system involves generating output background signal with bandwidth greater than that of background signal characterized by received data |
JP2003255993A (en) * | 2002-03-04 | 2003-09-10 | Ntt Docomo Inc | System, method, and program for speech recognition, and system, method, and program for speech synthesis |
WO2004002028A2 (en) * | 2002-06-19 | 2003-12-31 | Koninklijke Philips Electronics N.V. | Audio signal processing apparatus and method |
WO2004068467A1 (en) * | 2003-01-31 | 2004-08-12 | Oticon A/S | Sound system improving speech intelligibility |
KR20050049103A (en) * | 2003-11-21 | 2005-05-25 | 삼성전자주식회사 | Method and apparatus for enhancing dialog using formant |
EA011361B1 (en) * | 2004-09-07 | 2009-02-27 | Сенсир Пти Лтд. | Apparatus and method for sound enhancement |
US8280730B2 (en) | 2005-05-25 | 2012-10-02 | Motorola Mobility Llc | Method and apparatus of increasing speech intelligibility in noisy environments |
GB2433849B (en) | 2005-12-29 | 2008-05-21 | Motorola Inc | Telecommunications terminal and method of operation of the terminal |
DE102006001730A1 (en) | 2006-01-13 | 2007-07-19 | Robert Bosch Gmbh | Sound system, method for improving the voice quality and / or intelligibility of voice announcements and computer program |
EP1814109A1 (en) * | 2006-01-27 | 2007-08-01 | Texas Instruments Incorporated | Voice amplification apparatus for modelling the Lombard effect |
JP2007295347A (en) * | 2006-04-26 | 2007-11-08 | Mitsubishi Electric Corp | Voice processor |
WO2018127263A2 (en) | 2017-01-03 | 2018-07-12 | Lizn Aps | Speech intelligibility enhancing system |
KR101414233B1 (en) | 2007-01-05 | 2014-07-02 | 삼성전자 주식회사 | Apparatus and method for improving speech intelligibility |
JP4926005B2 (en) | 2007-11-13 | 2012-05-09 | ソニー・エリクソン・モバイルコミュニケーションズ株式会社 | Audio signal processing apparatus, audio signal processing method, and communication terminal |
EP2232700B1 (en) | 2007-12-21 | 2014-08-13 | Dts Llc | System for adjusting perceived loudness of audio signals |
JP5453740B2 (en) * | 2008-07-02 | 2014-03-26 | 富士通株式会社 | Speech enhancement device |
US8538042B2 (en) | 2009-08-11 | 2013-09-17 | Dts Llc | System for increasing perceived loudness of speakers |
EP2372700A1 (en) * | 2010-03-11 | 2011-10-05 | Oticon A/S | A speech intelligibility predictor and applications thereof |
KR102060208B1 (en) * | 2011-07-29 | 2019-12-27 | 디티에스 엘엘씨 | Adaptive voice intelligibility processor |
CN103002105A (en) * | 2011-09-16 | 2013-03-27 | 宏碁股份有限公司 | Mobile communication method capable of improving articulation of communication contents |
CN103297896B (en) * | 2012-02-27 | 2016-07-06 | 联想(北京)有限公司 | A kind of audio-frequency inputting method and electronic equipment |
US9020818B2 (en) * | 2012-03-05 | 2015-04-28 | Malaspina Labs (Barbados) Inc. | Format based speech reconstruction from noisy signals |
US9312829B2 (en) | 2012-04-12 | 2016-04-12 | Dts Llc | System for adjusting loudness of audio signals in real time |
EP3010017A1 (en) * | 2014-10-14 | 2016-04-20 | Thomson Licensing | Method and apparatus for separating speech data from background data in audio communication |
JP6565206B2 (en) * | 2015-02-20 | 2019-08-28 | ヤマハ株式会社 | Audio processing apparatus and audio processing method |
EP3107097B1 (en) | 2015-06-17 | 2017-11-15 | Nxp B.V. | Improved speech intelligilibility |
US9847093B2 (en) | 2015-06-19 | 2017-12-19 | Samsung Electronics Co., Ltd. | Method and apparatus for processing speech signal |
JP6790732B2 (en) * | 2016-11-02 | 2020-11-25 | ヤマハ株式会社 | Signal processing method and signal processing device |
WO2019127112A1 (en) * | 2017-12-27 | 2019-07-04 | 深圳前海达闼云端智能科技有限公司 | Voice interaction method and device and intelligent terminal |
CN109346058A (en) * | 2018-11-29 | 2019-02-15 | 西安交通大学 | A kind of speech acoustics feature expansion system |
US11817114B2 (en) | 2019-12-09 | 2023-11-14 | Dolby Laboratories Licensing Corporation | Content and environmentally aware environmental noise compensation |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5870292A (en) * | 1981-10-22 | 1983-04-26 | 日産自動車株式会社 | Voice recognition equipment for vehicle |
US4538295A (en) * | 1982-08-16 | 1985-08-27 | Nissan Motor Company, Limited | Speech recognition system for an automotive vehicle |
KR940009391B1 (en) * | 1985-07-01 | 1994-10-07 | 모토로라 인코포레이티드 | Noise rejection system |
GB8801014D0 (en) * | 1988-01-18 | 1988-02-17 | British Telecomm | Noise reduction |
US5235669A (en) * | 1990-06-29 | 1993-08-10 | At&T Laboratories | Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec |
CA2056110C (en) * | 1991-03-27 | 1997-02-04 | Arnold I. Klayman | Public address intelligibility system |
FI102337B1 (en) * | 1995-09-13 | 1998-11-13 | Nokia Mobile Phones Ltd | Method and circuit arrangement for processing an audio signal |
GB2306086A (en) * | 1995-10-06 | 1997-04-23 | Richard Morris Trim | Improved adaptive audio systems |
-
1997
- 1997-07-02 GB GBGB9714001.6A patent/GB9714001D0/en not_active Ceased
-
1998
- 1998-04-21 CA CA002235455A patent/CA2235455A1/en not_active Abandoned
- 1998-06-26 ZA ZA9805607A patent/ZA985607B/en unknown
- 1998-07-01 AT AT98932337T patent/ATE214832T1/en not_active IP Right Cessation
- 1998-07-01 PL PL98337717A patent/PL337717A1/en unknown
- 1998-07-01 JP JP50665899A patent/JP2002507291A/en active Pending
- 1998-07-01 DE DE69804310T patent/DE69804310D1/en not_active Expired - Lifetime
- 1998-07-01 KR KR1019997012508A patent/KR20010014352A/en not_active Application Discontinuation
- 1998-07-01 WO PCT/GB1998/001936 patent/WO1999001863A1/en not_active Application Discontinuation
- 1998-07-01 AU AU82277/98A patent/AU8227798A/en not_active Abandoned
- 1998-07-01 CN CN98807458A patent/CN1265217A/en active Pending
- 1998-07-01 EP EP98932337A patent/EP0993670B1/en not_active Expired - Lifetime
- 1998-07-01 GB GB9814279A patent/GB2327835B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
DE69804310D1 (en) | 2002-04-25 |
PL337717A1 (en) | 2000-08-28 |
EP0993670A1 (en) | 2000-04-19 |
CA2235455A1 (en) | 1999-01-02 |
CN1265217A (en) | 2000-08-30 |
KR20010014352A (en) | 2001-02-26 |
GB2327835A (en) | 1999-02-03 |
EP0993670B1 (en) | 2002-03-20 |
GB9714001D0 (en) | 1997-09-10 |
ATE214832T1 (en) | 2002-04-15 |
AU8227798A (en) | 1999-01-25 |
JP2002507291A (en) | 2002-03-05 |
WO1999001863A1 (en) | 1999-01-14 |
GB2327835B (en) | 2000-04-19 |
GB9814279D0 (en) | 1998-09-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2327835B (en) | Method and apparatus for speech enhancement in a speech communication system | |
HK1003399A1 (en) | Method and apparatus for detection and bypass of tandem vocoding | |
RU2146394C1 (en) | Method and device for alternating rate voice coding using reduced encoding rate | |
EP0785541B1 (en) | Usage of voice activity detection for efficient coding of speech | |
WO1998057436A3 (en) | Source coding enhancement using spectral-band replication | |
SE9500321L (en) | Procedure for noise suppression by spectral subtraction | |
JP3131249B2 (en) | Mixed audio signal receiver | |
AU1324592A (en) | Method and apparatus for the teaching of languages | |
Paul | A 500-800 bps adaptive vector quantization vocoder using a perceptually motivated distance measure | |
US7643991B2 (en) | Speech enhancement for electronic voiced messages | |
GB2343822A (en) | Using LSP to alter frequency characteristics of speech | |
JP3166797B2 (en) | Voice coding method, voice decoding method, and voice codec | |
Brandenburg et al. | Fast signal processor encodes 48 kHz/16-bit audio into 3-bit in real time | |
Cox | Current methods of speech coding | |
SU1674226A1 (en) | Method and apparatus for detecting speech signals and their boundaries | |
Dimolitsas et al. | Objective assessment methodology and evaluation of low-rate digital voice processors | |
CN116168699A (en) | Security platform control method and device based on voice recognition, storage medium and equipment | |
Gan et al. | Implementation of silence compression scheme for G. 723.1 speech coder using TI TMS320C51 DSP chip | |
Riedhammer et al. | A software kit for automatic voice descrambling | |
Servetti et al. | " Dipartimento di Automatica e Informatica" IRITI-CNR | |
Bunnell et al. | Speech processing program | |
JPS5853349B2 (en) | Speech analysis and synthesis method | |
Bunnell et al. | Speech processing program | |
Okazaki et al. | Implementation of a 4.8 kbps voice codec based on pitch-synchronous DFT coding | |
KR20050092961A (en) | Apparatus and method for improving a ring back tone |