JP4761506B2 - 音声処理方法と装置及びプログラム並びに音声システム - Google Patents
音声処理方法と装置及びプログラム並びに音声システム Download PDFInfo
- Publication number
- JP4761506B2 JP4761506B2 JP2005056342A JP2005056342A JP4761506B2 JP 4761506 B2 JP4761506 B2 JP 4761506B2 JP 2005056342 A JP2005056342 A JP 2005056342A JP 2005056342 A JP2005056342 A JP 2005056342A JP 4761506 B2 JP4761506 B2 JP 4761506B2
- Authority
- JP
- Japan
- Prior art keywords
- spectrum
- envelope
- spectral
- deformation
- spectrum envelope
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K11/00—Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
- G10K11/16—Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
- G10K11/175—Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
- G10K11/1752—Masking
- G10K11/1754—Speech masking
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- General Health & Medical Sciences (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
- Telephone Function (AREA)
Priority Applications (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2005056342A JP4761506B2 (ja) | 2005-03-01 | 2005-03-01 | 音声処理方法と装置及びプログラム並びに音声システム |
| KR1020077019988A KR100931419B1 (ko) | 2005-03-01 | 2006-02-23 | 음성 처리 방법과 장치, 기억 매체 및 음성 시스템 |
| DE602006014096T DE602006014096D1 (de) | 2005-03-01 | 2006-02-23 | Sprachverarbeitungsverfahren und -einrichtung, Speichermedium und Sprachsystem |
| EP06714430A EP1855269B1 (en) | 2005-03-01 | 2006-02-23 | Speech processing method and device, storage medium, and speech system |
| PCT/JP2006/303290 WO2006093019A1 (ja) | 2005-03-01 | 2006-02-23 | 音声処理方法と装置及び記憶媒体並びに音声システム |
| CN2006800066680A CN101138020B (zh) | 2005-03-01 | 2006-02-23 | 声音处理方法和装置及存储媒体以及声音系统 |
| US11/849,106 US8065138B2 (en) | 2005-03-01 | 2007-08-31 | Speech processing method and apparatus, storage medium, and speech system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2005056342A JP4761506B2 (ja) | 2005-03-01 | 2005-03-01 | 音声処理方法と装置及びプログラム並びに音声システム |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2006243178A JP2006243178A (ja) | 2006-09-14 |
| JP2006243178A5 JP2006243178A5 (https=) | 2007-08-30 |
| JP4761506B2 true JP4761506B2 (ja) | 2011-08-31 |
Family
ID=36941053
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2005056342A Expired - Lifetime JP4761506B2 (ja) | 2005-03-01 | 2005-03-01 | 音声処理方法と装置及びプログラム並びに音声システム |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US8065138B2 (https=) |
| EP (1) | EP1855269B1 (https=) |
| JP (1) | JP4761506B2 (https=) |
| KR (1) | KR100931419B1 (https=) |
| CN (1) | CN101138020B (https=) |
| DE (1) | DE602006014096D1 (https=) |
| WO (1) | WO2006093019A1 (https=) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4757158B2 (ja) * | 2006-09-20 | 2011-08-24 | 富士通株式会社 | 音信号処理方法、音信号処理装置及びコンピュータプログラム |
| US8229130B2 (en) * | 2006-10-17 | 2012-07-24 | Massachusetts Institute Of Technology | Distributed acoustic conversation shielding system |
| JP5082541B2 (ja) * | 2007-03-29 | 2012-11-28 | ヤマハ株式会社 | 拡声装置 |
| US8140326B2 (en) * | 2008-06-06 | 2012-03-20 | Fuji Xerox Co., Ltd. | Systems and methods for reducing speech intelligibility while preserving environmental sounds |
| JP5511342B2 (ja) * | 2009-12-09 | 2014-06-04 | 日本板硝子環境アメニティ株式会社 | 音声変更装置、音声変更方法および音声情報秘話システム |
| JP5489778B2 (ja) * | 2010-02-25 | 2014-05-14 | キヤノン株式会社 | 情報処理装置およびその処理方法 |
| JP5605062B2 (ja) * | 2010-08-03 | 2014-10-15 | 大日本印刷株式会社 | 騒音源の快音化方法および快音化装置 |
| JP5569291B2 (ja) * | 2010-09-17 | 2014-08-13 | 大日本印刷株式会社 | 騒音源の快音化方法および快音化装置 |
| JP6007481B2 (ja) * | 2010-11-25 | 2016-10-12 | ヤマハ株式会社 | マスカ音生成装置、マスカ音信号を記憶した記憶媒体、マスカ音再生装置、およびプログラム |
| WO2012128678A1 (en) | 2011-03-21 | 2012-09-27 | Telefonaktiebolaget L M Ericsson (Publ) | Method and arrangement for damping of dominant frequencies in an audio signal |
| JP2014513320A (ja) * | 2011-03-21 | 2014-05-29 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | オーディオ信号におけるドミナント周波数を減衰する方法及び装置 |
| US8972251B2 (en) | 2011-06-07 | 2015-03-03 | Qualcomm Incorporated | Generating a masking signal on an electronic device |
| US8583425B2 (en) * | 2011-06-21 | 2013-11-12 | Genband Us Llc | Methods, systems, and computer readable media for fricatives and high frequencies detection |
| WO2013012312A2 (en) * | 2011-07-19 | 2013-01-24 | Jin Hem Thong | Wave modification method and system thereof |
| JP5849508B2 (ja) * | 2011-08-09 | 2016-01-27 | 株式会社大林組 | Bgmのマスキング効果評価方法及びbgmのマスキング効果評価装置 |
| JP5925493B2 (ja) * | 2012-01-11 | 2016-05-25 | グローリー株式会社 | 会話保護システム及び会話保護方法 |
| EP2862169A4 (en) * | 2012-06-15 | 2016-03-02 | Jemardator Ab | DIFFERENCE OF CEPSTRAL SEPARATION |
| US8670986B2 (en) | 2012-10-04 | 2014-03-11 | Medical Privacy Solutions, Llc | Method and apparatus for masking speech in a private environment |
| CN103818290A (zh) * | 2012-11-16 | 2014-05-28 | 黄金富 | 一种用于汽车司机与老板的隔声装置 |
| CN103826176A (zh) * | 2012-11-16 | 2014-05-28 | 黄金富 | 一种用于汽车司机与乘客之间的司机专用保密耳筒 |
| JP2014130251A (ja) * | 2012-12-28 | 2014-07-10 | Glory Ltd | 会話保護システム及び会話保護方法 |
| JP5929786B2 (ja) * | 2013-03-07 | 2016-06-08 | ソニー株式会社 | 信号処理装置、信号処理方法及び記憶媒体 |
| JP6371516B2 (ja) * | 2013-11-15 | 2018-08-08 | キヤノン株式会社 | 音響信号処理装置および方法 |
| JP6098654B2 (ja) * | 2014-03-10 | 2017-03-22 | ヤマハ株式会社 | マスキング音データ生成装置およびプログラム |
| JP7145596B2 (ja) * | 2017-09-15 | 2022-10-03 | 株式会社Lixil | 擬音装置 |
| CN108540680B (zh) * | 2018-02-02 | 2021-03-02 | 广州视源电子科技股份有限公司 | 讲话状态的切换方法及装置、通话系统 |
| US10757507B2 (en) * | 2018-02-13 | 2020-08-25 | Ppip, Llc | Sound shaping apparatus |
| WO2019245916A1 (en) * | 2018-06-19 | 2019-12-26 | Georgetown University | Method and system for parametric speech synthesis |
| US12556927B2 (en) | 2024-01-19 | 2026-02-17 | Cisco Technology, Inc. | Speech confidentiality monitoring and alerting |
Family Cites Families (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US3681530A (en) * | 1970-06-15 | 1972-08-01 | Gte Sylvania Inc | Method and apparatus for signal bandwidth compression utilizing the fourier transform of the logarithm of the frequency spectrum magnitude |
| US4827516A (en) * | 1985-10-16 | 1989-05-02 | Toppan Printing Co., Ltd. | Method of analyzing input speech and speech analysis apparatus therefor |
| JPH0522391A (ja) | 1991-07-10 | 1993-01-29 | Sony Corp | 音声マスキング装置 |
| JP3557662B2 (ja) * | 1994-08-30 | 2004-08-25 | ソニー株式会社 | 音声符号化方法及び音声復号化方法、並びに音声符号化装置及び音声復号化装置 |
| JPH09319389A (ja) * | 1996-03-28 | 1997-12-12 | Matsushita Electric Ind Co Ltd | 環境音発生装置 |
| US6904404B1 (en) * | 1996-07-01 | 2005-06-07 | Matsushita Electric Industrial Co., Ltd. | Multistage inverse quantization having the plurality of frequency bands |
| JP3246715B2 (ja) * | 1996-07-01 | 2002-01-15 | 松下電器産業株式会社 | オーディオ信号圧縮方法,およびオーディオ信号圧縮装置 |
| JP3266819B2 (ja) * | 1996-07-30 | 2002-03-18 | 株式会社エイ・ティ・アール人間情報通信研究所 | 周期信号変換方法、音変換方法および信号分析方法 |
| JP3707153B2 (ja) * | 1996-09-24 | 2005-10-19 | ソニー株式会社 | ベクトル量子化方法、音声符号化方法及び装置 |
| US6073100A (en) * | 1997-03-31 | 2000-06-06 | Goodridge, Jr.; Alan G | Method and apparatus for synthesizing signals using transform-domain match-output extension |
| SE512719C2 (sv) * | 1997-06-10 | 2000-05-02 | Lars Gustaf Liljeryd | En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion |
| JP3706249B2 (ja) * | 1998-06-16 | 2005-10-12 | ヤマハ株式会社 | 音声変換装置、音声変換方法、および音声変換プログラムを記録した記録媒体 |
| GB9927131D0 (en) * | 1999-11-16 | 2000-01-12 | Royal College Of Art | Apparatus for acoustically improving an environment and related method |
| FR2813722B1 (fr) * | 2000-09-05 | 2003-01-24 | France Telecom | Procede et dispositif de dissimulation d'erreurs et systeme de transmission comportant un tel dispositif |
| JP3590342B2 (ja) * | 2000-10-18 | 2004-11-17 | 日本電信電話株式会社 | 信号符号化方法、装置及び信号符号化プログラムを記録した記録媒体 |
| FR2819362A1 (fr) * | 2001-01-05 | 2002-07-12 | Rene Travere | Attenuateur, brouilleur, de conversation applique au telephone |
| JP3703394B2 (ja) * | 2001-01-16 | 2005-10-05 | シャープ株式会社 | 声質変換装置および声質変換方法およびプログラム記憶媒体 |
| JP2002251199A (ja) * | 2001-02-27 | 2002-09-06 | Ricoh Co Ltd | 音声入力情報処理装置 |
| AU2003213439A1 (en) * | 2002-03-08 | 2003-09-22 | Nippon Telegraph And Telephone Corporation | Digital signal encoding method, decoding method, encoding device, decoding device, digital signal encoding program, and decoding program |
| JP4195267B2 (ja) * | 2002-03-14 | 2008-12-10 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置、その音声認識方法及びプログラム |
| US20030187663A1 (en) * | 2002-03-28 | 2003-10-02 | Truman Michael Mead | Broadband frequency translation for high frequency regeneration |
| US7143028B2 (en) * | 2002-07-24 | 2006-11-28 | Applied Minds, Inc. | Method and system for masking speech |
| US7451082B2 (en) * | 2003-08-27 | 2008-11-11 | Texas Instruments Incorporated | Noise-resistant utterance detector |
| JP4336552B2 (ja) * | 2003-09-11 | 2009-09-30 | グローリー株式会社 | マスキング装置 |
-
2005
- 2005-03-01 JP JP2005056342A patent/JP4761506B2/ja not_active Expired - Lifetime
-
2006
- 2006-02-23 CN CN2006800066680A patent/CN101138020B/zh not_active Expired - Fee Related
- 2006-02-23 KR KR1020077019988A patent/KR100931419B1/ko not_active Expired - Fee Related
- 2006-02-23 DE DE602006014096T patent/DE602006014096D1/de not_active Expired - Lifetime
- 2006-02-23 WO PCT/JP2006/303290 patent/WO2006093019A1/ja not_active Ceased
- 2006-02-23 EP EP06714430A patent/EP1855269B1/en not_active Expired - Lifetime
-
2007
- 2007-08-31 US US11/849,106 patent/US8065138B2/en not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| DE602006014096D1 (de) | 2010-06-17 |
| US8065138B2 (en) | 2011-11-22 |
| CN101138020A (zh) | 2008-03-05 |
| WO2006093019A1 (ja) | 2006-09-08 |
| EP1855269A4 (en) | 2009-04-22 |
| EP1855269A1 (en) | 2007-11-14 |
| US20080281588A1 (en) | 2008-11-13 |
| KR20070099681A (ko) | 2007-10-09 |
| CN101138020B (zh) | 2010-10-13 |
| KR100931419B1 (ko) | 2009-12-11 |
| JP2006243178A (ja) | 2006-09-14 |
| EP1855269B1 (en) | 2010-05-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP4761506B2 (ja) | 音声処理方法と装置及びプログラム並びに音声システム | |
| Darwin | Listening to speech in the presence of other sounds | |
| Binns et al. | The role of fundamental frequency contours in the perception of speech against interfering speech | |
| KR100643310B1 (ko) | 음성 데이터의 포먼트와 유사한 교란 신호를 출력하여송화자 음성을 차폐하는 방법 및 장치 | |
| JP2017538146A (ja) | インテリジェントな音声認識および処理のためのシステム、方法、およびデバイス | |
| Nathwani et al. | Speech intelligibility improvement in car noise environment by voice transformation | |
| Deroche et al. | Roles of the target and masker fundamental frequencies in voice segregation | |
| JP6087731B2 (ja) | 音声明瞭化装置、方法及びプログラム | |
| JP2014130251A (ja) | 会話保護システム及び会話保護方法 | |
| Huang et al. | Lombard speech model for automatic enhancement of speech intelligibility over telephone channel | |
| JP5662711B2 (ja) | 音声変更装置、音声変更方法および音声情報秘話システム | |
| Liu et al. | Application of spectral subtraction method on enhancement of electrolarynx speech | |
| JP2012008393A (ja) | 音声変更装置、音声変更方法および音声情報秘話システム | |
| JP4785563B2 (ja) | 音声処理装置および音声処理方法 | |
| JP4680099B2 (ja) | 音声処理装置および音声処理方法 | |
| Brouckxon et al. | Time and frequency dependent amplification for speech intelligibility enhancement in noisy environments | |
| RU2589298C1 (ru) | Способ повышения разборчивости и информативности звуковых сигналов в шумовой обстановке | |
| Alam et al. | Perceptual improvement of Wiener filtering employing a post-filter | |
| Jokinen et al. | Phase modification for increasing the intelligibility of telephone speech in near-end noise conditions–evaluation of two methods | |
| Upadhyay | Iterative-processed multiband speech enhancement for suppressing musical sounds | |
| Wang et al. | Investigation of the relative perceptual importance of temporal envelope and temporal fine structure between tonal and non-tonal languages. | |
| Song et al. | Smart Wristwatches Employing Finger-Conducted Voice Transmission System | |
| JP5662712B2 (ja) | 音声変更装置、音声変更方法および音声情報秘話システム | |
| JP2024166925A (ja) | 会話音声保護装置 | |
| Huang et al. | Biologically inspired algorithm for enhancement of speech intelligibility over telephone channel |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20070711 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20070711 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20110118 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20110318 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20110510 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20110606 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20140617 Year of fee payment: 3 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 4761506 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313117 |
|
| R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
| EXPY | Cancellation because of completion of term |