JP2005202335A - 音声処理方法と装置及びプログラム - Google Patents
音声処理方法と装置及びプログラム Download PDFInfo
- Publication number
- JP2005202335A JP2005202335A JP2004011111A JP2004011111A JP2005202335A JP 2005202335 A JP2005202335 A JP 2005202335A JP 2004011111 A JP2004011111 A JP 2004011111A JP 2004011111 A JP2004011111 A JP 2004011111A JP 2005202335 A JP2005202335 A JP 2005202335A
- Authority
- JP
- Japan
- Prior art keywords
- processing
- audio signal
- calculating
- coefficient
- mean square
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012545 processing Methods 0.000 title claims abstract description 86
- 238000000034 method Methods 0.000 title claims description 23
- 238000001228 spectrum Methods 0.000 claims abstract description 31
- 238000004364 calculation method Methods 0.000 claims abstract description 17
- 230000005236 sound signal Effects 0.000 claims description 54
- 230000008569 process Effects 0.000 claims description 21
- 230000001629 suppression Effects 0.000 claims description 14
- 238000003672 processing method Methods 0.000 claims description 5
- 238000012935 Averaging Methods 0.000 abstract 1
- 230000000873 masking effect Effects 0.000 description 13
- 230000000694 effects Effects 0.000 description 12
- 238000002474 experimental method Methods 0.000 description 11
- 238000012360 testing method Methods 0.000 description 11
- 230000007704 transition Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 230000008447 perception Effects 0.000 description 4
- 230000003595 spectral effect Effects 0.000 description 3
- 208000032041 Hearing impaired Diseases 0.000 description 2
- 235000016496 Panda oleosa Nutrition 0.000 description 2
- 240000000220 Panda oleosa Species 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 206010002953 Aphonia Diseases 0.000 description 1
- 101100188552 Arabidopsis thaliana OCT3 gene Proteins 0.000 description 1
- 206010036626 Presbyacusis Diseases 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 238000012353 t test Methods 0.000 description 1
Images
Landscapes
- Circuit For Audible Band Transducer (AREA)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2004011111A JP2005202335A (ja) | 2004-01-19 | 2004-01-19 | 音声処理方法と装置及びプログラム |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2004011111A JP2005202335A (ja) | 2004-01-19 | 2004-01-19 | 音声処理方法と装置及びプログラム |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2005202335A true JP2005202335A (ja) | 2005-07-28 |
| JP2005202335A5 JP2005202335A5 (enExample) | 2007-02-22 |
Family
ID=34823634
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2004011111A Pending JP2005202335A (ja) | 2004-01-19 | 2004-01-19 | 音声処理方法と装置及びプログラム |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JP2005202335A (enExample) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2007065285A (ja) * | 2005-08-31 | 2007-03-15 | Takayuki Arai | 音声信号処理方法、装置及びプログラム |
| JP2008245159A (ja) * | 2007-03-28 | 2008-10-09 | Toshiba Corp | 音響信号発生装置および方法 |
| KR100876794B1 (ko) | 2007-04-03 | 2009-01-09 | 삼성전자주식회사 | 이동 단말에서 음성의 명료도 향상 장치 및 방법 |
| US8675882B2 (en) | 2008-01-21 | 2014-03-18 | Panasonic Corporation | Sound signal processing device and method |
| WO2021031942A1 (zh) * | 2019-08-16 | 2021-02-25 | 阿里巴巴集团控股有限公司 | 一种针对目标频谱矩阵的处理方法及装置 |
| CN115485768A (zh) * | 2020-05-01 | 2022-12-16 | 谷歌有限责任公司 | 端到端多发言者重叠语音识别 |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001083978A (ja) * | 1999-07-15 | 2001-03-30 | Matsushita Electric Ind Co Ltd | 音声認識装置 |
| JP2001100763A (ja) * | 1999-09-29 | 2001-04-13 | Yamaha Corp | 波形分析方法 |
-
2004
- 2004-01-19 JP JP2004011111A patent/JP2005202335A/ja active Pending
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001083978A (ja) * | 1999-07-15 | 2001-03-30 | Matsushita Electric Ind Co Ltd | 音声認識装置 |
| JP2001100763A (ja) * | 1999-09-29 | 2001-04-13 | Yamaha Corp | 波形分析方法 |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2007065285A (ja) * | 2005-08-31 | 2007-03-15 | Takayuki Arai | 音声信号処理方法、装置及びプログラム |
| JP2008245159A (ja) * | 2007-03-28 | 2008-10-09 | Toshiba Corp | 音響信号発生装置および方法 |
| KR100876794B1 (ko) | 2007-04-03 | 2009-01-09 | 삼성전자주식회사 | 이동 단말에서 음성의 명료도 향상 장치 및 방법 |
| US8019603B2 (en) | 2007-04-03 | 2011-09-13 | Samsung Electronics Co., Ltd | Apparatus and method for enhancing speech intelligibility in a mobile terminal |
| US8675882B2 (en) | 2008-01-21 | 2014-03-18 | Panasonic Corporation | Sound signal processing device and method |
| WO2021031942A1 (zh) * | 2019-08-16 | 2021-02-25 | 阿里巴巴集团控股有限公司 | 一种针对目标频谱矩阵的处理方法及装置 |
| CN115485768A (zh) * | 2020-05-01 | 2022-12-16 | 谷歌有限责任公司 | 端到端多发言者重叠语音识别 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110473567B (zh) | 基于深度神经网络的音频处理方法、装置及存储介质 | |
| Skowronski et al. | Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments | |
| CN112086093A (zh) | 解决基于感知的对抗音频攻击的自动语音识别系统 | |
| JP2002014689A (ja) | デジタルに圧縮されたスピーチの了解度を向上させる方法および装置 | |
| US10176824B2 (en) | Method and system for consonant-vowel ratio modification for improving speech perception | |
| Rennies et al. | Intelligibility-Enhancing Speech Modifications-The Hurricane Challenge 2.0. | |
| EP3113183B1 (en) | Speech intelligibility improving apparatus and computer program therefor | |
| Kusumoto et al. | Modulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments | |
| Hazrati et al. | Simultaneous suppression of noise and reverberation in cochlear implants using a ratio masking strategy | |
| Roman et al. | Intelligibility of reverberant noisy speech with ideal binary masking | |
| Jayan et al. | Automated modification of consonant–vowel ratio of stops for improving speech intelligibility | |
| Huang et al. | Lombard speech model for automatic enhancement of speech intelligibility over telephone channel | |
| Hummersone | A psychoacoustic engineering approach to machine sound source separation in reverberant environments | |
| JP2005202335A (ja) | 音声処理方法と装置及びプログラム | |
| JP4876245B2 (ja) | 子音加工装置、音声情報伝達装置及び子音加工方法 | |
| JP4774255B2 (ja) | 音声信号処理方法、装置及びプログラム | |
| Kociński et al. | Time-compressed speech intelligibility in different reverberant conditions | |
| Arai et al. | Using steady-state suppression to improve speech intelligibility in reverberant environments for elderly listeners | |
| JP2008102551A (ja) | 音声信号の処理装置およびその処理方法 | |
| Bhattacharya et al. | Combined spectral and temporal enhancement to improve cochlear-implant speech perception | |
| Zorila et al. | On the Quality and Intelligibility of Noisy Speech Processed for Near-End Listening Enhancement. | |
| Hodoshima et al. | The effect of pre-processing approach for improving speech intelligibility in a hall: Comparison between diotic and dichotic listening conditions | |
| Kubo et al. | Effects of speaker's and listener's acoustic environments on speech intelligibility and annoyance | |
| Amano et al. | Acoustic features of pop-out voice in babble noise | |
| CN102222507B (zh) | 一种适用于汉语语言的听力损失补偿方法及设备 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20070109 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20070109 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20090817 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20090825 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20091026 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20100420 |