WO2006070560A1 - 雑音抑圧装置、雑音抑圧方法、雑音抑圧プログラムおよびコンピュータに読み取り可能な記録媒体 - Google Patents
雑音抑圧装置、雑音抑圧方法、雑音抑圧プログラムおよびコンピュータに読み取り可能な記録媒体 Download PDFInfo
- Publication number
- WO2006070560A1 WO2006070560A1 PCT/JP2005/022095 JP2005022095W WO2006070560A1 WO 2006070560 A1 WO2006070560 A1 WO 2006070560A1 JP 2005022095 W JP2005022095 W JP 2005022095W WO 2006070560 A1 WO2006070560 A1 WO 2006070560A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- spectrum
- noise
- frame
- speech
- gain
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 19
- 238000001228 spectrum Methods 0.000 claims abstract description 214
- 238000006243 chemical reaction Methods 0.000 claims abstract description 41
- 238000009499 grossing Methods 0.000 claims abstract description 39
- 230000001629 suppression Effects 0.000 claims description 44
- 230000003595 spectral effect Effects 0.000 claims description 25
- 238000001514 detection method Methods 0.000 claims description 8
- 238000010586 diagram Methods 0.000 description 13
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 4
- 238000007781 pre-processing Methods 0.000 description 3
- 238000012935 Averaging Methods 0.000 description 1
- 230000005534 acoustic noise Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
Definitions
- Noise suppression device noise suppression method, noise suppression program, and computer-readable recording medium
- the present invention relates to a noise suppression device, a noise suppression method, a noise suppression program, and a computer-readable recording medium that suppress noise from an audio signal on which noise is superimposed.
- use of the present invention is not limited to the above-described noise suppression device, noise suppression method, noise suppression program, and computer-readable recording medium.
- Spectral subtraction proposed by S. F. Boll is known as a simple and very effective technique for suppressing noise from a speech signal on which noise is superimposed. Based on this spectral subtraction, the gain is calculated using the noise superimposed speech power spectrum of the current frame (see Non-Patent Document 1, for example).
- Non-Patent Document 1 S 'F' Ball (SF Boll), "Suppression of Acoustic Noise in Speech Using Spectral SuDtraction", Z Le Eey Transactions on Acoustics, Speech Processing and Signal Processing (197 ESP, 27-2, p. 113—120
- Non-Patent Document 2 Norihide Kitaoka, Ichiro Akahori, Kiyoshi Nakagawa, “Speech recognition under noisy environment using spectral subtraction and time-direction smoothing”, IEICE Transactions, 2 February 000, J83—D—II ⁇ , No. 2, p. 500—508
- the noise suppression device includes a first frame dividing means for dividing an input voice on which noise is superimposed, and an input voice divided into frames by the first frame dividing means into a spectrum.
- First spectrum converting means speech section detecting means for discriminating whether each frame divided by the first frame dividing means is a speech section or a non-speech section, and the speech section detecting means being determined as a non-speech section
- Noise spectrum estimation means for estimating a noise spectrum using the input voice spectrum in a section
- second frame division means for dividing the input voice into frames longer than the frame length of the first frame division means
- second spectrum conversion means for converting the input speech divided by the second frame dividing means into a spectrum
- Smoothing means for smoothing the spectrum converted by the second spectral conversion means in the frequency direction, the spectrum smoothed by the smoothing means, and the estimated noise estimated by the noise spectrum estimation means
- Gain calculating means for calculating a gain based on the spectrum, and extra-subtracting means for subtracting extra-scale
- the noise suppression method according to the invention of claim 7 includes a first frame dividing step of dividing an input speech on which noise is superimposed, and an input divided into frames by the first frame dividing step.
- a first spectrum converting step for converting speech into a spectrum a speech segment detecting step for determining whether each frame divided by the first frame dividing means is a speech segment or a non-speech segment, and a non-speech step in the speech segment detecting step.
- the input speech is framed to a frame length longer than the frame length of the noise spectrum estimation step of estimating the noise spectrum using the input speech spectrum of the interval determined as the speech interval and the first frame dividing step.
- a noise suppression program according to the invention of claim 8 causes a computer to execute the noise suppression method according to claim 7.
- the noise suppression program described in 8 is recorded.
- FIG. 1 is a block diagram showing a functional configuration of a noise suppression apparatus according to an embodiment of the present invention.
- FIG. 2 is a flowchart showing the processing of the noise suppression method according to the embodiment of the present invention.
- FIG. 3 is a block diagram showing a functional configuration of a noise suppression apparatus using spectral subtraction in the prior art.
- FIG. 4 shows a noise suppression apparatus using a time-direction smoothed noise-powered speech power spectrum. It is a block diagram which shows the functional structure of these.
- FIG. 5 is a block diagram showing a functional configuration of the noise suppression device of this embodiment.
- FIG. 6 is an explanatory diagram for explaining frame division of input speech.
- FIG. 7 is an explanatory diagram for explaining gain calculation when smoothing is performed in the frequency direction. Explanation of symbols
- FIG. 1 is a block diagram showing a functional configuration of a noise suppression device according to an embodiment of the present invention.
- the noise suppression apparatus calculates an input speech force speech spectrum and a noise vector, calculates a gain based on the speech spectrum and the noise spectrum, and uses the calculated gain to determine the noise of the input speech. Repress. Further, this noise suppression apparatus includes a first frame division unit 101, a first conversion unit 102, a noise spectrum estimation unit 103, a second frame division unit 104, a second conversion unit 105, a smoothing unit 106, a gain calculation unit. 107 and a spectrum subtracting unit 108.
- the first frame dividing unit 101 divides the input speech into frames with a predetermined frame length.
- the first conversion unit 102 converts the input speech that has been frame-divided by the first frame division unit into a vector.
- the noise spectrum estimation unit 103 estimates a noise vector using a spectrum of a frame determined to be a non-speech interval among the spectra converted by the first conversion unit 102.
- the second frame dividing unit 104 divides the input speech into frames having a frame length longer than that of the first frame dividing unit 101. Also, the second frame dividing unit 104 can divide the input voice into frames that are an integral multiple of the frame length of the first frame dividing unit 101, for example, twice the frame length.
- the first frame dividing unit 101 and the second frame dividing unit 104 can respectively perform windowing processing on the divided input speech. Further, the first frame dividing unit 101 and the second frame dividing unit 104 can perform windowing processing on the divided input speech using a hanging window.
- the second conversion unit 105 converts the input speech that has been frame-divided by the second frame division unit 104 into a spectrum.
- the smoothing unit 106 smoothes the spectrum converted by the second conversion unit 105 in the frequency direction. For example, when the second frame division unit 104 divides the input speech into frames that are twice the frame length of the first frame division unit 101, the smoothing unit 106 uses the second conversion unit 105 to The even numbered spectrum of the converted spectrum can be smoothed using the numbered spectra before and after the even numbered spectrum. That is, the smoothing unit 106 converts the 2K-th spectrum converted by the second conversion unit 105 into the 2K-first spectrum, the 2K-th spectrum, and the 2K + Smooth using the first spectrum.
- Gain calculating section 107 calculates a gain based on the spectrum smoothed by smoothing section 103 and the noise spectrum estimated by noise spectrum estimating section 103.
- the vector subtractor 108 multiplies the input speech spectrum converted by the first converter 102 by the gain calculated by the gain calculator 107, thereby suppressing the noise of the input speech.
- the spectrum subtraction unit 108 can input the gain calculated by the gain calculation unit 107 and the input speech spectrum converted by the first conversion unit 102 at the same timing.
- FIG. 2 is a flowchart showing processing of the noise suppression method according to the embodiment of the present invention.
- first frame division section 101 divides audio into frames with a predetermined frame length (step S201).
- the first conversion unit 102 converts the input voice frame-divided by the first frame division unit into a spectrum (step S202).
- the noise spectrum estimation unit 103 estimates the noise spectrum using the spectrum of the frame determined to be a non-speech period among the spectrum converted by the first conversion unit 102 (step S203).
- the second frame dividing unit 104 divides the input voice into frames longer than the frame length of the first frame dividing unit 101 (step S204).
- the second conversion unit 105 converts the input voice frame-divided by the second frame division unit 104 into a spectrum (step S205).
- the smoothing unit 106 smoothes the vector converted by the second conversion unit 105 in the frequency direction (step S206).
- the gain calculation unit 107 calculates a gain based on the spectrum smoothed by the smoothing unit 103 and the noise spectrum estimated by the noise spectrum estimation unit 103 (step S207).
- the vector subtraction unit 108 subtracts the spectrum by multiplying the input speech spectrum converted by the first conversion unit 102 by the gain calculated by the gain calculation unit 107 (step S208).
- the third term on the right side of the above equation is a cross-correlation term between speech and noise.
- speech and noise are uncorrelated, they are approximated by the following equation (2).
- the clean speech power spectrum is estimated as the following formula (3) by subtracting the noise power spectrum power estimation noise power spectrum.
- I s (k) ⁇ 2 ⁇ l X (k)
- a is a subtraction coefficient, and is set to a value larger than 1 in order to subtract a large amount of the estimated noise power spectrum.
- ⁇ is a floor coefficient, and is set to a small positive value to avoid the spectrum after subtraction being negative or close to zero.
- the above formula is gay G (k) can be used as a filtering for I x (k) I.
- the estimated clean speech amplitude spectrum can be obtained by the following equation (6).
- the estimated clean speech spectrum is obtained by the following equation (7).
- FIG. 3 is a block diagram showing a functional configuration of a conventional spectral subtraction noise suppression apparatus.
- the noise suppression apparatus shown in FIG. 3 includes a signal frame division unit 401, a spectrum conversion unit 402, a speech section detection unit 403, a noise spectrum estimation unit 404, a gain calculation unit 405, a spectrum subtraction unit 406, a waveform conversion unit 407,
- the waveform synthesis unit 408 is configured.
- the signal frame division unit 401 divides the noise-superimposed speech into frames having a certain number of sample powers, and sends the frames to the spectrum conversion unit 402 and the speech interval detection unit 403.
- the spectrum conversion unit 402 calculates the noise superimposed speech spectrum X (k) by discrete Fourier transform and sends it to the gain calculation unit 405 and the spectrum subtraction unit 406.
- the speech segment detection unit 403 discriminates the speech segment Z non-speech segment and sends the noise superimposed speech spectrum of the frame determined to be a non-speech segment to the noise spectrum estimation unit 404.
- the noise spectrum estimation unit 404 is a power spectrum of the past several frames determined to be non-speech. And calculate the estimated noise power spectrum.
- Gain calculation section 405 calculates gain G (k) using the noise superimposed speech power spectrum and the estimated noise power spectrum.
- Spectrum subtraction section 406 multiplies gain G (k) by noise-superimposed speech spectrum X (k) to estimate an estimated clean speech spectrum.
- the waveform converter 407 converts the estimated clean speech spectrum into a time waveform by inverse discrete Fourier transform.
- the waveform synthesis unit 408 synthesizes a continuous waveform by overlapping and adding time waveforms in frame units.
- FIG. 4 is a block diagram showing a functional configuration of a noise suppression device that uses a time-direction smoothed noise-superimposed speech power spectrum.
- the noise suppression apparatus shown in FIG. 4 has a configuration in which a time direction smoothing unit 409 is provided in front of the gain calculation unit 405 shown in FIG.
- the time-direction smooth noise-superimposed speech power spectrum at the current frame time t is obtained by a moving average of past L frames including the current frame as shown in the following equation (8).
- a is a weight for smoothing and is given by the following equation (9).
- Gain calculation section 405 calculates the noise superimposed speech power spectrum of the current frame in equation (5).
- FIG. 5 is a block diagram showing a functional configuration of the noise suppression apparatus of this embodiment.
- the noise suppression apparatus shown in FIG. 5 includes a signal frame division unit 401, a spectrum conversion unit 402, a speech interval detection unit 403, a noise spectrum estimation unit 404, a gain calculation unit 405, a spectrum subtraction unit 406, a waveform conversion unit 407, A waveform synthesis unit 408, a gain calculation frame division unit 601, a spectrum conversion unit 602, and a frequency direction smoothing unit 603 are configured.
- the actual processing is executed by using the RAM as a work area by reading the program written in the CPU power ROM.
- the embodiment will be described with reference to FIG. First, the noise superimposed speech is sent to the signal frame division unit 401 and the gain calculation frame division unit 601.
- the signal frame division unit 401 divides the noise-superimposed speech into frames composed of N (for example, 256) samples. At this time, a windowing process is performed in order to increase the frequency analysis accuracy of the discrete Fourier transform (DFT) in the spectrum conversion unit 402. In addition, when performing waveform synthesis processing, the frames are divided so as to overlap in order to prevent discontinuous waveforms at the frame boundaries.
- N for example, 256
- X (n) S (n) + d (n), 0 ⁇ n ⁇
- S (n) is a clean speech signal and d (n) is noise.
- Spectrum X (k) is sent to spectrum subtraction unit 406.s
- the speech segment detection unit 403 converts the noise-superimposed speech signal X (n) divided into frames in parallel.
- Noise spectrum estimation section 404 calculates a time average of power spectra for the past several frames determined to be non-speech intervals, and gives an estimated noise power spectrum DP by the following equation (11).
- the gain calculating frame dividing unit 601 divides the noise-superimposed speech into frames having more than N (for example, 512) sample forces. At this time, the center of the gain calculation frame division window is made to coincide with the center of the signal frame division window.
- the spectrum converter 602 converts the frame-divided noise-superimposed speech signal x (m) into discrete frames g
- X (1) S (1) + D (1), expressed as 0 ⁇ 1 ⁇ M—1.
- S (1) is the clean speech spectrum g g g g g
- the first component of Toll, D (1) is the first component of the noise spectrum.
- the frequency direction smoothing unit 603 smoothes the gain calculation spectrum X (1).
- Gain g the gain calculation spectrum X (1).
- the gain calculation unit 405 uses the estimated noise power spectrum DP and the frequency-direction smoothed power spectrum XP sent from the noise spectrum estimation unit 404 to calculate the gain G (k) as shown in the following equation (13). To calculate.
- ⁇ is a subtraction coefficient and is set to a value larger than 1 to subtract a large amount of the estimated noise power spectrum DP
- ⁇ is a floor coefficient
- the spectrum after subtraction is negative or 0 It is set to a small positive value to avoid a value close to.
- the calculated gain G (k) is sent to the spectrum subtraction unit 406.
- the spectrum subtraction unit 406 multiplies the spectrum X (k) calculated by the spectrum conversion unit 402 by the gain s in G (k) to obtain the estimated clean speech vector from which the estimated noise spectrum is subtracted. Calculated as shown in the following formula (14).
- the waveform converter 407 performs inverse discrete Fourier transform (InverSe Discrete Fourier Transform (IDFT) is used to obtain a time waveform in units of frames.
- the waveform synthesizer 408 synthesizes a continuous waveform by overlapping the time waveforms in units of frames, and outputs noise-suppressed speech.
- FIG. 6 is an explanatory diagram for explaining frame division of input speech.
- FIG. 6 (a) shows a case where the signal frame dividing unit 401 divides the noise-superimposed speech into frames having N (for example, 256) sample power.
- N for example, 256
- a windowing process is performed in order to increase the frequency analysis accuracy of Discrete Fourier Transform (DFT).
- DFT Discrete Fourier Transform
- the frames are divided so as to overlap to prevent discontinuous waveforms at the frame boundaries.
- FIG. 6B shows a case where the gain calculation frame dividing unit 601 divides the noise-superimposed speech into frames of more than N (for example, 512) samplers.
- N for example, 512
- the time width is twice that of Fig. 6 (a). In this way, the number of samples in the gain calculation frame is made larger than the number of samples in the signal frame. Also, the center of the gain calculation frame and the center of the signal frame are matched.
- FIG. 7 is an explanatory diagram illustrating gain calculation when smoothing is performed in the frequency direction. As shown in the diagram 801, the gain calculation spectrum X (1) is converted by the spectrum conversion unit 602.
- the number-direction smoothing uses a plurality of spectral components centered on a spectral component that matches the frequency of the signal spectral component.
- the gain 802 indicated by G (3) is calculated.
- the gain 802 is combined with the spectrum X (k) shown by the graph 803 in the spectrum subtraction unit 406.
- the window function will be described.
- the spectrum conversion of a long signal is a discrete Fourier transform because it uses force discrete value data that is divided into frame units and Fourier-transformed.
- the width of the main lobe (frequency force ⁇ the surrounding amplitude spectrum is large and the region) is narrow, and the amplitude of the side lobe (the amplitude spectrum where the frequency is away from 0 is small and the region) is small. It is a condition.
- Specific examples include rectangular windows, Hayung windows, windows, ming windows, and Gauss windows.
- the window function used in the present embodiment is a Hayung window.
- the window function of the Hayung window is 0 ⁇ n ⁇ N
- This window function has relatively low sidelobe amplitude, although the frequency resolution of the main lobe is relatively poor.
- smoothing is performed in the frequency direction using a plurality of spectral components of the noise superimposed speech power spectrum, so that the cross-correlation term between speech and noise can be reduced, and high The accuracy gain can be estimated. Further, since the center of the gain calculation frame and the signal frame coincide with each other, the gain can be calculated using a frame at almost the same time as the signal frame, so that the gain can be estimated with high accuracy. As a result, the musical noise can provide high-quality sound with less distortion of the sound spectrum. In addition, when the embodiment is used as preprocessing for voice recognition, the effect of improving the voice recognition rate under noise is significant.
- the noise suppression method described in the present embodiment can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation.
- This program is recorded on a computer-readable recording medium such as a hard disk, a flexible disk, a CD-ROM, an MO, and a DVD, and is executed by being read by the computer.
- This program is a transmission medium that can be distributed over a network such as the Internet. Also good.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Circuit For Audible Band Transducer (AREA)
- Noise Elimination (AREA)
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2006550638A JP4568733B2 (ja) | 2004-12-28 | 2005-12-01 | 雑音抑圧装置、雑音抑圧方法、雑音抑圧プログラムおよびコンピュータに読み取り可能な記録媒体 |
US11/794,130 US7957964B2 (en) | 2004-12-28 | 2005-12-01 | Apparatus and methods for noise suppression in sound signals |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004382163 | 2004-12-28 | ||
JP2004-382163 | 2004-12-28 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2006070560A1 true WO2006070560A1 (ja) | 2006-07-06 |
Family
ID=36614685
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/022095 WO2006070560A1 (ja) | 2004-12-28 | 2005-12-01 | 雑音抑圧装置、雑音抑圧方法、雑音抑圧プログラムおよびコンピュータに読み取り可能な記録媒体 |
Country Status (3)
Country | Link |
---|---|
US (1) | US7957964B2 (ja) |
JP (1) | JP4568733B2 (ja) |
WO (1) | WO2006070560A1 (ja) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010055024A (ja) * | 2008-08-29 | 2010-03-11 | Toshiba Corp | 信号補正装置 |
JP2010532879A (ja) * | 2007-07-06 | 2010-10-14 | オーディエンス,インコーポレイテッド | アダプティブ・インテリジェント・ノイズ抑制システム及び方法 |
JP2011081033A (ja) * | 2009-10-02 | 2011-04-21 | Toshiba Corp | 信号処理装置、及び携帯端末装置 |
US8515098B2 (en) | 2008-10-24 | 2013-08-20 | Yamaha Corporation | Noise suppression device and noise suppression method |
CN112837703A (zh) * | 2020-12-30 | 2021-05-25 | 深圳市联影高端医疗装备创新研究院 | 医疗成像设备中语音信号获取方法、装置、设备和介质 |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100207689A1 (en) * | 2007-09-19 | 2010-08-19 | Nec Corporation | Noise suppression device, its method, and program |
DK2164066T3 (da) * | 2008-09-15 | 2016-06-13 | Oticon As | Støjspektrumsporing i støjende akustiske signaler |
JP5526524B2 (ja) | 2008-10-24 | 2014-06-18 | ヤマハ株式会社 | 雑音抑圧装置及び雑音抑圧方法 |
JP5245714B2 (ja) | 2008-10-24 | 2013-07-24 | ヤマハ株式会社 | 雑音抑圧装置及び雑音抑圧方法 |
JP5566846B2 (ja) * | 2010-10-15 | 2014-08-06 | 本田技研工業株式会社 | ノイズパワー推定装置及びノイズパワー推定方法並びに音声認識装置及び音声認識方法 |
EP2717263B1 (en) * | 2012-10-05 | 2016-11-02 | Nokia Technologies Oy | Method, apparatus, and computer program product for categorical spatial analysis-synthesis on the spectrum of a multichannel audio signal |
JP6477295B2 (ja) * | 2015-06-29 | 2019-03-06 | 株式会社Jvcケンウッド | 雑音検出装置、雑音検出方法及び雑音検出プログラム |
JP6597062B2 (ja) * | 2015-08-31 | 2019-10-30 | 株式会社Jvcケンウッド | 雑音低減装置、雑音低減方法、雑音低減プログラム |
JP6729187B2 (ja) | 2016-08-30 | 2020-07-22 | 富士通株式会社 | 音声処理プログラム、音声処理方法及び音声処理装置 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0822297A (ja) * | 1994-07-07 | 1996-01-23 | Matsushita Commun Ind Co Ltd | 雑音抑圧装置 |
JPH09311698A (ja) * | 1996-05-21 | 1997-12-02 | Oki Electric Ind Co Ltd | 背景雑音消去装置 |
JP2001134287A (ja) * | 1999-11-10 | 2001-05-18 | Mitsubishi Electric Corp | 雑音抑圧装置 |
JP2002221988A (ja) * | 2001-01-25 | 2002-08-09 | Toshiba Corp | 音声信号の雑音抑圧方法と装置及び音声認識装置 |
JP2003101445A (ja) * | 2001-09-20 | 2003-04-04 | Mitsubishi Electric Corp | エコー処理装置 |
JP2004234023A (ja) * | 2004-04-02 | 2004-08-19 | Mitsubishi Electric Corp | 雑音抑圧装置 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1376539B8 (en) * | 2001-03-28 | 2010-12-15 | Mitsubishi Denki Kabushiki Kaisha | Noise suppressor |
-
2005
- 2005-12-01 US US11/794,130 patent/US7957964B2/en not_active Expired - Fee Related
- 2005-12-01 WO PCT/JP2005/022095 patent/WO2006070560A1/ja not_active Application Discontinuation
- 2005-12-01 JP JP2006550638A patent/JP4568733B2/ja not_active Expired - Fee Related
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0822297A (ja) * | 1994-07-07 | 1996-01-23 | Matsushita Commun Ind Co Ltd | 雑音抑圧装置 |
JPH09311698A (ja) * | 1996-05-21 | 1997-12-02 | Oki Electric Ind Co Ltd | 背景雑音消去装置 |
JP2001134287A (ja) * | 1999-11-10 | 2001-05-18 | Mitsubishi Electric Corp | 雑音抑圧装置 |
JP2002221988A (ja) * | 2001-01-25 | 2002-08-09 | Toshiba Corp | 音声信号の雑音抑圧方法と装置及び音声認識装置 |
JP2003101445A (ja) * | 2001-09-20 | 2003-04-04 | Mitsubishi Electric Corp | エコー処理装置 |
JP2004234023A (ja) * | 2004-04-02 | 2004-08-19 | Mitsubishi Electric Corp | 雑音抑圧装置 |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010532879A (ja) * | 2007-07-06 | 2010-10-14 | オーディエンス,インコーポレイテッド | アダプティブ・インテリジェント・ノイズ抑制システム及び方法 |
JP2010055024A (ja) * | 2008-08-29 | 2010-03-11 | Toshiba Corp | 信号補正装置 |
US8108011B2 (en) | 2008-08-29 | 2012-01-31 | Kabushiki Kaisha Toshiba | Signal correction device |
US8515098B2 (en) | 2008-10-24 | 2013-08-20 | Yamaha Corporation | Noise suppression device and noise suppression method |
JP2011081033A (ja) * | 2009-10-02 | 2011-04-21 | Toshiba Corp | 信号処理装置、及び携帯端末装置 |
CN112837703A (zh) * | 2020-12-30 | 2021-05-25 | 深圳市联影高端医疗装备创新研究院 | 医疗成像设备中语音信号获取方法、装置、设备和介质 |
Also Published As
Publication number | Publication date |
---|---|
US20080010063A1 (en) | 2008-01-10 |
JP4568733B2 (ja) | 2010-10-27 |
US7957964B2 (en) | 2011-06-07 |
JPWO2006070560A1 (ja) | 2008-06-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2006070560A1 (ja) | 雑音抑圧装置、雑音抑圧方法、雑音抑圧プログラムおよびコンピュータに読み取り可能な記録媒体 | |
AU696152B2 (en) | Spectral subtraction noise suppression method | |
JP4958303B2 (ja) | 雑音抑圧方法およびその装置 | |
JP5791092B2 (ja) | 雑音抑圧の方法、装置、及びプログラム | |
KR101737824B1 (ko) | 잡음 환경의 입력신호로부터 잡음을 제거하는 방법 및 그 장치 | |
JP4856662B2 (ja) | 雑音除去装置、その方法、そのプログラム及び記録媒体 | |
JP4787851B2 (ja) | エコー抑圧ゲイン推定方法とそれを用いたエコー消去装置と、装置プログラムと記録媒体 | |
JP4454591B2 (ja) | 雑音スペクトル推定方法、雑音抑圧方法及び雑音抑圧装置 | |
CN115223583A (zh) | 一种语音增强方法、装置、设备及介质 | |
JP4965891B2 (ja) | 信号処理装置およびその方法 | |
EP1944754B1 (en) | Speech fundamental frequency estimator and method for estimating a speech fundamental frequency | |
JP5769671B2 (ja) | エコー抑圧ゲイン推定方法とそれを用いたエコー消去装置とプログラム | |
JP4123835B2 (ja) | 雑音抑圧装置および雑音抑圧方法 | |
JP5889224B2 (ja) | エコー抑圧ゲイン推定方法とそれを用いたエコー消去装置とプログラム | |
JP5413575B2 (ja) | 雑音抑圧の方法、装置、及びプログラム | |
JP5769670B2 (ja) | エコー抑圧ゲイン推定方法とそれを用いたエコー消去装置とプログラム | |
JP3279254B2 (ja) | スペクトル雑音除去装置 | |
JP5769672B2 (ja) | エコー抑圧ゲイン推定方法とそれを用いたエコー消去装置とプログラム | |
JP4325044B2 (ja) | 音声認識システム | |
JP5562451B1 (ja) | エコー抑圧ゲイン推定方法とそれを用いたエコー消去装置とプログラム | |
EP1635331A1 (en) | Method for estimating a signal to noise ratio | |
JP5700850B2 (ja) | 遅延推定方法とその方法を用いたエコー消去方法と、それらの装置とプログラムとその記録媒体 | |
JP2005099405A (ja) | 雑音除去方法、雑音除去装置およびプログラム | |
CN111226278B (zh) | 低复杂度的浊音语音检测和基音估计 | |
JP2019060976A (ja) | 音声処理プログラム、音声処理方法および音声処理装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006550638 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11794130 Country of ref document: US |
|
WWP | Wipo information: published in national office |
Ref document number: 11794130 Country of ref document: US |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 05811370 Country of ref document: EP Kind code of ref document: A1 |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 5811370 Country of ref document: EP |