EP1395981A1 - Dispositif et procede de traitement d'un signal audio. - Google Patents
Dispositif et procede de traitement d'un signal audio.Info
- Publication number
- EP1395981A1 EP1395981A1 EP02743323A EP02743323A EP1395981A1 EP 1395981 A1 EP1395981 A1 EP 1395981A1 EP 02743323 A EP02743323 A EP 02743323A EP 02743323 A EP02743323 A EP 02743323A EP 1395981 A1 EP1395981 A1 EP 1395981A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- windows
- processing
- segmentation
- audio signal
- samples
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012545 processing Methods 0.000 title claims abstract description 72
- 230000005236 sound signal Effects 0.000 title claims abstract description 33
- 238000000034 method Methods 0.000 title claims description 32
- 230000011218 segmentation Effects 0.000 claims abstract description 41
- 230000001360 synchronised effect Effects 0.000 claims abstract description 11
- 230000009466 transformation Effects 0.000 claims description 35
- 230000009467 reduction Effects 0.000 claims description 28
- 238000011282 treatment Methods 0.000 claims description 13
- 230000003595 spectral effect Effects 0.000 claims description 12
- 238000000844 transformation Methods 0.000 claims description 12
- 230000001755 vocal effect Effects 0.000 claims description 6
- 238000004590 computer program Methods 0.000 claims description 5
- 238000003672 processing method Methods 0.000 claims description 3
- 230000015654 memory Effects 0.000 description 10
- 230000006870 function Effects 0.000 description 7
- 238000004422 calculation algorithm Methods 0.000 description 6
- 238000004891 communication Methods 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000007493 shaping process Methods 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 238000004833 X-ray photoelectron spectroscopy Methods 0.000 description 1
- 238000010420 art technique Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 238000002513 implantation Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000004513 sizing Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
Definitions
- the present invention relates to the field of processing audio signals. More specifically, the invention relates, in particular, to the reduction or cancellation of noise in an audio signal processed by a digital communication device, for example of the digital telephone type and / or radio hands-free type mobile telephones.
- this problem is remedied by inserting noise attenuators or cancellers, acting on the signal picked up by a microphone, before specific processing of the audio signal.
- an echo or noise cancellation and reduction device is inserted between a microphone intended to pick up an audio signal and a device for processing the audio signal.
- This device improves the useful signal-to-noise ratio or reduces the echo so that the signal can be further processed under optimized conditions.
- this technique of the prior art requires a specific dedicated device, which has the disadvantage of causing additional costs and increased complexity of use.
- the Ibruit reduction function based on the use of a fast Fourier transform (or FFT) applied to a continuous flow of vocal samples is integrated into the digital communication device.
- FFT fast Fourier transform
- the sample stream is divided into windows of 256 samples obtained by the application of a formatting window, the windows overlapping by half (the first 128 samples of a window corresponding to the last 128 samples from the previous window).
- An FFT is applied to each window then the FFT result is processed by a noise canceling or echo cancellation function.
- the result of this function is processed by an inverse fast Fourier transform (or IFFT) in order to reconstitute a flow of vocal samples which can be processed by a vocal processing function.
- IFFT inverse fast Fourier transform
- an objective of the invention is to provide a method and a device for audio processing in a device which allows a reduction in the complexity of a processing based on a mathematical transformation applying to blocks of data while optimizing the audio processing applied to audio frames.
- Another objective of the invention is to optimize the integration of the processing based on a mathematical transformation and of the audio processing.
- An objective of the invention is also to optimize the time periods for these treatments.
- Another objective of the invention is to reduce the computing power necessary for these treatments.
- the invention proposes a method for processing an audio signal, comprising:!
- a first step of processing a source audio signal implementing at least one mathematical transformation applied to first sequences of samples obtained by the application of first segmentation windows on the source audio signal;
- a second stage of audio processing applied to second sequences of samples obtained by the application of second segmentation windows to the signal delivered by the first stage, the second segmentation windows being separate from the first segmentation windows; J remarkable in that two successive first windows and / or j two i successive second windows overlap, the overlaps being such that the segmentations are synchronous.
- the audio processing steps can be implemented sequentially or in a multitasking environment. Otherwise,; this implementation is facilitated by the use of memory with predictable, precise and economical sizing. According to a particular characteristic, the method is remarkable in that the second segmentation windows are successive frames.
- the method is remarkable in that the last sample of a first sequence is also the last sample, after the first step, of the corresponding second sequence.
- the second audio processing step is carried out without unnecessary waiting to optimize the overall audio processing times.
- the method is remarkable in that
- each first segmentation window is a window with perfect reconstruction obtained by convolution:
- the first intermediate window being adapted to the mathematical transformation (s) (in particular there is an attenuation of the second relatively strong window lobe while the main lobe remains flat), the quality of the corresponding treatment is optimized.
- the second intermediate window being rectangular, the processing of the corresponding samples is simple and efficient. According to a particular characteristic, the method is remarkable in that i the first processing step applied to each first sequence further comprises:
- the method is remarkable in that the predetermined processing sub-step comprises a reduction or cancellation of noise in the audio signal. According to a particular characteristic, the method is remarkable in that the predetermined processing sub-step comprises at least one processing belonging to the group comprising:
- the method advantageously combines treatments such as the reduction and / or cancellation of noise and / or echo and / or voice recognition in a device (for example of the telephone, personal computer or remote control type) which allows a reduction of complexity while optimizing the efficiency of these treatments and / or a strong integration of the device (which consequently allows a reduction in costs and energy consumption which which is relatively important in particular for communication devices operating on battery).
- a device for example of the telephone, personal computer or remote control type
- the method is remarkable in that said one or more mathematical transformations belong to the group comprising:
- the invention advantageously makes it possible to use one or more mathematical transformations adapted to the first audio processing, these transformations being applied to blocks of size different from the size of the second segmentation windows.
- the method is remarkable in that the source audio signal is a voice signal.
- the invention is thus well suited to the second audio processing when it is specific to speech such as, for example, voice coding (“vocoding”) and / or voice compression for memorization and / or remote transmission.
- the invention also relates to a device for processing an audio signal, comprising: - first means for processing a source audio signal, implementing at least one mathematical transformation applied to first sequences of samples obtained by the application of the first segmentation windows to the source audio signal; and j
- step the second segmentation windows being distinct from the first segmentation windows; remarkable in that two successive first windows and / or two successive second windows overlap, the overlaps being such that the segmentations are synchronous. !
- the invention relates to a computer program product, remarkable in that the program comprises sequences of instructions adapted to the implementation of an audio processing method as previously described when the program is executed on a computer.
- FIG. 2 illustrates the successive treatments carried out by the radiotelephone of Figure 1, on a voice signal
- FIG. 3 shows a noise cancellation or reduction algorithm, according to Figure 2;
- FIG. 4 shows a voice processing applied to a frame, according to Figure 2; :
- FIG. 5 describes a windowing of the sample flow as performed by the processing of Figures 3 and 4;
- FIG. 7 illustrates a formatting window, optimized and used in the window operations of Figure 3 according to a preferred embodiment of the invention.
- ⁇ - FIG. 8 describes more precisely a reduction type treatment of
- the FFT and IFFT treat windows comprising a power of i
- the speech coding takes into account windows which do not have the same size (typically voice processing within the framework of GSM i considers windows of 160 samples).
- the voice signal is sampled at a frequency of 8 kHz before being transmitted in 20ms frames in compressed form to a recipient.
- the speech coding is carried out on frames of 160 samples, by a vocoder. This coding which is a function of the desired bit rate is notably specified in the following documents:
- EFR Enhanced Full Rate
- AMR Adaptive Multi-Rate
- the noise reduction or cancellation device and / or echo processes a window of length 256 which can cut up to three windows of length 160. It is, among other things, the asynchronism inherent in this state-of-the-art technique that makes these treatments complex and requires an oversizing of memories and computing power and / or the clock d 'a DSP (Signal Processing Processor' from the English 'Digital Signal Processor' used for calculations).
- the two types of processing are synchronized by making the end of a noise cancellation or reduction window and / or echo systematically coincide with a voice processing frame and preferably with the end of a voice processing frame.
- a formatting window (adapted to associated speech frames of 160 samples and to FFT at 256 points) is preferably:
- Such a window is, for example, obtained by the convolution of a Hanning window of width 97 (denoted Hanning (97)) with a rectangular window of width 160 (denoted Rect (160)).
- a 256-point FFT is then applied to each window of 256 samples synchronized on the frames of 160 samples.
- a noise reduction algorithm of any type known per se, is applied before performing an inverse transform operation (denoted IFFT) on the block of 256 samples considered.
- Blocks of 256 samples are thus processed successively.
- the first 96 processed samples from the current window are added to the last 96 processed samples from the previous window.
- the first 160 samples of the current window are transmitted to the vocoder to be processed according to the speech coding methods known per se, in accordance, where appropriate, with the applicable standard. !
- FIG. 1 a radiotelephone implementing the invention is presented.
- FIG. 1 schematically illustrates a general block diagram; of a radiotelephone, in accordance with the invention according to a preferred embodiment.
- the radiotelephone 100 comprises interconnected by an address and data bus 103:
- DSP signal processing processor
- FIG. 1 a human / machine relationship interface (typically a keyboard and a screen) 113.
- a human / machine relationship interface typically a keyboard and a screen 113.
- FIG. 1 Each of the elements illustrated in FIG. 1 is well known to those skilled in the art. These common elements are not described here.
- register designates in each of the memories mentioned, both a low-capacity memory area (some binary data) and a high-capacity memory area (allowing a program to be stored whole or an entire sequence of transaction data).
- the non-volatile memory 105 (or ROM) stores in registers which, for convenience, have the same names as the data they store:
- a value L (typically worth 256), representing a first size of segmentation window corresponding to a number of points taken into account by an FFT in a register 115; - a value V (typically worth 160), representing a second; window size corresponding to a frame size processed by a vo ⁇ deur in a register 115; and; values, ⁇ , ⁇ , K and used for noise reduction in the signal.
- L typically worth 256
- V typically worth 160
- the random access memory 106 stores data, variables and intermediate processing results and includes in particular:
- the DSP is particularly suitable for processing of the Fourier transformation and speech coding type.
- DSP GROUP registered trademark
- OAK registered trademark
- FIG. 2 illustrates the successive treatments carried out by the radiotelephone of FIG. 1, on a voice signal.
- the signal entering the microphone 107 is the sum 203: - of a voice signal which can be affected by an echo (symbolized by the sum of the produced signal 200 and the delayed produced signal); and - a noise 202
- the noisy signal picked up by the microphone 107 is delivered to the converter.
- Analog / Digital 204 where it converted into a series of digital samples during a step 204.
- the sampling is typically done at a frequency equal to 8 kHz.
- “vocoded” frames are shaped by the unit 112 to be transmitted by the radio module 111 according to techniques known per se (for example, according to the GSM standard).
- i i
- FIG. 3 presents an algorithm for canceling or reducing noise, implemented in the processing step 205 of FIG. 2.
- the DSP 104 initializes in the RAM 106, a first block of 96 samples at zero corresponding to the last samples received as well as all the variables necessary for the proper functioning of the processing 205.
- the DSP 104 stores in the RAM 106 following the samples previously received a sequence of 160 incoming samples from the converter 108.
- the DSP 104 applies a window of
- step 304 an inverse transformation from that of step 302, of IFFT type is applied to the sequence processed.
- the DSP 104 adds, if necessary (that is to say after a first iteration), the last 96 samples of the previous processed sequence to the first 96 processed samples of the current sequence .
- step 306 the sequence or frame formed of the first 160 current processed samples is transmitted to the vocoder. Then, during a step 307, the 160 samples received corresponding to the 160 samples transmitted during step 305 are erased from memory 106.
- step 301 is repeated.
- FIG. 4 presents a coding of the speech, implemented in step 206 of FIG. 2. ' ,
- the DSP 104 initializes in the RAM 106, all the variables necessary for the proper functioning of the coding 206.
- FIG. 5 describes a windowing of the sequences of samples as performed by the processing of FIGS. 3 and 4.; On a first graph, the curve 500 of the intensity 503 is represented.
- the time is divided into successive frames 507 and 508 of length L ′ equal to 160, not overlapping and obtained during the transmission step 306.
- the signal segmentation is such that windows 505 (respecuvement 506), and 507 (respectively 502) are perfectly synchronized.
- the windows 505 (respectively 506), and 507 (respectively 502) end on the same sample before or after treatment (according to steps 303, 304 and 305).
- FIG. 7 illustrates windows 700 and 701 for shaping, optimized according to the invention (corresponding to the windows 505 and 506 respectively of FIG. 5 but shown more precisely).
- the graph gives the amplitude 602 of a window as a function of the rank of a sample 601.
- windows 700 and 701 are Hanning windows ob: enue by convolution of an intermediate Hanning window of length 97 with a rectangular window of length 160. We thus obtain, with the successive shifts of the windows, equal to 160 samples of windows with perfect reconstruction.
- FIG. 8 specifies the step 303 of processing of the noise reduction type as illustrated with reference to FIG. 3.
- a frame 801 comprising 256 spectral components corresponding to a noisy voice signal is processed according to the processing 303 described below.
- the DSP 104 converts the components of the frame 801 from rectangular coordinates to polar coordinates to separate the phase from the spectral amplitude.
- the power P m) of the isignal j is first estimated in the short term according to the following relationships: i
- P xk (l) (1- oc)
- PJm) a PJm-1) + (1- ⁇ )
- PJm PJm
- ⁇ f corresponds to a spectral value floor, / ⁇ limit the attenuation of the noise reduction filter to a positive value to leave a 1 minimum noise in the signal.
- the DSP 104 multiplies the amplitude
- the DSP 104 constructs the signal 809 with reduced noise from the amplitude
- the signal 809 is then processed according to step 304 of inverse Fourier transformation.
- the person skilled in the art can make any variant in the application of the invention which is not limited to mobile telephony (in particular of GSM, UMTS, IS95 type, etc.) but extends to any type of device comprising an audio coding after or before a mathematical transformation on an incoming audio signal.
- mobile telephony in particular of GSM, UMTS, IS95 type, etc.
- any type of device comprising an audio coding after or before a mathematical transformation on an incoming audio signal.
- the invention applies not only to the processing of voice source signals but extends to any type of audio processing.
- the mathematical transformation applied is in particular of any type applying to blocks of samples of a particular length which is not equal to the size of the frames processed according to an audio processing or which is not a multiple or a divisor close to this size of (rame.
- the invention extends to the case where the size of the audio frames is equal to 1
- the invention applies to any type of processing associated with the mathematical transformation and carried out before or after a step of coding the speech, in particular in the case of voice recognition or cancellation; and / or echo reduction.
- the invention is not limited to a purely material implantation but that it can also be implemented in the form of a sequence of instructions of a computer program or any form mixes a material part and a part software.
- the corresponding sequence of instructions may be stored in a removable storage means (such as for example a floppy disk, a CD-ROM or a DVD-ROM) or no, this storage means being partially or totally readable by a computer than a microprocessor.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Telephone Function (AREA)
- Input Circuits Of Receivers And Coupling Of Receivers And Audio Equipment (AREA)
- Stereo-Broadcasting Methods (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)
- Noise Elimination (AREA)
- Stereophonic System (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Description
Claims
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FR0106412 | 2001-05-15 | ||
| FR0106412A FR2824978B1 (fr) | 2001-05-15 | 2001-05-15 | Dispositif et procede de traitement d'un signal audio |
| PCT/FR2002/001640 WO2002093558A1 (fr) | 2001-05-15 | 2002-05-15 | Dispositif et procede de traitement d'un signal audio. |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| EP1395981A1 true EP1395981A1 (fr) | 2004-03-10 |
| EP1395981B1 EP1395981B1 (fr) | 2007-10-31 |
Family
ID=8863317
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP02743323A Expired - Lifetime EP1395981B1 (fr) | 2001-05-15 | 2002-05-15 | Dispositif et procede de traitement d'un signal audio. |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US7295968B2 (fr) |
| EP (1) | EP1395981B1 (fr) |
| JP (1) | JP2004527797A (fr) |
| KR (1) | KR20040005965A (fr) |
| CN (1) | CN1223991C (fr) |
| AT (1) | ATE377244T1 (fr) |
| DE (1) | DE60223246D1 (fr) |
| FR (1) | FR2824978B1 (fr) |
| IL (2) | IL158797A0 (fr) |
| WO (1) | WO2002093558A1 (fr) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118430527A (zh) * | 2024-07-05 | 2024-08-02 | 青岛珞宾通信有限公司 | 一种基于pda端边缘计算处理的声音识别方法 |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8219391B2 (en) * | 2005-02-15 | 2012-07-10 | Raytheon Bbn Technologies Corp. | Speech analyzing system with speech codebook |
| EP2024863B1 (fr) | 2006-05-07 | 2018-01-10 | Varcode Ltd. | Systeme et procede pour ameliorer la gestion de la qualite dans une chaine logistique de produits |
| US7562811B2 (en) | 2007-01-18 | 2009-07-21 | Varcode Ltd. | System and method for improved quality management in a product logistic chain |
| ATE520120T1 (de) * | 2006-06-29 | 2011-08-15 | Nxp Bv | Klangrahmenlängenanpassung |
| WO2008135962A2 (fr) | 2007-05-06 | 2008-11-13 | Varcode Ltd. | Système et procédé de gestion de qualité utilisant des indicateurs de code à barres |
| JP5638948B2 (ja) * | 2007-08-01 | 2014-12-10 | ジンジャー ソフトウェア、インコーポレイティッド | インターネットコーパスを用いた、文脈依存言語の自動的な修正および改善 |
| US8540156B2 (en) | 2007-11-14 | 2013-09-24 | Varcode Ltd. | System and method for quality management utilizing barcode indicators |
| US11704526B2 (en) | 2008-06-10 | 2023-07-18 | Varcode Ltd. | Barcoded indicators for quality management |
| KR20120125310A (ko) | 2010-02-01 | 2012-11-14 | 진저 소프트웨어 인코퍼레이티드 | 특히 소형 키보드 디바이스를 위한 인터넷 코퍼스를 사용하는 자동 문맥 감응식 언어 교정 |
| EP2372704A1 (fr) * | 2010-03-11 | 2011-10-05 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | Processeur de signal et procédé de traitement d'un signal |
| US20140025374A1 (en) * | 2012-07-22 | 2014-01-23 | Xia Lou | Speech enhancement to improve speech intelligibility and automatic speech recognition |
| US8807422B2 (en) | 2012-10-22 | 2014-08-19 | Varcode Ltd. | Tamper-proof quality management barcode indicators |
| EP2848300A1 (fr) | 2013-09-13 | 2015-03-18 | Borealis AG | Procédé de production d'oléfines par métathèse et système de réacteur associé |
| JP6048596B2 (ja) * | 2014-01-28 | 2016-12-21 | 三菱電機株式会社 | 集音装置、集音装置の入力信号補正方法および移動機器情報システム |
| CN104914307B (zh) * | 2015-04-23 | 2017-09-12 | 深圳市鼎阳科技有限公司 | 一种频谱仪及其多参数并行扫频的频谱测量方法 |
| CN107615027B (zh) | 2015-05-18 | 2020-03-27 | 发可有限公司 | 用于可激活质量标签的热致变色墨水标记 |
| US10697837B2 (en) | 2015-07-07 | 2020-06-30 | Varcode Ltd. | Electronic quality indicator |
| US10594530B2 (en) * | 2018-05-29 | 2020-03-17 | Qualcomm Incorporated | Techniques for successive peak reduction crest factor reduction |
| US20210020191A1 (en) * | 2019-07-18 | 2021-01-21 | DeepConvo Inc. | Methods and systems for voice profiling as a service |
| US11532314B2 (en) * | 2019-12-16 | 2022-12-20 | Google Llc | Amplitude-independent window sizes in audio encoding |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1062963C (zh) * | 1990-04-12 | 2001-03-07 | 多尔拜实验特许公司 | 用于产生高质量声音信号的解码器和编码器 |
| JPH07264144A (ja) * | 1994-03-16 | 1995-10-13 | Toshiba Corp | 信号圧縮符号化装置および圧縮信号復号装置 |
| FI100840B (fi) * | 1995-12-12 | 1998-02-27 | Nokia Mobile Phones Ltd | Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin |
| WO1998006090A1 (fr) * | 1996-08-02 | 1998-02-12 | Universite De Sherbrooke | Codage parole/audio a l'aide d'une transformee non lineaire a amplitude spectrale |
| US5913191A (en) * | 1997-10-17 | 1999-06-15 | Dolby Laboratories Licensing Corporation | Frame-based audio coding with additional filterbank to suppress aliasing artifacts at frame boundaries |
| US5903872A (en) * | 1997-10-17 | 1999-05-11 | Dolby Laboratories Licensing Corporation | Frame-based audio coding with additional filterbank to attenuate spectral splatter at frame boundaries |
| US6370500B1 (en) * | 1999-09-30 | 2002-04-09 | Motorola, Inc. | Method and apparatus for non-speech activity reduction of a low bit rate digital voice message |
| US6418405B1 (en) * | 1999-09-30 | 2002-07-09 | Motorola, Inc. | Method and apparatus for dynamic segmentation of a low bit rate digital voice message |
| FI116643B (fi) * | 1999-11-15 | 2006-01-13 | Nokia Corp | Kohinan vaimennus |
-
2001
- 2001-05-15 FR FR0106412A patent/FR2824978B1/fr not_active Expired - Fee Related
-
2002
- 2002-05-15 DE DE60223246T patent/DE60223246D1/de not_active Expired - Lifetime
- 2002-05-15 JP JP2002590150A patent/JP2004527797A/ja active Pending
- 2002-05-15 US US10/477,816 patent/US7295968B2/en not_active Expired - Fee Related
- 2002-05-15 KR KR10-2003-7014895A patent/KR20040005965A/ko not_active Withdrawn
- 2002-05-15 AT AT02743323T patent/ATE377244T1/de not_active IP Right Cessation
- 2002-05-15 CN CNB028129784A patent/CN1223991C/zh not_active Expired - Fee Related
- 2002-05-15 IL IL15879702A patent/IL158797A0/xx active IP Right Grant
- 2002-05-15 WO PCT/FR2002/001640 patent/WO2002093558A1/fr not_active Ceased
- 2002-05-15 EP EP02743323A patent/EP1395981B1/fr not_active Expired - Lifetime
-
2003
- 2003-11-10 IL IL158797A patent/IL158797A/en not_active IP Right Cessation
Non-Patent Citations (1)
| Title |
|---|
| See references of WO02093558A1 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118430527A (zh) * | 2024-07-05 | 2024-08-02 | 青岛珞宾通信有限公司 | 一种基于pda端边缘计算处理的声音识别方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| FR2824978B1 (fr) | 2003-09-19 |
| WO2002093558A1 (fr) | 2002-11-21 |
| CN1223991C (zh) | 2005-10-19 |
| US20040236572A1 (en) | 2004-11-25 |
| KR20040005965A (ko) | 2004-01-16 |
| FR2824978A1 (fr) | 2002-11-22 |
| IL158797A (en) | 2009-02-11 |
| ATE377244T1 (de) | 2007-11-15 |
| US7295968B2 (en) | 2007-11-13 |
| CN1520589A (zh) | 2004-08-11 |
| JP2004527797A (ja) | 2004-09-09 |
| EP1395981B1 (fr) | 2007-10-31 |
| DE60223246D1 (de) | 2007-12-13 |
| IL158797A0 (en) | 2004-05-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP1395981B1 (fr) | Dispositif et procede de traitement d'un signal audio. | |
| EP1593116B1 (fr) | Procédé pour le traitement numérique différencié de la voix et de la musique, le filtrage de bruit, la création d'effets spéciaux et dispositif pour la mise en oeuvre dudit procédé | |
| EP0002998B1 (fr) | Procédé de compression de données relatives au signal vocal et dispositif mettant en oeuvre ledit procédé | |
| EP1356461B1 (fr) | Procede et dispositif de reduction de bruit | |
| EP1016072B1 (fr) | Procede et dispositif de debruitage d'un signal de parole numerique | |
| EP1789956A1 (fr) | Procede de traitement d'un signal sonore bruite et dispositif pour la mise en ceuvre du procede | |
| EP0557166A1 (fr) | Procédé de réduction de bruit acoustique dans un signal de parole | |
| EP0998166A1 (fr) | Dispositif de traitement audio récepteur et procédé pour filtrer un signal utile et le restituer en présence de bruit ambiant | |
| FR2789823A1 (fr) | Filtres a sous-bandes unilaterales | |
| EP0884926B1 (fr) | Procédé et dispositif de traitement optimisé d'un signal perturbateur lors d'une prise de son | |
| FR2739481A1 (fr) | Appareil et procede d'elimination du bruit | |
| CA2939213A1 (fr) | Systemes, procedes et dispositifs de communication ayant une meilleure immunite au bruit | |
| EP1849157B1 (fr) | Procede de mesure de la gene due au bruit dans un signal audio | |
| EP0692883A1 (fr) | Procédé d'égalisation aveugle et son application à la reconnaissance de la parole | |
| EP2515300A1 (fr) | Procédé et système de réduction du bruit | |
| EP2126905B1 (fr) | Procédés et dispositifs d'encodage et décodage de signaux audio, signal audio encodé | |
| EP1103138B1 (fr) | Dispositif de traitement numerique a filtrage frequentiel et a complexite de calcul reduite | |
| EP1021805B1 (fr) | Procede et disposition de conditionnement d'un signal de parole numerique | |
| EP0989544A1 (fr) | Dispositif et procédé de filtrage d'un signal de parole, récepteur et système de communications téléphonique | |
| EP3828886B1 (fr) | Procede et systeme pour separer dans un flux audio la composante voix et la composante bruit | |
| FR3161060A1 (fr) | Personnalisation d’un réseau de neurones pour le rehaussement de la parole | |
| EP1155497A1 (fr) | Procede et systeme de traitement de signaux d'antenne | |
| WO1999027523A1 (fr) | Procede de reconstruction, apres debruitage, de signaux sonores | |
| EP0824798B1 (fr) | Filtrage adaptatif a sous-bandes | |
| WO2006077005A2 (fr) | Dispositif d'annulation d'echo acoustique, procede et programme d'ordinateur correspondants |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| 17P | Request for examination filed |
Effective date: 20031117 |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
| AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
| GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
| RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: WAVECOM |
|
| GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
| GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
| AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
| REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D Free format text: LANGUAGE OF EP DOCUMENT: FRENCH |
|
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
| REF | Corresponds to: |
Ref document number: 60223246 Country of ref document: DE Date of ref document: 20071213 Kind code of ref document: P |
|
| NLV1 | Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act | ||
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071031 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20080131 Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20080211 |
|
| GBV | Gb: ep patent (uk) treated as always having been void in accordance with gb section 77(7)/1977 [no translation filed] | ||
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20080331 |
|
| REG | Reference to a national code |
Ref country code: IE Ref legal event code: FD4D |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071031 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071031 |
|
| PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
| 26N | No opposition filed |
Effective date: 20080801 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20080201 Ref country code: IE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071031 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071031 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20080530 Year of fee payment: 7 |
|
| BERE | Be: lapsed |
Owner name: WAVECOM Effective date: 20080531 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20080531 |
|
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20080531 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20080201 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20080531 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071031 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20080531 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071031 |
|
| REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST Effective date: 20100129 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20090602 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20080515 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071031 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20080531 |