CN1408110A - 基于正弦模型的音频信号编码 - Google Patents
基于正弦模型的音频信号编码 Download PDFInfo
- Publication number
- CN1408110A CN1408110A CN01805964A CN01805964A CN1408110A CN 1408110 A CN1408110 A CN 1408110A CN 01805964 A CN01805964 A CN 01805964A CN 01805964 A CN01805964 A CN 01805964A CN 1408110 A CN1408110 A CN 1408110A
- Authority
- CN
- China
- Prior art keywords
- function
- coding method
- input signal
- signal
- norm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000005236 sound signal Effects 0.000 title description 5
- 238000000034 method Methods 0.000 claims abstract description 56
- 230000008859 change Effects 0.000 claims description 6
- 238000009432 framing Methods 0.000 claims description 4
- 238000009795 derivation Methods 0.000 claims description 2
- 238000012216 screening Methods 0.000 claims description 2
- 238000006243 chemical reaction Methods 0.000 claims 1
- 230000006870 function Effects 0.000 abstract description 50
- 238000003786 synthesis reaction Methods 0.000 abstract description 6
- 230000000873 masking effect Effects 0.000 abstract 1
- 230000008569 process Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 230000008878 coupling Effects 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 2
- 238000012804 iterative process Methods 0.000 description 2
- 238000013016 damping Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000000452 restraining effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0013—Codebook search algorithms
- G10L2019/0014—Selection criteria for distances
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (19)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00203856.0 | 2000-11-03 | ||
EP00203856 | 2000-11-03 | ||
EP01201685.3 | 2001-05-08 | ||
EP01201685 | 2001-05-08 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1408110A true CN1408110A (zh) | 2003-04-02 |
CN1216366C CN1216366C (zh) | 2005-08-24 |
Family
ID=26072835
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN018059643A Expired - Fee Related CN1216366C (zh) | 2000-11-03 | 2001-10-31 | 基于正弦模型的音频信号编码 |
Country Status (8)
Country | Link |
---|---|
US (1) | US7120587B2 (zh) |
EP (1) | EP1338001B1 (zh) |
JP (1) | JP2004513392A (zh) |
KR (1) | KR20020070373A (zh) |
CN (1) | CN1216366C (zh) |
AT (1) | ATE354850T1 (zh) |
DE (1) | DE60126811T2 (zh) |
WO (1) | WO2002037476A1 (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1934619B (zh) * | 2004-03-17 | 2010-05-26 | 皇家飞利浦电子股份有限公司 | 音频编码 |
CN101563848B (zh) * | 2006-12-29 | 2013-02-13 | 三星电子株式会社 | 音频编码与解码装置及其方法 |
CN101606193B (zh) * | 2007-02-12 | 2013-11-13 | 三星电子株式会社 | 音频编码和解码装置和方法 |
CN103021416B (zh) * | 2011-09-26 | 2017-04-26 | 索尼公司 | 音频编码装置和方法、音频解码装置和方法 |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7079986B2 (en) * | 2003-12-31 | 2006-07-18 | Sieracki Jeffrey M | Greedy adaptive signature discrimination system and method |
US8478539B2 (en) | 2003-12-31 | 2013-07-02 | Jeffrey M. Sieracki | System and method for neurological activity signature determination, discrimination, and detection |
US8271200B2 (en) * | 2003-12-31 | 2012-09-18 | Sieracki Jeffrey M | System and method for acoustic signature extraction, detection, discrimination, and localization |
US7751572B2 (en) | 2005-04-15 | 2010-07-06 | Dolby International Ab | Adaptive residual audio coding |
KR100788706B1 (ko) * | 2006-11-28 | 2007-12-26 | 삼성전자주식회사 | 광대역 음성 신호의 부호화/복호화 방법 |
KR101346771B1 (ko) * | 2007-08-16 | 2013-12-31 | 삼성전자주식회사 | 심리 음향 모델에 따른 마스킹 값보다 작은 정현파 신호를효율적으로 인코딩하는 방법 및 장치, 그리고 인코딩된오디오 신호를 디코딩하는 방법 및 장치 |
KR101441898B1 (ko) | 2008-02-01 | 2014-09-23 | 삼성전자주식회사 | 주파수 부호화 방법 및 장치와 주파수 복호화 방법 및 장치 |
US8805083B1 (en) | 2010-03-21 | 2014-08-12 | Jeffrey M. Sieracki | System and method for discriminating constituents of image by complex spectral signature extraction |
US9886945B1 (en) | 2011-07-03 | 2018-02-06 | Reality Analytics, Inc. | System and method for taxonomically distinguishing sample data captured from biota sources |
US9691395B1 (en) | 2011-12-31 | 2017-06-27 | Reality Analytics, Inc. | System and method for taxonomically distinguishing unconstrained signal data segments |
US9558762B1 (en) | 2011-07-03 | 2017-01-31 | Reality Analytics, Inc. | System and method for distinguishing source from unconstrained acoustic signals emitted thereby in context agnostic manner |
JPWO2018198454A1 (ja) * | 2017-04-28 | 2019-06-27 | ソニー株式会社 | 情報処理装置、および情報処理方法 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1062963C (zh) * | 1990-04-12 | 2001-03-07 | 多尔拜实验特许公司 | 用于产生高质量声音信号的解码器和编码器 |
JP3446216B2 (ja) * | 1992-03-06 | 2003-09-16 | ソニー株式会社 | 音声信号処理方法 |
US5651090A (en) * | 1994-05-06 | 1997-07-22 | Nippon Telegraph And Telephone Corporation | Coding method and coder for coding input signals of plural channels using vector quantization, and decoding method and decoder therefor |
JP3707153B2 (ja) * | 1996-09-24 | 2005-10-19 | ソニー株式会社 | ベクトル量子化方法、音声符号化方法及び装置 |
FI973873A (fi) * | 1997-10-02 | 1999-04-03 | Nokia Mobile Phones Ltd | Puhekoodaus |
-
2001
- 2001-10-31 CN CN018059643A patent/CN1216366C/zh not_active Expired - Fee Related
- 2001-10-31 DE DE60126811T patent/DE60126811T2/de not_active Expired - Fee Related
- 2001-10-31 AT AT01980541T patent/ATE354850T1/de not_active IP Right Cessation
- 2001-10-31 US US10/169,345 patent/US7120587B2/en not_active Expired - Fee Related
- 2001-10-31 WO PCT/EP2001/012721 patent/WO2002037476A1/en active IP Right Grant
- 2001-10-31 JP JP2002540143A patent/JP2004513392A/ja not_active Withdrawn
- 2001-10-31 EP EP01980541A patent/EP1338001B1/en not_active Expired - Lifetime
- 2001-10-31 KR KR1020027008652A patent/KR20020070373A/ko not_active Application Discontinuation
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1934619B (zh) * | 2004-03-17 | 2010-05-26 | 皇家飞利浦电子股份有限公司 | 音频编码 |
CN101563848B (zh) * | 2006-12-29 | 2013-02-13 | 三星电子株式会社 | 音频编码与解码装置及其方法 |
US8725519B2 (en) | 2006-12-29 | 2014-05-13 | Samsung Electronics Co., Ltd. | Audio encoding and decoding apparatus and method thereof |
CN101606193B (zh) * | 2007-02-12 | 2013-11-13 | 三星电子株式会社 | 音频编码和解码装置和方法 |
CN103021416B (zh) * | 2011-09-26 | 2017-04-26 | 索尼公司 | 音频编码装置和方法、音频解码装置和方法 |
Also Published As
Publication number | Publication date |
---|---|
US7120587B2 (en) | 2006-10-10 |
EP1338001A1 (en) | 2003-08-27 |
DE60126811T2 (de) | 2007-12-06 |
JP2004513392A (ja) | 2004-04-30 |
US20030009332A1 (en) | 2003-01-09 |
CN1216366C (zh) | 2005-08-24 |
KR20020070373A (ko) | 2002-09-06 |
WO2002037476A1 (en) | 2002-05-10 |
ATE354850T1 (de) | 2007-03-15 |
DE60126811D1 (de) | 2007-04-05 |
EP1338001B1 (en) | 2007-02-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10609501B2 (en) | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field | |
CN1408110A (zh) | 基于正弦模型的音频信号编码 | |
US7680656B2 (en) | Multi-sensory speech enhancement using a speech-state model | |
US9037454B2 (en) | Efficient coding of overcomplete representations of audio using the modulated complex lapped transform (MCLT) | |
JP6574287B2 (ja) | ピラミッドベクトル量子化器形状サーチ | |
TWI657434B (zh) | 解碼壓縮高階保真立體音響表示之方法及裝置,及編碼壓縮高階保真立體音響表示之方法及裝置 | |
US20080219466A1 (en) | Low bit-rate universal audio coder | |
KR20070051857A (ko) | 스케일러블 오디오 코딩 | |
Goodwin | The STFT, sinusoidal models, and speech modification | |
US20180358025A1 (en) | Method and apparatus for audio object coding based on informed source separation | |
Nguyen et al. | Fregrad: Lightweight and Fast Frequency-Aware Diffusion Vocoder | |
CN114333891B (zh) | 一种语音处理方法、装置、电子设备和可读介质 | |
CN111326166B (zh) | 语音处理方法及装置、计算机可读存储介质、电子设备 | |
RU2660633C2 (ru) | Устройство и способ для кодирования, обработки и декодирования огибающей аудиосигнала путем разделения огибающей аудиосигнала с использованием квантования и кодирования распределения | |
US20070129939A1 (en) | Method for scale-factor estimation in an audio encoder | |
RU2823441C9 (ru) | Способ и устройство для сжатия и восстановления представления системы амбисоник высшего порядка для звукового поля | |
Vafin et al. | Rate-distortion optimized quantization in multistage audio coding | |
Petrovsky et al. | Audio coding with a masking threshold adapted wavelet packet based on run-time reconfigurable processor architecture | |
RU2823441C2 (ru) | Способ и устройство для сжатия и восстановления представления системы амбисоник высшего порядка для звукового поля | |
Christensen et al. | Amplitude modulated sinusoidal signal decomposition for audio coding | |
Chen | Parametric speech coding using short-time amplitude spectrum | |
Zahedi et al. | On Perceptual Audio Compression with Side Information at the Decoder | |
Pena et al. | Realtime implementations of MPEG-2 and MPEG-4 natural audio coders | |
Scanio | A Prony Speech Processing Technique |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: IPG ELECTRONICS 503 CO., LTD. Free format text: FORMER OWNER: ROYAL PHILIPS ELECTRONICS CO., LTD. Effective date: 20090828 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20090828 Address after: British Channel Islands Patentee after: Koninkl Philips Electronics NV Address before: Holland Ian Deho Finn Patentee before: Koninklike Philips Electronics N. V. |
|
ASS | Succession or assignment of patent right |
Owner name: PENDRAGON WIRELESS CO., LTD. Free format text: FORMER OWNER: IPG ELECTRONICS 503 LTD. Effective date: 20130110 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20130110 Address after: Washington State Patentee after: Pendragon wireless limited liability company Address before: British Channel Islands Patentee before: Koninkl Philips Electronics NV |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20050824 Termination date: 20141031 |
|
EXPY | Termination of patent right or utility model |