CA2982017A1 - Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation - Google Patents
Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation Download PDFInfo
- Publication number
- CA2982017A1 CA2982017A1 CA2982017A CA2982017A CA2982017A1 CA 2982017 A1 CA2982017 A1 CA 2982017A1 CA 2982017 A CA2982017 A CA 2982017A CA 2982017 A CA2982017 A CA 2982017A CA 2982017 A1 CA2982017 A1 CA 2982017A1
- Authority
- CA
- Canada
- Prior art keywords
- audio signals
- mixture
- time
- domain
- decoding
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 59
- 239000000203 mixture Substances 0.000 title claims abstract description 46
- 238000000034 method Methods 0.000 title claims abstract description 43
- 238000000926 separation method Methods 0.000 title description 10
- 238000005070 sampling Methods 0.000 claims abstract description 19
- 238000001228 spectrum Methods 0.000 claims description 12
- 238000003860 storage Methods 0.000 claims description 7
- 238000013459 approach Methods 0.000 description 15
- 230000008901 benefit Effects 0.000 description 15
- 238000013139 quantization Methods 0.000 description 14
- 238000012545 processing Methods 0.000 description 9
- 238000009826 distribution Methods 0.000 description 8
- 239000011159 matrix material Substances 0.000 description 7
- 239000013598 vector Substances 0.000 description 6
- 238000001914 filtration Methods 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 3
- 238000007476 Maximum Likelihood Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- HCUOEKSZWPGJIM-YBRHCDHNSA-N (e,2e)-2-hydroxyimino-6-methoxy-4-methyl-5-nitrohex-3-enamide Chemical compound COCC([N+]([O-])=O)\C(C)=C\C(=N/O)\C(N)=O HCUOEKSZWPGJIM-YBRHCDHNSA-N 0.000 description 1
- 101100401106 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) met-7 gene Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002040 relaxant effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M1/00—Analogue/digital conversion; Digital/analogue conversion
- H03M1/12—Analogue/digital converters
- H03M1/124—Sampling or signal conditioning arrangements specially adapted for A/D converters
- H03M1/1245—Details of sampling arrangements or methods
- H03M1/1265—Non-uniform sampling
- H03M1/128—Non-uniform sampling at random intervals, e.g. digital alias free signal processing [DASP]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Quality & Reliability (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP15305536.3 | 2015-04-10 | ||
| EP15305536 | 2015-04-10 | ||
| EP15306144.5A EP3115992A1 (en) | 2015-07-10 | 2015-07-10 | Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation |
| EP15306144.5 | 2015-07-10 | ||
| EP15306425 | 2015-09-16 | ||
| EP15306425.8 | 2015-09-16 | ||
| PCT/EP2016/055135 WO2016162165A1 (en) | 2015-04-10 | 2016-03-10 | Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CA2982017A1 true CA2982017A1 (en) | 2016-10-13 |
Family
ID=55521726
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA2982017A Abandoned CA2982017A1 (en) | 2015-04-10 | 2016-03-10 | Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US20180082693A1 (enExample) |
| EP (1) | EP3281196A1 (enExample) |
| JP (1) | JP2018513996A (enExample) |
| KR (1) | KR20170134467A (enExample) |
| CN (1) | CN107636756A (enExample) |
| BR (1) | BR112017021865A2 (enExample) |
| CA (1) | CA2982017A1 (enExample) |
| MX (1) | MX2017012957A (enExample) |
| RU (1) | RU2716911C2 (enExample) |
| WO (1) | WO2016162165A1 (enExample) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112115918A (zh) * | 2020-09-29 | 2020-12-22 | 西北工业大学 | 一种信号稀疏表示及重构的时频原子字典及信号处理方法 |
| CN113314110B (zh) * | 2021-04-25 | 2022-12-02 | 天津大学 | 一种基于量子测量与酉变换技术的语言模型及构建方法 |
| KR20220151953A (ko) * | 2021-05-07 | 2022-11-15 | 한국전자통신연구원 | 부가 정보를 이용한 오디오 신호의 부호화 및 복호화 방법과 그 방법을 수행하는 부호화기 및 복호화기 |
| CN115116465A (zh) * | 2022-05-23 | 2022-09-27 | 佛山智优人科技有限公司 | 一种声源分离的方法及声源分离装置 |
| CN120452467B (zh) * | 2025-07-14 | 2025-09-16 | 国网福建省电力有限公司信息通信分公司 | 一种基于Codec的语音与背景音分离方法、装置、设备及介质 |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3622365B2 (ja) * | 1996-09-26 | 2005-02-23 | ヤマハ株式会社 | 音声符号化伝送方式 |
| JP3580777B2 (ja) * | 1998-12-28 | 2004-10-27 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | オーディオ信号又はビットストリームの符号化又は復号化のための方法及び装置 |
| WO2005096274A1 (en) * | 2004-04-01 | 2005-10-13 | Beijing Media Works Co., Ltd | An enhanced audio encoding/decoding device and method |
| AU2006285538B2 (en) * | 2005-08-30 | 2011-03-24 | Lg Electronics Inc. | Apparatus for encoding and decoding audio signal and method thereof |
| US7873511B2 (en) * | 2006-06-30 | 2011-01-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic |
| AU2008215232B2 (en) * | 2007-02-14 | 2010-02-25 | Lg Electronics Inc. | Methods and apparatuses for encoding and decoding object-based audio signals |
| JP4932917B2 (ja) * | 2009-04-03 | 2012-05-16 | 株式会社エヌ・ティ・ティ・ドコモ | 音声復号装置、音声復号方法、及び音声復号プログラム |
| CN101742313B (zh) * | 2009-12-10 | 2011-09-07 | 北京邮电大学 | 基于压缩感知技术的分布式信源编码的方法 |
| US8489403B1 (en) * | 2010-08-25 | 2013-07-16 | Foundation For Research and Technology—Institute of Computer Science ‘FORTH-ICS’ | Apparatuses, methods and systems for sparse sinusoidal audio processing and transmission |
| US8390490B2 (en) * | 2011-05-12 | 2013-03-05 | Texas Instruments Incorporated | Compressive sensing analog-to-digital converters |
| EP2688066A1 (en) * | 2012-07-16 | 2014-01-22 | Thomson Licensing | Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction |
| WO2014047025A1 (en) * | 2012-09-19 | 2014-03-27 | Analog Devices, Inc. | Source separation using a circular model |
| WO2014128275A1 (en) * | 2013-02-21 | 2014-08-28 | Dolby International Ab | Methods for parametric multi-channel encoding |
| KR101717006B1 (ko) * | 2013-04-05 | 2017-03-15 | 돌비 인터네셔널 에이비 | 오디오 프로세싱 시스템 |
| US9576583B1 (en) * | 2014-12-01 | 2017-02-21 | Cedar Audio Ltd | Restoring audio signals with mask and latent variables |
| WO2016137871A1 (en) * | 2015-02-23 | 2016-09-01 | Metzler Richard E S Lister | Systems, apparatus, and methods for bit level representation for data processing and analytics |
-
2016
- 2016-03-10 EP EP16709072.9A patent/EP3281196A1/en not_active Withdrawn
- 2016-03-10 US US15/564,633 patent/US20180082693A1/en not_active Abandoned
- 2016-03-10 MX MX2017012957A patent/MX2017012957A/es unknown
- 2016-03-10 WO PCT/EP2016/055135 patent/WO2016162165A1/en not_active Ceased
- 2016-03-10 CA CA2982017A patent/CA2982017A1/en not_active Abandoned
- 2016-03-10 RU RU2017134722A patent/RU2716911C2/ru not_active IP Right Cessation
- 2016-03-10 BR BR112017021865A patent/BR112017021865A2/pt not_active Application Discontinuation
- 2016-03-10 JP JP2017552843A patent/JP2018513996A/ja not_active Ceased
- 2016-03-10 KR KR1020177028242A patent/KR20170134467A/ko not_active Withdrawn
- 2016-03-10 CN CN201680028431.6A patent/CN107636756A/zh active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| MX2017012957A (es) | 2018-02-01 |
| BR112017021865A2 (pt) | 2018-07-10 |
| RU2716911C2 (ru) | 2020-03-17 |
| RU2017134722A3 (enExample) | 2019-10-08 |
| RU2017134722A (ru) | 2019-04-04 |
| KR20170134467A (ko) | 2017-12-06 |
| WO2016162165A1 (en) | 2016-10-13 |
| US20180082693A1 (en) | 2018-03-22 |
| CN107636756A (zh) | 2018-01-26 |
| JP2018513996A (ja) | 2018-05-31 |
| EP3281196A1 (en) | 2018-02-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8515767B2 (en) | Technique for encoding/decoding of codebook indices for quantized MDCT spectrum in scalable speech and audio codecs | |
| US9514759B2 (en) | Method and apparatus for performing an adaptive down- and up-mixing of a multi-channel audio signal | |
| JP4842265B2 (ja) | 信号の状況(コンテキスト)ベース符号化及び復号化 | |
| US9774975B2 (en) | Method and apparatus for decoding a compressed HOA representation, and method and apparatus for encoding a compressed HOA representation | |
| Ozerov et al. | Informed source separation: source coding meets source separation | |
| CA2982017A1 (en) | Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation | |
| JP2017516125A (ja) | エンコーダ、デコーダ並びに符号化及び復号方法 | |
| US8914280B2 (en) | Method and apparatus for encoding/decoding speech signal | |
| US10460738B2 (en) | Encoding apparatus for processing an input signal and decoding apparatus for processing an encoded signal | |
| CN105659320B (zh) | 音频编码器和解码器 | |
| RU2678136C1 (ru) | Устройство и способ обработки кодированного аудиосигнала | |
| JPWO2018203471A1 (ja) | 符号化装置及び符号化方法 | |
| JP3590071B2 (ja) | 音声の効率的な符号化のためのスペクトルパラメータの予測分割マトリックス量子化 | |
| WO2011097963A1 (zh) | 编码方法、解码方法、编码器和解码器 | |
| Rohlfing et al. | NMF-based informed source separation | |
| EP2023339A1 (en) | A low-delay audio coder | |
| US20180075863A1 (en) | Method for encoding signals, method for separating signals in a mixture, corresponding computer program products, devices and bitstream | |
| Rohlfing et al. | Very low bitrate spatial audio coding with dimensionality reduction | |
| US11176954B2 (en) | Encoding and decoding of multichannel or stereo audio signals | |
| Bilen et al. | Compressive sampling-based informed source separation | |
| EP3115992A1 (en) | Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation | |
| CA2914418C (en) | Apparatus and method for audio signal envelope encoding, processing and decoding by splitting the audio signal envelope employing distribution quantization and coding | |
| KR20230023560A (ko) | 부호화 방법 및 복호화 방법, 상기 방법을 수행하는 부호화기 및 복호화기 | |
| Yang et al. | Multi-stage encoding scheme for multiple audio objects using compressed sensing | |
| Bläser et al. | Adaptive coding of non-negative factorization parameters with application to informed source separation |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FZDE | Discontinued |
Effective date: 20210910 |
|
| FZDE | Discontinued |
Effective date: 20210910 |