CA2982017A1 - Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation - Google Patents

Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation Download PDF

Info

Publication number
CA2982017A1
CA2982017A1 CA2982017A CA2982017A CA2982017A1 CA 2982017 A1 CA2982017 A1 CA 2982017A1 CA 2982017 A CA2982017 A CA 2982017A CA 2982017 A CA2982017 A CA 2982017A CA 2982017 A1 CA2982017 A1 CA 2982017A1
Authority
CA
Canada
Prior art keywords
audio signals
mixture
time
domain
decoding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA2982017A
Other languages
English (en)
French (fr)
Inventor
Cagdas Bilen
Alexey Ozerov
Patrick Perez
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
InterDigital CE Patent Holdings SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from EP15306144.5A external-priority patent/EP3115992A1/en
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Publication of CA2982017A1 publication Critical patent/CA2982017A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M1/00Analogue/digital conversion; Digital/analogue conversion
    • H03M1/12Analogue/digital converters
    • H03M1/124Sampling or signal conditioning arrangements specially adapted for A/D converters
    • H03M1/1245Details of sampling arrangements or methods
    • H03M1/1265Non-uniform sampling
    • H03M1/128Non-uniform sampling at random intervals, e.g. digital alias free signal processing [DASP]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
CA2982017A 2015-04-10 2016-03-10 Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation Abandoned CA2982017A1 (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
EP15305536.3 2015-04-10
EP15305536 2015-04-10
EP15306144.5 2015-07-10
EP15306144.5A EP3115992A1 (en) 2015-07-10 2015-07-10 Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation
EP15306425 2015-09-16
EP15306425.8 2015-09-16
PCT/EP2016/055135 WO2016162165A1 (en) 2015-04-10 2016-03-10 Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation

Publications (1)

Publication Number Publication Date
CA2982017A1 true CA2982017A1 (en) 2016-10-13

Family

ID=55521726

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2982017A Abandoned CA2982017A1 (en) 2015-04-10 2016-03-10 Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation

Country Status (10)

Country Link
US (1) US20180082693A1 (https=)
EP (1) EP3281196A1 (https=)
JP (1) JP2018513996A (https=)
KR (1) KR20170134467A (https=)
CN (1) CN107636756A (https=)
BR (1) BR112017021865A2 (https=)
CA (1) CA2982017A1 (https=)
MX (1) MX2017012957A (https=)
RU (1) RU2716911C2 (https=)
WO (1) WO2016162165A1 (https=)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112115918A (zh) * 2020-09-29 2020-12-22 西北工业大学 一种信号稀疏表示及重构的时频原子字典及信号处理方法
CN113314110B (zh) * 2021-04-25 2022-12-02 天津大学 一种基于量子测量与酉变换技术的语言模型及构建方法
KR20220151953A (ko) * 2021-05-07 2022-11-15 한국전자통신연구원 부가 정보를 이용한 오디오 신호의 부호화 및 복호화 방법과 그 방법을 수행하는 부호화기 및 복호화기
CN115116465A (zh) * 2022-05-23 2022-09-27 佛山智优人科技有限公司 一种声源分离的方法及声源分离装置
CN120452467B (zh) * 2025-07-14 2025-09-16 国网福建省电力有限公司信息通信分公司 一种基于Codec的语音与背景音分离方法、装置、设备及介质

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3622365B2 (ja) * 1996-09-26 2005-02-23 ヤマハ株式会社 音声符号化伝送方式
AU754877B2 (en) * 1998-12-28 2002-11-28 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Method and devices for coding or decoding an audio signal or bit stream
EP1852851A1 (en) * 2004-04-01 2007-11-07 Beijing Media Works Co., Ltd An enhanced audio encoding/decoding device and method
US7761303B2 (en) * 2005-08-30 2010-07-20 Lg Electronics Inc. Slot position coding of TTT syntax of spatial audio coding application
US7873511B2 (en) * 2006-06-30 2011-01-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic
CA2645915C (en) * 2007-02-14 2012-10-23 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
JP4932917B2 (ja) * 2009-04-03 2012-05-16 株式会社エヌ・ティ・ティ・ドコモ 音声復号装置、音声復号方法、及び音声復号プログラム
CN101742313B (zh) * 2009-12-10 2011-09-07 北京邮电大学 基于压缩感知技术的分布式信源编码的方法
US8489403B1 (en) * 2010-08-25 2013-07-16 Foundation For Research and Technology—Institute of Computer Science ‘FORTH-ICS’ Apparatuses, methods and systems for sparse sinusoidal audio processing and transmission
US8390490B2 (en) * 2011-05-12 2013-03-05 Texas Instruments Incorporated Compressive sensing analog-to-digital converters
EP2688066A1 (en) * 2012-07-16 2014-01-22 Thomson Licensing Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction
US20150312663A1 (en) * 2012-09-19 2015-10-29 Analog Devices, Inc. Source separation using a circular model
US9715880B2 (en) * 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
RU2625444C2 (ru) * 2013-04-05 2017-07-13 Долби Интернэшнл Аб Система обработки аудио
US9576583B1 (en) * 2014-12-01 2017-02-21 Cedar Audio Ltd Restoring audio signals with mask and latent variables
US20180048917A1 (en) * 2015-02-23 2018-02-15 Board Of Regents, The University Of Texas System Systems, apparatus, and methods for bit level representation for data processing and analytics

Also Published As

Publication number Publication date
US20180082693A1 (en) 2018-03-22
RU2017134722A3 (https=) 2019-10-08
RU2716911C2 (ru) 2020-03-17
JP2018513996A (ja) 2018-05-31
BR112017021865A2 (pt) 2018-07-10
KR20170134467A (ko) 2017-12-06
CN107636756A (zh) 2018-01-26
WO2016162165A1 (en) 2016-10-13
MX2017012957A (es) 2018-02-01
RU2017134722A (ru) 2019-04-04
EP3281196A1 (en) 2018-02-14

Similar Documents

Publication Publication Date Title
US8515767B2 (en) Technique for encoding/decoding of codebook indices for quantized MDCT spectrum in scalable speech and audio codecs
US9514759B2 (en) Method and apparatus for performing an adaptive down- and up-mixing of a multi-channel audio signal
JP4842265B2 (ja) 信号の状況(コンテキスト)ベース符号化及び復号化
US9774975B2 (en) Method and apparatus for decoding a compressed HOA representation, and method and apparatus for encoding a compressed HOA representation
CA2982017A1 (en) Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation
JP2017516125A (ja) エンコーダ、デコーダ並びに符号化及び復号方法
RU2678136C1 (ru) Устройство и способ обработки кодированного аудиосигнала
US8914280B2 (en) Method and apparatus for encoding/decoding speech signal
CA3017405C (en) Encoding apparatus for processing an input signal and decoding apparatus for processing an encoded signal
CN105659320B (zh) 音频编码器和解码器
JPWO2018203471A1 (ja) 符号化装置及び符号化方法
WO2011097963A1 (zh) 编码方法、解码方法、编码器和解码器
Rohlfing et al. NMF-based informed source separation
EP2023339A1 (en) A low-delay audio coder
Rohlfing et al. Very low bitrate spatial audio coding with dimensionality reduction
Bilen et al. Compressive sampling-based informed source separation
US11176954B2 (en) Encoding and decoding of multichannel or stereo audio signals
EP3115992A1 (en) Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation
CA2914418C (en) Apparatus and method for audio signal envelope encoding, processing and decoding by splitting the audio signal envelope employing distribution quantization and coding
AU2014280258B9 (en) Apparatus and method for audio signal envelope encoding, processing and decoding by modelling a cumulative sum representation employing distribution quantization and coding
KR20230023560A (ko) 부호화 방법 및 복호화 방법, 상기 방법을 수행하는 부호화기 및 복호화기
Bläser et al. Adaptive coding of non-negative factorization parameters with application to informed source separation
Ramirez Intra-predictive switched split vector quantization of speech spectra
JP7318645B2 (ja) 符号化装置および方法、復号装置および方法、並びにプログラム
Yang et al. Multi-stage encoding scheme for multiple audio objects using compressed sensing

Legal Events

Date Code Title Description
FZDE Discontinued

Effective date: 20210910