CN107636756A - 用于编码多个音频信号的方法和设备以及用于利用改进的分离解码多个音频信号的混合的方法和设备 - Google Patents

用于编码多个音频信号的方法和设备以及用于利用改进的分离解码多个音频信号的混合的方法和设备 Download PDF

Info

Publication number
CN107636756A
CN107636756A CN201680028431.6A CN201680028431A CN107636756A CN 107636756 A CN107636756 A CN 107636756A CN 201680028431 A CN201680028431 A CN 201680028431A CN 107636756 A CN107636756 A CN 107636756A
Authority
CN
China
Prior art keywords
audio signals
source
encoding
decoding
estimated
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201680028431.6A
Other languages
English (en)
Chinese (zh)
Inventor
C.比伦
A.奥泽罗夫
P.佩雷斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
InterDigital CE Patent Holdings SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from EP15306144.5A external-priority patent/EP3115992A1/en
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Publication of CN107636756A publication Critical patent/CN107636756A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M1/00Analogue/digital conversion; Digital/analogue conversion
    • H03M1/12Analogue/digital converters
    • H03M1/124Sampling or signal conditioning arrangements specially adapted for A/D converters
    • H03M1/1245Details of sampling arrangements or methods
    • H03M1/1265Non-uniform sampling
    • H03M1/128Non-uniform sampling at random intervals, e.g. digital alias free signal processing [DASP]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
CN201680028431.6A 2015-04-10 2016-03-10 用于编码多个音频信号的方法和设备以及用于利用改进的分离解码多个音频信号的混合的方法和设备 Pending CN107636756A (zh)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
EP15305536.3 2015-04-10
EP15305536 2015-04-10
EP15306144.5 2015-07-10
EP15306144.5A EP3115992A1 (en) 2015-07-10 2015-07-10 Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation
EP15306425 2015-09-16
EP15306425.8 2015-09-16
PCT/EP2016/055135 WO2016162165A1 (en) 2015-04-10 2016-03-10 Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation

Publications (1)

Publication Number Publication Date
CN107636756A true CN107636756A (zh) 2018-01-26

Family

ID=55521726

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201680028431.6A Pending CN107636756A (zh) 2015-04-10 2016-03-10 用于编码多个音频信号的方法和设备以及用于利用改进的分离解码多个音频信号的混合的方法和设备

Country Status (10)

Country Link
US (1) US20180082693A1 (https=)
EP (1) EP3281196A1 (https=)
JP (1) JP2018513996A (https=)
KR (1) KR20170134467A (https=)
CN (1) CN107636756A (https=)
BR (1) BR112017021865A2 (https=)
CA (1) CA2982017A1 (https=)
MX (1) MX2017012957A (https=)
RU (1) RU2716911C2 (https=)
WO (1) WO2016162165A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220358940A1 (en) * 2021-05-07 2022-11-10 Electronics And Telecommunications Research Institute Methods of encoding and decoding audio signal using side information, and encoder and decoder for performing the methods

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112115918A (zh) * 2020-09-29 2020-12-22 西北工业大学 一种信号稀疏表示及重构的时频原子字典及信号处理方法
CN113314110B (zh) * 2021-04-25 2022-12-02 天津大学 一种基于量子测量与酉变换技术的语言模型及构建方法
CN115116465A (zh) * 2022-05-23 2022-09-27 佛山智优人科技有限公司 一种声源分离的方法及声源分离装置
CN120452467B (zh) * 2025-07-14 2025-09-16 国网福建省电力有限公司信息通信分公司 一种基于Codec的语音与背景音分离方法、装置、设备及介质

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101501759A (zh) * 2006-06-30 2009-08-05 弗劳恩霍夫应用研究促进协会 具有动态可变规整特性的音频编码器、音频解码器和音频处理器
CN101742313A (zh) * 2009-12-10 2010-06-16 北京邮电大学 基于压缩感知技术的分布式信源编码的方法
US20110044458A1 (en) * 2005-08-30 2011-02-24 Lg Electronics, Inc. Slot position coding of residual signals of spatial audio coding application
CN102379004A (zh) * 2009-04-03 2012-03-14 株式会社Ntt都科摩 语音编码装置、语音解码装置、语音编码方法、语音解码方法、语音编码程序以及语音解码程序
WO2014047025A1 (en) * 2012-09-19 2014-03-27 Analog Devices, Inc. Source separation using a circular model
WO2014128275A1 (en) * 2013-02-21 2014-08-28 Dolby International Ab Methods for parametric multi-channel encoding
WO2014161996A2 (en) * 2013-04-05 2014-10-09 Dolby International Ab Audio processing system
CN104428833A (zh) * 2012-07-16 2015-03-18 汤姆逊许可公司 用于对多信道hoa音频信号进行编码以便降噪的方法和设备以及用于对多信道hoa音频信号进行解码以便降噪的方法和设备

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3622365B2 (ja) * 1996-09-26 2005-02-23 ヤマハ株式会社 音声符号化伝送方式
AU754877B2 (en) * 1998-12-28 2002-11-28 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Method and devices for coding or decoding an audio signal or bit stream
EP1852851A1 (en) * 2004-04-01 2007-11-07 Beijing Media Works Co., Ltd An enhanced audio encoding/decoding device and method
CA2645915C (en) * 2007-02-14 2012-10-23 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
US8489403B1 (en) * 2010-08-25 2013-07-16 Foundation For Research and Technology—Institute of Computer Science ‘FORTH-ICS’ Apparatuses, methods and systems for sparse sinusoidal audio processing and transmission
US8390490B2 (en) * 2011-05-12 2013-03-05 Texas Instruments Incorporated Compressive sensing analog-to-digital converters
US9576583B1 (en) * 2014-12-01 2017-02-21 Cedar Audio Ltd Restoring audio signals with mask and latent variables
US20180048917A1 (en) * 2015-02-23 2018-02-15 Board Of Regents, The University Of Texas System Systems, apparatus, and methods for bit level representation for data processing and analytics

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110044458A1 (en) * 2005-08-30 2011-02-24 Lg Electronics, Inc. Slot position coding of residual signals of spatial audio coding application
CN101501759A (zh) * 2006-06-30 2009-08-05 弗劳恩霍夫应用研究促进协会 具有动态可变规整特性的音频编码器、音频解码器和音频处理器
CN102379004A (zh) * 2009-04-03 2012-03-14 株式会社Ntt都科摩 语音编码装置、语音解码装置、语音编码方法、语音解码方法、语音编码程序以及语音解码程序
CN101742313A (zh) * 2009-12-10 2010-06-16 北京邮电大学 基于压缩感知技术的分布式信源编码的方法
CN104428833A (zh) * 2012-07-16 2015-03-18 汤姆逊许可公司 用于对多信道hoa音频信号进行编码以便降噪的方法和设备以及用于对多信道hoa音频信号进行解码以便降噪的方法和设备
WO2014047025A1 (en) * 2012-09-19 2014-03-27 Analog Devices, Inc. Source separation using a circular model
WO2014128275A1 (en) * 2013-02-21 2014-08-28 Dolby International Ab Methods for parametric multi-channel encoding
WO2014161996A2 (en) * 2013-04-05 2014-10-09 Dolby International Ab Audio processing system

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
ALEXEY OZEROV等: "Coding-Based Informed Source Separation: Nonnegative Tensor Factorization Approach", 《IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING》 *
ANTHONY GRIFFIN等: "Single-channel and Multi-channel Sinusoidal Audio Coding Using Compressed Sensing", 《IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING》 *
JANG G J等: "Single-channel signal separation using time-domain basis functions", 《IEEE SIGNAL PROCESSING LETTERS》 *
JASON LASKA等: "Random Sampling for Analog Conversion of Wideband", 《DESIGN, APPLICATIONS, INTEGRATION AND SOFTWARE, 2006》 *
LIUTKUS A等: "Informed source separation using latent components", 《 INFORMED SOURCE SEPARATION USING LATENT COMPONENTS》 *
MATHIEU PARVAIX等: "A Watermarking-Based Method for Informed Source Separation of Audio Signals With a Single Sensor", 《IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING》 *
TUOMAS VIRTANEN等: "Compositional Models for Audio Processing: Uncovering the structure of sound mixtures", 《IEEE SIGNAL PROCESSING MAGAZINE》 *
尚丽: "稀疏编码算法及其应用研究", 《中国博士学位论文全文数据库》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220358940A1 (en) * 2021-05-07 2022-11-10 Electronics And Telecommunications Research Institute Methods of encoding and decoding audio signal using side information, and encoder and decoder for performing the methods
US11783844B2 (en) * 2021-05-07 2023-10-10 Electronics And Telecommunications Research Institute Methods of encoding and decoding audio signal using side information, and encoder and decoder for performing the methods

Also Published As

Publication number Publication date
US20180082693A1 (en) 2018-03-22
RU2017134722A3 (https=) 2019-10-08
RU2716911C2 (ru) 2020-03-17
JP2018513996A (ja) 2018-05-31
BR112017021865A2 (pt) 2018-07-10
KR20170134467A (ko) 2017-12-06
CA2982017A1 (en) 2016-10-13
WO2016162165A1 (en) 2016-10-13
MX2017012957A (es) 2018-02-01
RU2017134722A (ru) 2019-04-04
EP3281196A1 (en) 2018-02-14

Similar Documents

Publication Publication Date Title
JP4961042B2 (ja) 整数変換ベースの符号化及び復号化のためのラウンディング雑音シェーピング
CN106415716B (zh) 编码器、解码器以及用于编码和解码的方法
Ozerov et al. Coding-based informed source separation: Nonnegative tensor factorization approach
US9978379B2 (en) Multi-channel encoding and/or decoding using non-negative tensor factorization
CN113223540B (zh) 在声音信号编码器和解码器中使用的方法、设备和存储器
CN101849258A (zh) 在可缩放语音和音频编解码器中的用于经量化的mdct频谱的码簿索引的编码/解码的技术
CN107636756A (zh) 用于编码多个音频信号的方法和设备以及用于利用改进的分离解码多个音频信号的混合的方法和设备
EP2814028B1 (en) Audio and speech coding device, audio and speech decoding device, method for coding audio and speech, and method for decoding audio and speech
JPWO2007088853A1 (ja) 音声符号化装置、音声復号装置、音声符号化システム、音声符号化方法及び音声復号方法
CA3017405C (en) Encoding apparatus for processing an input signal and decoding apparatus for processing an encoded signal
RU2017117896A (ru) Кодирование и декодирование аудиосигналов
Bilen et al. Solving time-domain audio inverse problems using nonnegative tensor factorization
CN107004422B (zh) 编码装置、解码装置、它们的方法及程序
US10593342B2 (en) Method and apparatus for sinusoidal encoding and decoding
Rohlfing et al. NMF-based informed source separation
Rohlfing et al. Very low bitrate spatial audio coding with dimensionality reduction
CN110709925B (zh) 用于音频编码或解码的方法及装置
Bilen et al. Compressive sampling-based informed source separation
AU2014280258B9 (en) Apparatus and method for audio signal envelope encoding, processing and decoding by modelling a cumulative sum representation employing distribution quantization and coding
EP3008725B1 (en) Apparatus and method for audio signal envelope encoding, processing and decoding by splitting the audio signal envelope employing distribution quantization and coding
KR20230023560A (ko) 부호화 방법 및 복호화 방법, 상기 방법을 수행하는 부호화기 및 복호화기
JP2008519308A5 (https=)
EP3115992A1 (en) Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation
JP5734519B2 (ja) 符号化方法、符号化装置、復号方法、復号装置、プログラム及び記録媒体
Gao et al. Study on joint speech encoding technology based on compressed sensing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20190604

Address after: France

Applicant after: Interactive Digital CE Patent Holding Company

Address before: I Si Eli Murli Nor, France

Applicant before: Thomson Licensing SA

TA01 Transfer of patent application right
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180126

WD01 Invention patent application deemed withdrawn after publication