RU2716911C2 - Способ и устройство для кодирования множественных аудиосигналов и способ и устройство для декодирования смеси множественных аудиосигналов с улучшенным разделением - Google Patents

Способ и устройство для кодирования множественных аудиосигналов и способ и устройство для декодирования смеси множественных аудиосигналов с улучшенным разделением Download PDF

Info

Publication number
RU2716911C2
RU2716911C2 RU2017134722A RU2017134722A RU2716911C2 RU 2716911 C2 RU2716911 C2 RU 2716911C2 RU 2017134722 A RU2017134722 A RU 2017134722A RU 2017134722 A RU2017134722 A RU 2017134722A RU 2716911 C2 RU2716911 C2 RU 2716911C2
Authority
RU
Russia
Prior art keywords
audio signals
multiple audio
mixture
sources
additional information
Prior art date
Application number
RU2017134722A
Other languages
English (en)
Russian (ru)
Other versions
RU2017134722A (ru
RU2017134722A3 (enExample
Inventor
Джагдас БЫЛЕН
Алексей ОЗЕРОВ
Патрик ПЕРЕС
Original Assignee
Интердиджитал Се Пэйтент Холдингз
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from EP15306144.5A external-priority patent/EP3115992A1/en
Application filed by Интердиджитал Се Пэйтент Холдингз filed Critical Интердиджитал Се Пэйтент Холдингз
Publication of RU2017134722A publication Critical patent/RU2017134722A/ru
Publication of RU2017134722A3 publication Critical patent/RU2017134722A3/ru
Application granted granted Critical
Publication of RU2716911C2 publication Critical patent/RU2716911C2/ru

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M1/00Analogue/digital conversion; Digital/analogue conversion
    • H03M1/12Analogue/digital converters
    • H03M1/124Sampling or signal conditioning arrangements specially adapted for A/D converters
    • H03M1/1245Details of sampling arrangements or methods
    • H03M1/1265Non-uniform sampling
    • H03M1/128Non-uniform sampling at random intervals, e.g. digital alias free signal processing [DASP]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
RU2017134722A 2015-04-10 2016-03-10 Способ и устройство для кодирования множественных аудиосигналов и способ и устройство для декодирования смеси множественных аудиосигналов с улучшенным разделением RU2716911C2 (ru)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
EP15305536 2015-04-10
EP15305536.3 2015-04-10
EP15306144.5A EP3115992A1 (en) 2015-07-10 2015-07-10 Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation
EP15306144.5 2015-07-10
EP15306425 2015-09-16
EP15306425.8 2015-09-16
PCT/EP2016/055135 WO2016162165A1 (en) 2015-04-10 2016-03-10 Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation

Publications (3)

Publication Number Publication Date
RU2017134722A RU2017134722A (ru) 2019-04-04
RU2017134722A3 RU2017134722A3 (enExample) 2019-10-08
RU2716911C2 true RU2716911C2 (ru) 2020-03-17

Family

ID=55521726

Family Applications (1)

Application Number Title Priority Date Filing Date
RU2017134722A RU2716911C2 (ru) 2015-04-10 2016-03-10 Способ и устройство для кодирования множественных аудиосигналов и способ и устройство для декодирования смеси множественных аудиосигналов с улучшенным разделением

Country Status (10)

Country Link
US (1) US20180082693A1 (enExample)
EP (1) EP3281196A1 (enExample)
JP (1) JP2018513996A (enExample)
KR (1) KR20170134467A (enExample)
CN (1) CN107636756A (enExample)
BR (1) BR112017021865A2 (enExample)
CA (1) CA2982017A1 (enExample)
MX (1) MX2017012957A (enExample)
RU (1) RU2716911C2 (enExample)
WO (1) WO2016162165A1 (enExample)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112115918A (zh) * 2020-09-29 2020-12-22 西北工业大学 一种信号稀疏表示及重构的时频原子字典及信号处理方法
CN113314110B (zh) * 2021-04-25 2022-12-02 天津大学 一种基于量子测量与酉变换技术的语言模型及构建方法
KR20220151953A (ko) * 2021-05-07 2022-11-15 한국전자통신연구원 부가 정보를 이용한 오디오 신호의 부호화 및 복호화 방법과 그 방법을 수행하는 부호화기 및 복호화기
CN115116465A (zh) * 2022-05-23 2022-09-27 佛山智优人科技有限公司 一种声源分离的方法及声源分离装置
CN120452467B (zh) * 2025-07-14 2025-09-16 国网福建省电力有限公司信息通信分公司 一种基于Codec的语音与背景音分离方法、装置、设备及介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005096274A1 (en) * 2004-04-01 2005-10-13 Beijing Media Works Co., Ltd An enhanced audio encoding/decoding device and method
US6975254B1 (en) * 1998-12-28 2005-12-13 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Methods and devices for coding or decoding an audio signal or bit stream
US8489403B1 (en) * 2010-08-25 2013-07-16 Foundation For Research and Technology—Institute of Computer Science ‘FORTH-ICS’ Apparatuses, methods and systems for sparse sinusoidal audio processing and transmission
US20140297294A1 (en) * 2007-02-14 2014-10-02 Lg Electronics Inc. Methods and Apparatuses for Encoding and Decoding Object-Based Audio Signals
WO2014161996A2 (en) * 2013-04-05 2014-10-09 Dolby International Ab Audio processing system

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3622365B2 (ja) * 1996-09-26 2005-02-23 ヤマハ株式会社 音声符号化伝送方式
WO2007027051A1 (en) * 2005-08-30 2007-03-08 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
US7873511B2 (en) * 2006-06-30 2011-01-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic
JP4932917B2 (ja) * 2009-04-03 2012-05-16 株式会社エヌ・ティ・ティ・ドコモ 音声復号装置、音声復号方法、及び音声復号プログラム
CN101742313B (zh) * 2009-12-10 2011-09-07 北京邮电大学 基于压缩感知技术的分布式信源编码的方法
US8390490B2 (en) * 2011-05-12 2013-03-05 Texas Instruments Incorporated Compressive sensing analog-to-digital converters
EP2688066A1 (en) * 2012-07-16 2014-01-22 Thomson Licensing Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction
US20150312663A1 (en) * 2012-09-19 2015-10-29 Analog Devices, Inc. Source separation using a circular model
US9715880B2 (en) * 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
US9576583B1 (en) * 2014-12-01 2017-02-21 Cedar Audio Ltd Restoring audio signals with mask and latent variables
US20180048917A1 (en) * 2015-02-23 2018-02-15 Board Of Regents, The University Of Texas System Systems, apparatus, and methods for bit level representation for data processing and analytics

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6975254B1 (en) * 1998-12-28 2005-12-13 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Methods and devices for coding or decoding an audio signal or bit stream
WO2005096274A1 (en) * 2004-04-01 2005-10-13 Beijing Media Works Co., Ltd An enhanced audio encoding/decoding device and method
US20140297294A1 (en) * 2007-02-14 2014-10-02 Lg Electronics Inc. Methods and Apparatuses for Encoding and Decoding Object-Based Audio Signals
US8489403B1 (en) * 2010-08-25 2013-07-16 Foundation For Research and Technology—Institute of Computer Science ‘FORTH-ICS’ Apparatuses, methods and systems for sparse sinusoidal audio processing and transmission
WO2014161996A2 (en) * 2013-04-05 2014-10-09 Dolby International Ab Audio processing system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
OZEROV ALEXEY et al. "Coding-based informed source separation: nonnegative tensor factorization approach". *

Also Published As

Publication number Publication date
MX2017012957A (es) 2018-02-01
CA2982017A1 (en) 2016-10-13
RU2017134722A (ru) 2019-04-04
RU2017134722A3 (enExample) 2019-10-08
CN107636756A (zh) 2018-01-26
BR112017021865A2 (pt) 2018-07-10
JP2018513996A (ja) 2018-05-31
EP3281196A1 (en) 2018-02-14
WO2016162165A1 (en) 2016-10-13
US20180082693A1 (en) 2018-03-22
KR20170134467A (ko) 2017-12-06

Similar Documents

Publication Publication Date Title
CN106415716B (zh) 编码器、解码器以及用于编码和解码的方法
Ozerov et al. Coding-based informed source separation: Nonnegative tensor factorization approach
RU2716911C2 (ru) Способ и устройство для кодирования множественных аудиосигналов и способ и устройство для декодирования смеси множественных аудиосигналов с улучшенным разделением
Ozerov et al. Informed source separation: source coding meets source separation
KR101733326B1 (ko) 개선된 확률 분포 추정을 이용한 선형 예측 기반 오디오 코딩
CN101849258A (zh) 在可缩放语音和音频编解码器中的用于经量化的mdct频谱的码簿索引的编码/解码的技术
JP6148342B2 (ja) 低または中ビットレートに対する知覚品質に基づくオーディオ分類
JPWO2007088853A1 (ja) 音声符号化装置、音声復号装置、音声符号化システム、音声符号化方法及び音声復号方法
EP3544005B1 (en) Audio coding with dithered quantization
RU2744485C1 (ru) Ослабление шума в декодере
RU2636126C2 (ru) Устройство для кодирования речевого сигнала с использованием acelp в автокорреляционной области
JPWO2012004998A1 (ja) スペクトル係数コーディングの量子化パラメータを効率的に符号化する装置及び方法
Rohlfing et al. NMF-based informed source separation
Vali et al. End-to-end optimized multi-stage vector quantization of spectral envelopes for speech and audio coding
AU2014280258B9 (en) Apparatus and method for audio signal envelope encoding, processing and decoding by modelling a cumulative sum representation employing distribution quantization and coding
Bilen et al. Compressive sampling-based informed source separation
CA2914418C (en) Apparatus and method for audio signal envelope encoding, processing and decoding by splitting the audio signal envelope employing distribution quantization and coding
Kırbız et al. Perceptual coding-based informed source separation
EP3115992A1 (en) Method and device for encoding multiple audio signals, and method and device for decoding a mixture of multiple audio signals with improved separation
Ramirez Intra-predictive switched split vector quantization of speech spectra
Rohlfing et al. Quantization-aware parameter estimation for audio upmixing
Kim KLT-based adaptive entropy-constrained vector quantization for the speech signals
HK1223725B (en) Apparatus and method for audio signal envelope encoding, processing and decoding by modelling a cumulative sum representation employing distribution quantization and coding

Legal Events

Date Code Title Description
HZ9A Changing address for correspondence with an applicant
MM4A The patent is invalid due to non-payment of fees

Effective date: 20210311