JP7808095B2 - 音コーデックにおける、非相関ステレオコンテンツの分類、クロストーク検出、およびステレオモード選択のための方法およびデバイス - Google Patents

音コーデックにおける、非相関ステレオコンテンツの分類、クロストーク検出、およびステレオモード選択のための方法およびデバイス

Info

Publication number
JP7808095B2
JP7808095B2 JP2023515652A JP2023515652A JP7808095B2 JP 7808095 B2 JP7808095 B2 JP 7808095B2 JP 2023515652 A JP2023515652 A JP 2023515652A JP 2023515652 A JP2023515652 A JP 2023515652A JP 7808095 B2 JP7808095 B2 JP 7808095B2
Authority
JP
Japan
Prior art keywords
stereo
stereo mode
sound signal
mode
previous frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2023515652A
Other languages
English (en)
Japanese (ja)
Other versions
JP2023540377A (ja
JP2023540377A5 (https=
Inventor
ウラジミール・マレノフスキー
トミー・ヴァイヤンクール
Original Assignee
ヴォイスエイジ・コーポレーション
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ヴォイスエイジ・コーポレーション filed Critical ヴォイスエイジ・コーポレーション
Publication of JP2023540377A publication Critical patent/JP2023540377A/ja
Publication of JP2023540377A5 publication Critical patent/JP2023540377A5/ja
Application granted granted Critical
Publication of JP7808095B2 publication Critical patent/JP7808095B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R27/00Public address systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
JP2023515652A 2020-09-09 2021-09-08 音コーデックにおける、非相関ステレオコンテンツの分類、クロストーク検出、およびステレオモード選択のための方法およびデバイス Active JP7808095B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202063075984P 2020-09-09 2020-09-09
US63/075,984 2020-09-09
PCT/CA2021/051238 WO2022051846A1 (en) 2020-09-09 2021-09-08 Method and device for classification of uncorrelated stereo content, cross-talk detection, and stereo mode selection in a sound codec

Publications (3)

Publication Number Publication Date
JP2023540377A JP2023540377A (ja) 2023-09-22
JP2023540377A5 JP2023540377A5 (https=) 2024-09-17
JP7808095B2 true JP7808095B2 (ja) 2026-01-28

Family

ID=80629696

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023515652A Active JP7808095B2 (ja) 2020-09-09 2021-09-08 音コーデックにおける、非相関ステレオコンテンツの分類、クロストーク検出、およびステレオモード選択のための方法およびデバイス

Country Status (9)

Country Link
US (1) US12494210B2 (https=)
EP (1) EP4211683B1 (https=)
JP (1) JP7808095B2 (https=)
KR (1) KR20230066056A (https=)
CN (1) CN116438811A (https=)
BR (1) BR112023003311A2 (https=)
CA (1) CA3192085A1 (https=)
MX (1) MX2023002825A (https=)
WO (1) WO2022051846A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12341621B1 (en) * 2022-01-31 2025-06-24 Zoom Communications, Inc. Audio capture device selection for in-person conference participants

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003522965A (ja) 1998-12-21 2003-07-29 クゥアルコム・インコーポレイテッド 周期的スピーチコーディング
JP2004509366A (ja) 2000-09-15 2004-03-25 テレフオンアクチーボラゲツト エル エム エリクソン 複数チャネル信号の符号化及び復号化
JP2009524846A (ja) 2006-01-24 2009-07-02 サムスン エレクトロニクス カンパニー リミテッド 適応的時間/周波数ベース符号化モード決定装置およびこのための符号化モード決定方法
JP2011527762A (ja) 2008-07-09 2011-11-04 サムスン エレクトロニクス カンパニー リミテッド 符号化方式の決定方法及び装置
JP2013033189A (ja) 2011-07-01 2013-02-14 Sony Corp オーディオ符号化装置、オーディオ符号化方法、およびプログラム
JP2018513408A (ja) 2015-04-05 2018-05-24 クゥアルコム・インコーポレイテッドQualcomm Incorporated エンコーダ選択
WO2019058927A1 (ja) 2017-09-25 2019-03-28 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 符号化装置及び符号化方法

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3207281B2 (ja) 1993-02-12 2001-09-10 株式会社東芝 ステレオ音声符号化・復号化方式、ステレオ音声復号化装置及び単独発言/複数同時発言判別装置
AU5663296A (en) * 1995-04-10 1996-10-30 Corporate Computer Systems, Inc. System for compression and decompression of audio signals fo r digital transmission
US6151571A (en) * 1999-08-31 2000-11-21 Andersen Consulting System, method and article of manufacture for detecting emotion in voice signals through analysis of a plurality of voice signal parameters
KR20070065401A (ko) * 2004-09-23 2007-06-22 코닌클리케 필립스 일렉트로닉스 엔.브이. 오디오 데이터를 처리하는 시스템 및 방법, 프로그램구성요소, 및 컴퓨터-판독가능 매체
US7599840B2 (en) * 2005-07-15 2009-10-06 Microsoft Corporation Selectively using multiple entropy models in adaptive coding and decoding
US8041042B2 (en) * 2006-11-30 2011-10-18 Nokia Corporation Method, system, apparatus and computer program product for stereo coding
KR101600082B1 (ko) * 2009-01-29 2016-03-04 삼성전자주식회사 오디오 신호의 음질 평가 방법 및 장치
CN101615910B (zh) * 2009-05-31 2010-12-22 华为技术有限公司 压缩编码的方法、装置和设备以及压缩解码方法
PT2633521T (pt) * 2010-10-25 2018-11-13 Voiceage Corp Codificação de sinais áudio genéricos com baixos débitos binários e pouco atraso
WO2013149671A1 (en) * 2012-04-05 2013-10-10 Huawei Technologies Co., Ltd. Multi-channel audio encoder and method for encoding a multi-channel audio signal
TWI612518B (zh) * 2012-11-13 2018-01-21 Samsung Electronics Co., Ltd. 編碼模式決定方法、音訊編碼方法以及音訊解碼方法
EP3067886A1 (en) * 2015-03-09 2016-09-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal
WO2016184958A1 (en) 2015-05-20 2016-11-24 Telefonaktiebolaget Lm Ericsson (Publ) Coding of multi-channel audio signals
US10319385B2 (en) 2015-09-25 2019-06-11 Voiceage Corporation Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget
US9888318B2 (en) * 2015-11-25 2018-02-06 Mediatek, Inc. Method, system and circuits for headset crosstalk reduction
US11145316B2 (en) 2017-06-01 2021-10-12 Panasonic Intellectual Property Corporation Of America Encoder and encoding method for selecting coding mode for audio channels based on interchannel correlation

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003522965A (ja) 1998-12-21 2003-07-29 クゥアルコム・インコーポレイテッド 周期的スピーチコーディング
JP2004509366A (ja) 2000-09-15 2004-03-25 テレフオンアクチーボラゲツト エル エム エリクソン 複数チャネル信号の符号化及び復号化
JP2009524846A (ja) 2006-01-24 2009-07-02 サムスン エレクトロニクス カンパニー リミテッド 適応的時間/周波数ベース符号化モード決定装置およびこのための符号化モード決定方法
JP2011527762A (ja) 2008-07-09 2011-11-04 サムスン エレクトロニクス カンパニー リミテッド 符号化方式の決定方法及び装置
JP2013033189A (ja) 2011-07-01 2013-02-14 Sony Corp オーディオ符号化装置、オーディオ符号化方法、およびプログラム
JP2018513408A (ja) 2015-04-05 2018-05-24 クゥアルコム・インコーポレイテッドQualcomm Incorporated エンコーダ選択
WO2019058927A1 (ja) 2017-09-25 2019-03-28 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 符号化装置及び符号化方法

Also Published As

Publication number Publication date
MX2023002825A (es) 2023-05-30
EP4211683A1 (en) 2023-07-19
KR20230066056A (ko) 2023-05-12
WO2022051846A1 (en) 2022-03-17
EP4211683A4 (en) 2024-08-07
CN116438811A (zh) 2023-07-14
CA3192085A1 (en) 2022-03-17
JP2023540377A (ja) 2023-09-22
US12494210B2 (en) 2025-12-09
US20240021208A1 (en) 2024-01-18
BR112023003311A2 (pt) 2023-03-21
EP4211683B1 (en) 2026-04-01

Similar Documents

Publication Publication Date Title
US12198705B2 (en) Apparatus, method or computer program for estimating an inter-channel time difference
US8532999B2 (en) Apparatus and method for generating a multi-channel synthesizer control signal, multi-channel synthesizer, method of generating an output signal from an input signal and machine-readable storage medium
CN103403800B (zh) 确定多声道音频信号的声道间时间差
CN108780648A (zh) 用于在时间上失配的信号的音频处理
EP3465681B1 (en) Method and apparatus for voice or sound activity detection for spatial audio
JP7813238B2 (ja) サウンドコーデックにおける音声/音楽分類およびコアエンコーダ選択のための方法およびデバイス
JP7808095B2 (ja) 音コーデックにおける、非相関ステレオコンテンツの分類、クロストーク検出、およびステレオモード選択のための方法およびデバイス
KR101841380B1 (ko) 다중-채널 오디오 신호 분류기
CN108806711A (zh) 一种提取方法及装置
HK40090246A (zh) 用於声音编解码器中的非相关立体声内容的分类、串音检测和立体声模式选择的方法和设备
HK1095195B (en) Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240906

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20240906

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20250807

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20250812

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20251111

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20251223

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20260116

R150 Certificate of patent or registration of utility model

Ref document number: 7808095

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150