KR20230060502A - 신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램 - Google Patents

신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램 Download PDF

Info

Publication number
KR20230060502A
KR20230060502A KR1020237005227A KR20237005227A KR20230060502A KR 20230060502 A KR20230060502 A KR 20230060502A KR 1020237005227 A KR1020237005227 A KR 1020237005227A KR 20237005227 A KR20237005227 A KR 20237005227A KR 20230060502 A KR20230060502 A KR 20230060502A
Authority
KR
South Korea
Prior art keywords
signal
coefficient
audio signal
information
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
KR1020237005227A
Other languages
English (en)
Korean (ko)
Inventor
히로유키 혼마
도루 치넨
아키후미 고노
Original Assignee
소니그룹주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 소니그룹주식회사 filed Critical 소니그룹주식회사
Publication of KR20230060502A publication Critical patent/KR20230060502A/ko
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/307Frequency adjustment, e.g. tone control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • G10L21/0388Details of processing therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/07Synergistic effects of band splitting and sub-band processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Telephone Function (AREA)
KR1020237005227A 2020-09-03 2021-08-20 신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램 Withdrawn KR20230060502A (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2020148234 2020-09-03
JPJP-P-2020-148234 2020-09-03
PCT/JP2021/030599 WO2022050087A1 (ja) 2020-09-03 2021-08-20 信号処理装置および方法、学習装置および方法、並びにプログラム

Publications (1)

Publication Number Publication Date
KR20230060502A true KR20230060502A (ko) 2023-05-04

Family

ID=80490814

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020237005227A Withdrawn KR20230060502A (ko) 2020-09-03 2021-08-20 신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램

Country Status (8)

Country Link
US (1) US20230300557A1 (https=)
EP (1) EP4210048A4 (https=)
JP (1) JPWO2022050087A1 (https=)
KR (1) KR20230060502A (https=)
CN (1) CN116018641A (https=)
BR (1) BR112023003488A2 (https=)
MX (1) MX2023002255A (https=)
WO (1) WO2022050087A1 (https=)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021261235A1 (ja) * 2020-06-22 2021-12-30 ソニーグループ株式会社 信号処理装置および方法、並びにプログラム
EP4202921B1 (en) * 2020-09-28 2026-04-08 Samsung Electronics Co., Ltd. Audio encoding apparatus and audio decoding apparatus
EP4468292A3 (en) * 2020-10-17 2024-12-11 Dolby International AB Method and apparatus for generating an intermediate audio format from an input multichannel audio signal

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2830051A3 (en) * 2013-07-22 2015-03-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
JP6439296B2 (ja) * 2014-03-24 2018-12-19 ソニー株式会社 復号装置および方法、並びにプログラム
US10038966B1 (en) * 2016-10-20 2018-07-31 Oculus Vr, Llc Head-related transfer function (HRTF) personalization based on captured images of user
US11159906B2 (en) 2016-12-12 2021-10-26 Sony Corporation HRTF measurement method, HRTF measurement device, and program
KR102002681B1 (ko) * 2017-06-27 2019-07-23 한양대학교 산학협력단 생성적 대립 망 기반의 음성 대역폭 확장기 및 확장 방법
CN110998721B (zh) * 2017-07-28 2024-04-26 弗劳恩霍夫应用研究促进协会 用于使用宽频带滤波器生成的填充信号对已编码的多声道信号进行编码或解码的装置
US10650806B2 (en) * 2018-04-23 2020-05-12 Cerence Operating Company System and method for discriminative training of regression deep neural networks
JP7442494B2 (ja) * 2018-07-25 2024-03-04 ドルビー ラボラトリーズ ライセンシング コーポレイション 光学式捕捉によるパーソナライズされたhrtf

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
INTERNATIONAL STANDARD ISO/IEC 23008-3 Second edition 2019-02 Information technology-High efficiency coding and media delivery in heterogeneous environments-Part 3: 3D audio

Also Published As

Publication number Publication date
US20230300557A1 (en) 2023-09-21
EP4210048A4 (en) 2024-02-21
BR112023003488A2 (pt) 2023-04-11
WO2022050087A1 (ja) 2022-03-10
MX2023002255A (es) 2023-05-16
JPWO2022050087A1 (https=) 2022-03-10
EP4210048A1 (en) 2023-07-12
CN116018641A (zh) 2023-04-25

Similar Documents

Publication Publication Date Title
KR102837743B1 (ko) 오디오 신호 및 연관된 메타데이터에 의해 공간 오디오를 표현하는 것
Cobos et al. An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction
US10182302B2 (en) Binaural decoder to output spatial stereo sound and a decoding method thereof
KR101325644B1 (ko) 변환 영역에서의 효율적인 바이노럴 사운드 공간화 방법 및장치
US8379868B2 (en) Spatial audio coding based on universal spatial cues
US9055371B2 (en) Controllable playback system offering hierarchical playback options
KR100928311B1 (ko) 오디오 피스 또는 오디오 데이터스트림의 인코딩된스테레오 신호를 생성하는 장치 및 방법
US9219972B2 (en) Efficient audio coding having reduced bit rate for ambient signals and decoding using same
CN114582357B (zh) 一种音频编解码方法和装置
US10764709B2 (en) Methods, apparatus and systems for dynamic equalization for cross-talk cancellation
JP7447798B2 (ja) 信号処理装置および方法、並びにプログラム
WO2018047667A1 (ja) 音声処理装置および方法
KR20230060502A (ko) 신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램
CN115376527A (zh) 三维音频信号编码方法、装置和编码器
US8041041B1 (en) Method and system for providing stereo-channel based multi-channel audio coding
CN112567769B (zh) 音频再现装置、音频再现方法和存储介质
EP4171065A1 (en) Signal processing device and method, and program
WO2022034805A1 (ja) 信号処理装置および方法、並びにオーディオ再生システム
Wang Soundfield analysis and synthesis: recording, reproduction and compression.
JP2017143325A (ja) 収音装置、収音方法、プログラム

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20230214

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20240705

Comment text: Request for Examination of Application

PC1202 Submission of document of withdrawal before decision of registration

Comment text: [Withdrawal of Procedure relating to Patent, etc.] Withdrawal (Abandonment)

Patent event code: PC12021R01D

Patent event date: 20250415

WITB Written withdrawal of application