KR20230060502A - 신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램 - Google Patents
신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램 Download PDFInfo
- Publication number
- KR20230060502A KR20230060502A KR1020237005227A KR20237005227A KR20230060502A KR 20230060502 A KR20230060502 A KR 20230060502A KR 1020237005227 A KR1020237005227 A KR 1020237005227A KR 20237005227 A KR20237005227 A KR 20237005227A KR 20230060502 A KR20230060502 A KR 20230060502A
- Authority
- KR
- South Korea
- Prior art keywords
- signal
- coefficient
- audio signal
- information
- processing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/307—Frequency adjustment, e.g. tone control
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G10L21/0388—Details of processing therefor
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/302—Electronic adaptation of stereophonic sound system to listener position or orientation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/11—Positioning of individual sound objects, e.g. moving airplane, within a sound field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/07—Synergistic effects of band splitting and sub-band processing
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- Telephone Function (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2020148234 | 2020-09-03 | ||
| JPJP-P-2020-148234 | 2020-09-03 | ||
| PCT/JP2021/030599 WO2022050087A1 (ja) | 2020-09-03 | 2021-08-20 | 信号処理装置および方法、学習装置および方法、並びにプログラム |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| KR20230060502A true KR20230060502A (ko) | 2023-05-04 |
Family
ID=80490814
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020237005227A Withdrawn KR20230060502A (ko) | 2020-09-03 | 2021-08-20 | 신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램 |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US20230300557A1 (https=) |
| EP (1) | EP4210048A4 (https=) |
| JP (1) | JPWO2022050087A1 (https=) |
| KR (1) | KR20230060502A (https=) |
| CN (1) | CN116018641A (https=) |
| BR (1) | BR112023003488A2 (https=) |
| MX (1) | MX2023002255A (https=) |
| WO (1) | WO2022050087A1 (https=) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2021261235A1 (ja) * | 2020-06-22 | 2021-12-30 | ソニーグループ株式会社 | 信号処理装置および方法、並びにプログラム |
| EP4202921B1 (en) * | 2020-09-28 | 2026-04-08 | Samsung Electronics Co., Ltd. | Audio encoding apparatus and audio decoding apparatus |
| EP4468292A3 (en) * | 2020-10-17 | 2024-12-11 | Dolby International AB | Method and apparatus for generating an intermediate audio format from an input multichannel audio signal |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2830051A3 (en) * | 2013-07-22 | 2015-03-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals |
| JP6439296B2 (ja) * | 2014-03-24 | 2018-12-19 | ソニー株式会社 | 復号装置および方法、並びにプログラム |
| US10038966B1 (en) * | 2016-10-20 | 2018-07-31 | Oculus Vr, Llc | Head-related transfer function (HRTF) personalization based on captured images of user |
| US11159906B2 (en) | 2016-12-12 | 2021-10-26 | Sony Corporation | HRTF measurement method, HRTF measurement device, and program |
| KR102002681B1 (ko) * | 2017-06-27 | 2019-07-23 | 한양대학교 산학협력단 | 생성적 대립 망 기반의 음성 대역폭 확장기 및 확장 방법 |
| CN110998721B (zh) * | 2017-07-28 | 2024-04-26 | 弗劳恩霍夫应用研究促进协会 | 用于使用宽频带滤波器生成的填充信号对已编码的多声道信号进行编码或解码的装置 |
| US10650806B2 (en) * | 2018-04-23 | 2020-05-12 | Cerence Operating Company | System and method for discriminative training of regression deep neural networks |
| JP7442494B2 (ja) * | 2018-07-25 | 2024-03-04 | ドルビー ラボラトリーズ ライセンシング コーポレイション | 光学式捕捉によるパーソナライズされたhrtf |
-
2021
- 2021-08-20 JP JP2022546230A patent/JPWO2022050087A1/ja not_active Abandoned
- 2021-08-20 BR BR112023003488A patent/BR112023003488A2/pt not_active Application Discontinuation
- 2021-08-20 MX MX2023002255A patent/MX2023002255A/es unknown
- 2021-08-20 CN CN202180052388.8A patent/CN116018641A/zh not_active Withdrawn
- 2021-08-20 KR KR1020237005227A patent/KR20230060502A/ko not_active Withdrawn
- 2021-08-20 WO PCT/JP2021/030599 patent/WO2022050087A1/ja not_active Ceased
- 2021-08-20 US US18/023,183 patent/US20230300557A1/en not_active Abandoned
- 2021-08-20 EP EP21864145.4A patent/EP4210048A4/en not_active Withdrawn
Non-Patent Citations (1)
| Title |
|---|
| INTERNATIONAL STANDARD ISO/IEC 23008-3 Second edition 2019-02 Information technology-High efficiency coding and media delivery in heterogeneous environments-Part 3: 3D audio |
Also Published As
| Publication number | Publication date |
|---|---|
| US20230300557A1 (en) | 2023-09-21 |
| EP4210048A4 (en) | 2024-02-21 |
| BR112023003488A2 (pt) | 2023-04-11 |
| WO2022050087A1 (ja) | 2022-03-10 |
| MX2023002255A (es) | 2023-05-16 |
| JPWO2022050087A1 (https=) | 2022-03-10 |
| EP4210048A1 (en) | 2023-07-12 |
| CN116018641A (zh) | 2023-04-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR102837743B1 (ko) | 오디오 신호 및 연관된 메타데이터에 의해 공간 오디오를 표현하는 것 | |
| Cobos et al. | An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction | |
| US10182302B2 (en) | Binaural decoder to output spatial stereo sound and a decoding method thereof | |
| KR101325644B1 (ko) | 변환 영역에서의 효율적인 바이노럴 사운드 공간화 방법 및장치 | |
| US8379868B2 (en) | Spatial audio coding based on universal spatial cues | |
| US9055371B2 (en) | Controllable playback system offering hierarchical playback options | |
| KR100928311B1 (ko) | 오디오 피스 또는 오디오 데이터스트림의 인코딩된스테레오 신호를 생성하는 장치 및 방법 | |
| US9219972B2 (en) | Efficient audio coding having reduced bit rate for ambient signals and decoding using same | |
| CN114582357B (zh) | 一种音频编解码方法和装置 | |
| US10764709B2 (en) | Methods, apparatus and systems for dynamic equalization for cross-talk cancellation | |
| JP7447798B2 (ja) | 信号処理装置および方法、並びにプログラム | |
| WO2018047667A1 (ja) | 音声処理装置および方法 | |
| KR20230060502A (ko) | 신호 처리 장치 및 방법, 학습 장치 및 방법, 그리고 프로그램 | |
| CN115376527A (zh) | 三维音频信号编码方法、装置和编码器 | |
| US8041041B1 (en) | Method and system for providing stereo-channel based multi-channel audio coding | |
| CN112567769B (zh) | 音频再现装置、音频再现方法和存储介质 | |
| EP4171065A1 (en) | Signal processing device and method, and program | |
| WO2022034805A1 (ja) | 信号処理装置および方法、並びにオーディオ再生システム | |
| Wang | Soundfield analysis and synthesis: recording, reproduction and compression. | |
| JP2017143325A (ja) | 収音装置、収音方法、プログラム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
Patent event date: 20230214 Patent event code: PA01051R01D Comment text: International Patent Application |
|
| PG1501 | Laying open of application | ||
| PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20240705 Comment text: Request for Examination of Application |
|
| PC1202 | Submission of document of withdrawal before decision of registration |
Comment text: [Withdrawal of Procedure relating to Patent, etc.] Withdrawal (Abandonment) Patent event code: PC12021R01D Patent event date: 20250415 |
|
| WITB | Written withdrawal of application |