DE112021003663T5 - Signalverarbeitungsvorrichtung, Verfahren und Programm - Google Patents
Signalverarbeitungsvorrichtung, Verfahren und Programm Download PDFInfo
- Publication number
- DE112021003663T5 DE112021003663T5 DE112021003663.7T DE112021003663T DE112021003663T5 DE 112021003663 T5 DE112021003663 T5 DE 112021003663T5 DE 112021003663 T DE112021003663 T DE 112021003663T DE 112021003663 T5 DE112021003663 T5 DE 112021003663T5
- Authority
- DE
- Germany
- Prior art keywords
- audio
- audio signal
- auditory
- gain
- metadata
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012545 processing Methods 0.000 title claims abstract description 150
- 238000000034 method Methods 0.000 title abstract description 52
- 238000012937 correction Methods 0.000 claims abstract description 271
- 230000005236 sound signal Effects 0.000 claims abstract description 237
- 238000013139 quantization Methods 0.000 claims abstract description 130
- 238000006243 chemical reaction Methods 0.000 claims description 84
- 230000008859 change Effects 0.000 claims description 53
- 238000003672 processing method Methods 0.000 claims description 14
- 238000001228 spectrum Methods 0.000 claims description 14
- 230000000873 masking effect Effects 0.000 claims description 7
- 238000005516 engineering process Methods 0.000 abstract description 30
- 238000004364 calculation method Methods 0.000 description 72
- 230000008569 process Effects 0.000 description 39
- 238000010586 diagram Methods 0.000 description 24
- 230000035807 sensation Effects 0.000 description 15
- 238000009877 rendering Methods 0.000 description 14
- 239000013598 vector Substances 0.000 description 8
- 230000035945 sensitivity Effects 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 230000001755 vocal effect Effects 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000006866 deterioration Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- NRNCYVBFPDDJNE-UHFFFAOYSA-N pemoline Chemical compound O1C(N)=NC(=O)C1C1=CC=CC=C1 NRNCYVBFPDDJNE-UHFFFAOYSA-N 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000002787 reinforcement Effects 0.000 description 2
- BUHVIAUBTBOHAG-FOYDDCNASA-N (2r,3r,4s,5r)-2-[6-[[2-(3,5-dimethoxyphenyl)-2-(2-methylphenyl)ethyl]amino]purin-9-yl]-5-(hydroxymethyl)oxolane-3,4-diol Chemical compound COC1=CC(OC)=CC(C(CNC=2C=3N=CN(C=3N=CN=2)[C@H]2[C@@H]([C@H](O)[C@@H](CO)O2)O)C=2C(=CC=CC=2)C)=C1 BUHVIAUBTBOHAG-FOYDDCNASA-N 0.000 description 1
- 241001342895 Chorus Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- HAORKNGNJCEJBX-UHFFFAOYSA-N cyprodinil Chemical compound N=1C(C)=CC(C2CC2)=NC=1NC1=CC=CC=C1 HAORKNGNJCEJBX-UHFFFAOYSA-N 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000002592 echocardiography Methods 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 238000004091 panning Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000010408 sweeping Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/035—Scalar quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Mathematical Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2020118174 | 2020-07-09 | ||
| JP2020-118174 | 2020-07-09 | ||
| JP2020-170985 | 2020-10-09 | ||
| JP2020170985 | 2020-10-09 | ||
| PCT/JP2021/024098 WO2022009694A1 (ja) | 2020-07-09 | 2021-06-25 | 信号処理装置および方法、並びにプログラム |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| DE112021003663T5 true DE112021003663T5 (de) | 2023-04-27 |
Family
ID=79553059
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| DE112021003663.7T Pending DE112021003663T5 (de) | 2020-07-09 | 2021-06-25 | Signalverarbeitungsvorrichtung, Verfahren und Programm |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US20230253000A1 (https=) |
| JP (1) | JPWO2022009694A1 (https=) |
| CN (1) | CN115943461A (https=) |
| DE (1) | DE112021003663T5 (https=) |
| WO (1) | WO2022009694A1 (https=) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20240032746A (ko) * | 2021-07-12 | 2024-03-12 | 소니그룹주식회사 | 부호화 장치 및 방법, 복호 장치 및 방법, 그리고 프로그램 |
| EP4693279A1 (en) * | 2023-03-27 | 2026-02-11 | Beijing Xiaomi Mobile Software Co., Ltd. | Quantization coding method, apparatus, device, and storage medium |
| US20250087230A1 (en) * | 2023-09-13 | 2025-03-13 | Microsoft Technology Licensing, Llc | System and Method for Speech Enhancement in Multichannel Audio Processing Systems |
| WO2025084114A1 (ja) * | 2023-10-20 | 2025-04-24 | ソニーグループ株式会社 | 信号処理装置および方法、並びにプログラム |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001154695A (ja) * | 1999-11-24 | 2001-06-08 | Victor Co Of Japan Ltd | オーディオ符号化装置及びその方法 |
| JP2006139827A (ja) * | 2004-11-10 | 2006-06-01 | Victor Co Of Japan Ltd | 3次元音場情報記録装置及びプログラム |
| KR102395351B1 (ko) * | 2013-07-31 | 2022-05-10 | 돌비 레버러토리즈 라이쎈싱 코오포레이션 | 공간적으로 분산된 또는 큰 오디오 오브젝트들의 프로세싱 |
| JP6531649B2 (ja) * | 2013-09-19 | 2019-06-19 | ソニー株式会社 | 符号化装置および方法、復号化装置および方法、並びにプログラム |
| US10477337B2 (en) * | 2014-01-16 | 2019-11-12 | Sony Corporation | Audio processing device and method therefor |
| KR102258784B1 (ko) * | 2014-04-11 | 2021-05-31 | 삼성전자주식회사 | 음향 신호의 렌더링 방법, 장치 및 컴퓨터 판독 가능한 기록 매체 |
| TWI607655B (zh) * | 2015-06-19 | 2017-12-01 | Sony Corp | Coding apparatus and method, decoding apparatus and method, and program |
| CN107710790B (zh) * | 2015-06-24 | 2021-06-22 | 索尼公司 | 用于处理声音的装置、方法及程序 |
| US9837086B2 (en) * | 2015-07-31 | 2017-12-05 | Apple Inc. | Encoded audio extended metadata-based dynamic range control |
| WO2017212642A1 (ja) * | 2016-06-10 | 2017-12-14 | 三菱電機株式会社 | 操作装置 |
-
2021
- 2021-06-25 US US18/013,217 patent/US20230253000A1/en active Pending
- 2021-06-25 JP JP2022535018A patent/JPWO2022009694A1/ja active Pending
- 2021-06-25 CN CN202180039314.0A patent/CN115943461A/zh not_active Withdrawn
- 2021-06-25 DE DE112021003663.7T patent/DE112021003663T5/de active Pending
- 2021-06-25 WO PCT/JP2021/024098 patent/WO2022009694A1/ja not_active Ceased
Non-Patent Citations (2)
| Title |
|---|
| ISO/IEC 23003-3, MPEG-D USAC |
| ISO/IEC 23008-3, MPEG-H 3D Audio |
Also Published As
| Publication number | Publication date |
|---|---|
| JPWO2022009694A1 (https=) | 2022-01-13 |
| CN115943461A (zh) | 2023-04-07 |
| US20230253000A1 (en) | 2023-08-10 |
| WO2022009694A1 (ja) | 2022-01-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DE112021003663T5 (de) | Signalverarbeitungsvorrichtung, Verfahren und Programm | |
| DE602006000239T2 (de) | Energieabhängige quantisierung für effiziente kodierung räumlicher audioparameter | |
| USRE48045E1 (en) | Encoding device and decoding device | |
| DE69214523T2 (de) | Dekodierer für variable anzahl von kanaldarstellungen mehrdimensionaler schallfelder | |
| DE60004814T2 (de) | Quantisierung in perzeptuellen audiokodierern mit kompensation des durch den synthesefilter verschmierten rauschens | |
| DE69633633T2 (de) | Mehrkanaliger prädiktiver subband-kodierer mit adaptiver, psychoakustischer bitzuweisung | |
| EP1763870B1 (de) | Erzeugung eines codierten multikanalsignals und decodierung eines codierten multikanalsignals | |
| DE60310716T2 (de) | System für die audiokodierung mit füllung von spektralen lücken | |
| DE602004004168T2 (de) | Kompatible mehrkanal-codierung/-decodierung | |
| DE602005006385T2 (de) | Vorrichtung und verfahren zum konstruieren eines mehrkanaligen ausgangssignals oder zum erzeugen eines downmix-signals | |
| DE60204038T2 (de) | Vorrichtung zum codieren bzw. decodieren eines audiosignals | |
| DE60002483T2 (de) | Skalierbares kodierungsverfahren für hochqualitätsaudio | |
| DE20321886U1 (de) | Inverse Quantisierung für Audio | |
| DE69933119T2 (de) | Verfahren und vorrichtung zur maskierung des quantisierungsrauschens von audiosignalen | |
| DE60318835T2 (de) | Parametrische darstellung von raumklang | |
| DE60206390T2 (de) | Effiziente und skalierbare parametrische stereocodierung für anwendungen mit niedriger bitrate | |
| DE69431622T2 (de) | Verfahren und gerät zum kodieren von mit mehreren bits kodiertem digitalem ton durch subtraktion eines adaptiven zittersignals, einfügen von versteckten kanalbits und filtrierung, sowie kodiergerät zur verwendung bei diesem verfahren | |
| DE602004010885T2 (de) | Audio-transkodierung | |
| EP1854334B1 (de) | Vorrichtung und verfahren zum erzeugen eines codierten stereo-signals eines audiostücks oder audiodatenstroms | |
| DE69529393T2 (de) | Verfahren zur gewichteten Geräuschfilterung | |
| DE60113602T2 (de) | Audiokodierer mit psychoakustischer Bitzuweisung | |
| DE69932861T2 (de) | Verfahren zur kodierung eines audiosignals mit einem qualitätswert für bit-zuordnung | |
| DE60303346T2 (de) | Encodier- und/oder Decodierverfahren für digitale Audiosignale, basierend auf Zeit-Frequenzkorrelation und Vorrichtung hierzu | |
| US7583804B2 (en) | Music information encoding/decoding device and method | |
| DE68927927T2 (de) | Kodierung von Audiosignalen unter Berücksichtigung der Wahrnehmbarkeit |