JP5400954B2 - 音声フォーマット・トランスコーダ - Google Patents
音声フォーマット・トランスコーダ Download PDFInfo
- Publication number
- JP5400954B2 JP5400954B2 JP2012509049A JP2012509049A JP5400954B2 JP 5400954 B2 JP5400954 B2 JP 5400954B2 JP 2012509049 A JP2012509049 A JP 2012509049A JP 2012509049 A JP2012509049 A JP 2012509049A JP 5400954 B2 JP5400954 B2 JP 5400954B2
- Authority
- JP
- Japan
- Prior art keywords
- spatial
- signal
- audio
- saoc
- converted signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 claims description 55
- 238000012545 processing Methods 0.000 claims description 33
- 230000005236 sound signal Effects 0.000 claims description 18
- 230000008569 process Effects 0.000 claims description 6
- 238000004590 computer program Methods 0.000 claims description 4
- 238000007476 Maximum Likelihood Methods 0.000 claims description 3
- 238000009792 diffusion process Methods 0.000 description 20
- 230000006870 function Effects 0.000 description 20
- 238000004458 analytical method Methods 0.000 description 19
- 238000009877 rendering Methods 0.000 description 16
- 230000007480 spreading Effects 0.000 description 11
- 238000003892 spreading Methods 0.000 description 11
- 238000004364 calculation method Methods 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 7
- 238000001914 filtration Methods 0.000 description 6
- 230000004044 response Effects 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000000926 separation method Methods 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 4
- 238000002156 mixing Methods 0.000 description 4
- 230000008447 perception Effects 0.000 description 4
- 230000035945 sensitivity Effects 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 238000013459 approach Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000009795 derivation Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000004091 panning Methods 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000001427 coherent effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- ZYXYTGQFPZEUFX-UHFFFAOYSA-N benzpyrimoxan Chemical compound O1C(OCCC1)C=1C(=NC=NC=1)OCC1=CC=C(C=C1)C(F)(F)F ZYXYTGQFPZEUFX-UHFFFAOYSA-N 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000036962 time dependent Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
Description
Claims (12)
- 少なくとも2つの方向性音声成分を有する入力音声信号を、SAOC(空間音声オブジェクト符号化)において利用可能な少なくとも2つの空間音源の値へトランスコードするための音声フォーマット・トランスコーダ(100)であって、
前記入力音声信号を、変換済信号表現と変換済信号到来方向とを有する変換済信号へと変換する変換器(110)と、
少なくとも2つの空間音源の少なくとも2つの空間位置を提供する位置提供器(120)と、
前記少なくとも2つの空間位置と前記変換済信号到来方向とに基づいて前記変換済信号表現を処理し、前記少なくとも2つの空間音源の値を取得する処理器(130)と、を備え、
前記処理器(130)は、前記少なくとも2つの空間音源の各々について重み係数を決定(303)し、
前記処理器(130)は、少なくとも2つの空間フィルタ(311,312,31N)を用いて前記重み係数に依存して前記変換済信号表現を処理し、少なくとも2つの空間音源を前記少なくとも2つの空間音源の値としての少なくとも2つの空間音源信号で近似するか、又は、前記少なくとも2つの空間音源の値として、前記重み係数に依存して前記少なくとも2つの空間音源の各々についてのパワー情報を推定(402)する、ことを特徴とする音声フォーマット・トランスコーダ(100)。 - 方向性音声符号化(DirAC)信号、B−フォーマット信号又はマイクロホン・アレイからの信号に従って入力信号をトランスコードする、請求項1に記載の音声フォーマット・トランスコーダ(100)。
- 前記変換器(110)は、いくつかの周波数帯域/サブ帯域及び/又は時間セグメント/フレームについて前記入力信号を変換する、請求項1又は2に記載の音声フォーマット・トランスコーダ(100)。
- 前記変換器(110)は、周波数帯域ごとに拡散性及び/又は信頼性の値をさらに有する変換済信号へと前記入力信号を変換する、請求項3に記載の音声フォーマット・トランスコーダ(100)。
- 前記少なくとも2つの空間音源信号を符号化してSAOC(空間音声オブジェクト符号化)ダウンミックス成分とSAOCサイド情報成分とを含むSAOC符号化済信号を取得する、SAOC符号器をさらに備えた請求項1に記載の音声フォーマット・トランスコーダ(100)。
- 前記処理器(130)は、前記少なくとも2つの空間音源のパワー情報をSAOC-OLD(オブジェクト・レベル差)へと変換することを特徴とする、請求項1に記載の音声フォーマット・トランスコーダ(100)。
- 前記処理器(130)は、前記少なくとも2つの空間音源についてオブジェクト間コヒーレンス(IOC)を計算する、請求項6に記載の音声フォーマット・トランスコーダ(100)。
- 前記位置提供器(120)は、前記変換済信号に基づいて前記少なくとも2つの空間音源の前記少なくとも2つの空間位置を検出するための検出器を含み、この検出器は前記少なくとも2つの空間位置を、入力信号の連続する複数の時間セグメント/フレームの結合によって検出する、請求項3乃至7に記載の音声フォーマット・トランスコーダ(100)。
- 前記検出器は、前記変換済信号のパワー空間密度についての最尤法に基づいて、前記少なくとも2つの空間位置を検出する、請求項8に記載の音声フォーマット・トランスコーダ(100)。
- 前記処理器(130)は追加的な背景オブジェクトのための重み係数をさらに決定し、当該重み係数は、前記少なくとも2つの空間音源と前記追加的な背景オブジェクトとに関連するエネルギーの合計が、前記変換済信号表現のエネルギーに等しくなるよう設定される、請求項1乃至9に記載の音声フォーマット・トランスコーダ(100)。
- 少なくとも2つの方向性音声成分を有する入力音声信号を、SAOC(空間音声オブジェクト符号化)において利用可能な少なくとも2つの空間音源の値へトランスコードする方法であって、
前記入力音声信号を、変換済信号表現と変換済信号到来方向とを有する変換済信号へと変換するステップと、
少なくとも2つの空間音源の少なくとも2つの空間位置を提供するステップと、
前記少なくとも2つの空間位置と前記変換済信号到来方向とに基づいて前記変換済信号表現を処理し、前記少なくとも2つの空間音源の値を取得する処理ステップと、を備え、
前記処理ステップは、
前記少なくとも2つの空間音源の各々について重み係数を決定(303)するサブステップと、
少なくとも2つの空間フィルタ(311,312,31N)を用いて前記重み係数に依存して前記変換済信号表現を処理し、少なくとも2つの空間音源を前記少なくとも2つの空間音源の値としての少なくとも2つの空間音源信号で近似するか、又は、前記少なくとも2つの空間音源の値として、前記重み係数に依存して前記少なくとも2つの空間音源の各々についてのパワー情報を推定(402)するサブステップと、を含むことを特徴とする方法。 - コンピュータ又はプロセッサに請求項11に記載の方法を実行させる、コンピュータプログラム。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP09006291.0 | 2009-05-08 | ||
EP09006291A EP2249334A1 (en) | 2009-05-08 | 2009-05-08 | Audio format transcoder |
PCT/EP2010/056252 WO2010128136A1 (en) | 2009-05-08 | 2010-05-07 | Audio format transcoder |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2012526296A JP2012526296A (ja) | 2012-10-25 |
JP5400954B2 true JP5400954B2 (ja) | 2014-01-29 |
Family
ID=41170090
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2012509049A Active JP5400954B2 (ja) | 2009-05-08 | 2010-05-07 | 音声フォーマット・トランスコーダ |
Country Status (13)
Country | Link |
---|---|
US (1) | US8891797B2 (ja) |
EP (2) | EP2249334A1 (ja) |
JP (1) | JP5400954B2 (ja) |
KR (1) | KR101346026B1 (ja) |
CN (1) | CN102422348B (ja) |
AU (1) | AU2010244393B2 (ja) |
BR (1) | BRPI1007730A2 (ja) |
CA (1) | CA2761439C (ja) |
ES (1) | ES2426136T3 (ja) |
MX (1) | MX2011011788A (ja) |
PL (1) | PL2427880T3 (ja) |
RU (1) | RU2519295C2 (ja) |
WO (1) | WO2010128136A1 (ja) |
Families Citing this family (58)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3093843B1 (en) * | 2009-09-29 | 2020-12-02 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Mpeg-saoc audio signal decoder, mpeg-saoc audio signal encoder, method for providing an upmix signal representation using mpeg-saoc decoding, method for providing a downmix signal representation using mpeg-saoc decoding, and computer program using a time/frequency-dependent common inter-object-correlation parameter value |
KR101410575B1 (ko) | 2010-02-24 | 2014-06-23 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 강화 다운믹스 신호를 생성하는 장치, 강화 다운믹스 신호를 생성하는 방법 및 컴퓨터 프로그램 |
KR101442446B1 (ko) * | 2010-12-03 | 2014-09-22 | 프라운호퍼-게젤샤프트 츄어 푀르더룽 데어 안게반텐 포르슝에.파우. | 도달 방향 추정치로부터의 기하학적 정보 추출을 통한 사운드 수집 |
US20140226842A1 (en) * | 2011-05-23 | 2014-08-14 | Nokia Corporation | Spatial audio processing apparatus |
TWI816597B (zh) | 2011-07-01 | 2023-09-21 | 美商杜比實驗室特許公司 | 用於增強3d音頻編輯與呈現之設備、方法及非暫態媒體 |
EP2600637A1 (en) | 2011-12-02 | 2013-06-05 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for microphone positioning based on a spatial power density |
EP2805326B1 (en) * | 2012-01-19 | 2015-10-14 | Koninklijke Philips N.V. | Spatial audio rendering and encoding |
US9268522B2 (en) | 2012-06-27 | 2016-02-23 | Volkswagen Ag | Devices and methods for conveying audio information in vehicles |
US9190065B2 (en) | 2012-07-15 | 2015-11-17 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for three-dimensional audio coding using basis function coefficients |
WO2014041067A1 (en) * | 2012-09-12 | 2014-03-20 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for providing enhanced guided downmix capabilities for 3d audio |
US10149048B1 (en) | 2012-09-26 | 2018-12-04 | Foundation for Research and Technology—Hellas (F.O.R.T.H.) Institute of Computer Science (I.C.S.) | Direction of arrival estimation and sound source enhancement in the presence of a reflective surface apparatuses, methods, and systems |
US10175335B1 (en) | 2012-09-26 | 2019-01-08 | Foundation For Research And Technology-Hellas (Forth) | Direction of arrival (DOA) estimation apparatuses, methods, and systems |
US20160210957A1 (en) | 2015-01-16 | 2016-07-21 | Foundation For Research And Technology - Hellas (Forth) | Foreground Signal Suppression Apparatuses, Methods, and Systems |
US10136239B1 (en) | 2012-09-26 | 2018-11-20 | Foundation For Research And Technology—Hellas (F.O.R.T.H.) | Capturing and reproducing spatial sound apparatuses, methods, and systems |
US9955277B1 (en) * | 2012-09-26 | 2018-04-24 | Foundation For Research And Technology-Hellas (F.O.R.T.H.) Institute Of Computer Science (I.C.S.) | Spatial sound characterization apparatuses, methods and systems |
US9554203B1 (en) | 2012-09-26 | 2017-01-24 | Foundation for Research and Technolgy—Hellas (FORTH) Institute of Computer Science (ICS) | Sound source characterization apparatuses, methods and systems |
US9549253B2 (en) | 2012-09-26 | 2017-01-17 | Foundation for Research and Technology—Hellas (FORTH) Institute of Computer Science (ICS) | Sound source localization and isolation apparatuses, methods and systems |
EP2717262A1 (en) * | 2012-10-05 | 2014-04-09 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding |
EP2733965A1 (en) | 2012-11-15 | 2014-05-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating a plurality of parametric audio streams and apparatus and method for generating a plurality of loudspeaker signals |
CN109166588B (zh) * | 2013-01-15 | 2022-11-15 | 韩国电子通信研究院 | 处理信道信号的编码/解码装置及方法 |
CN110223702B (zh) * | 2013-05-24 | 2023-04-11 | 杜比国际公司 | 音频解码系统和重构方法 |
GB2515089A (en) * | 2013-06-14 | 2014-12-17 | Nokia Corp | Audio Processing |
CN104244164A (zh) | 2013-06-18 | 2014-12-24 | 杜比实验室特许公司 | 生成环绕立体声声场 |
GB2521649B (en) * | 2013-12-27 | 2018-12-12 | Nokia Technologies Oy | Method, apparatus, computer program code and storage medium for processing audio signals |
KR101468357B1 (ko) * | 2014-02-17 | 2014-12-03 | 인하대학교 산학협력단 | 트랜스 코딩 서버의 cpu 전력 관리 방법 |
CN106228991B (zh) * | 2014-06-26 | 2019-08-20 | 华为技术有限公司 | 编解码方法、装置及系统 |
CN105657633A (zh) | 2014-09-04 | 2016-06-08 | 杜比实验室特许公司 | 生成针对音频对象的元数据 |
RU2696952C2 (ru) * | 2014-10-01 | 2019-08-07 | Долби Интернешнл Аб | Аудиокодировщик и декодер |
TWI587286B (zh) * | 2014-10-31 | 2017-06-11 | 杜比國際公司 | 音頻訊號之解碼和編碼的方法及系統、電腦程式產品、與電腦可讀取媒體 |
CN107004421B (zh) * | 2014-10-31 | 2020-07-07 | 杜比国际公司 | 多通道音频信号的参数编码和解码 |
US9794721B2 (en) | 2015-01-30 | 2017-10-17 | Dts, Inc. | System and method for capturing, encoding, distributing, and decoding immersive audio |
CN105989852A (zh) | 2015-02-16 | 2016-10-05 | 杜比实验室特许公司 | 分离音频源 |
US10176813B2 (en) | 2015-04-17 | 2019-01-08 | Dolby Laboratories Licensing Corporation | Audio encoding and rendering with discontinuity compensation |
EP3318070B1 (en) | 2015-07-02 | 2024-05-22 | Dolby Laboratories Licensing Corporation | Determining azimuth and elevation angles from stereo recordings |
HK1255002A1 (zh) | 2015-07-02 | 2019-08-02 | 杜比實驗室特許公司 | 根據立體聲記錄確定方位角和俯仰角 |
KR102614577B1 (ko) | 2016-09-23 | 2023-12-18 | 삼성전자주식회사 | 전자 장치 및 그 제어 방법 |
EP3324407A1 (en) | 2016-11-17 | 2018-05-23 | Fraunhofer Gesellschaft zur Förderung der Angewand | Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic |
EP3324406A1 (en) | 2016-11-17 | 2018-05-23 | Fraunhofer Gesellschaft zur Förderung der Angewand | Apparatus and method for decomposing an audio signal using a variable threshold |
GB2559765A (en) | 2017-02-17 | 2018-08-22 | Nokia Technologies Oy | Two stage audio focus for spatial audio processing |
EP3392882A1 (en) * | 2017-04-20 | 2018-10-24 | Thomson Licensing | Method for processing an input audio signal and corresponding electronic device, non-transitory computer readable program product and computer readable storage medium |
US10893373B2 (en) * | 2017-05-09 | 2021-01-12 | Dolby Laboratories Licensing Corporation | Processing of a multi-channel spatial audio format input signal |
WO2018208560A1 (en) * | 2017-05-09 | 2018-11-15 | Dolby Laboratories Licensing Corporation | Processing of a multi-channel spatial audio format input signal |
CA3076703C (en) * | 2017-10-04 | 2024-01-02 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding |
US11328735B2 (en) * | 2017-11-10 | 2022-05-10 | Nokia Technologies Oy | Determination of spatial audio parameter encoding and associated decoding |
SG11202004389VA (en) * | 2017-11-17 | 2020-06-29 | Fraunhofer Ges Forschung | Apparatus and method for encoding or decoding directional audio coding parameters using quantization and entropy coding |
WO2019143867A1 (en) * | 2018-01-18 | 2019-07-25 | Dolby Laboratories Licensing Corporation | Methods and devices for coding soundfield representation signals |
EP3762923B1 (en) * | 2018-03-08 | 2024-07-10 | Nokia Technologies Oy | Audio coding |
US11315578B2 (en) | 2018-04-16 | 2022-04-26 | Dolby Laboratories Licensing Corporation | Methods, apparatus and systems for encoding and decoding of directional sound sources |
DE112019003358T5 (de) * | 2018-07-02 | 2021-03-25 | Dolby International Ab | Verfahren und vorrichtung zum codieren und/oder decodieren immersiver audiosignale |
SG11202007627RA (en) * | 2018-10-08 | 2020-09-29 | Dolby Laboratories Licensing Corp | Transforming audio signals captured in different formats into a reduced number of formats for simplifying encoding and decoding operations |
BR112021007807A2 (pt) * | 2018-10-26 | 2021-07-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | analisador, avaliador de similaridade, codificador e decodificador de áudio, conversor de formato, renderizador, métodos e representação de áudio |
CN117809663A (zh) * | 2018-12-07 | 2024-04-02 | 弗劳恩霍夫应用研究促进协会 | 从包括至少两个声道的信号产生声场描述的装置、方法 |
MX2021008616A (es) * | 2019-01-21 | 2021-10-13 | Fraunhofer Ges Forschung | Aparato y método para codificar una representación de audio espacial o aparato y método para decodificar una señal de audio codificada utilizando metadatos de transporte y programas de computadora relacionados. |
WO2020221431A1 (en) * | 2019-04-30 | 2020-11-05 | Huawei Technologies Co., Ltd. | Device and method for rendering a binaural audio signal |
WO2020249480A1 (en) * | 2019-06-12 | 2020-12-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Packet loss concealment for dirac based spatial audio coding |
CN110660401B (zh) * | 2019-09-02 | 2021-09-24 | 武汉大学 | 一种基于高低频域分辨率切换的音频对象编解码方法 |
GB2587196A (en) | 2019-09-13 | 2021-03-24 | Nokia Technologies Oy | Determination of spatial audio parameter encoding and associated decoding |
CN113450823B (zh) * | 2020-03-24 | 2022-10-28 | 海信视像科技股份有限公司 | 基于音频的场景识别方法、装置、设备及存储介质 |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2354858A1 (en) * | 2001-08-08 | 2003-02-08 | Dspfactory Ltd. | Subband directional audio signal processing using an oversampled filterbank |
WO2003079330A1 (en) * | 2002-03-12 | 2003-09-25 | Dilithium Networks Pty Limited | Method for adaptive codebook pitch-lag computation in audio transcoders |
RU2335022C2 (ru) * | 2003-07-21 | 2008-09-27 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Преобразование формата аудиофайла |
US20080260048A1 (en) * | 2004-02-16 | 2008-10-23 | Koninklijke Philips Electronics, N.V. | Transcoder and Method of Transcoding Therefore |
US7415117B2 (en) * | 2004-03-02 | 2008-08-19 | Microsoft Corporation | System and method for beamforming using a microphone array |
US20070250308A1 (en) * | 2004-08-31 | 2007-10-25 | Koninklijke Philips Electronics, N.V. | Method and device for transcoding |
FI20055261A0 (fi) | 2005-05-27 | 2005-05-27 | Midas Studios Avoin Yhtioe | Akustisten muuttajien kokoonpano, järjestelmä ja menetelmä akustisten signaalien vastaanottamista tai toistamista varten |
FI20055260A0 (fi) * | 2005-05-27 | 2005-05-27 | Midas Studios Avoin Yhtioe | Laite, järjestelmä ja menetelmä akustisten signaalien vastaanottamista tai toistamista varten |
CN101238511B (zh) * | 2005-08-11 | 2011-09-07 | 旭化成株式会社 | 声源分离装置、音频识别装置、移动电话机、声源分离方法 |
US20080004729A1 (en) * | 2006-06-30 | 2008-01-03 | Nokia Corporation | Direct encoding into a directional audio coding format |
EP1890456B1 (en) * | 2006-08-15 | 2014-11-12 | Nero Ag | Apparatus for transcoding encoded content |
AU2007300813B2 (en) * | 2006-09-29 | 2010-10-14 | Lg Electronics Inc. | Methods and apparatuses for encoding and decoding object-based audio signals |
US9015051B2 (en) * | 2007-03-21 | 2015-04-21 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Reconstruction of audio channels with direction parameters indicating direction of origin |
US20080298610A1 (en) * | 2007-05-30 | 2008-12-04 | Nokia Corporation | Parameter Space Re-Panning for Spatial Audio |
US8509454B2 (en) * | 2007-11-01 | 2013-08-13 | Nokia Corporation | Focusing on a portion of an audio scene for an audio signal |
KR101415026B1 (ko) * | 2007-11-19 | 2014-07-04 | 삼성전자주식회사 | 마이크로폰 어레이를 이용한 다채널 사운드 획득 방법 및장치 |
-
2009
- 2009-05-08 EP EP09006291A patent/EP2249334A1/en not_active Withdrawn
-
2010
- 2010-05-07 MX MX2011011788A patent/MX2011011788A/es active IP Right Grant
- 2010-05-07 EP EP10718175.2A patent/EP2427880B1/en active Active
- 2010-05-07 JP JP2012509049A patent/JP5400954B2/ja active Active
- 2010-05-07 ES ES10718175T patent/ES2426136T3/es active Active
- 2010-05-07 AU AU2010244393A patent/AU2010244393B2/en active Active
- 2010-05-07 CN CN2010800202893A patent/CN102422348B/zh active Active
- 2010-05-07 CA CA2761439A patent/CA2761439C/en active Active
- 2010-05-07 PL PL10718175T patent/PL2427880T3/pl unknown
- 2010-05-07 KR KR1020117027001A patent/KR101346026B1/ko active IP Right Grant
- 2010-05-07 RU RU2011145865/08A patent/RU2519295C2/ru active
- 2010-05-07 BR BRPI1007730A patent/BRPI1007730A2/pt active Search and Examination
- 2010-05-07 WO PCT/EP2010/056252 patent/WO2010128136A1/en active Application Filing
-
2011
- 2011-11-04 US US13/289,252 patent/US8891797B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
AU2010244393B2 (en) | 2013-02-14 |
CA2761439A1 (en) | 2010-11-11 |
RU2519295C2 (ru) | 2014-06-10 |
BRPI1007730A2 (pt) | 2018-03-06 |
CN102422348A (zh) | 2012-04-18 |
EP2427880B1 (en) | 2013-07-31 |
PL2427880T3 (pl) | 2014-01-31 |
EP2249334A1 (en) | 2010-11-10 |
US20120114126A1 (en) | 2012-05-10 |
AU2010244393A1 (en) | 2011-11-24 |
RU2011145865A (ru) | 2013-05-27 |
ES2426136T3 (es) | 2013-10-21 |
MX2011011788A (es) | 2011-11-29 |
KR20120013986A (ko) | 2012-02-15 |
KR101346026B1 (ko) | 2013-12-31 |
US8891797B2 (en) | 2014-11-18 |
JP2012526296A (ja) | 2012-10-25 |
WO2010128136A1 (en) | 2010-11-11 |
CA2761439C (en) | 2015-04-21 |
EP2427880A1 (en) | 2012-03-14 |
CN102422348B (zh) | 2013-09-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5400954B2 (ja) | 音声フォーマット・トランスコーダ | |
RU2759160C2 (ru) | УСТРОЙСТВО, СПОСОБ И КОМПЬЮТЕРНАЯ ПРОГРАММА ДЛЯ КОДИРОВАНИЯ, ДЕКОДИРОВАНИЯ, ОБРАБОТКИ СЦЕНЫ И ДРУГИХ ПРОЦЕДУР, ОТНОСЯЩИХСЯ К ОСНОВАННОМУ НА DirAC ПРОСТРАНСТВЕННОМУ АУДИОКОДИРОВАНИЮ | |
JP6086923B2 (ja) | 幾何学配置に基づく空間オーディオ符号化ストリームを統合する装置および方法 | |
US9183839B2 (en) | Apparatus, method and computer program for providing a set of spatial cues on the basis of a microphone signal and apparatus for providing a two-channel audio signal and a set of spatial cues | |
KR101619578B1 (ko) | 기하학 기반의 공간 오디오 코딩을 위한 장치 및 방법 | |
AU2020210549B2 (en) | Apparatus and method for encoding a spatial audio representation or apparatus and method for decoding an encoded audio signal using transport metadata and related computer programs | |
AU2021357364B2 (en) | Apparatus, method, or computer program for processing an encoded audio scene using a parameter smoothing | |
RU2792050C2 (ru) | Устройство и способ для кодирования пространственного звукового представления или устройство и способ для декодирования закодированного аудиосигнала с использованием транспортных метаданных и соответствующие компьютерные программы | |
AU2021357840B2 (en) | Apparatus, method, or computer program for processing an encoded audio scene using a bandwidth extension |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20130129 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20130426 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20131008 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20131025 |
|
R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 Ref document number: 5400954 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |