RU2768224C1 - Двусторонняя медийная аналитика - Google Patents
Двусторонняя медийная аналитика Download PDFInfo
- Publication number
- RU2768224C1 RU2768224C1 RU2021116055A RU2021116055A RU2768224C1 RU 2768224 C1 RU2768224 C1 RU 2768224C1 RU 2021116055 A RU2021116055 A RU 2021116055A RU 2021116055 A RU2021116055 A RU 2021116055A RU 2768224 C1 RU2768224 C1 RU 2768224C1
- Authority
- RU
- Russia
- Prior art keywords
- content
- audio content
- classification information
- file
- type
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/65—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNPCT/CN2018/120923 | 2018-12-13 | ||
| CN2018120923 | 2018-12-13 | ||
| US201962792997P | 2019-01-16 | 2019-01-16 | |
| US62/792,997 | 2019-01-16 | ||
| EP19157080.3 | 2019-02-14 | ||
| EP19157080 | 2019-02-14 | ||
| PCT/US2019/065338 WO2020123424A1 (en) | 2018-12-13 | 2019-12-10 | Dual-ended media intelligence |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| RU2768224C1 true RU2768224C1 (ru) | 2022-03-23 |
Family
ID=69104844
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| RU2021116055A RU2768224C1 (ru) | 2018-12-13 | 2019-12-10 | Двусторонняя медийная аналитика |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US12469500B2 (https=) |
| EP (1) | EP3895164B1 (https=) |
| JP (2) | JP7455836B2 (https=) |
| KR (1) | KR20210102899A (https=) |
| CN (1) | CN113168839B (https=) |
| BR (1) | BR112021009667A2 (https=) |
| RU (1) | RU2768224C1 (https=) |
| WO (1) | WO2020123424A1 (https=) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2023539121A (ja) * | 2020-08-18 | 2023-09-13 | ドルビー ラボラトリーズ ライセンシング コーポレイション | オーディオコンテンツの識別 |
| WO2022115303A1 (en) | 2020-11-27 | 2022-06-02 | Dolby Laboratories Licensing Corporation | Automatic generation and selection of target profiles for dynamic equalization of audio content |
| CN115102931B (zh) * | 2022-05-20 | 2023-12-19 | 阿里巴巴(中国)有限公司 | 自适应调整音频延迟的方法及电子设备 |
| CN116723438A (zh) * | 2023-05-26 | 2023-09-08 | 三星电子(中国)研发中心 | 修正参数生成方法和装置 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120084089A1 (en) * | 2010-09-30 | 2012-04-05 | Google Inc. | Progressive encoding of audio |
| US20150088508A1 (en) * | 2013-09-25 | 2015-03-26 | Verizon Patent And Licensing Inc. | Training speech recognition using captions |
| US20170243596A1 (en) * | 2014-07-31 | 2017-08-24 | Dolby Laboratories Licensing Corporation | Audio Processing Systems and Methods |
| RU2639663C2 (ru) * | 2013-01-28 | 2017-12-21 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Способ и устройство для нормализованного проигрывания аудио медиаданных с вложенными метаданными громкости и без них на новых медиаустройствах |
| US20180182394A1 (en) * | 2016-11-30 | 2018-06-28 | Spotify Ab | Identification of taste attributes from an audio signal |
Family Cites Families (31)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6360234B2 (en) | 1997-08-14 | 2002-03-19 | Virage, Inc. | Video cataloger system with synchronized encoders |
| US6833865B1 (en) | 1998-09-01 | 2004-12-21 | Virage, Inc. | Embedded metadata engines in digital capture devices |
| CN1284104C (zh) | 2001-05-15 | 2006-11-08 | 皇家菲利浦电子有限公司 | 内容分析设备 |
| US7454331B2 (en) * | 2002-08-30 | 2008-11-18 | Dolby Laboratories Licensing Corporation | Controlling loudness of speech in signals that contain speech and other types of audio material |
| US7895138B2 (en) * | 2004-11-23 | 2011-02-22 | Koninklijke Philips Electronics N.V. | Device and a method to process audio data, a computer program element and computer-readable medium |
| JP4713396B2 (ja) | 2006-05-09 | 2011-06-29 | シャープ株式会社 | 映像音声再生装置、及びその音像移動方法 |
| US8121198B2 (en) | 2006-10-16 | 2012-02-21 | Microsoft Corporation | Embedding content-based searchable indexes in multimedia files |
| US7640272B2 (en) | 2006-12-07 | 2009-12-29 | Microsoft Corporation | Using automated content analysis for audio/video content consumption |
| CA2645915C (en) | 2007-02-14 | 2012-10-23 | Lg Electronics Inc. | Methods and apparatuses for encoding and decoding object-based audio signals |
| US20080208589A1 (en) * | 2007-02-27 | 2008-08-28 | Cross Charles W | Presenting Supplemental Content For Digital Media Using A Multimodal Application |
| US20100138890A1 (en) | 2007-05-07 | 2010-06-03 | Nxp B.V. | Device to allow content analysis in real time |
| EP2144230A1 (en) * | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme having cascaded switches |
| CN102089803B (zh) * | 2008-07-11 | 2013-02-27 | 弗劳恩霍夫应用研究促进协会 | 用以将信号的不同段分类的方法与鉴别器 |
| MX2012005723A (es) | 2009-12-07 | 2012-06-13 | Dolby Lab Licensing Corp | Metodo y aparato para decodificar un cuadrado de una señal de audio digital codificada y medio de almacenamiento que graba un programa de instrucciones. |
| TWI581250B (zh) * | 2010-12-03 | 2017-05-01 | 杜比實驗室特許公司 | 利用多媒體處理節點之適應性處理技術 |
| KR102185941B1 (ko) * | 2011-07-01 | 2020-12-03 | 돌비 레버러토리즈 라이쎈싱 코오포레이션 | 적응형 오디오 신호 생성, 코딩 및 렌더링을 위한 시스템 및 방법 |
| US20140056430A1 (en) * | 2012-08-21 | 2014-02-27 | Electronics And Telecommunications Research Institute | System and method for reproducing wave field using sound bar |
| US9805725B2 (en) | 2012-12-21 | 2017-10-31 | Dolby Laboratories Licensing Corporation | Object clustering for rendering object-based audio content based on perceptual criteria |
| JP6041789B2 (ja) | 2013-01-03 | 2016-12-14 | 三菱電機株式会社 | 入力信号を符号化する方法 |
| CN112652316B (zh) | 2013-01-21 | 2023-09-15 | 杜比实验室特许公司 | 利用响度处理状态元数据的音频编码器和解码器 |
| US9609452B2 (en) | 2013-02-08 | 2017-03-28 | Qualcomm Incorporated | Obtaining sparseness information for higher order ambisonic audio renderers |
| US8903186B2 (en) | 2013-02-28 | 2014-12-02 | Facebook, Inc. | Methods and systems for differentiating synthetic and non-synthetic images |
| CN104080024B (zh) * | 2013-03-26 | 2019-02-19 | 杜比实验室特许公司 | 音量校平器控制器和控制方法以及音频分类器 |
| CN104078050A (zh) * | 2013-03-26 | 2014-10-01 | 杜比实验室特许公司 | 用于音频分类和音频处理的设备和方法 |
| US9559651B2 (en) * | 2013-03-29 | 2017-01-31 | Apple Inc. | Metadata for loudness and dynamic range control |
| TWM487509U (zh) | 2013-06-19 | 2014-10-01 | 杜比實驗室特許公司 | 音訊處理設備及電子裝置 |
| US10110911B2 (en) | 2014-11-11 | 2018-10-23 | Cisco Technology, Inc. | Parallel media encoding |
| US10834436B2 (en) | 2015-05-27 | 2020-11-10 | Arris Enterprises Llc | Video classification using user behavior from a network digital video recorder |
| US9837086B2 (en) * | 2015-07-31 | 2017-12-05 | Apple Inc. | Encoded audio extended metadata-based dynamic range control |
| US9934790B2 (en) | 2015-07-31 | 2018-04-03 | Apple Inc. | Encoded audio metadata-based equalization |
| JP7086521B2 (ja) | 2017-02-27 | 2022-06-20 | ヤマハ株式会社 | 情報処理方法および情報処理装置 |
-
2019
- 2019-12-10 KR KR1020217017682A patent/KR20210102899A/ko not_active Withdrawn
- 2019-12-10 US US17/312,011 patent/US12469500B2/en active Active
- 2019-12-10 BR BR112021009667-1A patent/BR112021009667A2/pt unknown
- 2019-12-10 WO PCT/US2019/065338 patent/WO2020123424A1/en not_active Ceased
- 2019-12-10 CN CN201980080866.9A patent/CN113168839B/zh active Active
- 2019-12-10 RU RU2021116055A patent/RU2768224C1/ru active
- 2019-12-10 JP JP2021532235A patent/JP7455836B2/ja active Active
- 2019-12-10 EP EP19831966.7A patent/EP3895164B1/en active Active
-
2024
- 2024-03-13 JP JP2024038518A patent/JP2024081674A/ja active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120084089A1 (en) * | 2010-09-30 | 2012-04-05 | Google Inc. | Progressive encoding of audio |
| RU2639663C2 (ru) * | 2013-01-28 | 2017-12-21 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Способ и устройство для нормализованного проигрывания аудио медиаданных с вложенными метаданными громкости и без них на новых медиаустройствах |
| US20150088508A1 (en) * | 2013-09-25 | 2015-03-26 | Verizon Patent And Licensing Inc. | Training speech recognition using captions |
| US20170243596A1 (en) * | 2014-07-31 | 2017-08-24 | Dolby Laboratories Licensing Corporation | Audio Processing Systems and Methods |
| US20180182394A1 (en) * | 2016-11-30 | 2018-06-28 | Spotify Ab | Identification of taste attributes from an audio signal |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2020123424A1 (en) | 2020-06-18 |
| JP7455836B2 (ja) | 2024-03-26 |
| US12469500B2 (en) | 2025-11-11 |
| CN113168839B (zh) | 2024-01-23 |
| EP3895164B1 (en) | 2022-09-07 |
| KR20210102899A (ko) | 2021-08-20 |
| EP3895164A1 (en) | 2021-10-20 |
| US20220059102A1 (en) | 2022-02-24 |
| BR112021009667A2 (pt) | 2021-08-17 |
| CN113168839A (zh) | 2021-07-23 |
| JP2022513184A (ja) | 2022-02-07 |
| JP2024081674A (ja) | 2024-06-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| RU2768224C1 (ru) | Двусторонняя медийная аналитика | |
| KR101100221B1 (ko) | 오디오 신호의 디코딩 방법 및 그 장치 | |
| KR102686742B1 (ko) | 객체 기반 오디오 신호 균형화 | |
| KR101761041B1 (ko) | 음량 및 동적 범위 제어에 대한 메타데이터 | |
| JP5001384B2 (ja) | オーディオ信号の処理方法及び装置 | |
| CN110890101B (zh) | 用于基于语音增强元数据进行解码的方法和设备 | |
| CN108369810B (zh) | 用于对多声道音频信号进行编码的自适应声道缩减处理 | |
| US8620008B2 (en) | Method and an apparatus for processing an audio signal | |
| AU2011305913B2 (en) | Audio stream mixing with dialog level normalization | |
| CN105814630A (zh) | 用于音频设备的组合动态范围压缩和引导截断防止的构思 | |
| MX2012005781A (es) | Aparato para proporcionar una representacion de señal de mezcla ascendente con base en la representacion de señal de mezcla descendente, aparato para proporcionar un flujo de bits que representa una señal de audio multicanal, metodos, programas informaticos y flujo de bits que representan una señal de audio multicanal usando un parametro de combinacion lineal. | |
| WO2009093867A2 (en) | A method and an apparatus for processing audio signal | |
| CN101479786A (zh) | 用于编码和解码基于对象的音频信号的方法和装置 | |
| WO2009093866A2 (en) | A method and an apparatus for processing an audio signal | |
| CA2712941A1 (en) | A method and an apparatus for processing an audio signal | |
| RU2455708C2 (ru) | Способы и устройства кодирования и декодирования объектно-ориентированных аудиосигналов | |
| US11463833B2 (en) | Method and apparatus for voice or sound activity detection for spatial audio | |
| KR20090110234A (ko) | 오디오 신호 처리 방법 및 이의 장치 | |
| HK40126637A (zh) | 基於对象的音频编解码器中不连续传输的方法和设备 |