RU2768224C1 - Двусторонняя медийная аналитика - Google Patents

Двусторонняя медийная аналитика Download PDF

Info

Publication number
RU2768224C1
RU2768224C1 RU2021116055A RU2021116055A RU2768224C1 RU 2768224 C1 RU2768224 C1 RU 2768224C1 RU 2021116055 A RU2021116055 A RU 2021116055A RU 2021116055 A RU2021116055 A RU 2021116055A RU 2768224 C1 RU2768224 C1 RU 2768224C1
Authority
RU
Russia
Prior art keywords
content
audio content
classification information
file
type
Prior art date
Application number
RU2021116055A
Other languages
English (en)
Russian (ru)
Inventor
Яньнин БАЙ
Марк Уильям ДЖЕРРАРД
Ричард ХАНЬ
Мартин УОЛТЕРС
Original Assignee
Долби Лабораторис Лайсэнзин Корпорейшн
Долби Интернешнл Аб
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Долби Лабораторис Лайсэнзин Корпорейшн, Долби Интернешнл Аб filed Critical Долби Лабораторис Лайсэнзин Корпорейшн
Application granted granted Critical
Publication of RU2768224C1 publication Critical patent/RU2768224C1/ru

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/65Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
RU2021116055A 2018-12-13 2019-12-10 Двусторонняя медийная аналитика RU2768224C1 (ru)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
CNPCT/CN2018/120923 2018-12-13
CN2018120923 2018-12-13
US201962792997P 2019-01-16 2019-01-16
US62/792,997 2019-01-16
EP19157080.3 2019-02-14
EP19157080 2019-02-14
PCT/US2019/065338 WO2020123424A1 (en) 2018-12-13 2019-12-10 Dual-ended media intelligence

Publications (1)

Publication Number Publication Date
RU2768224C1 true RU2768224C1 (ru) 2022-03-23

Family

ID=69104844

Family Applications (1)

Application Number Title Priority Date Filing Date
RU2021116055A RU2768224C1 (ru) 2018-12-13 2019-12-10 Двусторонняя медийная аналитика

Country Status (8)

Country Link
US (1) US12469500B2 (https=)
EP (1) EP3895164B1 (https=)
JP (2) JP7455836B2 (https=)
KR (1) KR20210102899A (https=)
CN (1) CN113168839B (https=)
BR (1) BR112021009667A2 (https=)
RU (1) RU2768224C1 (https=)
WO (1) WO2020123424A1 (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2023539121A (ja) * 2020-08-18 2023-09-13 ドルビー ラボラトリーズ ライセンシング コーポレイション オーディオコンテンツの識別
WO2022115303A1 (en) 2020-11-27 2022-06-02 Dolby Laboratories Licensing Corporation Automatic generation and selection of target profiles for dynamic equalization of audio content
CN115102931B (zh) * 2022-05-20 2023-12-19 阿里巴巴(中国)有限公司 自适应调整音频延迟的方法及电子设备
CN116723438A (zh) * 2023-05-26 2023-09-08 三星电子(中国)研发中心 修正参数生成方法和装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120084089A1 (en) * 2010-09-30 2012-04-05 Google Inc. Progressive encoding of audio
US20150088508A1 (en) * 2013-09-25 2015-03-26 Verizon Patent And Licensing Inc. Training speech recognition using captions
US20170243596A1 (en) * 2014-07-31 2017-08-24 Dolby Laboratories Licensing Corporation Audio Processing Systems and Methods
RU2639663C2 (ru) * 2013-01-28 2017-12-21 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Способ и устройство для нормализованного проигрывания аудио медиаданных с вложенными метаданными громкости и без них на новых медиаустройствах
US20180182394A1 (en) * 2016-11-30 2018-06-28 Spotify Ab Identification of taste attributes from an audio signal

Family Cites Families (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6360234B2 (en) 1997-08-14 2002-03-19 Virage, Inc. Video cataloger system with synchronized encoders
US6833865B1 (en) 1998-09-01 2004-12-21 Virage, Inc. Embedded metadata engines in digital capture devices
CN1284104C (zh) 2001-05-15 2006-11-08 皇家菲利浦电子有限公司 内容分析设备
US7454331B2 (en) * 2002-08-30 2008-11-18 Dolby Laboratories Licensing Corporation Controlling loudness of speech in signals that contain speech and other types of audio material
US7895138B2 (en) * 2004-11-23 2011-02-22 Koninklijke Philips Electronics N.V. Device and a method to process audio data, a computer program element and computer-readable medium
JP4713396B2 (ja) 2006-05-09 2011-06-29 シャープ株式会社 映像音声再生装置、及びその音像移動方法
US8121198B2 (en) 2006-10-16 2012-02-21 Microsoft Corporation Embedding content-based searchable indexes in multimedia files
US7640272B2 (en) 2006-12-07 2009-12-29 Microsoft Corporation Using automated content analysis for audio/video content consumption
CA2645915C (en) 2007-02-14 2012-10-23 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
US20080208589A1 (en) * 2007-02-27 2008-08-28 Cross Charles W Presenting Supplemental Content For Digital Media Using A Multimodal Application
US20100138890A1 (en) 2007-05-07 2010-06-03 Nxp B.V. Device to allow content analysis in real time
EP2144230A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme having cascaded switches
CN102089803B (zh) * 2008-07-11 2013-02-27 弗劳恩霍夫应用研究促进协会 用以将信号的不同段分类的方法与鉴别器
MX2012005723A (es) 2009-12-07 2012-06-13 Dolby Lab Licensing Corp Metodo y aparato para decodificar un cuadrado de una señal de audio digital codificada y medio de almacenamiento que graba un programa de instrucciones.
TWI581250B (zh) * 2010-12-03 2017-05-01 杜比實驗室特許公司 利用多媒體處理節點之適應性處理技術
KR102185941B1 (ko) * 2011-07-01 2020-12-03 돌비 레버러토리즈 라이쎈싱 코오포레이션 적응형 오디오 신호 생성, 코딩 및 렌더링을 위한 시스템 및 방법
US20140056430A1 (en) * 2012-08-21 2014-02-27 Electronics And Telecommunications Research Institute System and method for reproducing wave field using sound bar
US9805725B2 (en) 2012-12-21 2017-10-31 Dolby Laboratories Licensing Corporation Object clustering for rendering object-based audio content based on perceptual criteria
JP6041789B2 (ja) 2013-01-03 2016-12-14 三菱電機株式会社 入力信号を符号化する方法
CN112652316B (zh) 2013-01-21 2023-09-15 杜比实验室特许公司 利用响度处理状态元数据的音频编码器和解码器
US9609452B2 (en) 2013-02-08 2017-03-28 Qualcomm Incorporated Obtaining sparseness information for higher order ambisonic audio renderers
US8903186B2 (en) 2013-02-28 2014-12-02 Facebook, Inc. Methods and systems for differentiating synthetic and non-synthetic images
CN104080024B (zh) * 2013-03-26 2019-02-19 杜比实验室特许公司 音量校平器控制器和控制方法以及音频分类器
CN104078050A (zh) * 2013-03-26 2014-10-01 杜比实验室特许公司 用于音频分类和音频处理的设备和方法
US9559651B2 (en) * 2013-03-29 2017-01-31 Apple Inc. Metadata for loudness and dynamic range control
TWM487509U (zh) 2013-06-19 2014-10-01 杜比實驗室特許公司 音訊處理設備及電子裝置
US10110911B2 (en) 2014-11-11 2018-10-23 Cisco Technology, Inc. Parallel media encoding
US10834436B2 (en) 2015-05-27 2020-11-10 Arris Enterprises Llc Video classification using user behavior from a network digital video recorder
US9837086B2 (en) * 2015-07-31 2017-12-05 Apple Inc. Encoded audio extended metadata-based dynamic range control
US9934790B2 (en) 2015-07-31 2018-04-03 Apple Inc. Encoded audio metadata-based equalization
JP7086521B2 (ja) 2017-02-27 2022-06-20 ヤマハ株式会社 情報処理方法および情報処理装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120084089A1 (en) * 2010-09-30 2012-04-05 Google Inc. Progressive encoding of audio
RU2639663C2 (ru) * 2013-01-28 2017-12-21 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Способ и устройство для нормализованного проигрывания аудио медиаданных с вложенными метаданными громкости и без них на новых медиаустройствах
US20150088508A1 (en) * 2013-09-25 2015-03-26 Verizon Patent And Licensing Inc. Training speech recognition using captions
US20170243596A1 (en) * 2014-07-31 2017-08-24 Dolby Laboratories Licensing Corporation Audio Processing Systems and Methods
US20180182394A1 (en) * 2016-11-30 2018-06-28 Spotify Ab Identification of taste attributes from an audio signal

Also Published As

Publication number Publication date
WO2020123424A1 (en) 2020-06-18
JP7455836B2 (ja) 2024-03-26
US12469500B2 (en) 2025-11-11
CN113168839B (zh) 2024-01-23
EP3895164B1 (en) 2022-09-07
KR20210102899A (ko) 2021-08-20
EP3895164A1 (en) 2021-10-20
US20220059102A1 (en) 2022-02-24
BR112021009667A2 (pt) 2021-08-17
CN113168839A (zh) 2021-07-23
JP2022513184A (ja) 2022-02-07
JP2024081674A (ja) 2024-06-18

Similar Documents

Publication Publication Date Title
RU2768224C1 (ru) Двусторонняя медийная аналитика
KR101100221B1 (ko) 오디오 신호의 디코딩 방법 및 그 장치
KR102686742B1 (ko) 객체 기반 오디오 신호 균형화
KR101761041B1 (ko) 음량 및 동적 범위 제어에 대한 메타데이터
JP5001384B2 (ja) オーディオ信号の処理方法及び装置
CN110890101B (zh) 用于基于语音增强元数据进行解码的方法和设备
CN108369810B (zh) 用于对多声道音频信号进行编码的自适应声道缩减处理
US8620008B2 (en) Method and an apparatus for processing an audio signal
AU2011305913B2 (en) Audio stream mixing with dialog level normalization
CN105814630A (zh) 用于音频设备的组合动态范围压缩和引导截断防止的构思
MX2012005781A (es) Aparato para proporcionar una representacion de señal de mezcla ascendente con base en la representacion de señal de mezcla descendente, aparato para proporcionar un flujo de bits que representa una señal de audio multicanal, metodos, programas informaticos y flujo de bits que representan una señal de audio multicanal usando un parametro de combinacion lineal.
WO2009093867A2 (en) A method and an apparatus for processing audio signal
CN101479786A (zh) 用于编码和解码基于对象的音频信号的方法和装置
WO2009093866A2 (en) A method and an apparatus for processing an audio signal
CA2712941A1 (en) A method and an apparatus for processing an audio signal
RU2455708C2 (ru) Способы и устройства кодирования и декодирования объектно-ориентированных аудиосигналов
US11463833B2 (en) Method and apparatus for voice or sound activity detection for spatial audio
KR20090110234A (ko) 오디오 신호 처리 방법 및 이의 장치
HK40126637A (zh) 基於对象的音频编解码器中不连续传输的方法和设备