KR20210102899A - 이중 종단 미디어 인텔리전스 - Google Patents

이중 종단 미디어 인텔리전스 Download PDF

Info

Publication number
KR20210102899A
KR20210102899A KR1020217017682A KR20217017682A KR20210102899A KR 20210102899 A KR20210102899 A KR 20210102899A KR 1020217017682 A KR1020217017682 A KR 1020217017682A KR 20217017682 A KR20217017682 A KR 20217017682A KR 20210102899 A KR20210102899 A KR 20210102899A
Authority
KR
South Korea
Prior art keywords
content
audio content
classification information
file
type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
KR1020217017682A
Other languages
English (en)
Korean (ko)
Inventor
야닝 바이
마크 윌리엄 제라드
리차드 한
마틴 월터스
Original Assignee
돌비 레버러토리즈 라이쎈싱 코오포레이션
돌비 인터네셔널 에이비
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 돌비 레버러토리즈 라이쎈싱 코오포레이션, 돌비 인터네셔널 에이비 filed Critical 돌비 레버러토리즈 라이쎈싱 코오포레이션
Publication of KR20210102899A publication Critical patent/KR20210102899A/ko
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/65Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
KR1020217017682A 2018-12-13 2019-12-10 이중 종단 미디어 인텔리전스 Withdrawn KR20210102899A (ko)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
CN2018120923 2018-12-13
CNPCT/CN2018/120923 2018-12-13
US201962792997P 2019-01-16 2019-01-16
US62/792,997 2019-01-16
EP19157080 2019-02-14
EP19157080.3 2019-02-14
PCT/US2019/065338 WO2020123424A1 (en) 2018-12-13 2019-12-10 Dual-ended media intelligence

Publications (1)

Publication Number Publication Date
KR20210102899A true KR20210102899A (ko) 2021-08-20

Family

ID=69104844

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020217017682A Withdrawn KR20210102899A (ko) 2018-12-13 2019-12-10 이중 종단 미디어 인텔리전스

Country Status (8)

Country Link
US (1) US12469500B2 (https=)
EP (1) EP3895164B1 (https=)
JP (2) JP7455836B2 (https=)
KR (1) KR20210102899A (https=)
CN (1) CN113168839B (https=)
BR (1) BR112021009667A2 (https=)
RU (1) RU2768224C1 (https=)
WO (1) WO2020123424A1 (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4200845B1 (en) 2020-08-18 2025-05-07 Dolby Laboratories Licensing Corporation Audio content identification
WO2022115303A1 (en) 2020-11-27 2022-06-02 Dolby Laboratories Licensing Corporation Automatic generation and selection of target profiles for dynamic equalization of audio content
CN115102931B (zh) * 2022-05-20 2023-12-19 阿里巴巴(中国)有限公司 自适应调整音频延迟的方法及电子设备
CN116723438A (zh) * 2023-05-26 2023-09-08 三星电子(中国)研发中心 修正参数生成方法和装置

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6360234B2 (en) 1997-08-14 2002-03-19 Virage, Inc. Video cataloger system with synchronized encoders
US6833865B1 (en) 1998-09-01 2004-12-21 Virage, Inc. Embedded metadata engines in digital capture devices
KR20030016406A (ko) 2001-05-15 2003-02-26 코닌클리케 필립스 일렉트로닉스 엔.브이. 콘텐트 분석 장치
US7454331B2 (en) * 2002-08-30 2008-11-18 Dolby Laboratories Licensing Corporation Controlling loudness of speech in signals that contain speech and other types of audio material
CN101065988B (zh) * 2004-11-23 2011-03-02 皇家飞利浦电子股份有限公司 处理音频数据的设备和方法
JP4713396B2 (ja) 2006-05-09 2011-06-29 シャープ株式会社 映像音声再生装置、及びその音像移動方法
US8121198B2 (en) 2006-10-16 2012-02-21 Microsoft Corporation Embedding content-based searchable indexes in multimedia files
US7640272B2 (en) 2006-12-07 2009-12-29 Microsoft Corporation Using automated content analysis for audio/video content consumption
EP2111617B1 (en) * 2007-02-14 2013-09-04 LG Electronics Inc. Audio decoding method and corresponding apparatus
US20080208589A1 (en) * 2007-02-27 2008-08-28 Cross Charles W Presenting Supplemental Content For Digital Media Using A Multimodal Application
US20100138890A1 (en) 2007-05-07 2010-06-03 Nxp B.V. Device to allow content analysis in real time
RU2507609C2 (ru) 2008-07-11 2014-02-20 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Способ и дискриминатор для классификации различных сегментов сигнала
EP2144230A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme having cascaded switches
WO2011071610A1 (en) 2009-12-07 2011-06-16 Dolby Laboratories Licensing Corporation Decoding of multichannel aufio encoded bit streams using adaptive hybrid transformation
US8965545B2 (en) * 2010-09-30 2015-02-24 Google Inc. Progressive encoding of audio
TWI896112B (zh) * 2010-12-03 2025-09-01 美商杜比實驗室特許公司 音頻解碼裝置、音頻解碼方法及音頻編碼方法
MY207992A (en) * 2011-07-01 2025-04-03 Dolby Laboratories Licensing Corp System and method for adaptive audio signal generation, coding and rendering
US20140056430A1 (en) * 2012-08-21 2014-02-27 Electronics And Telecommunications Research Institute System and method for reproducing wave field using sound bar
US9805725B2 (en) 2012-12-21 2017-10-31 Dolby Laboratories Licensing Corporation Object clustering for rendering object-based audio content based on perceptual criteria
JP6041789B2 (ja) 2013-01-03 2016-12-14 三菱電機株式会社 入力信号を符号化する方法
TWM467148U (zh) 2013-01-21 2013-12-01 Dolby Lab Licensing Corp 具響度處理狀態詮釋資料之音訊處理設備
CA2898567C (en) * 2013-01-28 2018-09-18 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Method and apparatus for normalized audio playback of media with and without embedded loudness metadata on new media devices
US9609452B2 (en) 2013-02-08 2017-03-28 Qualcomm Incorporated Obtaining sparseness information for higher order ambisonic audio renderers
US8903186B2 (en) 2013-02-28 2014-12-02 Facebook, Inc. Methods and systems for differentiating synthetic and non-synthetic images
CN104078050A (zh) * 2013-03-26 2014-10-01 杜比实验室特许公司 用于音频分类和音频处理的设备和方法
CN107093991B (zh) * 2013-03-26 2020-10-09 杜比实验室特许公司 基于目标响度的响度归一化方法和设备
US9559651B2 (en) 2013-03-29 2017-01-31 Apple Inc. Metadata for loudness and dynamic range control
TWM487509U (zh) 2013-06-19 2014-10-01 杜比實驗室特許公司 音訊處理設備及電子裝置
US9418650B2 (en) 2013-09-25 2016-08-16 Verizon Patent And Licensing Inc. Training speech recognition using captions
EP3175446B1 (en) 2014-07-31 2019-06-19 Dolby Laboratories Licensing Corporation Audio processing systems and methods
US10110911B2 (en) 2014-11-11 2018-10-23 Cisco Technology, Inc. Parallel media encoding
US10834436B2 (en) 2015-05-27 2020-11-10 Arris Enterprises Llc Video classification using user behavior from a network digital video recorder
US9837086B2 (en) * 2015-07-31 2017-12-05 Apple Inc. Encoded audio extended metadata-based dynamic range control
US9934790B2 (en) 2015-07-31 2018-04-03 Apple Inc. Encoded audio metadata-based equalization
US9934785B1 (en) 2016-11-30 2018-04-03 Spotify Ab Identification of taste attributes from an audio signal
JP7086521B2 (ja) 2017-02-27 2022-06-20 ヤマハ株式会社 情報処理方法および情報処理装置

Also Published As

Publication number Publication date
EP3895164A1 (en) 2021-10-20
CN113168839B (zh) 2024-01-23
US20220059102A1 (en) 2022-02-24
US12469500B2 (en) 2025-11-11
RU2768224C1 (ru) 2022-03-23
EP3895164B1 (en) 2022-09-07
JP2022513184A (ja) 2022-02-07
JP2024081674A (ja) 2024-06-18
WO2020123424A1 (en) 2020-06-18
CN113168839A (zh) 2021-07-23
JP7455836B2 (ja) 2024-03-26
BR112021009667A2 (pt) 2021-08-17

Similar Documents

Publication Publication Date Title
KR102686742B1 (ko) 객체 기반 오디오 신호 균형화
JP2024081674A (ja) デュアルエンドのメディア・インテリジェンス
CN102768835B (zh) 用于编码和解码具有各种声道的多对象音频信号的设备和方法
JP5001384B2 (ja) オーディオ信号の処理方法及び装置
KR101049144B1 (ko) 오디오 신호 처리방법 및 장치
US8620008B2 (en) Method and an apparatus for processing an audio signal
US11096002B2 (en) Energy-ratio signalling and synthesis
CN105637582A (zh) 音频编码装置及音频解码装置
CN114175151A (zh) Ivas比特流的编码和解码
US11463833B2 (en) Method and apparatus for voice or sound activity detection for spatial audio
CN106104684A (zh) 多通道音频信号分类器
WO2009075511A1 (en) A method and an apparatus for processing a signal
KR100740807B1 (ko) 공간정보기반 오디오 부호화에서의 공간정보 추출 방법
HK40126637A (zh) 基於对象的音频编解码器中不连续传输的方法和设备

Legal Events

Date Code Title Description
PA0105 International application

St.27 status event code: A-0-1-A10-A15-nap-PA0105

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000

A201 Request for examination
P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

D21 Rejection of application intended

Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D21-EXM-PE0902 (AS PROVIDED BY THE NATIONAL OFFICE)

PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

T11 Administrative time limit extension requested

Free format text: ST27 STATUS EVENT CODE: U-3-3-T10-T11-OTH-X000 (AS PROVIDED BY THE NATIONAL OFFICE)

T11-X000 Administrative time limit extension requested

St.27 status event code: U-3-3-T10-T11-oth-X000

T11 Administrative time limit extension requested

Free format text: ST27 STATUS EVENT CODE: U-3-3-T10-T11-OTH-X000 (AS PROVIDED BY THE NATIONAL OFFICE)

T11-X000 Administrative time limit extension requested

St.27 status event code: U-3-3-T10-T11-oth-X000

B11 Application withdrawn

Free format text: ST27 STATUS EVENT CODE: N-1-6-B10-B11-NAP-PC1202 (AS PROVIDED BY THE NATIONAL OFFICE)

PC1202 Submission of document of withdrawal before decision of registration

St.27 status event code: N-1-6-B10-B11-nap-PC1202