PL4035402T3 - Wygładzanie metadanych audio - Google Patents

Wygładzanie metadanych audio

Info

Publication number
PL4035402T3
PL4035402T3 PL20789319.9T PL20789319T PL4035402T3 PL 4035402 T3 PL4035402 T3 PL 4035402T3 PL 20789319 T PL20789319 T PL 20789319T PL 4035402 T3 PL4035402 T3 PL 4035402T3
Authority
PL
Poland
Prior art keywords
smoothing
audio metadata
metadata
audio
metadata smoothing
Prior art date
Application number
PL20789319.9T
Other languages
English (en)
Inventor
Weiguo Zheng
Rex Ching
Weibo Ni
Kensuke Miyagi
Sean Munday
Teresa Tao
Original Assignee
Netflix, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Netflix, Inc. filed Critical Netflix, Inc.
Publication of PL4035402T3 publication Critical patent/PL4035402T3/pl

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/23611Insertion of stuffing data into a multiplex stream, e.g. to obtain a constant bitrate
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0356Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for synchronising with other signals, e.g. video signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • G10L21/055Time compression or expansion for synchronising with other signals, e.g. video signals
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44016Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Diaphragms For Electromechanical Transducers (AREA)
  • Amplifiers (AREA)
  • Stereophonic System (AREA)
PL20789319.9T 2019-09-23 2020-09-22 Wygładzanie metadanych audio PL4035402T3 (pl)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201962904542P 2019-09-23 2019-09-23
US15/931,442 US11416208B2 (en) 2019-09-23 2020-05-13 Audio metadata smoothing
PCT/US2020/052017 WO2021061656A1 (en) 2019-09-23 2020-09-22 Audio metadata smoothing

Publications (1)

Publication Number Publication Date
PL4035402T3 true PL4035402T3 (pl) 2024-08-26

Family

ID=74880856

Family Applications (1)

Application Number Title Priority Date Filing Date
PL20789319.9T PL4035402T3 (pl) 2019-09-23 2020-09-22 Wygładzanie metadanych audio

Country Status (7)

Country Link
US (1) US11416208B2 (pl)
EP (1) EP4035402B1 (pl)
AU (1) AU2020352977B2 (pl)
BR (1) BR112022005474A2 (pl)
MX (1) MX2022002587A (pl)
PL (1) PL4035402T3 (pl)
WO (1) WO2021061656A1 (pl)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7375002B2 (ja) * 2019-05-14 2023-11-07 AlphaTheta株式会社 音響装置および楽曲再生プログラム
US11758206B1 (en) * 2021-03-12 2023-09-12 Amazon Technologies, Inc. Encoding media content for playback compatibility
CN116959503B (zh) * 2023-07-25 2024-09-10 腾讯科技(深圳)有限公司 滑音音频的模拟方法、装置和存储介质及电子设备
CN119071717B (zh) * 2024-08-10 2025-04-11 汇智声科技(惠州)有限公司 一种数字扬声器的动态音频处理方法及系统

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101479969B (zh) * 2006-06-26 2012-07-04 Nxp股份有限公司 数据封装的方法和设备
US9183885B2 (en) * 2008-05-30 2015-11-10 Echostar Technologies L.L.C. User-initiated control of an audio/video stream to skip interstitial content between program segments
US8326127B2 (en) * 2009-01-30 2012-12-04 Echostar Technologies L.L.C. Methods and apparatus for identifying portions of a video stream based on characteristics of the video stream
US8422699B2 (en) * 2009-04-17 2013-04-16 Linear Acoustic, Inc. Loudness consistency at program boundaries
KR101805212B1 (ko) * 2009-08-14 2017-12-05 디티에스 엘엘씨 객체-지향 오디오 스트리밍 시스템
US8428936B2 (en) * 2010-03-05 2013-04-23 Motorola Mobility Llc Decoder for audio signal including generic audio and speech frames
US9026450B2 (en) * 2011-03-09 2015-05-05 Dts Llc System for dynamically creating and rendering audio objects
US8924580B2 (en) * 2011-08-12 2014-12-30 Cisco Technology, Inc. Constant-quality rate-adaptive streaming
US8990849B2 (en) * 2012-02-14 2015-03-24 Verizon Patent And Licensing Inc. Advertisement insertion into media content for streaming
US8813120B1 (en) * 2013-03-15 2014-08-19 Google Inc. Interstitial audio control
US20140275851A1 (en) * 2013-03-15 2014-09-18 eagleyemed, Inc. Multi-site data sharing platform
US20150199968A1 (en) * 2014-01-16 2015-07-16 CloudCar Inc. Audio stream manipulation for an in-vehicle infotainment system
KR102355472B1 (ko) * 2014-09-12 2022-01-26 소니그룹주식회사 송신 장치, 송신 방법, 수신 장치 및 수신 방법
RU2708942C2 (ru) * 2014-10-24 2019-12-12 Долби Интернешнл Аб Кодирование и декодирование аудиосигналов
ES2733858T3 (es) * 2015-03-09 2019-12-03 Fraunhofer Ges Forschung Codificación de audio alineada por fragmentos
GB2573597B8 (en) * 2015-06-22 2025-08-06 Time Machine Capital Ltd Auditory augmentation system
US10341770B2 (en) * 2015-09-30 2019-07-02 Apple Inc. Encoded audio metadata-based loudness equalization and dynamic equalization during DRC
US10509622B2 (en) * 2015-10-27 2019-12-17 Super Hi-Fi, Llc Audio content production, audio sequencing, and audio blending system and method
EP3185570A1 (en) * 2015-12-22 2017-06-28 Thomson Licensing Method and apparatus for transmission-based smoothing of rendering
US9880803B2 (en) * 2016-04-06 2018-01-30 International Business Machines Corporation Audio buffering continuity
US11183147B2 (en) * 2016-10-07 2021-11-23 Sony Semiconductor Solutions Corporation Device and method for processing video content for display control
GB2557970B (en) * 2016-12-20 2020-12-09 Mashtraxx Ltd Content tracking system and method

Also Published As

Publication number Publication date
BR112022005474A2 (pt) 2022-06-14
WO2021061656A1 (en) 2021-04-01
AU2020352977A1 (en) 2022-02-24
EP4035402B1 (en) 2024-05-01
US11416208B2 (en) 2022-08-16
MX2022002587A (es) 2022-03-22
CA3147190A1 (en) 2021-04-01
AU2020352977B2 (en) 2023-06-01
EP4035402A1 (en) 2022-08-03
US20210089259A1 (en) 2021-03-25

Similar Documents

Publication Publication Date Title
CA197843S (en) Audio eyeglasses
CA185920S (en) Audio eyeglasses
GB2596003B (en) Audio processing
CA185332S (en) Headphone
CA185101S (en) Headphone
CA184601S (en) Loudspeaker
GB2591355B (en) Audio circuitry
SG11202100960RA (en) Transcription factor profiling
PL4035402T3 (pl) Wygładzanie metadanych audio
GB201913726D0 (en) Audio processing
CA188026S (en) Loudspeaker
GB201907601D0 (en) Audio processing
GB201818690D0 (en) Audio processing
AU201815744S (en) Loudspeaker
CA205889S (en) Audio processor
GB201907570D0 (en) Audio processing
GB201803408D0 (en) Audio processing
GB2593136B (en) Rendering audio
GB2573888B (en) Loudspeaker
PL3935630T3 (pl) Miksowanie w dół (downmixing) audio
GB201909715D0 (en) Stereo audio
GB202101515D0 (en) intelligent audio for physical spaces
GB201916335D0 (en) Audio processing
CA207656S (en) Audio eyeglasses
CA207651S (en) Audio eyeglasses