BR112022005474A2 - Suavização de metadados de áudio - Google Patents
Suavização de metadados de áudioInfo
- Publication number
- BR112022005474A2 BR112022005474A2 BR112022005474A BR112022005474A BR112022005474A2 BR 112022005474 A2 BR112022005474 A2 BR 112022005474A2 BR 112022005474 A BR112022005474 A BR 112022005474A BR 112022005474 A BR112022005474 A BR 112022005474A BR 112022005474 A2 BR112022005474 A2 BR 112022005474A2
- Authority
- BR
- Brazil
- Prior art keywords
- audio
- metadata
- segment
- smoothing
- frame
- Prior art date
Links
- 238000009499 grossing Methods 0.000 title abstract 3
- 238000000034 method Methods 0.000 abstract 5
- 230000003044 adaptive effect Effects 0.000 abstract 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/23611—Insertion of stuffing data into a multiplex stream, e.g. to obtain a constant bitrate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0356—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for synchronising with other signals, e.g. video signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
- G10L21/055—Time compression or expansion for synchronising with other signals, e.g. video signals
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44016—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/84—Generation or processing of descriptive data, e.g. content descriptors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Diaphragms For Electromechanical Transducers (AREA)
- Amplifiers (AREA)
- Stereophonic System (AREA)
Abstract
suavização de metadados de áudio. o método implementado por computador revelado para suavizar lacunas de áudio usando metadados adaptáveis identifica um segmento de áudio inicial e um segmento de áudio subsequente que se segue ao segmento de áudio inicial. o método acessa um primeiro conjunto de metadados que corresponde a um último quadro de áudio do segmento de áudio inicial e acessa um segundo conjunto de metadados que corresponde ao primeiro quadro de áudio do segmento de áudio subsequente. os primeiro e segundo conjuntos de metadados incluem informação de característica de áudio para os dois segmentos de áudio. o método então gera um novo conjunto de metadados que é baseado em ambos os conjuntos de características de áudio. o método adicionalmente insere um novo quadro de áudio entre o último quadro de áudio do segmento de áudio inicial e o primeiro quadro de áudio do segmento de áudio subsequente e aplica o novo conjunto de metadados ao novo quadro de áudio. vários outros métodos, sistemas e meios legíveis por computador também são revelados.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962904542P | 2019-09-23 | 2019-09-23 | |
US15/931,442 US11416208B2 (en) | 2019-09-23 | 2020-05-13 | Audio metadata smoothing |
PCT/US2020/052017 WO2021061656A1 (en) | 2019-09-23 | 2020-09-22 | Audio metadata smoothing |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112022005474A2 true BR112022005474A2 (pt) | 2022-06-14 |
Family
ID=74880856
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112022005474A BR112022005474A2 (pt) | 2019-09-23 | 2020-09-22 | Suavização de metadados de áudio |
Country Status (7)
Country | Link |
---|---|
US (1) | US11416208B2 (pt) |
EP (1) | EP4035402B1 (pt) |
AU (1) | AU2020352977B2 (pt) |
BR (1) | BR112022005474A2 (pt) |
CA (1) | CA3147190A1 (pt) |
MX (1) | MX2022002587A (pt) |
WO (1) | WO2021061656A1 (pt) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11934738B2 (en) * | 2019-05-14 | 2024-03-19 | Alphatheta Corporation | Acoustic device and music piece reproduction program |
US11758206B1 (en) * | 2021-03-12 | 2023-09-12 | Amazon Technologies, Inc. | Encoding media content for playback compatibility |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100042740A1 (en) * | 2006-06-26 | 2010-02-18 | Nxp B.V. | Method and device for data packing |
US9183885B2 (en) * | 2008-05-30 | 2015-11-10 | Echostar Technologies L.L.C. | User-initiated control of an audio/video stream to skip interstitial content between program segments |
US8326127B2 (en) * | 2009-01-30 | 2012-12-04 | Echostar Technologies L.L.C. | Methods and apparatus for identifying portions of a video stream based on characteristics of the video stream |
US8422699B2 (en) * | 2009-04-17 | 2013-04-16 | Linear Acoustic, Inc. | Loudness consistency at program boundaries |
KR20120062758A (ko) * | 2009-08-14 | 2012-06-14 | 에스알에스 랩스, 인크. | 오디오 객체들을 적응적으로 스트리밍하기 위한 시스템 |
US8428936B2 (en) * | 2010-03-05 | 2013-04-23 | Motorola Mobility Llc | Decoder for audio signal including generic audio and speech frames |
WO2012122397A1 (en) * | 2011-03-09 | 2012-09-13 | Srs Labs, Inc. | System for dynamically creating and rendering audio objects |
US8924580B2 (en) * | 2011-08-12 | 2014-12-30 | Cisco Technology, Inc. | Constant-quality rate-adaptive streaming |
US8789090B1 (en) * | 2012-02-14 | 2014-07-22 | Uplynk, LLC | Advertisement insertion into media content for streaming |
US20140275851A1 (en) * | 2013-03-15 | 2014-09-18 | eagleyemed, Inc. | Multi-site data sharing platform |
US8813120B1 (en) * | 2013-03-15 | 2014-08-19 | Google Inc. | Interstitial audio control |
US20150199968A1 (en) * | 2014-01-16 | 2015-07-16 | CloudCar Inc. | Audio stream manipulation for an in-vehicle infotainment system |
JP6809221B2 (ja) * | 2014-09-12 | 2021-01-06 | ソニー株式会社 | 送信装置、送信方法、受信装置および受信方法 |
KR102474541B1 (ko) * | 2014-10-24 | 2022-12-06 | 돌비 인터네셔널 에이비 | 오디오 신호들의 인코딩 및 디코딩 |
RU2681958C1 (ru) * | 2015-03-09 | 2019-03-14 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Выровненное по фрагменту аудиокодирование |
GB2539875B (en) * | 2015-06-22 | 2017-09-20 | Time Machine Capital Ltd | Music Context System, Audio Track Structure and method of Real-Time Synchronization of Musical Content |
US10341770B2 (en) * | 2015-09-30 | 2019-07-02 | Apple Inc. | Encoded audio metadata-based loudness equalization and dynamic equalization during DRC |
CN113220259A (zh) * | 2015-10-27 | 2021-08-06 | 超级保真有限公司 | 音频内容制作、音频排序和音频混合的系统和方法 |
EP3185570A1 (en) * | 2015-12-22 | 2017-06-28 | Thomson Licensing | Method and apparatus for transmission-based smoothing of rendering |
US9880803B2 (en) * | 2016-04-06 | 2018-01-30 | International Business Machines Corporation | Audio buffering continuity |
US11183147B2 (en) * | 2016-10-07 | 2021-11-23 | Sony Semiconductor Solutions Corporation | Device and method for processing video content for display control |
GB2557970B (en) * | 2016-12-20 | 2020-12-09 | Mashtraxx Ltd | Content tracking system and method |
-
2020
- 2020-05-13 US US15/931,442 patent/US11416208B2/en active Active
- 2020-09-22 AU AU2020352977A patent/AU2020352977B2/en active Active
- 2020-09-22 BR BR112022005474A patent/BR112022005474A2/pt unknown
- 2020-09-22 WO PCT/US2020/052017 patent/WO2021061656A1/en unknown
- 2020-09-22 MX MX2022002587A patent/MX2022002587A/es unknown
- 2020-09-22 CA CA3147190A patent/CA3147190A1/en active Pending
- 2020-09-22 EP EP20789319.9A patent/EP4035402B1/en active Active
Also Published As
Publication number | Publication date |
---|---|
EP4035402A1 (en) | 2022-08-03 |
AU2020352977B2 (en) | 2023-06-01 |
AU2020352977A1 (en) | 2022-02-24 |
EP4035402B1 (en) | 2024-05-01 |
US11416208B2 (en) | 2022-08-16 |
MX2022002587A (es) | 2022-03-22 |
US20210089259A1 (en) | 2021-03-25 |
WO2021061656A1 (en) | 2021-04-01 |
CA3147190A1 (en) | 2021-04-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112022005474A2 (pt) | Suavização de metadados de áudio | |
EP3748629A4 (en) | IDENTIFICATION METHOD AND DEVICE FOR LANGUAGE KEYWORDS, COMPUTER READABLE STORAGE MEDIUM AND COMPUTER DEVICE | |
BR112015019526A8 (pt) | MÉTODO E APARELHO PARA APRIMORAR A DIRETIVIDADE DE UM SINAL AMBISONICS DE 1ª ORDEM E MEIO DE ARMAZENAMENTO LEGÍVEL POR COMPUTADOR NÃO TRANSITÓRIO. | |
BR112019013609A8 (pt) | Método e aparelho de processamento de informação | |
BR112019005584A2 (pt) | método e equipamento de usuário (ue) para comunicação de baixa latência”. | |
BR112018077230A2 (pt) | sistemas e métodos para identificar conteúdo correspondente | |
BR112013032923A2 (pt) | aparelho, método implementado por computador e mídia legível por computador | |
BR112018006098A2 (pt) | sistemas e métodos para processamento de vídeo | |
BR112018068098A2 (pt) | unidade de ocultação de erros, decodificador de áudio, método relacionado e programa de computador para diminuição gradual de um quadro de áudio oculto de acordo com diversos fatores de amortecimento para diversas bandas de frequência | |
BR112015004288A2 (pt) | renderização de som refletido para áudio à base de objeto | |
BR112015032010A2 (pt) | métodos, aparelhos e produtos de programa de computador para fornecer informações de reconfiguração dinâmica de uplink-downlink para equipamentos de usuário | |
BR112015029113A2 (pt) | codificação eficiente de cenas de áudio contendo objetos de áudio | |
EP4012978A4 (en) | METHOD AND APPARATUS FOR DETERMINING THE ROOT CAUSE OF A FAILURE, SERVER AND COMPUTER READABLE MEDIA | |
BR112022000187A2 (pt) | Método de processamento de dados de vídeo, aparelho para processar dados de vídeo, meio de armazenamento não transitório legível por computador, meio de gravação não transitório legível por computador | |
BR112021026664A2 (pt) | Corte de vídeo automatizado usando importância relativa de objetos identificados | |
BR112015029129A2 (pt) | codificação eficiente de cenas de áudio contendo objetos de áudio | |
BR112017026743A2 (pt) | aparelho e método de decodificação, programa, e, aparelho e método de codificação | |
BR112021020758A2 (pt) | Codificação de imagem usando índice de transformada | |
BR112022000466A2 (pt) | Controle de cancelamento de eco acústico para dispositivos de áudio distribuído | |
EP4044546A4 (en) | METHOD, DEVICE AND DEVICE FOR MESSAGE PROCESSING, AND COMPUTER-READABLE STORAGE MEDIUM | |
BR112015019056A2 (pt) | sistemas e métodos para realização de controle de ganho | |
BR112022006453A2 (pt) | Método de processamento de dados de dados de vídeo, aparelho para processar dados de vídeo, meio de armazenamento não transitório legível por computador e meio de gravação não transitório legível por computador | |
EP4084415A4 (en) | DATA MANAGEMENT METHOD AND SYSTEM, ASSOCIATED SUBSYSTEM AND COMPUTER READABLE MEDIA | |
BR112022011512A2 (pt) | Método para retroalimentação de harq-ack para pdcch e dispositivo. | |
BR112013032162A2 (pt) | recuperação de falha de hss para acesso não 3gpp |