BR112023003488A2 - Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa - Google Patents
Dispositivos e métodos de processamento de sinal e de aprendizado, e, programaInfo
- Publication number
- BR112023003488A2 BR112023003488A2 BR112023003488A BR112023003488A BR112023003488A2 BR 112023003488 A2 BR112023003488 A2 BR 112023003488A2 BR 112023003488 A BR112023003488 A BR 112023003488A BR 112023003488 A BR112023003488 A BR 112023003488A BR 112023003488 A2 BR112023003488 A2 BR 112023003488A2
- Authority
- BR
- Brazil
- Prior art keywords
- signal processing
- program
- signal
- learning
- audio signal
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/307—Frequency adjustment, e.g. tone control
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G10L21/0388—Details of processing therefor
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/302—Electronic adaptation of stereophonic sound system to listener position or orientation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/11—Positioning of individual sound objects, e.g. moving airplane, within a sound field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/07—Synergistic effects of band splitting and sub-band processing
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- Telephone Function (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2020148234 | 2020-09-03 | ||
| PCT/JP2021/030599 WO2022050087A1 (ja) | 2020-09-03 | 2021-08-20 | 信号処理装置および方法、学習装置および方法、並びにプログラム |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| BR112023003488A2 true BR112023003488A2 (pt) | 2023-04-11 |
Family
ID=80490814
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| BR112023003488A BR112023003488A2 (pt) | 2020-09-03 | 2021-08-20 | Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US20230300557A1 (https=) |
| EP (1) | EP4210048A4 (https=) |
| JP (1) | JPWO2022050087A1 (https=) |
| KR (1) | KR20230060502A (https=) |
| CN (1) | CN116018641A (https=) |
| BR (1) | BR112023003488A2 (https=) |
| MX (1) | MX2023002255A (https=) |
| WO (1) | WO2022050087A1 (https=) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2021261235A1 (ja) * | 2020-06-22 | 2021-12-30 | ソニーグループ株式会社 | 信号処理装置および方法、並びにプログラム |
| EP4202921B1 (en) * | 2020-09-28 | 2026-04-08 | Samsung Electronics Co., Ltd. | Audio encoding apparatus and audio decoding apparatus |
| EP4468292A3 (en) * | 2020-10-17 | 2024-12-11 | Dolby International AB | Method and apparatus for generating an intermediate audio format from an input multichannel audio signal |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2830051A3 (en) * | 2013-07-22 | 2015-03-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals |
| JP6439296B2 (ja) * | 2014-03-24 | 2018-12-19 | ソニー株式会社 | 復号装置および方法、並びにプログラム |
| US10038966B1 (en) * | 2016-10-20 | 2018-07-31 | Oculus Vr, Llc | Head-related transfer function (HRTF) personalization based on captured images of user |
| US11159906B2 (en) | 2016-12-12 | 2021-10-26 | Sony Corporation | HRTF measurement method, HRTF measurement device, and program |
| KR102002681B1 (ko) * | 2017-06-27 | 2019-07-23 | 한양대학교 산학협력단 | 생성적 대립 망 기반의 음성 대역폭 확장기 및 확장 방법 |
| CN110998721B (zh) * | 2017-07-28 | 2024-04-26 | 弗劳恩霍夫应用研究促进协会 | 用于使用宽频带滤波器生成的填充信号对已编码的多声道信号进行编码或解码的装置 |
| US10650806B2 (en) * | 2018-04-23 | 2020-05-12 | Cerence Operating Company | System and method for discriminative training of regression deep neural networks |
| JP7442494B2 (ja) * | 2018-07-25 | 2024-03-04 | ドルビー ラボラトリーズ ライセンシング コーポレイション | 光学式捕捉によるパーソナライズされたhrtf |
-
2021
- 2021-08-20 JP JP2022546230A patent/JPWO2022050087A1/ja not_active Abandoned
- 2021-08-20 BR BR112023003488A patent/BR112023003488A2/pt not_active Application Discontinuation
- 2021-08-20 MX MX2023002255A patent/MX2023002255A/es unknown
- 2021-08-20 CN CN202180052388.8A patent/CN116018641A/zh not_active Withdrawn
- 2021-08-20 KR KR1020237005227A patent/KR20230060502A/ko not_active Withdrawn
- 2021-08-20 WO PCT/JP2021/030599 patent/WO2022050087A1/ja not_active Ceased
- 2021-08-20 US US18/023,183 patent/US20230300557A1/en not_active Abandoned
- 2021-08-20 EP EP21864145.4A patent/EP4210048A4/en not_active Withdrawn
Also Published As
| Publication number | Publication date |
|---|---|
| US20230300557A1 (en) | 2023-09-21 |
| EP4210048A4 (en) | 2024-02-21 |
| WO2022050087A1 (ja) | 2022-03-10 |
| MX2023002255A (es) | 2023-05-16 |
| JPWO2022050087A1 (https=) | 2022-03-10 |
| EP4210048A1 (en) | 2023-07-12 |
| CN116018641A (zh) | 2023-04-25 |
| KR20230060502A (ko) | 2023-05-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| BR112023003488A2 (pt) | Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa | |
| US9786298B1 (en) | Audio fingerprinting based on audio energy characteristics | |
| KR102304197B1 (ko) | 오디오 에너지 특성에 기초한 오디오 핑거프린팅 | |
| MX2021006565A (es) | Aparato, metodo y programa de computadora para codificacion, decodificacion, procesamiento de escenas y otros procedimientos relacionados con codificacion de audio espacial basada en dirac que utiliza compensacion difusa. | |
| CN105448312B (zh) | 音频同步播放方法、装置及系统 | |
| Geng et al. | Longvale: Vision-audio-language-event benchmark towards time-aware omni-modal perception of long videos | |
| BR112022010200A2 (pt) | Modelo psicoacústico para processamento de áudio | |
| BR112023020018A2 (pt) | Método de processamento de vídeo para aplicação e dispositivo eletrônico | |
| BR112018077408A2 (pt) | aparelho e método de formação do campo de som, e, programa. | |
| BR112021018423A2 (pt) | Método e aparelho para reproduzir dados multimídia, dispositivo eletrônico e meio de armazenamento | |
| RU2015150055A (ru) | Эффективное кодирование звуковых сцен, содержащих звуковые объекты | |
| BR112012025570A2 (pt) | aparelho e método de processamento de sinal, meio de gravação, decodificador, codificador, métodos de decodificação e de codificação. | |
| MX2023006300A (es) | Método y aparato de codificación y decodificación de audio. | |
| BR112022024820A2 (pt) | Transição de modo sincronizada | |
| CN101620856A (zh) | 对输入信号值序列进行时间缩放的方法 | |
| CN112331188A (zh) | 一种语音数据处理方法、系统及终端设备 | |
| CL2024000852A1 (es) | Método para decodificar y codificar. (div. 3256-22) | |
| MX2024004852A (es) | Corriente de bits que representa audio en un entorno. | |
| Zhao et al. | Towards expressive video dubbing with multiscale multimodal context interaction | |
| Bhandari et al. | Reverb: Open-source ASR and diarization from Rev | |
| Wang et al. | Listening Between the Frames: Bridging Temporal Gaps in Large Audio-Language Models | |
| CL2024000531A1 (es) | Método y aparato para procesamiento dinámico basado en metadatos de datos de audio | |
| BR112021019942A2 (pt) | Dispositivos e métodos de processamento de informações e reprodução, e, programa | |
| BR112021019728A2 (pt) | Método e aparelho para processamento de retomada de escuta de aplicação de música e dispositivo | |
| MX2024006931A (es) | Metodo y aparato para procesar datos de audio. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| B11A | Dismissal acc. art.33 of ipl - examination not requested within 36 months of filing | ||
| B11Y | Definitive dismissal - extension of time limit for request of examination expired [chapter 11.1.1 patent gazette] |