BR112023003488A2 - Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa - Google Patents
Dispositivos e métodos de processamento de sinal e de aprendizado, e, programaInfo
- Publication number
- BR112023003488A2 BR112023003488A2 BR112023003488A BR112023003488A BR112023003488A2 BR 112023003488 A2 BR112023003488 A2 BR 112023003488A2 BR 112023003488 A BR112023003488 A BR 112023003488A BR 112023003488 A BR112023003488 A BR 112023003488A BR 112023003488 A2 BR112023003488 A2 BR 112023003488A2
- Authority
- BR
- Brazil
- Prior art keywords
- signal processing
- program
- signal
- learning
- audio signal
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 4
- PWPJGUXAGUPAHP-UHFFFAOYSA-N lufenuron Chemical compound C1=C(Cl)C(OC(F)(F)C(C(F)(F)F)F)=CC(Cl)=C1NC(=O)NC(=O)C1=C(F)C=CC=C1F PWPJGUXAGUPAHP-UHFFFAOYSA-N 0.000 title abstract 2
- 230000005236 sound signal Effects 0.000 abstract 4
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/307—Frequency adjustment, e.g. tone control
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G10L21/0388—Details of processing therefor
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/302—Electronic adaptation of stereophonic sound system to listener position or orientation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/11—Positioning of individual sound objects, e.g. moving airplane, within a sound field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/07—Synergistic effects of band splitting and sub-band processing
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- Telephone Function (AREA)
Abstract
DISPOSITIVOS E MÉTODOS DE PROCESSAMENTO DE SINAL E DE APRENDIZADO, E, PROGRAMA. A presente tecnologia se refere a um dispositivo e método de processamento de sinal, um dispositivo e método de aprendizado e um programa que tornam possível executar a reprodução de áudio com alta qualidade, mesmo um dispositivo barato. Este dispositivo de processamento de sinal compreende: uma unidade de processamento de decodificação que demultiplexa um fluxo de bits de entrada em um primeiro sinal de áudio, metadados do primeiro sinal de áudio e primeira informação de banda de alta frequência para expandir uma banda; e uma unidade de expansão de banda que executa processamento de expansão de banda com base em um segundo sinal de áudio obtido ao executar processamento de sinal com base no primeiro sinal de saída e nos metadados, e segunda informação de banda de alta frequência gerada com base na primeira informação de banda de alta frequência, gerando, desse modo, um sinal de áudio de saída. A tecnologia atual é aplicável a smartphones.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2020148234 | 2020-09-03 | ||
PCT/JP2021/030599 WO2022050087A1 (ja) | 2020-09-03 | 2021-08-20 | 信号処理装置および方法、学習装置および方法、並びにプログラム |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112023003488A2 true BR112023003488A2 (pt) | 2023-04-11 |
Family
ID=80490814
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112023003488A BR112023003488A2 (pt) | 2020-09-03 | 2021-08-20 | Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa |
Country Status (8)
Country | Link |
---|---|
US (1) | US20230300557A1 (pt) |
EP (1) | EP4210048A4 (pt) |
JP (1) | JPWO2022050087A1 (pt) |
KR (1) | KR20230060502A (pt) |
CN (1) | CN116018641A (pt) |
BR (1) | BR112023003488A2 (pt) |
MX (1) | MX2023002255A (pt) |
WO (1) | WO2022050087A1 (pt) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021261235A1 (ja) * | 2020-06-22 | 2021-12-30 | ソニーグループ株式会社 | 信号処理装置および方法、並びにプログラム |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2830052A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension |
JP6439296B2 (ja) * | 2014-03-24 | 2018-12-19 | ソニー株式会社 | 復号装置および方法、並びにプログラム |
US10038966B1 (en) * | 2016-10-20 | 2018-07-31 | Oculus Vr, Llc | Head-related transfer function (HRTF) personalization based on captured images of user |
WO2018110269A1 (ja) | 2016-12-12 | 2018-06-21 | ソニー株式会社 | Hrtf測定方法、hrtf測定装置、およびプログラム |
KR102002681B1 (ko) * | 2017-06-27 | 2019-07-23 | 한양대학교 산학협력단 | 생성적 대립 망 기반의 음성 대역폭 확장기 및 확장 방법 |
ES2965741T3 (es) * | 2017-07-28 | 2024-04-16 | Fraunhofer Ges Forschung | Aparato para codificar o decodificar una señal multicanal codificada mediante una señal de relleno generada por un filtro de banda ancha |
US10650806B2 (en) * | 2018-04-23 | 2020-05-12 | Cerence Operating Company | System and method for discriminative training of regression deep neural networks |
EP3827603A1 (en) * | 2018-07-25 | 2021-06-02 | Dolby Laboratories Licensing Corporation | Personalized hrtfs via optical capture |
-
2021
- 2021-08-20 JP JP2022546230A patent/JPWO2022050087A1/ja active Pending
- 2021-08-20 WO PCT/JP2021/030599 patent/WO2022050087A1/ja active Application Filing
- 2021-08-20 EP EP21864145.4A patent/EP4210048A4/en active Pending
- 2021-08-20 CN CN202180052388.8A patent/CN116018641A/zh active Pending
- 2021-08-20 KR KR1020237005227A patent/KR20230060502A/ko unknown
- 2021-08-20 US US18/023,183 patent/US20230300557A1/en active Pending
- 2021-08-20 BR BR112023003488A patent/BR112023003488A2/pt not_active Application Discontinuation
- 2021-08-20 MX MX2023002255A patent/MX2023002255A/es unknown
Also Published As
Publication number | Publication date |
---|---|
WO2022050087A1 (ja) | 2022-03-10 |
MX2023002255A (es) | 2023-05-16 |
CN116018641A (zh) | 2023-04-25 |
KR20230060502A (ko) | 2023-05-04 |
JPWO2022050087A1 (pt) | 2022-03-10 |
EP4210048A1 (en) | 2023-07-12 |
US20230300557A1 (en) | 2023-09-21 |
EP4210048A4 (en) | 2024-02-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9786298B1 (en) | Audio fingerprinting based on audio energy characteristics | |
KR102304197B1 (ko) | 오디오 에너지 특성에 기초한 오디오 핑거프린팅 | |
CN105118520B (zh) | 一种音频开头爆音的消除方法及装置 | |
CN105448312B (zh) | 音频同步播放方法、装置及系统 | |
BR112023003488A2 (pt) | Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa | |
BR112012025570A2 (pt) | aparelho e método de processamento de sinal, meio de gravação, decodificador, codificador, métodos de decodificação e de codificação. | |
RU2015150055A (ru) | Эффективное кодирование звуковых сцен, содержащих звуковые объекты | |
BR112018068643A2 (pt) | decodificação de sinal de áudio | |
CN101620856B (zh) | 对输入信号值序列进行时间缩放的方法和设备 | |
BR112022010200A2 (pt) | Modelo psicoacústico para processamento de áudio | |
RU2008140142A (ru) | Способы и устройства кодирования и декодирования, основывающихся на объектах ориентированных аудиосигналов | |
RU2016114565A (ru) | Устройство обработки информации, способ и программа | |
CN105898556A (zh) | 一种外挂字幕的自动同步方法及装置 | |
BR112023020018A2 (pt) | Método de processamento de vídeo para aplicação e dispositivo eletrônico | |
MX2021006572A (es) | Aparato, metodo y programa de computadora para codificacion, decodificacion, procesamiento de escenas y otros procedimientos relacionados con codificacion de audio espacial basada en dirac que utiliza generadores de componentes de bajo, medio y alto orden. | |
EP4033483A3 (en) | Method and apparatus for testing vehicle-mounted voice device, electronic device and storage medium | |
BR112022024820A2 (pt) | Transição de modo sincronizada | |
BR112018076546A2 (pt) | decodificação de áudio com o uso de taxa de amostragem intermediária | |
BR112021019942A2 (pt) | Dispositivos e métodos de processamento de informações e reprodução, e, programa | |
MX2022014771A (es) | Integracion de la musica con instruccion de actividad fisica. | |
US10284952B2 (en) | Audio processing apparatus and control method thereof | |
Bhandari et al. | Reverb: Open-Source ASR and Diarization from Rev | |
Barba et al. | Voice and audio signal processing using the WSOLA Algorithm MATLAB software | |
JP2024108613A (ja) | 字幕生成装置、映像伝送装置、字幕生成方法およびプログラム | |
EP4407436A3 (en) | Bitstream representing audio in an environment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B11A | Dismissal acc. art.33 of ipl - examination not requested within 36 months of filing |