BR112023003488A2 - Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa - Google Patents

Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa

Info

Publication number
BR112023003488A2
BR112023003488A2 BR112023003488A BR112023003488A BR112023003488A2 BR 112023003488 A2 BR112023003488 A2 BR 112023003488A2 BR 112023003488 A BR112023003488 A BR 112023003488A BR 112023003488 A BR112023003488 A BR 112023003488A BR 112023003488 A2 BR112023003488 A2 BR 112023003488A2
Authority
BR
Brazil
Prior art keywords
signal processing
program
signal
learning
audio signal
Prior art date
Application number
BR112023003488A
Other languages
English (en)
Portuguese (pt)
Inventor
Honma Hiroyuki
Chinen Toru
Kono Akifumi
Original Assignee
Sony Group Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Group Corp filed Critical Sony Group Corp
Publication of BR112023003488A2 publication Critical patent/BR112023003488A2/pt

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/307Frequency adjustment, e.g. tone control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • G10L21/0388Details of processing therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/07Synergistic effects of band splitting and sub-band processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Telephone Function (AREA)
BR112023003488A 2020-09-03 2021-08-20 Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa BR112023003488A2 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2020148234 2020-09-03
PCT/JP2021/030599 WO2022050087A1 (ja) 2020-09-03 2021-08-20 信号処理装置および方法、学習装置および方法、並びにプログラム

Publications (1)

Publication Number Publication Date
BR112023003488A2 true BR112023003488A2 (pt) 2023-04-11

Family

ID=80490814

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023003488A BR112023003488A2 (pt) 2020-09-03 2021-08-20 Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa

Country Status (8)

Country Link
US (1) US20230300557A1 (https=)
EP (1) EP4210048A4 (https=)
JP (1) JPWO2022050087A1 (https=)
KR (1) KR20230060502A (https=)
CN (1) CN116018641A (https=)
BR (1) BR112023003488A2 (https=)
MX (1) MX2023002255A (https=)
WO (1) WO2022050087A1 (https=)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021261235A1 (ja) * 2020-06-22 2021-12-30 ソニーグループ株式会社 信号処理装置および方法、並びにプログラム
EP4202921B1 (en) * 2020-09-28 2026-04-08 Samsung Electronics Co., Ltd. Audio encoding apparatus and audio decoding apparatus
EP4468292A3 (en) * 2020-10-17 2024-12-11 Dolby International AB Method and apparatus for generating an intermediate audio format from an input multichannel audio signal

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2830051A3 (en) * 2013-07-22 2015-03-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
JP6439296B2 (ja) * 2014-03-24 2018-12-19 ソニー株式会社 復号装置および方法、並びにプログラム
US10038966B1 (en) * 2016-10-20 2018-07-31 Oculus Vr, Llc Head-related transfer function (HRTF) personalization based on captured images of user
US11159906B2 (en) 2016-12-12 2021-10-26 Sony Corporation HRTF measurement method, HRTF measurement device, and program
KR102002681B1 (ko) * 2017-06-27 2019-07-23 한양대학교 산학협력단 생성적 대립 망 기반의 음성 대역폭 확장기 및 확장 방법
CN110998721B (zh) * 2017-07-28 2024-04-26 弗劳恩霍夫应用研究促进协会 用于使用宽频带滤波器生成的填充信号对已编码的多声道信号进行编码或解码的装置
US10650806B2 (en) * 2018-04-23 2020-05-12 Cerence Operating Company System and method for discriminative training of regression deep neural networks
JP7442494B2 (ja) * 2018-07-25 2024-03-04 ドルビー ラボラトリーズ ライセンシング コーポレイション 光学式捕捉によるパーソナライズされたhrtf

Also Published As

Publication number Publication date
US20230300557A1 (en) 2023-09-21
EP4210048A4 (en) 2024-02-21
WO2022050087A1 (ja) 2022-03-10
MX2023002255A (es) 2023-05-16
JPWO2022050087A1 (https=) 2022-03-10
EP4210048A1 (en) 2023-07-12
CN116018641A (zh) 2023-04-25
KR20230060502A (ko) 2023-05-04

Similar Documents

Publication Publication Date Title
BR112023003488A2 (pt) Dispositivos e métodos de processamento de sinal e de aprendizado, e, programa
US9786298B1 (en) Audio fingerprinting based on audio energy characteristics
KR102304197B1 (ko) 오디오 에너지 특성에 기초한 오디오 핑거프린팅
MX2021006565A (es) Aparato, metodo y programa de computadora para codificacion, decodificacion, procesamiento de escenas y otros procedimientos relacionados con codificacion de audio espacial basada en dirac que utiliza compensacion difusa.
CN105448312B (zh) 音频同步播放方法、装置及系统
Geng et al. Longvale: Vision-audio-language-event benchmark towards time-aware omni-modal perception of long videos
BR112022010200A2 (pt) Modelo psicoacústico para processamento de áudio
BR112023020018A2 (pt) Método de processamento de vídeo para aplicação e dispositivo eletrônico
BR112018077408A2 (pt) aparelho e método de formação do campo de som, e, programa.
BR112021018423A2 (pt) Método e aparelho para reproduzir dados multimídia, dispositivo eletrônico e meio de armazenamento
RU2015150055A (ru) Эффективное кодирование звуковых сцен, содержащих звуковые объекты
BR112012025570A2 (pt) aparelho e método de processamento de sinal, meio de gravação, decodificador, codificador, métodos de decodificação e de codificação.
MX2023006300A (es) Método y aparato de codificación y decodificación de audio.
BR112022024820A2 (pt) Transição de modo sincronizada
CN101620856A (zh) 对输入信号值序列进行时间缩放的方法
CN112331188A (zh) 一种语音数据处理方法、系统及终端设备
CL2024000852A1 (es) Método para decodificar y codificar. (div. 3256-22)
MX2024004852A (es) Corriente de bits que representa audio en un entorno.
Zhao et al. Towards expressive video dubbing with multiscale multimodal context interaction
Bhandari et al. Reverb: Open-source ASR and diarization from Rev
Wang et al. Listening Between the Frames: Bridging Temporal Gaps in Large Audio-Language Models
CL2024000531A1 (es) Método y aparato para procesamiento dinámico basado en metadatos de datos de audio
BR112021019942A2 (pt) Dispositivos e métodos de processamento de informações e reprodução, e, programa
BR112021019728A2 (pt) Método e aparelho para processamento de retomada de escuta de aplicação de música e dispositivo
MX2024006931A (es) Metodo y aparato para procesar datos de audio.

Legal Events

Date Code Title Description
B11A Dismissal acc. art.33 of ipl - examination not requested within 36 months of filing
B11Y Definitive dismissal - extension of time limit for request of examination expired [chapter 11.1.1 patent gazette]