MX2023002255A - Dispositivo y método de procesamiento de señales, dispositivo y método de aprendizaje y programa. - Google Patents

Dispositivo y método de procesamiento de señales, dispositivo y método de aprendizaje y programa.

Info

Publication number
MX2023002255A
MX2023002255A MX2023002255A MX2023002255A MX2023002255A MX 2023002255 A MX2023002255 A MX 2023002255A MX 2023002255 A MX2023002255 A MX 2023002255A MX 2023002255 A MX2023002255 A MX 2023002255A MX 2023002255 A MX2023002255 A MX 2023002255A
Authority
MX
Mexico
Prior art keywords
audio signal
signal processing
basis
program
processing device
Prior art date
Application number
MX2023002255A
Other languages
English (en)
Inventor
Hiroyuki Honma
Toru Chinen
Akifumi Kono
Original Assignee
Sony Group Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Group Corp filed Critical Sony Group Corp
Publication of MX2023002255A publication Critical patent/MX2023002255A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/307Frequency adjustment, e.g. tone control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • G10L21/0388Details of processing therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/07Synergistic effects of band splitting and sub-band processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Telephone Function (AREA)

Abstract

La presente tecnología se refiere a un dispositivo y método de procesamiento de señales, a un dispositivo y método de aprendizaje y a un programa que hacen posible realizar una reproducción de audio de calidad alta incluso con un dispositivo de bajo coste. Este dispositivo de procesamiento de señales comprende: una unidad de procesamiento de descodificación que desmultiplexa un flujo de bits de entrada en una primera señal de audio, metadatos de la primera señal de audio y primera información de banda de alta frecuencia para el ensanchamiento de banda; y una unidad de ensanchamiento de banda que realiza un procesamiento de ensanchamiento de banda en función de una segunda señal de audio obtenida realizando un procesamiento de señales en función de la primera señal de audio y los metadatos, y una segunda información de banda de alta frecuencia generada en función de la primera información de banda de alta frecuencia, generando de ese modo una señal de audio de salida. La presente tecnología es aplicable a los teléfonos inteligentes.
MX2023002255A 2020-09-03 2021-08-20 Dispositivo y método de procesamiento de señales, dispositivo y método de aprendizaje y programa. MX2023002255A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2020148234 2020-09-03
PCT/JP2021/030599 WO2022050087A1 (ja) 2020-09-03 2021-08-20 信号処理装置および方法、学習装置および方法、並びにプログラム

Publications (1)

Publication Number Publication Date
MX2023002255A true MX2023002255A (es) 2023-05-16

Family

ID=80490814

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2023002255A MX2023002255A (es) 2020-09-03 2021-08-20 Dispositivo y método de procesamiento de señales, dispositivo y método de aprendizaje y programa.

Country Status (8)

Country Link
US (1) US20230300557A1 (es)
EP (1) EP4210048A4 (es)
JP (1) JPWO2022050087A1 (es)
KR (1) KR20230060502A (es)
CN (1) CN116018641A (es)
BR (1) BR112023003488A2 (es)
MX (1) MX2023002255A (es)
WO (1) WO2022050087A1 (es)

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2830051A3 (en) * 2013-07-22 2015-03-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
JP6439296B2 (ja) * 2014-03-24 2018-12-19 ソニー株式会社 復号装置および方法、並びにプログラム
US10038966B1 (en) * 2016-10-20 2018-07-31 Oculus Vr, Llc Head-related transfer function (HRTF) personalization based on captured images of user
JP6992767B2 (ja) 2016-12-12 2022-01-13 ソニーグループ株式会社 Hrtf測定方法、hrtf測定装置、およびプログラム
KR102002681B1 (ko) * 2017-06-27 2019-07-23 한양대학교 산학협력단 생성적 대립 망 기반의 음성 대역폭 확장기 및 확장 방법
RU2741379C1 (ru) * 2017-07-28 2021-01-25 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Оборудование для кодирования или декодирования кодированного многоканального сигнала с использованием заполняющего сигнала, сформированного посредством широкополосного фильтра
US10650806B2 (en) * 2018-04-23 2020-05-12 Cerence Operating Company System and method for discriminative training of regression deep neural networks
EP3827603A1 (en) * 2018-07-25 2021-06-02 Dolby Laboratories Licensing Corporation Personalized hrtfs via optical capture

Also Published As

Publication number Publication date
KR20230060502A (ko) 2023-05-04
US20230300557A1 (en) 2023-09-21
EP4210048A4 (en) 2024-02-21
CN116018641A (zh) 2023-04-25
JPWO2022050087A1 (es) 2022-03-10
BR112023003488A2 (pt) 2023-04-11
WO2022050087A1 (ja) 2022-03-10
EP4210048A1 (en) 2023-07-12

Similar Documents

Publication Publication Date Title
EP3968321A3 (en) Multi-channel acoustic echo cancellation
US9786298B1 (en) Audio fingerprinting based on audio energy characteristics
MX2017009378A (es) Dispositivo para la reproducción del habla configurado para enmascarar el habla reproducida en una zona de habla enmascarada.
AU2018344830A8 (en) Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to DirAC based spatial audio coding
SG10201707702YA (en) Collaborative Voice Controlled Devices
MX2016005224A (es) Metodo y dispositivo para lograr el registro de audio objetivo y aparato electronico.
EP4283616A3 (en) Computer program product for encoding a signal
MY189000A (en) Audio processing device and method, and program therefor
MX2020009581A (es) Métodos y dispositivos para codificar y/o decodificar señales de audio inmersivo.
MY179136A (en) Apparatus and method for multichannel direct-ambient decomposition for audio signal processing
EA202090186A2 (ru) Кодирование и декодирование звука с использованием параметров преобразования представления
BRPI0802614A2 (pt) métodos e aparelhos para codificação e decodificação de sinais de áudio baseados em objeto
CA2699004A1 (en) A method and an apparatus of decoding an audio signal
WO2009128666A3 (ko) 오디오 신호를 처리하는 방법 및 장치
KR102304197B1 (ko) 오디오 에너지 특성에 기초한 오디오 핑거프린팅
GB201108885D0 (en) Processing audio signals
MX2017016228A (es) Aparato codificador, metodo de codificacion, aparato decodificador, metodo de decodificacion, y programa.
RU2016114565A (ru) Устройство обработки информации, способ и программа
AU2019394097A8 (en) Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to DirAC based spatial audio coding using diffuse compensation
MX2018003911A (es) Aparato de recepcion, aparato de transmision y metodo de procesamiento de datos.
RU2009116276A (ru) Способы и устройства для кодирования и декодирования аудиосигналов на основе объектов
MX2023002255A (es) Dispositivo y método de procesamiento de señales, dispositivo y método de aprendizaje y programa.
BR112018013526A2 (pt) aparelho e método para processamento de áudio, e, programa
MY173513A (en) Coding/decoding method, apparatus, and system
MX2018009145A (es) Aparato y método para mejorar una transición desde una porción de señal de audio oculta hasta una porción de señal de audio subsiguiente de una señal de audio.