BR112017018145A2 - aparelho e método para processamento de um sinal de áudio para obter um sinal de áudio processado utilizando um envelope de domínio de tempo alvo - Google Patents

aparelho e método para processamento de um sinal de áudio para obter um sinal de áudio processado utilizando um envelope de domínio de tempo alvo

Info

Publication number
BR112017018145A2
BR112017018145A2 BR112017018145A BR112017018145A BR112017018145A2 BR 112017018145 A2 BR112017018145 A2 BR 112017018145A2 BR 112017018145 A BR112017018145 A BR 112017018145A BR 112017018145 A BR112017018145 A BR 112017018145A BR 112017018145 A2 BR112017018145 A2 BR 112017018145A2
Authority
BR
Brazil
Prior art keywords
audio signal
target time
processing
processed audio
time domain
Prior art date
Application number
BR112017018145A
Other languages
English (en)
Other versions
BR112017018145B1 (pt
Inventor
Sascha Disch
Christian Dittmar
Meinard Müller
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of BR112017018145A2 publication Critical patent/BR112017018145A2/pt
Publication of BR112017018145B1 publication Critical patent/BR112017018145B1/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/03Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • G10L21/0388Details of processing therefor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)
  • Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
  • Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)

Abstract

o assunto da presente invenção é um aparelho 2, descrito por um diagrama de blocos esquemático, para processamento de um sinal de áudio 4 para obter um sinal de áudio processado 6. o aparelho 2 compreende um calculador de fase 8 para calcular valores de fase 10 para valores espectrais de uma sequência de estruturas de domínio de frequência 12, representando estruturas sobrepostas do sinal de áudio 4. além disso, o calculador de fase 8 é configurado para calcular os valores de fase 10 com base em informações em um envelope de domínio de tempo alvo 14 relacionadas ao sinal de áudio processado 6, de modo que o sinal de áudio processado 6 tenha, pelo menos em uma aproximação, o envelope de domínio de tempo alvo 14 e um envelope espectral determinado pela sequência de estruturas de domínio de frequência 12.
BR112017018145-2A 2015-02-26 2016-02-23 Aparelho e método para processamento de um sinal de áudio para obter um sinal de áudio processado utilizando um envelope de domínio de tempo alvo BR112017018145B1 (pt)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
EP15156704.7 2015-02-26
EP15156704 2015-02-26
EP15181118 2015-08-14
EP15181118.9 2015-08-14
PCT/EP2016/053752 WO2016135132A1 (en) 2015-02-26 2016-02-23 Apparatus and method for processing an audio signal to obtain a processed audio signal using a target time-domain envelope

Publications (2)

Publication Number Publication Date
BR112017018145A2 true BR112017018145A2 (pt) 2018-04-10
BR112017018145B1 BR112017018145B1 (pt) 2023-11-28

Family

ID=55409840

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112017018145-2A BR112017018145B1 (pt) 2015-02-26 2016-02-23 Aparelho e método para processamento de um sinal de áudio para obter um sinal de áudio processado utilizando um envelope de domínio de tempo alvo

Country Status (11)

Country Link
US (1) US10373623B2 (pt)
EP (1) EP3262639B1 (pt)
JP (1) JP6668372B2 (pt)
KR (1) KR102125410B1 (pt)
CN (1) CN107517593B (pt)
BR (1) BR112017018145B1 (pt)
CA (1) CA2976864C (pt)
ES (1) ES2837107T3 (pt)
MX (1) MX2017010593A (pt)
RU (1) RU2679254C1 (pt)
WO (1) WO2016135132A1 (pt)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6445417B2 (ja) * 2015-10-30 2018-12-26 日本電信電話株式会社 信号波形推定装置、信号波形推定方法、プログラム
WO2017143095A1 (en) * 2016-02-16 2017-08-24 Red Pill VR, Inc. Real-time adaptive audio source separation
US10224042B2 (en) * 2016-10-31 2019-03-05 Qualcomm Incorporated Encoding of multiple audio signals
EP3382701A1 (en) 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for post-processing an audio signal using prediction based shaping
EP3382700A1 (en) * 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for post-processing an audio signal using a transient location detection
EP3382703A1 (en) * 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and methods for processing an audio signal
EP3457401A1 (en) * 2017-09-18 2019-03-20 Thomson Licensing Method for modifying a style of an audio object, and corresponding electronic device, computer readable program products and computer readable storage medium
WO2019083130A1 (ko) * 2017-10-25 2019-05-02 삼성전자주식회사 전자 장치 및 그 제어 방법
EP3550561A1 (en) * 2018-04-06 2019-10-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Downmixer, audio encoder, method and computer program applying a phase value to a magnitude value
US10529349B2 (en) * 2018-04-16 2020-01-07 Mitsubishi Electric Research Laboratories, Inc. Methods and systems for end-to-end speech separation with unfolded iterative phase reconstruction
EP3576088A1 (en) * 2018-05-30 2019-12-04 Fraunhofer Gesellschaft zur Förderung der Angewand Audio similarity evaluator, audio encoder, methods and computer program
EP3841821B1 (en) * 2018-08-20 2023-06-28 Telefonaktiebolaget Lm Ericsson (Publ) Physical random access channel signal generation optimization for 5g new radio
WO2020094263A1 (en) 2018-11-05 2020-05-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and audio signal processor, for providing a processed audio signal representation, audio decoder, audio encoder, methods and computer programs
US10659099B1 (en) * 2018-12-12 2020-05-19 Samsung Electronics Co., Ltd. Page scanning devices, computer-readable media, and methods for bluetooth page scanning using a wideband receiver
US11456007B2 (en) * 2019-01-11 2022-09-27 Samsung Electronics Co., Ltd End-to-end multi-task denoising for joint signal distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) optimization
CN109753943B (zh) * 2019-01-14 2023-09-19 沈阳化工大学 一种自适应分配变模态分解方法
CN110411439B (zh) * 2019-07-15 2021-07-09 北京控制工程研究所 一种根据星能量等级生成仿真星点的方法、装置及介质
KR102294639B1 (ko) * 2019-07-16 2021-08-27 한양대학교 산학협력단 다중 디코더를 이용한 심화 신경망 기반의 비-자동회귀 음성 합성 방법 및 시스템
CN110838299B (zh) * 2019-11-13 2022-03-25 腾讯音乐娱乐科技(深圳)有限公司 一种瞬态噪声的检测方法、装置及设备
CN111402858B (zh) * 2020-02-27 2024-05-03 平安科技(深圳)有限公司 一种歌声合成方法、装置、计算机设备及存储介质
CN112133319A (zh) * 2020-08-31 2020-12-25 腾讯音乐娱乐科技(深圳)有限公司 音频生成的方法、装置、设备及存储介质
WO2022076404A1 (en) * 2020-10-05 2022-04-14 The Trustees Of Columbia University In The City Of New York Systems and methods for brain-informed speech separation
CN112257577A (zh) * 2020-10-21 2021-01-22 华北电力大学 一种利用线性流形投影的微震信号重构方法和系统
CN113191317B (zh) * 2021-05-21 2022-09-27 江西理工大学 一种基于极点构造低通滤波器的信号包络提取方法和装置
US11682411B2 (en) 2021-08-31 2023-06-20 Spotify Ab Wind noise suppresor
CN113835065B (zh) * 2021-09-01 2024-05-17 深圳壹秘科技有限公司 基于深度学习的声源方向确定方法、装置、设备及介质
CN113903355B (zh) * 2021-12-09 2022-03-01 北京世纪好未来教育科技有限公司 语音获取方法、装置、电子设备及存储介质
CN115116460B (zh) * 2022-06-17 2024-03-12 腾讯科技(深圳)有限公司 音频信号增强方法、装置、设备、存储介质及程序产品
CN115691541B (zh) * 2022-12-27 2023-03-21 深圳元象信息科技有限公司 语音分离方法、装置及存储介质
CN117745551B (zh) * 2024-02-19 2024-04-26 电子科技大学 一种图像信号相位恢复的方法

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10513282A (ja) * 1995-11-22 1998-12-15 フィリップス エレクトロニクス ネムローゼ フェンノートシャップ 言語信号再合成方法および装置
SE512719C2 (sv) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion
WO2004008437A2 (en) * 2002-07-16 2004-01-22 Koninklijke Philips Electronics N.V. Audio coding
DE10313875B3 (de) * 2003-03-21 2004-10-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Analysieren eines Informationssignals
US7415392B2 (en) 2004-03-12 2008-08-19 Mitsubishi Electric Research Laboratories, Inc. System for separating multiple sound sources from monophonic input with non-negative matrix factor deconvolution
DE102004021403A1 (de) * 2004-04-30 2005-11-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Informationssignalverarbeitung durch Modifikation in der Spektral-/Modulationsspektralbereichsdarstellung
JP5129115B2 (ja) * 2005-04-01 2013-01-23 クゥアルコム・インコーポレイテッド 高帯域バーストの抑制のためのシステム、方法、および装置
TWI324336B (en) * 2005-04-22 2010-05-01 Qualcomm Inc Method of signal processing and apparatus for gain factor smoothing
CN101140759B (zh) * 2006-09-08 2010-05-12 华为技术有限公司 语音或音频信号的带宽扩展方法及系统
CN101197577A (zh) * 2006-12-07 2008-06-11 展讯通信(上海)有限公司 一种用于音频处理框架中的编码和解码方法
US7715342B2 (en) * 2007-06-22 2010-05-11 Research In Motion Limited Location of packet data convergence protocol in a long-term evolution multimedia broadcast multicast service
CN101521010B (zh) * 2008-02-29 2011-10-05 华为技术有限公司 一种音频信号的编解码方法和装置
CN101662288B (zh) * 2008-08-28 2012-07-04 华为技术有限公司 音频编码、解码方法及装置、系统
WO2010028297A1 (en) * 2008-09-06 2010-03-11 GH Innovation, Inc. Selective bandwidth extension
CN101770776B (zh) 2008-12-29 2011-06-08 华为技术有限公司 瞬态信号的编码方法和装置、解码方法和装置及处理系统
PL2234103T3 (pl) * 2009-03-26 2012-02-29 Fraunhofer Ges Forschung Urządzenie i sposób manipulacji sygnałem audio
WO2011039668A1 (en) * 2009-09-29 2011-04-07 Koninklijke Philips Electronics N.V. Apparatus for mixing a digital audio
JP5651980B2 (ja) * 2010-03-31 2015-01-14 ソニー株式会社 復号装置、復号方法、およびプログラム
US9546924B2 (en) * 2011-06-30 2017-01-17 Telefonaktiebolaget Lm Ericsson (Publ) Transform audio codec and methods for encoding and decoding a time segment of an audio signal
CN103258539B (zh) * 2012-02-15 2015-09-23 展讯通信(上海)有限公司 一种语音信号特性的变换方法和装置
SG11201405196VA (en) * 2012-02-27 2014-09-26 Ecole Polytech Sample processing device with detachable slide
EP2631906A1 (en) * 2012-02-27 2013-08-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Phase coherence control for harmonic signals in perceptual audio codecs
JP5997592B2 (ja) * 2012-04-27 2016-09-28 株式会社Nttドコモ 音声復号装置
WO2014021318A1 (ja) * 2012-08-01 2014-02-06 独立行政法人産業技術総合研究所 音声分析合成のためのスペクトル包絡及び群遅延の推定システム及び音声信号の合成システム
CN104103276B (zh) * 2013-04-12 2017-04-12 北京天籁传音数字技术有限公司 一种声音编解码装置及其方法
KR101732059B1 (ko) * 2013-05-15 2017-05-04 삼성전자주식회사 오디오 신호의 부호화, 복호화 방법 및 장치
EP3080640B1 (en) * 2013-12-11 2019-11-20 Airbus (Sas) Phase retrieval algorithm for generation of constant time envelope with prescribed fourier transform magnitude signal

Also Published As

Publication number Publication date
CN107517593A (zh) 2017-12-26
CN107517593B (zh) 2021-03-12
EP3262639B1 (en) 2020-10-07
ES2837107T3 (es) 2021-06-29
US10373623B2 (en) 2019-08-06
JP2018510374A (ja) 2018-04-12
RU2679254C1 (ru) 2019-02-06
MX2017010593A (es) 2018-05-07
CA2976864C (en) 2020-07-14
EP3262639A1 (en) 2018-01-03
WO2016135132A1 (en) 2016-09-01
KR20170125058A (ko) 2017-11-13
JP6668372B2 (ja) 2020-03-18
US20170345433A1 (en) 2017-11-30
CA2976864A1 (en) 2016-09-01
KR102125410B1 (ko) 2020-06-22
BR112017018145B1 (pt) 2023-11-28

Similar Documents

Publication Publication Date Title
BR112017018145A2 (pt) aparelho e método para processamento de um sinal de áudio para obter um sinal de áudio processado utilizando um envelope de domínio de tempo alvo
BR112016029895A2 (pt) decodificador e método para decodificação de um sinal de áudio, codificador e método para codificação de um sinal de áudio
EP3419200B8 (en) Method, apparatus, computer program and system for determining information related to the audience of an audio-visual content program
EP3520318A4 (en) CONFIDENCE CALCULATION METHOD AND APPARATUS
GB2546906A (en) Data processing apparatus and method using programmable significance data
BR112018076658A2 (pt) método e dispositivo de processamento distribuído de dados em streaming
BR112016021654A2 (pt) Sistema para determinar a condição física de uma usuária, dispositivo, e, método para determinar uma condição física de uma usuária
BR112018002979A2 (pt) controle de sinal-alvo de banda alta
BR112015018912A2 (pt) método e dispositivo para identificar comportamento de usuário
BR112015007625A2 (pt) aparelho, método de geração de uma medida de interferência de áudio e produto de programa de computador
BR112013020378A2 (pt) método de precaução de medição de rsrq de ue para coordenação de interferência
BR112017008901A2 (pt) método implementado por computador, sistema, aparelho e meio legível por computador não-transitório
BR112018003599A2 (pt) método de coleta de dados de sonda e dispositivo para coleta de dados de sonda
MX2016004865A (es) Metodo y dispositivo para analizar relacion social.
BR112015032174A2 (pt) escalador de tempo, descodificador de áudio, método e um programa de computador utilizando um controle de qualidade
EP4246926A3 (en) Domain name operation verification code generation and/or verification
WO2014200912A3 (en) Mathematical processes for determination of peptidase cleavage
BR112016017406A2 (pt) Método e dispositivo para determinar um modelo ambiental de dimensão n+1 e aparelho de prospecção
BR112017000852A2 (pt) ?aparelho e método para gerar um sinal melhorado utilizando enchimento de ruído independente?.
BR112017007304A2 (pt) método e dispositivo para processar dinheiro eletrônico
BR112017023431A2 (pt) método para estimar saturação de água em medições eletromagnéticas
BR112017020011A2 (pt) método e aparelho para realizar detecção de posição de passagem de entrada em tempo real
GB2532940A8 (en) Method of and apparatus for providing an output surface in a data processing system
BR112015016294A2 (pt) dispositivo e método para criação de terminal
MX2017001239A (es) Procesador y metodo para el procesamiento de una senal de audio por el uso del analisis truncado o las porciones de solapamiento de la ventana de sintesis.

Legal Events

Date Code Title Description
B06U Preliminary requirement: requests with searches performed by other patent offices: procedure suspended [chapter 6.21 patent gazette]
B09A Decision: intention to grant [chapter 9.1 patent gazette]
B16A Patent or certificate of addition of invention granted [chapter 16.1 patent gazette]

Free format text: PRAZO DE VALIDADE: 20 (VINTE) ANOS CONTADOS A PARTIR DE 23/02/2016, OBSERVADAS AS CONDICOES LEGAIS