MX2015009171A - Ajuste de nivel en el dominio del tiempo para descodificacion o codificacion de señales de audio. - Google Patents

Ajuste de nivel en el dominio del tiempo para descodificacion o codificacion de señales de audio.

Info

Publication number
MX2015009171A
MX2015009171A MX2015009171A MX2015009171A MX2015009171A MX 2015009171 A MX2015009171 A MX 2015009171A MX 2015009171 A MX2015009171 A MX 2015009171A MX 2015009171 A MX2015009171 A MX 2015009171A MX 2015009171 A MX2015009171 A MX 2015009171A
Authority
MX
Mexico
Prior art keywords
audio signal
time
level shift
representation
frequency band
Prior art date
Application number
MX2015009171A
Other languages
English (en)
Other versions
MX346358B (es
Inventor
Matthias Neusinger
Markus Lohwasser
Manuel Jander
Bernhard Neugebauer
Stephan Schreiner
Arne Borsum
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of MX2015009171A publication Critical patent/MX2015009171A/es
Publication of MX346358B publication Critical patent/MX346358B/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0017Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0018Speech coding using phonetic or linguistical decoding of the source; Reconstruction using text-to-speech synthesis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0224Processing in the time domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • G10L21/0332Details of processing therefor involving modification of waveforms
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • G10L21/034Automatic adjustment

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)

Abstract

Un descodificador de señales de audio (100) para proporcionar una representación de señal de audio descodificada con base en una representación de señal de audio codificada comprende una etapa de pre-procesamiento de descodificador (110) para obtener una pluralidad de señales de banda de frecuencia de la representación de la señal de audio codificada, un estimador de recorte (120), un cambiador de nivel (130), un convertidor del dominio de la frecuencia al dominio del tiempo (140), y un compensador de cambio de nivel (150). El estimador de recorte (120) analiza la representación de señal de audio codificada y/o la información adicional con respecto a una ganancia de las señales de banda de frecuencia a fin de determinar un factor de cambio de nivel actual. El cambiador de nivel (130) cambia los niveles de las señales de banda de frecuencia de acuerdo al factor de cambio de nivel. El convertidor del dominio de la frecuencia al dominio del tiempo (140) convierte las señales de banda de frecuencia cambiadas de nivel a una representación en el dominio del tiempo. El compensador de cambio de nivel (150) actúa sobre la representación en el dominio del tiempo para compensar al menos parcialmente un cambio de nivel correspondiente y para obtener una representación en el dominio del tiempo sustancialmente compensada.
MX2015009171A 2013-01-18 2014-01-07 Ajuste de nivel en el dominio del tiempo para descodificacion o codificacion de señales de audio. MX346358B (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP13151910.0A EP2757558A1 (en) 2013-01-18 2013-01-18 Time domain level adjustment for audio signal decoding or encoding
PCT/EP2014/050171 WO2014111290A1 (en) 2013-01-18 2014-01-07 Time domain level adjustment for audio signal decoding or encoding

Publications (2)

Publication Number Publication Date
MX2015009171A true MX2015009171A (es) 2015-11-09
MX346358B MX346358B (es) 2017-03-15

Family

ID=47603376

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2015009171A MX346358B (es) 2013-01-18 2014-01-07 Ajuste de nivel en el dominio del tiempo para descodificacion o codificacion de señales de audio.

Country Status (11)

Country Link
US (1) US9830915B2 (es)
EP (2) EP2757558A1 (es)
JP (1) JP6184519B2 (es)
KR (2) KR101953648B1 (es)
CN (1) CN105210149B (es)
BR (1) BR112015017293B1 (es)
CA (1) CA2898005C (es)
ES (1) ES2604983T3 (es)
MX (1) MX346358B (es)
RU (1) RU2608878C1 (es)
WO (1) WO2014111290A1 (es)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006047600A1 (en) 2004-10-26 2006-05-04 Dolby Laboratories Licensing Corporation Calculating and adjusting the perceived loudness and/or the perceived spectral balance of an audio signal
TWI447709B (zh) 2010-02-11 2014-08-01 Dolby Lab Licensing Corp 用以非破壞地正常化可攜式裝置中音訊訊號響度之系統及方法
CN103325380B (zh) 2012-03-23 2017-09-12 杜比实验室特许公司 用于信号增强的增益后处理
US10844689B1 (en) 2019-12-19 2020-11-24 Saudi Arabian Oil Company Downhole ultrasonic actuator system for mitigating lost circulation
CN112185400A (zh) 2012-05-18 2021-01-05 杜比实验室特许公司 用于维持与参数音频编码器相关联的可逆动态范围控制信息的系统
EP2757558A1 (en) * 2013-01-18 2014-07-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Time domain level adjustment for audio signal decoding or encoding
EP2901449B1 (en) 2013-01-21 2018-01-03 Dolby Laboratories Licensing Corporation Audio encoder and decoder with program loudness and boundary metadata
BR112015017064B1 (pt) 2013-01-21 2022-03-22 Dolby Laboratories Licensing Corporation Método, meio legível em computador e aparelho para otimizar o nível de intensidade do som e a faixa dinâmica através de dispositivos de reprodução diferentes
CN110379434B (zh) 2013-02-21 2023-07-04 杜比国际公司 用于参数化多声道编码的方法
CN104080024B (zh) 2013-03-26 2019-02-19 杜比实验室特许公司 音量校平器控制器和控制方法以及音频分类器
CN110083714B (zh) 2013-04-05 2024-02-13 杜比实验室特许公司 用于自动文件检测的对来自基于文件的媒体的特有信息的获取、恢复和匹配
TWM487509U (zh) 2013-06-19 2014-10-01 杜比實驗室特許公司 音訊處理設備及電子裝置
CN108364657B (zh) 2013-07-16 2020-10-30 超清编解码有限公司 处理丢失帧的方法和解码器
WO2015038522A1 (en) 2013-09-12 2015-03-19 Dolby Laboratories Licensing Corporation Loudness adjustment for downmixed audio content
EP3044876B1 (en) 2013-09-12 2019-04-10 Dolby Laboratories Licensing Corporation Dynamic range control for a wide variety of playback environments
KR20160090796A (ko) * 2013-11-27 2016-08-01 마이크로칩 테크놀로지 인코포레이티드 메인 클록의 높은 정밀 발진기
CN110808723A (zh) 2014-05-26 2020-02-18 杜比实验室特许公司 音频信号响度控制
CN106683681B (zh) * 2014-06-25 2020-09-25 华为技术有限公司 处理丢失帧的方法和装置
EP4060661B1 (en) 2014-10-10 2024-04-24 Dolby Laboratories Licensing Corporation Transmission-agnostic presentation-based program loudness
WO2016129412A1 (ja) * 2015-02-10 2016-08-18 ソニー株式会社 送信装置、送信方法、受信装置および受信方法
CN104795072A (zh) * 2015-03-25 2015-07-22 无锡天脉聚源传媒科技有限公司 一种音频数据的编码方法及装置
CN105662706B (zh) * 2016-01-07 2018-06-05 深圳大学 增强时域表达的人工耳蜗信号处理方法及系统
CN109328382B (zh) * 2016-06-22 2023-06-16 杜比国际公司 用于将数字音频信号从第一频域变换到第二频域的音频解码器及方法
KR102565447B1 (ko) * 2017-07-26 2023-08-08 삼성전자주식회사 청각 인지 속성에 기반하여 디지털 오디오 신호의 이득을 조정하는 전자 장치 및 방법
US10942914B2 (en) * 2017-10-19 2021-03-09 Adobe Inc. Latency optimization for digital asset compression
US11120363B2 (en) 2017-10-19 2021-09-14 Adobe Inc. Latency mitigation for encoding data
US11086843B2 (en) 2017-10-19 2021-08-10 Adobe Inc. Embedding codebooks for resource optimization
EP3483879A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Analysis/synthesis windowing function for modulated lapped transformation
EP3483886A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Selecting pitch lag
WO2019091576A1 (en) 2017-11-10 2019-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits
EP3483882A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Controlling bandwidth in encoders and/or decoders
EP3483884A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Signal filtering
EP3483878A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder supporting a set of different loss concealment tools
US10331400B1 (en) * 2018-02-22 2019-06-25 Cirrus Logic, Inc. Methods and apparatus for soft clipping
CN109286922B (zh) * 2018-09-27 2021-09-17 珠海市杰理科技股份有限公司 蓝牙提示音处理方法、系统、可读存储介质和蓝牙设备
US11930347B2 (en) * 2019-02-13 2024-03-12 Dolby Laboratories Licensing Corporation Adaptive loudness normalization for audio object clustering
US11322127B2 (en) * 2019-07-17 2022-05-03 Silencer Devices, LLC. Noise cancellation with improved frequency resolution
CN111342937B (zh) * 2020-03-17 2022-05-06 北京百瑞互联技术有限公司 一种动态调整编解码处理器电压和/或频率的方法和装置

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU4265796A (en) 1994-12-15 1996-07-03 British Telecommunications Public Limited Company Speech processing
US6280309B1 (en) 1995-10-19 2001-08-28 Norton Company Accessories and attachments for angle grinder
US5796842A (en) * 1996-06-07 1998-08-18 That Corporation BTSC encoder
US6289309B1 (en) * 1998-12-16 2001-09-11 Sarnoff Corporation Noise spectrum tracking for speech enhancement
JP3681105B2 (ja) * 2000-02-24 2005-08-10 アルパイン株式会社 データ処理方式
JP4907826B2 (ja) * 2000-02-29 2012-04-04 クゥアルコム・インコーポレイテッド 閉ループのマルチモードの混合領域の線形予測音声コーダ
US6651040B1 (en) * 2000-05-31 2003-11-18 International Business Machines Corporation Method for dynamic adjustment of audio input gain in a speech system
CA2359771A1 (en) * 2001-10-22 2003-04-22 Dspfactory Ltd. Low-resource real-time audio synthesis system and method
JP2003280691A (ja) * 2002-03-19 2003-10-02 Sanyo Electric Co Ltd 音声処理方法および音声処理装置
US20050004793A1 (en) * 2003-07-03 2005-01-06 Pasi Ojala Signal adaptation for higher band coding in a codec utilizing band split coding
DE10345995B4 (de) 2003-10-02 2005-07-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Verarbeiten eines Signals mit einer Sequenz von diskreten Werten
US7751572B2 (en) * 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding
RU2008112137A (ru) * 2005-09-30 2009-11-10 Панасоник Корпорэйшн (Jp) Устройство кодирования речи и способ кодирования речи
DE102006022346B4 (de) * 2006-05-12 2008-02-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Informationssignalcodierung
WO2008100100A1 (en) * 2007-02-14 2008-08-21 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
US9653088B2 (en) * 2007-06-13 2017-05-16 Qualcomm Incorporated Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding
US8126578B2 (en) * 2007-09-26 2012-02-28 University Of Washington Clipped-waveform repair in acoustic signals using generalized linear prediction
EP2225827B1 (en) * 2007-12-11 2013-05-01 Nxp B.V. Prevention of audio signal clipping
CN101350199A (zh) * 2008-07-29 2009-01-21 北京中星微电子有限公司 音频编码器及音频编码方法
EP4293665A3 (en) * 2008-10-29 2024-01-10 Dolby International AB Signal clipping protection using pre-existing audio gain metadata
US8346547B1 (en) * 2009-05-18 2013-01-01 Marvell International Ltd. Encoder quantization architecture for advanced audio coding
AU2011311543B2 (en) * 2010-10-07 2015-05-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V Apparatus and method for level estimation of coded audio frames in a bit stream domain
TW202339510A (zh) * 2011-07-01 2023-10-01 美商杜比實驗室特許公司 用於適應性音頻信號的產生、譯碼與呈現之系統與方法
RU2586874C1 (ru) * 2011-12-15 2016-06-10 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Устройство, способ и компьютерная программа для устранения артефактов амплитудного ограничения
EP2757558A1 (en) * 2013-01-18 2014-07-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Time domain level adjustment for audio signal decoding or encoding

Also Published As

Publication number Publication date
EP2946384A1 (en) 2015-11-25
KR101953648B1 (ko) 2019-05-23
RU2608878C1 (ru) 2017-01-25
US20160019898A1 (en) 2016-01-21
EP2946384B1 (en) 2016-11-02
CA2898005A1 (en) 2014-07-24
CA2898005C (en) 2018-08-14
US9830915B2 (en) 2017-11-28
ES2604983T3 (es) 2017-03-10
EP2757558A1 (en) 2014-07-23
CN105210149A (zh) 2015-12-30
JP6184519B2 (ja) 2017-08-23
KR20150106929A (ko) 2015-09-22
BR112015017293A2 (pt) 2018-05-15
WO2014111290A1 (en) 2014-07-24
MX346358B (es) 2017-03-15
CN105210149B (zh) 2019-08-30
BR112015017293B1 (pt) 2021-12-21
JP2016505168A (ja) 2016-02-18
KR20170104661A (ko) 2017-09-15

Similar Documents

Publication Publication Date Title
MX346358B (es) Ajuste de nivel en el dominio del tiempo para descodificacion o codificacion de señales de audio.
MY160467A (en) Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction
MY194835A (en) Audio or Video Encoder, Audio or Video Decoder and Related Methods for Processing Multi-Channel Audio of Video Signals Using a Variable Prediction Direction
MX2011007925A (es) Codificador de audio, decodificador de audio, información de audio codificada, métodos para la codificación y decodificación de una señal de audio y programa de computadora.
WO2011049416A3 (en) Apparatus and method encoding/decoding with phase information and residual information
WO2012016128A3 (en) Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals
TR201908748T4 (tr) Ses cihazları için kombine dinamik aralıklı sıkıştırma ve kılavuzlu kırpma önlemeye ilişkin konsept.
MX2019005799A (es) Estimacion del ruido de fondo en las señales de audio.
MX358306B (es) Decodificador, codificador y método para la estimación informada de sonoridad en sistemas de codificación basada en objetos.
WO2009131703A3 (en) Coding of depth signal
MY166169A (en) Audio signal encoder,audio signal decoder,method for encoding or decoding an audio signal using an aliasing-cancellation
WO2014009878A3 (en) Encoding and decoding of audio signals
MX356334B (es) Decodificador de audio y método para proveer una información de audio decodificada usando un ocultamiento de error sobre la base de una señal de excitación de dominio de tiempo.
WO2011029048A3 (en) Pitch estimation and audio source separation
WO2008129645A1 (ja) ピーク抑圧方法
GB2532379A (en) Wind noise reduction
MY187728A (en) Method and system for encoding audio data with adaptive low frequency compensation
MX345622B (es) Decodificador para generar una señal de audio mejorada en frecuencia, método de decodificación, codificador para generar una señal codificada y metodo de codificación utilizando informacion secundaria de selección compacta.
PH12016500470A1 (en) Gain shape estimation for improved tracking of high-band temporal characteristics
TR201815245T4 (tr) Armonik ses sinyallerinin dönüşüm kodlaması / kod çözümü.
WO2010090427A3 (ko) 오디오 신호의 부호화 및 복호화 방법 및 그 장치
GB2562818A (en) Non-linearity cancellation in a dual-path ADC
MX2015010225A (es) Sistemas y metodos para realizar modulacion de ruido y ajuste de ganancia.
MX2016010129A (es) Decodificacion de señal de frecuencia modulada y amplitud modulada combinadas.
FI20115760A (fi) Paikan arviointi- ja tiedonsiirtomenetelmä

Legal Events

Date Code Title Description
FG Grant or registration